UniProt accession
A0A088F834 [UniProt]
Protein name
Tailspike protein
RBP type
TSP
Evidence DepoScope
Probability 1,00
TSP
Evidence GenBank
Probability 1,00
TF
Evidence Phold
Probability 1,00
TSP
Evidence RBPdetect
Probability 0,91
TSP
Evidence RBPdetect2
Probability 0,96
TSP
Evidence UniProt/TrEMBL
Probability 1,00
Protein sequence
MSTGCSDVLTLNDLQIAKKHQIFEAEVITGKQGGVAGGADIGYATNQVTGQTQKTLPAVLRDAGFSPASFNFATGGTLGINDANKAVLWPIEDGGDGNYYAWRGSLPKVIPAASTPLTTGGISDSAWVAFGDITFRAEADKKFKYSVKLSDFTTLQQLADAAVDSVLIDRDYTFSNNETVNFGGKTLTIDCKAKFIGDGMLIWEQLGEGSVVNQPHMQTQTTPYTVYRFDDNGNWVTNPSTVLASVVQRMDKGYKPNINDLDIWGSLPDHIKNQTAGATLRVMSGSNITVNSPEATFGGYVFTLCNRILVKNPRNFIAWESGITFENHHTSAWGYGNWVVGGEIKYGSGCAVLFIRNDGGEDHDGGVRDLISYRVGESGVKTYQNEIGGRSARNYRLVFDNITTIQCYYDGIDINADTGPQVERVDDYPLSQYPWFQLPTEHIIRNIITRDCMGIGAWWDGQRNIIDNVVTYEAHKEGIFDRGTNNDITNVTVIGANKDVVNVNQLTCEGSSRLRGVMIHAYTTQGYAVYAPQSEISAVACAGSGTKKILCTYVSDVQGGNINVQHNENQMTLAMRPAMHGTINPSLLMTADCQVAAPGGEASIVKLSAIQEGVRVGEMQLNRLGFKHMSIPVAPSALPESALEHNSSIGFFFGDDGVLRILIKKPDGTYKTHDLS
Physico‐chemical
properties
protein length:676 AA
molecular weight: 73436,50580 Da
isoelectric point:5,31157
aromaticity:0,08876
hydropathy:-0,23314

Domains

Domains [InterPro]
G3DSA:2.10.10.80
ATT
65–136
IPR040775
RBD
71–132
IPR015331
RBD
138–670
A0A088F834
1 676
Architecture
ATT
STR
ATT 65-136 | STR 137-675 |
Legend: ATT STR RBD CBM LEC ENZ CHP LNK TAS TTP UNK Unmapped

Tail Spike Domain Segmentation

Tail Spike Domain Segmentation

This protein has been segmented into three structural domains: N-terminal, central domain, and C-terminal.

Domain Layout
N-terminal
Central
C-terminal
A0A088F834
1 676
Domain Start End Length (AA) Confidence
N-terminal 1 155 155 0,9925
Central domain 156 558 404 0,9868
C-terminal 559 676 117 0,9255
Legend: N-terminal Central domain C-terminal
3D Structure with Domain Coloring

The structure is colored according to the domain segmentation: N-terminal (blue), Central (green), C-terminal (pink).

Domain Coloring
N-terminal
1-155
Central
156-558
C-terminal
559-676

Taxonomy

  Name Taxonomy ID Lineage
Phage Salmonella phage LSPA1
[NCBI]
1540823 Uroviricota > Caudoviricetes > Sarkviridae > Jerseyvirus > Jerseyvirus LSPA1
Host Salmonella paratyphi
[NCBI]
54388 cellular organisms > Bacteria > Pseudomonadati > Pseudomonadota > Gammaproteobacteria > Enterobacterales

Coding sequence (CDS)

Coding sequence (CDS)
Genbank protein accession
AIM41130.1 [NCBI]
Genbank nucleotide accession
KM272358 [NCBI]
CDS location
range 23643 -> 25673
strand +
CDS
ATGTCTACTGGTTGCAGCGATGTACTGACACTTAACGATTTACAAATAGCTAAAAAACACCAGATTTTCGAAGCCGAGGTGATCACCGGCAAACAGGGTGGTGTAGCCGGCGGTGCAGATATCGGCTACGCCACTAACCAGGTAACAGGGCAGACGCAGAAGACGCTGCCGGCGGTCTTACGTGACGCCGGTTTCTCCCCGGCGTCCTTTAACTTCGCAACCGGCGGAACCCTGGGAATTAACGATGCCAATAAAGCTGTTCTTTGGCCTATAGAGGATGGCGGGGATGGGAACTATTACGCATGGCGTGGCTCCCTGCCGAAAGTTATCCCCGCGGCGTCCACCCCTCTAACAACCGGCGGCATTTCTGATTCGGCTTGGGTAGCTTTTGGGGACATTACCTTTCGCGCGGAAGCGGATAAGAAATTTAAATACTCCGTTAAGCTGTCCGACTTTACTACGTTACAACAATTGGCGGATGCCGCTGTTGATAGTGTTCTTATCGACCGCGATTACACTTTCAGCAATAACGAGACCGTTAACTTCGGCGGGAAGACCCTGACCATCGACTGTAAAGCGAAGTTTATCGGCGACGGCATGCTAATATGGGAACAACTCGGCGAAGGGTCTGTTGTGAATCAACCACATATGCAGACACAAACCACACCGTACACGGTGTATAGATTCGACGACAACGGTAACTGGGTGACTAACCCATCAACGGTGCTGGCGTCGGTAGTCCAAAGGATGGATAAGGGGTATAAGCCCAATATTAACGATTTGGATATCTGGGGTAGCCTTCCTGATCACATAAAAAATCAAACAGCCGGTGCGACCCTCCGCGTTATGAGCGGATCAAACATAACCGTAAATTCACCGGAAGCGACTTTCGGCGGTTATGTATTCACTCTATGTAATCGTATATTGGTTAAAAACCCACGAAATTTTATCGCATGGGAGTCGGGTATTACTTTTGAAAACCACCATACATCCGCATGGGGCTATGGTAACTGGGTCGTCGGCGGAGAGATAAAGTACGGTTCAGGGTGCGCCGTTTTGTTTATCCGCAATGACGGCGGTGAAGACCATGATGGCGGGGTCAGGGATTTAATATCATATCGCGTTGGTGAATCTGGAGTTAAAACTTATCAAAACGAGATTGGTGGAAGGTCCGCCCGAAACTACCGTCTGGTGTTTGATAACATTACGACCATACAGTGCTATTACGACGGGATAGATATCAACGCGGATACAGGCCCCCAGGTTGAGCGCGTAGATGATTACCCGCTCTCCCAATACCCCTGGTTTCAGTTGCCGACTGAACACATCATCCGCAATATCATTACACGTGACTGCATGGGTATCGGCGCGTGGTGGGATGGGCAAAGAAATATCATTGATAATGTTGTAACCTACGAGGCCCATAAAGAGGGTATTTTTGATAGAGGTACTAACAACGACATCACTAACGTAACAGTTATCGGCGCAAACAAGGACGTAGTTAACGTTAACCAGCTTACTTGCGAGGGCAGTAGCAGATTGCGGGGCGTTATGATTCATGCCTACACCACGCAAGGGTATGCCGTATACGCACCACAATCGGAAATATCCGCTGTCGCTTGCGCAGGGAGCGGGACCAAGAAAATACTTTGTACCTATGTCAGTGATGTGCAAGGGGGTAACATCAATGTCCAACACAATGAAAACCAGATGACACTCGCTATGCGCCCTGCGATGCACGGTACCATAAATCCATCACTATTGATGACCGCAGATTGTCAAGTGGCGGCACCTGGTGGGGAGGCAAGCATTGTGAAGCTTTCCGCAATCCAGGAGGGGGTGCGCGTGGGCGAGATGCAGCTTAACCGCTTAGGCTTTAAGCATATGAGCATACCAGTAGCCCCATCAGCTCTACCGGAAAGCGCATTAGAGCATAATTCATCTATAGGCTTTTTCTTTGGAGATGACGGGGTGCTGCGAATCCTCATCAAGAAACCAGACGGGACTTACAAAACCCACGACTTATCCTAA

Genome Context

Genome Context

Gene Ontology

Description Category Evidence (source)
GO:0044423 virion component Cellular Component IEA:UniProtKB-KW (UniProt)
GO:0051701 biological process involved in interaction with host Biological Process IEA:UniProtKB-ARBA (UniProt)
GO:0019058 viral life cycle Biological Process IEA:UniProtKB-ARBA (UniProt)

Tertiary structure

PDB ID
aabd133a1474d45d86cbee28b48779ace35c36a8db980a0318fe27ee17589755
ESMFold
Source ESMFold
Method ESMFold
Resolution 0,7302
Oligomeric State monomer
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50