Genbank accession
XHB38983.1 [GenBank]
Protein name
tail protein
RBP type
TSP
Evidence DepoScope
Probability 1,00
TSP
Evidence RBPdetect2
Probability 0,95
Protein sequence
MMADCDNIPTLQTVQDFNTDAKVLTEVVTSDSELTVLPDSAGNTHKTLFGLQSEYRAAIQAAGGVPLGVWTAGVTEFNAYNEYAVYNGIPYKPRASTTLPYTAQSSDPTIVPDSTFVQPYVDSLVSVARTNNAPIDSVVYLATNQVVGVNYFYDIVTQITYYSPVEISGSITDTGSDVDGFRNITVDSAQYVIFNSSFIALLQSSGVSLDYSPSEVSILKESISTSASVLYDESASVRLTAWDTLTTPYVITSWSDNIFGGVDVVTDQGTFEFVRPSVKAARDQGDAAGWGVVIDSVTDSTNQILKAISDLGEGDDLFIPHPCAFGQVTVNKDLNIKGYSGDKSNRLITLIGADAGFITSGRRIKVTYRNLYYLCDGDSVDGLTNRQIPQSVGSGADIDEIEYLDCYAKNCVIGFSMVFESGRTLRKKAIMEDCEVETTYGTSSGEGYGLHAANDKQTHGYDSKIYINRNLIDGATRHSIYASRSGGYVITNNYVKNHRALTFDGNIRPAILVGRASDTVGYGNTYEDCYGVCLAIVPETVESVDYDCANVHFWGETFINPKELGAVAFGLSARNGGSIIGASVKNSSFRMNDNAIPLLLSYYGFDLDFKNNSAVYRGTIPAFNVFVLVGVDDTGYSSLYSDRWNFTENTIDIYSNTGAVTEYIITRNIGAYINLDNEFKNTNTELVLSDGAMTKNRITINGTRYLSAGISTPVGNITPLKTGERIVLLAPDPDTVWLSTGRTNNDWLQLG
Physico‐chemical
properties
protein length:751 AA
molecular weight: 81565,65500 Da
isoelectric point:4,50605
aromaticity:0,10253
hydropathy:-0,11997

Domains

Domains [InterPro]
IPR011050
STR
285–651
XHB38983.1
1 751
Architecture
ATT
STR
RBD
ATT 1-165 | STR 280-651 | RBD 659-751
Legend: ATT STR RBD CBM LEC ENZ CHP LNK TAS TTP UNK Unmapped

Tail Spike Domain Segmentation

Tail Spike Domain Segmentation

This protein has been segmented into three structural domains: N-terminal, central domain, and C-terminal.

Domain Layout
N-terminal
Central
C-terminal
XHB38983.1
1 751
Domain Start End Length (AA) Confidence
N-terminal 1 301 301 0,9930
Central domain 302 682 382 0,9885
C-terminal 683 751 68 0,8972
Legend: N-terminal Central domain C-terminal
3D Structure with Domain Coloring

The structure is colored according to the domain segmentation: N-terminal (blue), Central (green), C-terminal (pink).

Domain Coloring
N-terminal
1-301
Central
302-682
C-terminal
683-751

Taxonomy

  Name Taxonomy ID Lineage
Phage Vibrio phage vB_VpS_LMAVpVPP
[NCBI]
3344730 Viruses > Duplodnaviria > Heunggongvirae > Uroviricota > Caudoviricetes
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)
Genbank protein accession
XHB38983.1 [NCBI]
Genbank nucleotide accession
PQ284950 [NCBI]
CDS location
range 11367 -> 13622
strand +
CDS
ATGATGGCTGATTGTGACAACATTCCTACCCTGCAGACAGTGCAGGACTTTAATACTGACGCTAAGGTTTTGACTGAGGTTGTCACCAGTGATTCCGAACTGACAGTATTACCTGATAGTGCTGGTAATACTCATAAAACCTTATTCGGTTTGCAGTCTGAATACCGAGCTGCAATTCAAGCGGCCGGTGGCGTTCCTCTCGGGGTGTGGACTGCAGGTGTGACAGAATTTAACGCGTACAACGAGTACGCTGTGTACAACGGCATTCCTTACAAGCCCCGCGCATCTACGACTCTCCCATACACCGCACAAAGTTCCGACCCGACCATTGTACCGGATTCGACATTCGTTCAGCCTTATGTAGACTCACTAGTTTCTGTAGCCCGCACCAATAACGCACCGATCGACTCAGTAGTTTACTTAGCAACAAATCAAGTTGTTGGAGTTAATTACTTTTACGACATCGTGACTCAGATTACGTATTACTCTCCTGTAGAAATATCAGGCTCTATTACAGATACGGGTTCTGACGTAGATGGATTCAGAAATATTACAGTAGATTCAGCTCAATACGTTATCTTTAACTCTTCCTTTATAGCTCTATTGCAGTCGTCAGGCGTGTCACTTGATTATTCACCTTCTGAGGTATCAATCCTTAAAGAATCAATATCGACTAGTGCTAGCGTTTTGTATGACGAGAGTGCGAGCGTGAGGCTAACAGCATGGGATACATTAACAACACCGTATGTCATCACATCATGGTCGGATAACATTTTCGGAGGTGTTGATGTTGTAACAGACCAAGGAACATTTGAATTCGTCAGACCTTCCGTTAAGGCCGCGCGTGACCAAGGGGATGCTGCTGGTTGGGGGGTTGTGATTGATTCAGTAACTGACTCAACAAACCAAATTCTTAAGGCAATATCTGACCTTGGTGAAGGGGATGATCTTTTCATTCCGCACCCATGTGCTTTCGGGCAAGTCACTGTAAATAAAGATTTAAACATTAAAGGGTATAGCGGAGACAAGTCAAATAGATTAATAACTTTGATCGGGGCTGATGCCGGGTTCATTACGTCAGGAAGAAGGATTAAAGTCACATACCGGAATCTATATTACCTTTGTGATGGTGATTCGGTTGACGGCCTCACAAACCGACAAATTCCGCAATCCGTTGGTTCTGGAGCTGATATAGATGAAATTGAATATTTAGATTGTTACGCAAAGAACTGTGTGATCGGGTTTTCAATGGTTTTTGAGTCTGGGAGAACTTTGCGCAAAAAAGCAATCATGGAAGATTGCGAAGTGGAAACAACTTATGGAACTTCCAGTGGTGAAGGGTACGGTCTTCATGCTGCAAATGATAAACAAACACATGGTTACGACTCTAAAATATACATCAATCGAAACTTGATTGACGGGGCCACGAGACATTCAATCTATGCCAGCCGTTCTGGCGGATATGTTATCACGAATAATTACGTAAAAAATCACAGAGCGTTGACTTTCGATGGAAATATTAGGCCTGCTATTTTAGTCGGTCGAGCATCGGATACAGTAGGTTACGGAAATACTTATGAAGACTGTTATGGTGTTTGTCTCGCTATTGTACCAGAAACAGTAGAATCTGTTGATTATGATTGCGCCAACGTTCATTTTTGGGGTGAGACATTCATTAATCCTAAAGAATTAGGGGCTGTTGCGTTTGGTCTTTCAGCTAGGAATGGAGGGTCTATAATTGGCGCTAGTGTAAAAAATTCATCTTTCAGAATGAATGACAACGCGATACCTTTACTTTTGTCTTATTACGGGTTTGACCTAGACTTTAAGAATAACTCAGCAGTTTACAGAGGAACCATTCCGGCATTCAATGTCTTCGTACTGGTTGGCGTTGATGATACTGGATACTCTTCTCTTTATTCAGATCGATGGAACTTCACTGAAAATACGATAGATATATACTCAAACACTGGGGCCGTTACAGAATATATTATAACCCGTAACATAGGTGCGTACATAAACTTAGATAATGAGTTTAAGAATACTAACACTGAATTGGTTTTAAGTGATGGAGCTATGACAAAAAATAGAATCACAATTAACGGAACAAGATACTTATCGGCGGGAATATCAACTCCTGTAGGTAACATAACACCGTTAAAAACAGGTGAGAGAATTGTACTACTAGCCCCGGATCCTGATACGGTATGGCTGTCGACAGGAAGGACAAATAATGACTGGCTGCAATTAGGGTAA

Genome Context

Genome Context

Tertiary structure

PDB ID
6dbf40d2b487b4b5e48e21e85634c0c250e8d4b19668b400824cad703ee0376b
ESMFold
Source ESMFold
Method ESMFold
Resolution 0,7115
Oligomeric State monomer
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50