Genbank accession
CAL9960477.1 [GenBank]
Protein name
tail collar fiber protein
RBP type
TSP
Evidence DepoScope
Probability 1,00
TSP
Evidence RBPdetect
Probability 0,89
TSP
Evidence RBPdetect2
Probability 0,94
Protein sequence
MAITQVGIATLSEELLQYIKDQIKNGTLSFTDLIDGPEYAGQDGKFLQAYSGSGSTGMQWQDLPSAGLNWQVEDTDFLAQDKIGILASNGITITLPSDPEEGNLVAIADHNSEFDVTPVTVVGSGGYLIEDESDLVLDLRNAYVQLIFDGTKWEIAQVNHPFNIQEITEETFPSGQISYTLSRIPPSRSSILVTSGGKIIPTSQYSLVGNVLSFGAVASDIIQVRHIGVPKATEVSDTPVGAMLYFPNSEPVDGWLDCNGSSISASVYPDLVKFLTKNPQAEIAYLPDARGNFIRTFDYGNGTDVIAPHSIDNILKDNKWGRWISTTNEATSENLWDGSTATRTTVLVSQGYIGYRFDAPVTMTDMILNNNDAIGTQHLPDTGIVRASHDGITWVDASNTVTLGATSQGRNTTFTVTTNDAYRFWSIHGTGGQPYADGSGEYWGVTSLYMNGSSTNRIVGGFQEQSVGPMPATTLGGTAAGTLAVASGAGANVLGQGGSGQVSGGGETRPDNQSYVLRIKAFHYQSGDLASTDVTALRDEVSRLSGQVNDGTSYVGPDAPENPSENARWYDTTSGRTYIWFNDGDSYQWVDDSPQAASTAQESLNAAEILSTGSQTAEALNNRFARNATWFDTVADMVSGYGLLAEGYTAITRGYYEAGDGGGAEYLITTGNSGDGYGSHDVGTNTAVLQHNGTVYSKQYGLIEKDSYYDTIDQTDGWISNNLRINAFVRNPEIGHCIFNPSVDTPYITCAGSILIDRGDLHVTIQPQARIYGRRTLPGGVEPSSVGLYSAGGLIVVADYTDPDNGDYTIAGTLSNFTFDGKGQMATIYSTDNDNGGIGIYNHNALSFAQVTNVSLRDFKISECDHVGINFDLDSRDILIENVNVSNTSDENIKVKGDGQVSGAYANVTIRDCALYNSRFGGRNNPMFIWVSNCNATIEGVSMSCTNSSIATKPQGVFVSGCENVTVNSCILNNCSQGVRMYGDSLALTVSNNYMEDCEFIVKRANIVGDSGNSRLTIIDNRCAGDFDALYSTVESVTTLSSLVVKRNDMSSCNSQLDLYLPASFPTATAPNQVDYQNNVPATGFDFNFITAGRVWNSKPGFEYVTVSGTTFTYDYAYYGGGYDKLAIMITGSSRNFPVYYPIWHSWGTNGVQEIYVPTSDDGVTESIQMTVSKSGTILTFTLVNNPLGTNFTLVKAHN
Physico‐chemical
properties
protein length:1199 AA
molecular weight: 128974,50070 Da
isoelectric point:4,40772
aromaticity:0,09675
hydropathy:-0,24020

Domains

Domains [InterPro]
IPR006626
Unmapped
67–89
IPR037053
ATT
236–296
IPR011083
ATT
241–294
IPR011050
STR
917–1125
CAL9960477.1
1 1199
Architecture
ATT
STR
ATT 236-296 | STR 297-1171 |
Legend: ATT STR RBD CBM LEC ENZ CHP LNK TAS TTP UNK Unmapped

Tail Spike Domain Segmentation

Tail Spike Domain Segmentation

This protein has been segmented into three structural domains: N-terminal, central domain, and C-terminal.

Domain Layout
N-terminal
Central
C-terminal
CAL9960477.1
1 1199
Domain Start End Length (AA) Confidence
N-terminal 1 317 317 0,8333
Central domain 318 516 200 0,3451
C-terminal 517 1199 682 0,1586
Legend: N-terminal Central domain C-terminal
3D Structure with Domain Coloring

The structure is colored according to the domain segmentation: N-terminal (blue), Central (green), C-terminal (pink).

Domain Coloring
N-terminal
1-317
Central
318-516
C-terminal
517-1199

Taxonomy

  Name Taxonomy ID Lineage
Phage Vibrio phage D69
[NCBI]
3105312 Viruses >
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)
Genbank protein accession
CAL9960477.1 [NCBI]
Genbank nucleotide accession
OZ195729 [NCBI]
CDS location
range 31162 -> 34761
strand +
CDS
ATGGCAATTACGCAAGTAGGTATAGCTACTCTATCGGAAGAGTTGCTGCAATACATCAAAGATCAAATCAAGAACGGCACGCTGTCGTTTACAGACCTTATCGACGGACCTGAGTATGCCGGTCAGGACGGTAAGTTCCTGCAGGCTTATTCAGGCTCAGGATCTACAGGCATGCAATGGCAAGATCTACCATCTGCAGGCCTTAATTGGCAAGTAGAAGATACAGACTTCCTGGCCCAGGATAAGATCGGTATCTTAGCAAGCAATGGAATTACAATAACACTACCTAGTGATCCTGAAGAAGGTAATCTAGTAGCAATAGCGGACCACAACAGTGAGTTCGACGTAACTCCTGTAACGGTAGTAGGTTCCGGAGGCTATCTGATCGAAGATGAATCAGATTTAGTACTAGACCTACGTAATGCTTACGTACAACTAATCTTCGACGGCACTAAGTGGGAGATCGCACAGGTCAACCACCCTTTCAATATCCAGGAGATTACCGAGGAAACTTTCCCAAGCGGTCAGATTTCATATACGCTCAGCCGTATTCCACCAAGCCGTAGCTCTATCCTAGTGACTAGTGGCGGCAAGATCATCCCTACAAGCCAATACAGCTTGGTAGGTAATGTACTTAGTTTCGGAGCAGTAGCATCTGATATTATTCAAGTACGTCATATCGGCGTACCTAAGGCTACTGAGGTTAGCGATACTCCGGTAGGCGCTATGCTTTATTTCCCTAACAGCGAGCCCGTAGATGGGTGGTTGGATTGTAATGGTTCAAGCATTAGTGCCAGTGTATACCCGGACTTAGTCAAGTTCCTTACTAAGAATCCTCAGGCGGAAATCGCATACCTACCTGACGCACGTGGTAACTTTATCCGTACGTTTGATTACGGTAACGGTACTGACGTTATTGCGCCTCACAGTATCGATAATATACTTAAGGATAATAAGTGGGGTCGCTGGATCTCTACGACTAACGAAGCTACGTCAGAGAATCTATGGGACGGCAGCACTGCTACTCGTACCACTGTATTAGTATCTCAAGGTTACATTGGATACCGTTTCGACGCTCCTGTGACTATGACAGATATGATCCTGAATAACAATGATGCTATCGGTACACAACACCTTCCAGATACAGGTATCGTACGTGCCTCTCACGACGGTATTACATGGGTAGACGCATCCAATACTGTTACTCTAGGAGCTACGTCTCAAGGTCGTAATACTACCTTCACAGTTACGACTAATGATGCGTATCGCTTCTGGTCTATTCACGGTACAGGTGGTCAGCCGTATGCTGACGGATCTGGCGAGTACTGGGGTGTAACCAGCCTATACATGAACGGTTCGTCAACTAACCGTATCGTAGGCGGGTTCCAGGAGCAGAGCGTAGGTCCTATGCCTGCAACTACTCTAGGCGGGACTGCGGCAGGTACTCTAGCAGTAGCTAGTGGTGCAGGAGCTAACGTACTAGGTCAAGGCGGAAGCGGTCAGGTATCAGGTGGTGGAGAGACTCGCCCAGATAACCAATCGTACGTCCTACGTATCAAGGCGTTCCATTACCAGTCAGGCGACTTGGCATCTACGGACGTTACTGCACTGCGTGACGAAGTATCTCGTCTAAGCGGGCAGGTTAACGATGGTACATCTTACGTAGGTCCAGACGCTCCAGAGAATCCTAGCGAGAATGCTCGTTGGTATGATACGACTTCTGGTCGTACGTACATCTGGTTCAACGATGGCGATAGCTACCAATGGGTAGATGATAGCCCTCAAGCAGCGTCTACGGCTCAAGAGTCGCTGAACGCAGCAGAGATCTTGTCTACAGGCTCGCAGACCGCAGAGGCGTTAAATAATAGATTCGCACGTAACGCCACGTGGTTTGACACGGTAGCCGATATGGTATCTGGGTATGGCTTACTTGCTGAAGGGTATACTGCGATTACTAGAGGATACTACGAAGCTGGGGACGGAGGTGGAGCAGAGTACTTGATCACCACCGGTAATAGCGGTGACGGATACGGCTCTCATGATGTCGGTACTAATACGGCAGTCTTACAGCATAACGGTACAGTGTATTCTAAGCAATATGGTCTTATAGAGAAAGATTCTTACTATGACACTATTGATCAGACTGATGGCTGGATATCCAATAATTTAAGGATTAATGCATTCGTAAGAAATCCCGAGATAGGGCATTGTATATTCAACCCGTCAGTAGACACCCCCTATATAACATGCGCAGGCTCTATACTAATAGACAGAGGTGATCTTCACGTAACCATACAGCCTCAAGCACGCATATACGGTCGTAGGACACTACCTGGAGGAGTAGAGCCTAGTTCTGTCGGCTTATACTCAGCAGGAGGTTTAATAGTAGTAGCTGACTACACAGACCCCGATAATGGCGACTATACTATTGCAGGAACCTTAAGTAACTTTACATTTGATGGGAAGGGGCAGATGGCAACTATCTACTCTACAGATAATGATAACGGTGGTATAGGCATCTATAACCACAACGCGTTAAGCTTTGCACAAGTAACCAATGTTTCTTTACGAGATTTCAAGATTTCAGAATGCGACCATGTAGGGATTAATTTCGACCTAGATTCAAGAGATATCCTGATCGAGAATGTCAACGTATCTAACACCAGTGATGAGAATATTAAAGTCAAAGGTGATGGTCAGGTATCAGGTGCGTACGCAAATGTAACTATTCGCGATTGTGCGTTATATAACAGCCGATTCGGGGGCCGTAACAACCCTATGTTCATATGGGTATCTAATTGTAACGCTACTATAGAAGGTGTATCTATGTCTTGTACTAATTCGAGTATAGCTACGAAACCTCAGGGAGTATTCGTATCCGGTTGTGAGAACGTCACTGTAAATAGTTGTATACTCAATAATTGCTCTCAAGGCGTCCGTATGTATGGCGACTCTCTAGCGCTCACAGTTAGCAATAACTACATGGAGGATTGTGAGTTTATAGTTAAACGCGCCAATATCGTCGGGGATAGCGGTAACAGTAGGCTGACCATAATCGACAATCGTTGCGCAGGAGATTTCGATGCACTATATAGTACTGTAGAATCTGTTACTACCTTAAGTTCTCTAGTAGTTAAGAGGAATGATATGTCAAGCTGTAACAGTCAATTAGACCTATACCTCCCTGCTAGTTTCCCTACTGCCACGGCACCTAATCAAGTAGATTATCAAAATAACGTTCCTGCCACAGGGTTTGATTTCAACTTCATTACGGCGGGTAGGGTATGGAATTCTAAACCGGGATTCGAATATGTAACCGTATCCGGAACGACTTTCACGTATGATTACGCATATTATGGAGGTGGTTACGATAAATTAGCTATAATGATAACTGGTTCTAGTAGGAATTTCCCCGTGTATTATCCTATATGGCATAGCTGGGGTACCAACGGGGTCCAGGAGATTTACGTACCTACCTCAGATGATGGTGTGACTGAGAGTATACAAATGACAGTATCTAAATCAGGTACTATACTGACATTTACTCTAGTCAACAATCCTCTAGGGACGAACTTCACCTTAGTTAAAGCACACAACTAG

Genome Context

Genome Context

Tertiary structure

PDB ID
2fe12ed45928931093ade5eeebbe7d2e72a771247b9348f182f515bf75725f7d
ColabFold
Source ColabFold
Method ColabFold
Resolution 0,2954
Oligomeric State monomer
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50