Genbank accession
YP_009140823.1 [GenBank]
Protein name
short tail fiber protein
RBP type
TF
Evidence GenBank
Probability 1,00
TF
Evidence Phold
Probability 1,00
TSP
Evidence RBPdetect
Probability 0,83
Protein sequence
MTYSFAPNGQPLYVAEGEFLQFRFKAPNTWDTTRTVTIRIGDLDQFWLLTTIPEDFTPDPFPFTDISADPGAELSTLFTTDVVFNPPDGAPTTSLTGLTPGTQASLFLGCNVSGNENIYAMRIDPLGDGNFGPWIQGDGTQVVENGAKIQVRARSSEFIVSPTRLTLVIGTSNEVWTIFTRAGVINEPNPFPDFTDLDEQDSNTYCYTPEVIRLQGMIDDADISLSAPGEWAVSSTGNTTTDVNGFQILDGATFTNIPGTVANGDYLQLRMLSSSNPITPLTTNLTIGTEVNGSDWTVRTGNNPSENPNSFSFPDIVGAIEDTLIGSEIRPDTVDGITGLGNGISVPVTVVSTDASLVRIKKNNGSVGVFPTTVENGDKIRIYLQSSATFGDTKSLQIKVGDRTISTWQVQTSAGPDTDADWSPPPNKNNQIPSSFVSSNPIAVTGINRPITIQSVAGYNALISIDYDTPVLGPRTFDPLVNSTFYVVVQAADQLGTPEVTTIQLGTGNPNQFQWQVTTYVTVPPSAANVGVWYSKKNSSFDTEGWIAAGSIPENAGDYYTEPKLDGYSIGTVLPILKEGVDNYGDLDGDLSSRYPGFIKCEGQSLDTTQYFMLFDIIGYSYGGSGSNFNLPDYRNRRICGIGPVDNQRGNSAALPTTTGGIDVPGSEGGFWYFNKIGSRGSQPLDQVQGIDSGLTPGSLDSDYFSLGTVRLSGLSTITEIIPFEINPNGFVTAQIGPLQSVKVGVPAHSHMYISAVTEGDRGDPLNRWGGTSRGLMGTNAQASYYETGDNSVSNSEEIWQEWVDWLGTLRNFKQEIIKYLGSEEAFETWVRANFPANDPVNEEPPSFDIDFSPLESSDFGDTSDDEEFDIEFLTWWLSPISGLSGATLVETGIAPRTQGSSRNWGCVFDTQPATFRIDNYLSTASGTETLTHSHLITENPVTNIQADFTGGNENDQGQNSGGFGSGLGGGVAGSVLTFKMRWSGTYVDSNGDKDPDGSDGGGGNGTYFPATAGDWGYRNGGAGYWTLPTDEVTREEDMITTGSSSGSGLRMEITYQAWPTGGDGSSNNDTRIRVNRIISAGSGYAVGDLVTTAFWNDDIPGTAARMLEIAAVGDAGTGGAAAVINVNFTQSDIFMDLSEGLFKYSSAFKRPFPDVIMRPQRQVPILTPFHKSKYIIKAY
Physico‐chemical
properties
protein length:1180 AA
molecular weight: 126843,14880 Da
isoelectric point:4,23686
aromaticity:0,10085
hydropathy:-0,29814

Domains

Domains [InterPro]
DC_0346
STR
1–994
IPR037053
ATT
565–635
IPR011083
ATT
596–637
SSF88874
STR
596–662
YP_009140823.1
1 1180
Architecture
STR
ATT
STR
STR 1-564 | ATT 565-637 | STR 638-994 |
Legend: ATT STR RBD CBM LEC ENZ CHP LNK TAS TTP UNK Unmapped

Tail Spike Domain Segmentation

Tail Spike Domain Segmentation

This protein has been segmented into three structural domains: N-terminal, central domain, and C-terminal.

Domain Layout
N-terminal
Central
C-terminal
YP_009140823.1
1 1180
Domain Start End Length (AA) Confidence
N-terminal 1 748 748 0,8371
Central domain 749 963 216 0,8822
C-terminal 964 1180 216 0,0871
Legend: N-terminal Central domain C-terminal
3D Structure with Domain Coloring

The structure is colored according to the domain segmentation: N-terminal (blue), Central (green), C-terminal (pink).

Domain Coloring
N-terminal
1-748
Central
749-963
C-terminal
964-1180

Taxonomy

  Name Taxonomy ID Lineage
Phage Synechococcus phage ACG-2014i
[NCBI]
1493513 Uroviricota > Caudoviricetes > Pantevenvirales > Chalconvirus > Chalconvirus acg2014i
Host Synechococcus sp. WH 7803
[NCBI]
32051 Bacteria > Cyanobacteria > Oscillatoriophycideae > Chroococcales > Synechococcus >

Coding sequence (CDS)

Coding sequence (CDS)
Genbank protein accession
YP_009140823.1 [NCBI]
Genbank nucleotide accession
NC_027132.1 [NCBI]
CDS location
range 40880 -> 44422
strand +
CDS
ATGACATATTCGTTCGCACCTAATGGACAACCGTTGTATGTTGCTGAAGGTGAATTCTTACAATTTAGATTCAAAGCTCCTAATACGTGGGATACTACTAGAACGGTAACTATTCGTATTGGTGATCTGGATCAATTCTGGTTGCTCACCACTATTCCTGAGGATTTTACTCCAGATCCATTTCCTTTTACAGATATTTCTGCAGATCCTGGTGCTGAATTGAGCACTTTATTTACTACTGATGTAGTGTTTAATCCACCAGATGGAGCTCCAACTACTTCTTTGACTGGATTAACACCAGGAACACAAGCATCTCTTTTCTTAGGATGTAATGTCTCAGGAAATGAAAATATATACGCGATGCGTATTGACCCTCTTGGTGATGGCAATTTTGGTCCATGGATTCAAGGTGATGGAACACAAGTTGTAGAAAATGGTGCAAAAATTCAGGTACGTGCTAGATCTTCTGAATTTATTGTATCCCCTACAAGATTGACACTTGTCATTGGTACTTCTAATGAAGTATGGACAATTTTTACTAGGGCAGGAGTTATTAATGAACCAAATCCTTTCCCTGATTTTACAGATTTAGATGAGCAAGATTCAAACACATATTGTTATACTCCAGAAGTTATTAGACTTCAAGGGATGATTGATGATGCAGATATTAGTTTATCTGCTCCTGGTGAGTGGGCAGTTTCTAGTACAGGAAACACAACAACTGATGTTAATGGATTTCAAATTTTAGATGGTGCAACATTTACCAATATTCCTGGTACAGTTGCAAATGGCGATTATTTGCAACTTAGAATGTTGTCTTCCTCCAATCCAATTACTCCACTAACTACTAACCTTACTATTGGTACTGAAGTAAATGGTAGTGATTGGACAGTAAGAACAGGAAATAACCCATCAGAAAATCCAAATAGTTTTTCTTTCCCAGATATTGTTGGTGCTATTGAAGATACATTAATTGGATCAGAAATAAGACCTGATACTGTTGATGGAATTACAGGACTTGGAAATGGTATATCTGTTCCAGTGACTGTTGTATCTACAGATGCATCTCTTGTACGTATCAAAAAGAATAATGGATCTGTTGGTGTATTTCCGACTACAGTAGAAAATGGAGATAAAATACGCATTTACCTGCAATCGTCAGCAACATTTGGTGATACAAAAAGTTTACAAATTAAAGTTGGTGATAGAACTATTTCAACATGGCAGGTACAAACTAGTGCTGGACCAGATACTGATGCAGATTGGTCTCCACCACCTAACAAAAATAATCAAATTCCATCATCATTTGTATCAAGTAATCCTATTGCTGTCACTGGTATTAATCGACCAATTACAATTCAAAGTGTAGCAGGATATAATGCTCTAATTTCTATTGACTATGACACTCCTGTACTTGGTCCCAGAACGTTTGATCCTCTTGTCAATTCTACTTTCTATGTTGTTGTTCAAGCAGCAGATCAACTAGGCACTCCAGAAGTAACAACAATTCAATTGGGAACTGGTAATCCTAATCAATTTCAATGGCAAGTAACAACTTATGTAACTGTACCTCCATCAGCAGCAAATGTAGGAGTTTGGTATAGTAAGAAAAATTCTTCTTTCGATACAGAAGGATGGATAGCAGCAGGTTCAATCCCAGAAAATGCTGGTGATTATTATACAGAACCTAAACTAGATGGTTATTCTATTGGAACAGTTTTACCAATCCTAAAAGAAGGTGTCGATAATTATGGTGATCTAGATGGAGATTTAAGCTCTAGATATCCAGGATTTATTAAATGTGAAGGTCAGAGTTTAGACACTACTCAATATTTTATGCTATTTGATATCATTGGATACAGTTATGGTGGATCTGGATCTAATTTTAATCTTCCTGACTATAGAAATAGAAGAATATGTGGTATTGGACCAGTTGACAATCAAAGAGGAAACTCTGCTGCATTACCAACAACTACTGGTGGAATTGATGTTCCAGGATCTGAGGGTGGATTCTGGTACTTTAATAAGATTGGTTCTCGTGGATCTCAACCATTAGATCAGGTTCAAGGTATTGATTCTGGATTAACACCAGGAAGTTTAGATAGTGATTACTTCTCACTAGGAACAGTTAGACTATCTGGATTGAGCACTATAACTGAAATTATTCCTTTTGAAATAAATCCAAATGGATTTGTGACTGCACAAATTGGACCTTTACAGTCAGTTAAAGTAGGTGTGCCTGCACATAGTCATATGTACATATCTGCAGTTACTGAAGGTGATCGTGGTGATCCATTGAATAGATGGGGCGGAACTTCTAGAGGATTGATGGGCACAAATGCACAAGCTAGTTATTATGAAACAGGAGATAATTCAGTTAGCAATTCTGAAGAAATTTGGCAAGAATGGGTTGATTGGCTTGGAACTTTAAGAAATTTCAAGCAAGAAATTATCAAATACTTAGGTAGTGAAGAAGCATTTGAAACATGGGTCAGAGCAAACTTCCCTGCTAATGACCCAGTAAACGAAGAACCACCATCATTCGATATTGACTTTTCACCCCTTGAATCATCTGATTTTGGTGATACTTCTGATGACGAAGAATTTGATATTGAATTCTTGACATGGTGGTTATCACCTATTTCAGGATTGTCAGGTGCTACGTTAGTAGAGACAGGAATAGCACCCCGTACTCAGGGTTCTAGCAGAAACTGGGGTTGTGTATTTGACACACAACCAGCAACATTTAGAATTGATAATTATCTTTCTACTGCATCTGGTACTGAAACATTAACACATTCACATTTAATTACTGAAAATCCTGTTACTAACATCCAAGCAGATTTTACTGGCGGTAATGAAAATGATCAAGGACAGAATAGTGGCGGATTTGGTTCTGGATTGGGTGGTGGAGTCGCTGGATCTGTATTAACATTCAAGATGAGGTGGAGTGGAACATATGTAGACAGCAATGGTGATAAAGATCCAGATGGATCTGATGGCGGTGGTGGAAATGGTACGTATTTTCCTGCAACTGCAGGTGATTGGGGATATAGAAATGGTGGTGCTGGATATTGGACATTACCAACAGATGAAGTGACTCGTGAAGAAGATATGATTACTACTGGTTCTAGTAGTGGATCTGGATTAAGAATGGAGATTACATATCAAGCATGGCCAACGGGTGGAGATGGTTCTTCAAATAATGACACTCGAATACGTGTTAATAGAATTATAAGTGCAGGATCTGGTTATGCAGTTGGTGATTTAGTTACTACTGCATTTTGGAATGATGATATTCCTGGCACTGCAGCGAGAATGCTTGAAATTGCTGCAGTTGGTGATGCCGGAACTGGTGGTGCTGCTGCTGTGATCAATGTAAACTTTACTCAAAGTGATATTTTTATGGATCTTAGTGAAGGTCTATTTAAATATTCCAGTGCTTTTAAAAGACCATTCCCTGATGTTATAATGAGACCACAAAGACAAGTTCCAATTCTGACCCCATTCCACAAATCTAAATACATTATCAAGGCATACTAA

Genome Context

Genome Context

Tertiary structure

PDB ID
f4c0faa9b32f6b0b60755e08978192e6a7a179fb136f5a9034829782cca59a2f
ESMFold
Source ESMFold
Method ESMFold
Resolution 0,4908
Oligomeric State monomer
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50

Literature

Title Authors Date PMID Source
Ecological redundancy of diverse viral populations within a natural community Gregory,A.C., LaButti,K., Copeland,A., Woyke,T. and Sullivan,M.B. 2015-03-20 GenBank