Genbank accession
YP_009779660.1 [GenBank]
Protein name
tail fiber protein
RBP type
TF
Evidence GenBank
Probability 1,00
TF
Evidence Phold
Probability 1,00
TSP
Evidence DepoScope
Probability 1,00
TSP
Evidence RBPdetect
Probability 0,87
TSP
Evidence RBPdetect2
Probability 0,59
Protein sequence
MALTRLKNIITSRTGRIIYVNPDDFDASDAIDNRGNSALRPFKSIQRAFLEVARFSYRVGLSNDEFDAFSIMLYPAEYIVDNRPGEVLYTNVAPIDENSNLDLTSPNNVLHKFNSVEGGIIVPRGCSLVGTDLRRTKIIPKYVPYPTVAGSLGITDEIQVPPRTAVFKVTGGTYFWQFSFFDGAEEGVYYKPDSTDTLAPKYSHHRLTCFEFADGLNPLSTLIASGTVPNQDYSAVANITQRTDLEIYYQKVSKAFASIPDTSGDPAADQIQARVEENRIVGPISDEYRVLQITRNGNTATAVTVDEFDNPRDHGFSVGVNINISGVTGSTGTQSELDAGVYNGSYTVTSASGNIFTYQMTSEPTGNAVGTNITVKTEIDTVDSASPYAFNLSLRSVWGMNGMHADGSKATGFKSMVVAQFTGLSLQKDDRAFVRYNESTGNYDVASAGDGAHLDGFAEYRRGWAHEHIKCSNDSFIQAVSVFAVGYGTHFTAESGADMSITNSNSNFGNTALRAAGFKAKSFSKDKAGEITHIIPPKALSTISTSATGVDGESTITLTNDGSVNGVIQGMQVSGSNIGSGALVTSINTNTRVITLSVPNADTVNGNIIFGEETSVNWVNIDIQRTKVVNQSLSGSGGTPGTRLYLYGYTTEASPPATKVQGYAIGSRQDGTGASAVADKINCLLVSQGAAEATIKNASISPYGPSVSGLSAGVTGSPIQFDSNTYTIGGVSGVVGGWYLSVSSTENSIYTTLSTNTTYNNVNFTPTTFIKRIADARDLQDRTFRIRLKIDKDKTNPLPRDPLSGYVMQPLNSDTTTYNLDKTFYIYDIEKTQEFERGVTDGIYYITLLCASIAPTTSNFNNRKFSQNVNEVYPTFDRDNPVADPDASVSVADNETIGLVYATDGASPTPNKDPKRSITKEGIEFLLTDNGWTQPGTTPNYDSVNKRLSNVELTARSGDEESRKINIRENNDGTVAPIPVEFRRHSILRSGNHTFEYLGFGPGNYSTAFPQTQVETLSTDQIKFSQSIKEEAGVAFYSGLNSNGDLFIGNQVINPVTGQITNEDIAQLNVVGEENTTIETFSELVLTDKLTVVGGASNQLESIFSGPVTFAGQVSSTNNITAKKITYNNQDGTIIKQTLLAPEDANGFPDFTNITGYDTPSDGDLVYNTNWSPGKSLGWIYYAGDWKEFGLTDTKQINIATHNDSNGDPQQHMGLGIATSADHRLNVLGNVKIDGNLLTTGTGGIAADKYITRTYRAGDTDEPNGTRTIFPITTYTGGVKHTASSLLIMLNGVVQVGGTETEVNTDSAASYYVDSNGQNVVFGSTVGDAPLSTDVLHIIELPI
Physico‐chemical
properties
protein length:1343 AA
molecular weight: 144521,87750 Da
isoelectric point:4,81571
aromaticity:0,08786
hydropathy:-0,30886

Domains

Domains [InterPro]

No domain annotations available.

Legend: ATT STR RBD CBM LEC ENZ CHP LNK TAS TTP UNK Unmapped

Tail Spike Domain Segmentation

Tail Spike Domain Segmentation

This protein has been segmented into three structural domains: N-terminal, central domain, and C-terminal.

Domain Layout
N-terminal
Central
C-terminal
YP_009779660.1
1 1343
Domain Start End Length (AA) Confidence
N-terminal 1 88 88 0,9680
Central domain 89 309 222 0,9490
C-terminal 310 1343 1033 0,1929
Legend: N-terminal Central domain C-terminal
3D Structure with Domain Coloring

The structure is colored according to the domain segmentation: N-terminal (blue), Central (green), C-terminal (pink).

Domain Coloring
N-terminal
1-88
Central
89-309
C-terminal
310-1343

Taxonomy

  Name Taxonomy ID Lineage
Phage Synechococcus phage ACG-2014b
[NCBI]
1493508 Uroviricota > Caudoviricetes > Pantevenvirales > Nereusvirus > Nereusvirus tusconc4
Host Synechococcus sp. WH 7803
[NCBI]
32051 Bacteria > Cyanobacteria > Oscillatoriophycideae > Chroococcales > Synechococcus >

Coding sequence (CDS)

Coding sequence (CDS)
Genbank protein accession
YP_009779660.1 [NCBI]
Genbank nucleotide accession
NC_047718.1 [NCBI]
CDS location
range 24300 -> 28331
strand +
CDS
ATGGCTCTTACTAGACTAAAGAATATTATTACGTCCAGAACTGGACGTATTATCTATGTTAACCCCGACGATTTCGATGCATCCGATGCTATCGATAATAGGGGAAACTCAGCGTTACGACCCTTTAAGTCTATTCAAAGAGCGTTTCTTGAGGTAGCGAGATTCTCGTATCGTGTTGGTTTATCAAATGACGAATTTGATGCCTTCTCGATTATGCTGTATCCAGCAGAATATATTGTTGACAACAGACCTGGTGAAGTTTTATATACAAACGTTGCTCCCATTGATGAGAACTCAAACTTAGATTTGACTTCTCCAAACAATGTTCTACATAAATTTAATTCGGTAGAAGGTGGTATCATTGTTCCTAGAGGTTGTTCCCTCGTTGGTACTGATCTTCGCCGTACAAAAATTATTCCTAAGTATGTTCCATATCCTACAGTAGCAGGTAGTCTTGGAATTACCGATGAAATTCAAGTTCCTCCTCGTACTGCTGTCTTTAAGGTAACTGGTGGTACTTACTTCTGGCAATTCTCATTCTTTGATGGTGCTGAAGAGGGTGTATATTACAAACCTGATAGCACTGATACTCTCGCACCTAAGTATTCTCATCACAGACTTACATGCTTTGAGTTTGCTGATGGTCTCAATCCTTTATCAACTCTTATTGCTAGTGGTACTGTTCCTAATCAGGACTATTCTGCCGTTGCAAATATCACTCAGAGGACAGACTTAGAGATTTATTATCAGAAAGTTTCTAAAGCATTTGCATCAATTCCTGATACATCGGGAGATCCTGCTGCCGACCAAATTCAGGCAAGAGTAGAAGAGAATAGAATTGTTGGTCCTATTTCGGATGAATATAGAGTCCTTCAAATCACTCGTAATGGTAATACTGCAACCGCAGTTACTGTTGATGAATTTGATAATCCTAGAGACCATGGTTTCTCGGTTGGTGTTAACATTAATATTTCTGGAGTTACTGGATCTACTGGAACTCAATCAGAACTAGATGCTGGTGTATACAATGGTTCTTATACTGTAACTTCTGCATCTGGCAATATTTTTACATATCAGATGACATCTGAACCAACTGGTAATGCTGTTGGTACTAACATCACTGTTAAGACTGAGATTGATACTGTTGACTCTGCATCACCATATGCGTTTAACTTGTCACTGAGAAGTGTCTGGGGCATGAATGGTATGCACGCCGATGGATCTAAGGCAACTGGATTTAAGTCCATGGTTGTAGCTCAATTCACGGGATTGTCCCTGCAAAAAGATGATAGAGCATTCGTAAGATATAATGAATCTACTGGTAATTATGATGTTGCTTCTGCTGGTGATGGTGCTCACTTAGATGGTTTTGCTGAATATAGAAGAGGATGGGCACACGAGCATATTAAGTGCTCTAATGACTCCTTCATTCAAGCAGTTTCTGTGTTCGCTGTTGGATATGGCACACATTTCACTGCTGAGAGTGGTGCTGACATGTCTATTACCAACTCAAACTCTAACTTTGGTAACACTGCTCTTCGTGCTGCTGGTTTCAAAGCAAAATCATTCTCAAAAGATAAAGCAGGCGAAATCACACACATCATCCCACCAAAAGCACTTTCAACCATTTCTACATCTGCAACAGGAGTTGATGGGGAAAGCACGATCACACTTACTAATGATGGTTCTGTGAATGGTGTTATTCAAGGTATGCAAGTTAGTGGTAGTAATATTGGTTCTGGTGCTCTTGTAACCTCAATTAACACCAATACTAGAGTTATTACACTATCTGTTCCTAATGCTGATACTGTGAATGGTAATATCATTTTTGGAGAAGAAACTTCAGTTAACTGGGTAAACATTGATATCCAAAGAACTAAAGTAGTTAACCAATCTTTATCTGGTTCTGGTGGTACTCCTGGTACTAGATTGTATCTTTATGGGTATACTACTGAAGCATCCCCTCCAGCAACAAAAGTTCAGGGTTATGCAATTGGTTCTAGACAAGATGGAACAGGTGCTTCTGCTGTTGCGGATAAAATTAACTGCTTACTTGTATCTCAGGGAGCAGCAGAAGCAACAATTAAGAATGCTTCTATTTCGCCATACGGTCCTTCTGTATCTGGTCTTTCTGCTGGTGTAACTGGTTCTCCAATACAATTTGATAGCAATACCTATACAATTGGAGGTGTTTCTGGAGTCGTTGGTGGTTGGTATCTCTCTGTAAGTTCTACAGAAAACTCTATCTACACAACCCTATCTACAAATACTACATACAACAACGTAAACTTTACTCCTACAACTTTTATTAAGAGAATTGCTGACGCAAGAGACTTGCAGGATAGAACTTTCCGTATTCGTCTTAAGATTGATAAGGATAAGACTAATCCTCTTCCTCGCGATCCTCTCTCTGGTTATGTCATGCAACCTTTGAATAGTGATACAACAACGTATAATCTAGATAAAACATTCTATATTTACGATATCGAAAAAACTCAAGAGTTTGAACGAGGTGTTACAGATGGAATCTACTACATTACCCTGCTATGTGCATCTATTGCACCTACAACTTCTAACTTCAACAACAGGAAGTTCTCTCAAAACGTCAACGAAGTGTATCCTACGTTTGACAGAGACAACCCTGTTGCTGACCCTGATGCTTCGGTATCCGTCGCTGACAATGAAACTATCGGTTTAGTATATGCTACTGATGGTGCATCACCTACACCTAATAAAGATCCTAAGCGTTCTATTACTAAAGAAGGAATCGAATTCCTTTTGACTGACAATGGTTGGACACAACCAGGCACAACTCCAAACTATGATTCTGTAAATAAGAGACTGTCTAATGTTGAATTAACAGCACGTTCTGGTGATGAAGAATCCAGAAAAATTAATATTCGTGAAAATAATGATGGAACAGTTGCTCCAATTCCAGTTGAGTTCAGAAGACACTCAATCCTTCGCTCAGGTAACCATACCTTTGAGTATCTTGGATTCGGTCCTGGTAACTACTCAACTGCATTCCCTCAGACTCAAGTAGAGACGCTATCTACAGACCAGATTAAGTTCTCTCAGTCTATTAAAGAAGAAGCAGGTGTTGCATTCTACTCAGGTCTTAACTCTAATGGTGACCTATTCATTGGTAACCAGGTAATCAACCCTGTTACGGGACAGATTACGAACGAAGATATTGCACAACTTAATGTTGTTGGTGAAGAGAATACAACGATTGAAACATTCTCTGAGTTGGTGTTGACTGATAAACTTACTGTTGTTGGTGGTGCATCTAACCAGTTAGAATCTATTTTCTCTGGTCCTGTTACTTTTGCTGGTCAAGTTTCTTCTACTAATAATATTACTGCTAAGAAGATTACTTATAATAACCAAGATGGTACGATCATCAAACAAACTCTACTTGCACCCGAAGATGCAAATGGATTCCCTGATTTCACCAACATTACTGGGTATGATACACCTTCTGATGGAGACTTAGTTTATAATACAAACTGGTCTCCTGGTAAGTCTCTTGGTTGGATTTATTATGCTGGAGATTGGAAAGAATTTGGTTTAACTGATACAAAACAAATTAACATTGCAACTCATAATGATTCAAATGGAGACCCTCAACAACATATGGGTCTTGGTATTGCAACTAGTGCAGATCATAGACTAAATGTTCTTGGTAATGTAAAAATTGATGGTAATTTACTTACCACAGGCACTGGTGGTATTGCTGCTGATAAGTACATCACAAGAACTTATCGTGCGGGTGATACCGATGAACCTAACGGTACTAGAACTATCTTCCCAATTACAACTTATACTGGTGGTGTTAAGCACACTGCAAGTTCTCTTCTTATTATGTTGAATGGTGTCGTACAAGTGGGTGGAACAGAAACTGAAGTAAATACAGATAGCGCAGCAAGTTACTATGTTGATAGTAATGGTCAAAACGTTGTGTTTGGTAGCACTGTTGGGGATGCTCCGTTATCTACTGACGTTCTTCATATTATTGAATTGCCTATCTAA

Genome Context

Genome Context

Tertiary structure

PDB ID
a846b669ac6e39a1e5b1bfa83d352b09c59f5c84a3b8877bd390e8bfbc2a5d96
ESMFold
Source ESMFold
Method ESMFold
Resolution 0,3854
Oligomeric State monomer
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50

Literature

Title Authors Date PMID Source
Ecological redundancy of diverse viral populations within a natural community Gregory,A.C., LaButti,K., Copeland,A., Woyke,T. and Sullivan,M.B. 2015-03-20 GenBank