UniProt accession
A0A0E3HGY4 [UniProt]
Protein name
Tail fiber
RBP type
TF
Evidence UniProt/TrEMBL
Probability 1,00
TF
Evidence GenBank
Probability 1,00
TSP
Evidence RBPdetect
Probability 0,76
TF
Evidence RBPdetect2
Probability 0,96
Protein sequence
MAFKINGVVRIDNSGNGFLGIVTATDANITGVLTATEVDAKVSSKAITEQTDGTVDDVTGADELLLLDAETGGLLRVSIDEFVQGSGIGTLVTDFDNLTVTGITTVATLKGIGSSNISVGSSLSFADGTEIIMGDSGDFSIHHDGDHTYLDETGQGNLKLRTNNFRVTNIAETKPAITAQVTSGVELYFDGNKKIETTNTGVLVSGMLTADRLRSGDIAARNVITLGITTIQDHLEVNDSTGSGTEYNLNVKTNGSSTFGVLGNGNILLGNSSGAPFMATNDHHATSKKYVDDAIDDSVSYIGASAWGSVAADGSLRNGLNCTTSQTSGGVYQVTFTTPLPNANYAITGASSDSAFWVKDGSETVNGFTVYIVDVDGNSINRDLSFAVFSANAIVPPSGVGADAWGTFSGTTGDLQAGMNISSTTRTALGEYDVEFITEMPSDSEYAVTAVSNASQAKFINIRNKSTSGFKVVTRDGSGNISDGAVSFAVHASSTVTPTYTWTRDGTTLKPANDGDDVEVNGRGLFESSQPADYVVRVKNTDTGTINPACLLLDMRDVDASNNYSALRVVGYNTTTGLDINSDGAITAQGQITANVSNQGGENSAIKAIQTNSNGYAVWVGDGPNSTDRTASIGPDGRATFSSGKISLGPTGSGTFADNNIFLQSDGQVRCTNVITSSALAGSGTRPLYVDSSGGLTISSSDRSLKTNITTLPDQIQVVKALNPVSFNWIEKERLGAGLEIGFIAQEVEEIVPEVVRSTAGILSVDYAKLTATLTSALQAALTRIEALEAKVQSLEEGVTPD
Physico‐chemical
properties
protein length:802 AA
molecular weight: 83366,67980 Da
isoelectric point:4,40664
aromaticity:0,06110
hydropathy:-0,11072

Domains

Domains [InterPro]
DC_0499
STR
153–398
DC_0421
STR
377–802
IPR030392
CHP
701–757
IPR030392
CHP
701–792
A0A0E3HGY4
1 802
Architecture
STR
STR 153-802
Legend: ATT STR RBD CBM LEC ENZ CHP LNK TAS TTP UNK Unmapped

Tail Spike Domain Segmentation

Tail Spike Domain Segmentation

This protein has been segmented into three structural domains: N-terminal, central domain, and C-terminal.

Domain Layout
N-terminal
Central
C-terminal
A0A0E3HGY4
1 802
Domain Start End Length (AA) Confidence
N-terminal 1 354 354 0,3978
Central domain 355 553 200 0,1206
C-terminal 554 802 248 0,8882
Legend: N-terminal Central domain C-terminal
3D Structure with Domain Coloring

The structure is colored according to the domain segmentation: N-terminal (blue), Central (green), C-terminal (pink).

Domain Coloring
N-terminal
1-354
Central
355-553
C-terminal
554-802

Taxonomy

  Name Taxonomy ID Lineage
Phage Synechococcus phage ACG-2014f
[NCBI]
1493511 Uroviricota > Caudoviricetes > Pantevenvirales > Atlauavirus > Atlauavirus tusconc8
Host Synechococcus sp. WH 7803
[NCBI]
32051 Bacteria > Cyanobacteria > Oscillatoriophycideae > Chroococcales > Synechococcus >

Coding sequence (CDS)

Coding sequence (CDS)
Genbank protein accession
AIX29347.1 [NCBI]
Genbank nucleotide accession
KJ019092 [NCBI]
CDS location
range 168510 -> 170918
strand +
CDS
ATGGCATTTAAGATTAATGGCGTAGTACGCATTGATAATAGTGGCAACGGTTTTCTGGGAATTGTAACCGCTACCGACGCTAATATCACAGGAGTTCTCACTGCCACTGAGGTGGATGCGAAAGTATCATCTAAGGCAATAACTGAACAAACAGATGGAACCGTAGATGATGTAACTGGTGCAGATGAACTATTACTATTAGATGCCGAGACTGGTGGATTACTAAGAGTATCAATAGATGAGTTCGTGCAGGGTTCAGGCATTGGAACTCTAGTAACGGACTTTGATAACTTAACTGTTACTGGTATCACAACCGTTGCCACCCTCAAAGGTATTGGATCTAGTAATATCAGTGTAGGAAGTTCACTTTCCTTTGCTGATGGTACAGAGATCATCATGGGTGACTCTGGTGACTTCAGTATTCATCATGATGGTGATCATACTTACCTAGATGAGACGGGGCAGGGTAATCTAAAACTCCGTACAAATAACTTTAGAGTAACAAATATAGCAGAAACAAAACCAGCTATTACAGCACAGGTTACTTCTGGTGTTGAGTTGTATTTTGATGGTAATAAGAAGATAGAAACTACAAATACTGGTGTTCTTGTGAGTGGTATGCTCACTGCTGATAGATTAAGATCAGGTGATATTGCTGCTAGAAATGTAATCACACTGGGCATTACCACTATTCAAGATCACCTAGAGGTGAATGATAGTACTGGTTCTGGTACCGAATACAATCTCAATGTTAAGACCAATGGTAGTTCTACCTTTGGTGTCTTAGGTAATGGAAATATTCTACTTGGAAACAGTTCCGGTGCTCCATTCATGGCGACGAATGATCACCACGCAACTTCCAAGAAGTATGTGGATGATGCTATTGATGATTCCGTCAGCTACATCGGCGCTAGTGCTTGGGGTAGTGTTGCTGCTGATGGATCACTTCGGAATGGACTAAACTGCACAACTTCACAAACAAGTGGAGGGGTTTATCAAGTCACGTTCACAACTCCTCTTCCTAATGCAAACTATGCTATCACTGGAGCATCTAGTGATAGTGCTTTCTGGGTAAAAGATGGTTCAGAAACAGTCAATGGATTTACAGTTTATATAGTTGATGTAGATGGAAATTCAATAAATAGAGATTTATCCTTCGCCGTCTTCAGTGCCAACGCCATCGTGCCGCCGTCAGGCGTAGGCGCTGACGCCTGGGGAACATTCTCTGGAACGACTGGCGACCTGCAAGCTGGGATGAATATCAGCTCAACTACACGTACTGCCCTAGGAGAATATGACGTAGAATTTATAACCGAGATGCCTTCGGACTCAGAATATGCCGTAACCGCAGTATCTAATGCAAGTCAAGCAAAGTTTATTAATATACGCAATAAGTCTACAAGTGGTTTTAAGGTTGTAACTAGAGATGGTTCTGGAAATATTTCTGACGGAGCCGTATCCTTTGCCGTCCACGCATCATCAACAGTCACGCCAACCTATACCTGGACACGCGACGGCACAACGCTTAAGCCTGCTAATGATGGTGATGATGTAGAGGTGAACGGTCGGGGCCTTTTCGAAAGCAGCCAGCCGGCTGACTATGTTGTGAGAGTAAAAAACACCGACACAGGTACTATCAATCCCGCATGTTTGCTGCTGGACATGCGAGATGTTGACGCTTCTAATAATTATTCAGCCTTAAGGGTAGTAGGTTACAACACTACAACAGGTCTTGATATAAATTCAGACGGCGCTATCACGGCTCAGGGGCAGATAACCGCAAACGTCAGTAACCAAGGCGGAGAGAATAGTGCAATCAAGGCAATTCAAACCAACTCCAACGGGTATGCGGTATGGGTTGGTGATGGACCTAACTCTACAGATAGAACCGCTTCTATAGGCCCTGATGGAAGAGCCACGTTTTCCTCCGGCAAAATTTCGCTGGGTCCGACCGGCTCGGGGACGTTTGCAGACAATAACATCTTCCTGCAGTCGGACGGACAAGTCAGATGCACGAACGTAATTACATCTTCAGCTTTGGCAGGCTCAGGCACCAGACCATTATATGTCGACTCATCTGGTGGTTTAACTATCAGCTCTTCTGATCGTAGTTTGAAGACGAACATCACTACTCTTCCTGATCAGATACAAGTAGTCAAAGCATTAAATCCCGTTTCGTTTAATTGGATAGAGAAAGAAAGACTAGGTGCAGGTCTTGAAATTGGATTTATTGCTCAAGAAGTAGAAGAAATCGTTCCTGAGGTTGTTCGTAGCACTGCCGGAATACTTTCAGTAGATTATGCAAAATTAACCGCAACTTTAACGTCTGCCCTACAGGCAGCACTCACCCGCATCGAAGCCCTCGAAGCGAAAGTCCAATCACTCGAAGAAGGTGTGACACCTGACTAA

Genome Context

Genome Context

Gene Ontology

Description Category Evidence (source)
GO:0098015 virus tail Cellular Component IEA:UniProtKB-KW (UniProt)

Tertiary structure

PDB ID
940083548dd3ff148927d048cfbb1a8dfb78467436ba97c447f8162c1872c0f5
ESMFold
Source ESMFold
Method ESMFold
Resolution 0,6736
Oligomeric State monomer
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50