UniProt accession
A0A0F6THB5 [UniProt]
Protein name
Long tail fiber proximal subunit
RBP type
TF
Evidence UniProt/TrEMBL
Probability 1,00
TF
Evidence RBPdetect
Probability 0,90
Protein sequence
MNYMRKGQTVKIKAAEGDTIASSVALLQFPKRSEYPPDAQWVSVTELEFNGTTSYVPVLELAYIEDTDSDTHYWVVQQNVPTVERVDAGNDATRARLGVIALATQAQANVDLENTPAKEVAITPETLANRTATEARRGIAKIATTAQVNQNSTASFVDDTIVTPKKLNERTATETRRGLAEIATQTETDAGLDDTTIITPKKLQARQGSETLSGIVKYVSTTSATPAETRGAAGTNVYNKTVNNLTISPKALDQYKATYAQQGAVILAVDSEVIAGQSQAGYSHAVVTPETLHKKTSTDGRIGLIEIATQAETNAGTDYTRAVTPKTLNDRKATEGLSGIAELATQVEFDTGTDDTRISTPLKIKTHFDSSDRTSVNSDSGLIEEGTLWNHYTLDISKANETQRGTLRVATQAESNAGTLDDVLITPKKLLGTKSTETSEGVIKVATRAETVTGTSANTAVSPKNLKWIVQSEPTWAATTAIRGFVKTSSGSITFVGNDTVGSTQPLESYEKNSYAISPYELNRVLANYLPLKAKAIDSNLLDGLDSSQFIRRDIAQTVNGSLTLTQQTNLSAPLVSSSTATFGGSVSANSTLTISNSGTATRLIFEKGPQTGTNPAQTMTIRVWGNEFGGGSDTTRSTVFEVGDETSNHFYSQRNKAGNITFSINGTVMPINVNASGTLNANGVATFGRSVTANGEFISKSANAFRAISGDYGFFIRNDGGSTYFMLTASGDQTGGFNGLRPLSINNQSGQITIGEGLIIANGATINSGGLTVNSRIRSQGTKTSDLYTRAPTSDNVGFWSIDINDSATYNQFPGYFKMVEKTNEVTGLPYLERGEEVKSPGTLTQFGNTLDSLYQDWITYPTTPEARTTRWTRTWQKTKNSWSSFVQVFDGGNPPQPSDIGALPSDNATIGNLTIRDFLRIGNVRIIPDPVNKSVKFEWIE
Physico‐chemical
properties
protein length:943 AA
molecular weight: 101293,96620 Da
isoelectric point:5,39444
aromaticity:0,06999
hydropathy:-0,39077

Domains

Domains [InterPro]
DC_1209
STR
1–713
IPR048390
ATT
633–746
DC_1209
STR
652–930
A0A0F6THB5
1 943
Architecture
STR
ATT
STR
ATT
STR
STR 1-632 | ATT 633-746 | STR 747-792 | ATT 793-891 | STR 892-930 |
Legend: ATT STR RBD CBM LEC ENZ CHP LNK TAS TTP UNK Unmapped

Taxonomy

  Name Taxonomy ID Lineage
Phage Escherichia coli O157 typing phage 3
[NCBI]
1508678 Uroviricota > Caudoviricetes > Pantevenvirales > Tevenvirinae > Mosigvirus
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)
Genbank protein accession
AKE45308.1 [NCBI]
Genbank nucleotide accession
KP869101 [NCBI]
CDS location
range 165596 -> 168427
strand -
CDS
ATGAACTACATGCGTAAAGGCCAAACAGTAAAAATTAAAGCTGCCGAAGGTGATACAATTGCTTCTTCTGTTGCATTGCTTCAGTTCCCTAAACGTTCTGAATATCCACCTGATGCTCAATGGGTCTCTGTTACTGAACTAGAATTCAATGGCACTACTTCATATGTTCCTGTATTAGAATTAGCGTATATCGAAGACACTGATTCTGACACTCACTATTGGGTTGTGCAACAGAATGTCCCTACGGTAGAACGTGTTGATGCTGGAAATGATGCTACACGAGCTCGTTTAGGTGTTATTGCTCTTGCTACTCAAGCGCAAGCAAATGTTGATTTAGAGAATACTCCAGCTAAAGAAGTGGCTATTACTCCGGAAACATTGGCAAATCGTACAGCAACTGAAGCTCGACGTGGTATCGCTAAAATTGCTACAACAGCACAAGTTAACCAGAATTCCACAGCATCATTTGTGGACGATACTATTGTTACGCCTAAAAAACTAAATGAGCGTACAGCAACTGAAACTCGTCGCGGTCTTGCTGAAATTGCTACTCAGACTGAAACCGATGCTGGTCTTGATGACACAACGATTATTACACCTAAGAAATTGCAGGCACGTCAGGGTTCTGAAACACTATCAGGTATAGTTAAATATGTATCAACTACTTCTGCTACTCCTGCTGAAACTCGTGGTGCTGCAGGCACTAACGTTTATAATAAAACCGTAAATAATTTAACTATTTCTCCTAAAGCCCTTGACCAATATAAAGCAACTTATGCTCAACAAGGTGCAGTAATTTTAGCTGTTGATAGTGAAGTAATTGCTGGTCAATCACAAGCAGGTTATTCTCACGCTGTAGTAACTCCTGAAACACTACATAAGAAAACTTCTACTGATGGACGTATTGGTTTAATTGAAATTGCTACGCAAGCAGAAACTAATGCTGGGACTGATTATACACGTGCAGTAACGCCTAAGACGTTAAATGATAGGAAAGCTACGGAAGGATTATCCGGCATAGCCGAACTTGCTACGCAAGTTGAATTTGATACTGGAACTGACGATACTCGTATCTCAACTCCACTGAAAATTAAAACTCATTTTGATTCTTCTGACCGTACCAGTGTTAATTCTGATTCCGGACTTATTGAAGAAGGAACCTTGTGGAACCATTATACTCTTGATATTTCTAAAGCAAATGAAACACAACGTGGTACACTTCGCGTAGCGACTCAGGCAGAATCTAATGCAGGAACTTTAGATGATGTTCTTATTACTCCTAAAAAGCTTTTAGGGACTAAGTCCACTGAAACGTCTGAAGGCGTAATTAAGGTTGCTACTCGGGCTGAAACTGTAACAGGAACTTCTGCTAATACTGCTGTATCTCCTAAGAATTTAAAATGGATTGTCCAGTCTGAACCAACATGGGCTGCTACTACGGCGATTCGTGGATTCGTTAAAACTTCATCTGGTTCTATTACATTCGTTGGTAATGATACAGTTGGTTCAACGCAACCTTTAGAATCATATGAAAAAAATAGCTATGCTATATCTCCATATGAATTAAACCGTGTACTTGCTAACTATTTGCCGTTGAAAGCTAAAGCCATAGATAGTAATTTATTAGATGGTCTAGATTCATCTCAGTTCATTCGTAGGGACATTGCACAGACGGTTAATGGTTCACTAACCTTAACCCAACAAACGAATCTGAGTGCCCCTCTTGTATCATCTAGTACTGCTACGTTTGGTGGTTCAGTTTCGGCAAATAGTACATTAACTATTTCTAACTCTGGAACAGCTACTCGACTGATTTTTGAAAAAGGACCTCAGACCGGAACAAATCCTGCTCAAACTATGACTATCAGAGTTTGGGGAAATGAATTTGGTGGTGGTTCAGATACAACACGTTCTACTGTATTTGAAGTTGGCGATGAAACATCTAATCACTTTTATTCTCAACGTAATAAAGCTGGGAATATAACGTTTAGCATTAATGGTACTGTGATGCCAATAAATGTTAACGCTTCAGGCACGTTAAATGCGAATGGCGTTGCAACATTTGGTCGTTCAGTTACAGCCAATGGTGAATTCATTAGCAAGTCTGCAAATGCTTTCAGAGCAATTAGTGGTGATTATGGATTCTTTATTCGCAATGATGGCGGCAGCACATATTTTATGCTTACTGCATCTGGTGATCAGACCGGTGGATTTAATGGATTACGTCCATTATCAATTAATAACCAATCCGGTCAGATTACAATTGGTGAAGGCTTAATCATTGCCAATGGTGCTACTATAAATTCTGGCGGTTTGACTGTTAACTCGAGAATTCGTTCTCAGGGTACTAAAACATCCGATTTATATACCCGTGCGCCAACATCTGATAATGTAGGATTCTGGTCAATTGATATTAACGATTCAGCCACTTATAACCAGTTCCCAGGTTATTTTAAAATGGTTGAAAAAACTAATGAAGTGACTGGGCTTCCATACTTAGAACGTGGCGAAGAAGTTAAATCTCCTGGTACATTGACTCAGTTTGGTAATACACTTGATTCACTTTACCAAGATTGGATTACTTATCCAACAACCCCAGAAGCACGTACAACCCGTTGGACTCGTACATGGCAGAAAACTAAAAATTCTTGGTCAAGTTTTGTTCAAGTATTTGATGGGGGTAACCCTCCTCAGCCATCTGATATTGGTGCTTTACCTTCTGATAATGCAACAATCGGAAACTTGACAATAAGGGATTTCTTAAGGATTGGTAATGTCCGCATTATTCCAGACCCTGTGAATAAATCTGTTAAATTCGAGTGGATTGAATAA

Genome Context

Genome Context

Gene Ontology

Description Category Evidence (source)
GO:0098024 virus tail, fiber Cellular Component IEA:UniProtKB-KW (UniProt)

Tertiary structure

PDB ID
646d4bb8365142c15a26742a2997d4520ffc5ee903b97fedb57bf95c71dd38ab
ESMFold
Source ESMFold
Method ESMFold
Resolution 0,5700
Oligomeric State monomer
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50