Genbank accession
ARU12974.1 [GenBank]
Protein name
tail fiber protein and host specificity
RBP type
TF
Evidence Phold
Probability 1,00
TF
Evidence RBPdetect2
Probability 0,95
Protein sequence
MQIWIHDKSMRKVCALNNNVPGMLPYSNSQWHTYLEYSTSTLDFTIPKIVNGKLHDDLKYINDQMYVSFYYDNSYHVFYVSQLIENDFSFQVTCNNTNLELSAEIERPLASVDGAKTLEWYLQTLDLLGFAGLEVGFNEIPDRTRTITFESQNGTKLEQLHSLMNQFDAEFVFRTDLNRDGTLKKFVIDIYQRPDGNHHGIGKVRGDVVLYYQSGLKGVQVSSDKTQLFNAGLFLGKDGLNLGSVVFEEKNELGQVEFYSFKDSPMVYAPLSADKYPSAMGGANEIDRWTRRDFQTEYSDVDSLKAYALRTIKQYAYPLMTYTVSVQSSFIENYKDINLGDTVKIIDNNFRGGLALEARVSEMIISFDNPTNNSVVFTNFRKLDNKPSSELQQRIDEIVSKSLPYQVEIRTTNGTVFKNGIGRSTVKPILKQGDKIVDATYRFVIDGTIKYSGLTYDMIASEINQPTTLTIAAWVDNKEVASEEITFLNVSDGKQGPKGPQGPQGPKGDRGNDGIAGKDGVGLKTTTIIYGISDSDTAMPTNWTSQPPALIKGKYLWTKTVWTYTDNSSETGYQKTYIAKDGNDGNDGLPGKDGVGIVNTTLRYAKSTDGVNKPSGSVIAAISDKYQPSNSSTDNLMMTGQRVRLEQGKTYILSAETNGTFTNRHNPDQQSDNATIWLVNPSFSTWAVISDSNTANGTKYTHTRPTGDYKIRVNSYTPDNSTWVKNIVFEDGTWSPDIPTVNPGEYLWTRTTWFYSDGTSEQGFSVAKMGEQGPKGDRGDRGPQGVQGLQGPKGDQGIPGPKGADGKTQYTHIAYADTVSGSGFSQTDVNKAYIGMYQDFNAEDSKNPQDYRWSKWKGSDGRDGIPGKAGADGRTPYVHFAYADSADGRTGFSLTQTGNKRYLGVLTNFFKEDSTNPSDYTWNDTAGSVSVGGENLIINSAFPKNLDNWGFWETGLPNENLHIATHDFYYNDTKNLFRLDSDGKGVPASSRRFPVKRNTDYSLNIQTFATGNIKGVTIYFLGRKANETDKTFTKVVHVKTYSGSPSVTQAVKWHLTFNSGDCDEGYIRIDNNGTTDGKTSMLFFAELDCYEGTTDRAWQASTKDLEEEMGTKADAAMTIEQINALNERAAIIKAEMEAKASAEILNNWIKNYQDFVKANETERAAAEKALVNSSQRVSTIAKELGELSDRWNFIDTYMSTSNDGLVIGKNDGSSSIMFNPNGRISMYSAGSEVMYISQGVIHIENGIFSKTIQVGRYREEQYHLNPDMNVIRYVGGF
Physico‐chemical
properties
protein length:1277 AA
molecular weight: 142104,00030 Da
isoelectric point:5,31725
aromaticity:0,10807
hydropathy:-0,57792

Domains

Domains [InterPro]
IPR010572
ENZ
141–382
G3DSA:1.20.5.320
STR
491–540
DC_1971
STR
783–865
ARU12974.1
1 1277
Architecture
STR
STR 1-1277
Legend: ATT STR RBD CBM LEC ENZ CHP LNK TAS TTP UNK Unmapped

Taxonomy

  Name Taxonomy ID Lineage
Phage Streptococcus phage P0091
[NCBI]
1971410 Viruses > Duplodnaviria > Heunggongvirae > Uroviricota > Caudoviricetes
Host Streptococcus thermophilus
[NCBI]
1308 cellular organisms > Bacteria > Bacillati > Bacillota > Bacilli > Lactobacillales

Coding sequence (CDS)

Coding sequence (CDS)
Genbank protein accession
ARU12974.1 [NCBI]
Genbank nucleotide accession
KY705251.1 [NCBI]
CDS location
range 14759 -> 18592
strand +
CDS
ATGCAAATTTGGATTCATGATAAAAGCATGCGCAAGGTGTGTGCACTGAATAATAACGTTCCTGGCATGCTTCCGTACTCAAACAGTCAATGGCACACCTACCTTGAATACTCAACCAGTACACTTGACTTCACAATTCCTAAAATTGTAAATGGAAAACTTCACGATGATTTAAAATATATCAATGATCAGATGTATGTGTCGTTTTACTATGACAATTCCTATCACGTTTTCTATGTTTCTCAACTCATTGAGAACGATTTTAGTTTTCAAGTTACTTGTAACAATACCAACTTGGAACTATCAGCAGAAATAGAGCGTCCGTTAGCTAGTGTTGACGGTGCTAAAACACTTGAGTGGTATCTTCAAACCCTTGATTTACTTGGTTTTGCTGGCCTTGAAGTTGGTTTCAATGAGATTCCTGATAGGACAAGAACTATCACGTTTGAATCTCAAAATGGCACAAAATTAGAACAGCTTCATAGCTTGATGAACCAGTTTGATGCTGAGTTTGTTTTTCGTACTGATTTAAACCGAGATGGCACTTTGAAAAAGTTTGTCATTGACATCTACCAACGACCAGATGGAAATCATCATGGAATTGGTAAGGTTAGAGGTGATGTCGTTCTATACTATCAAAGCGGTCTTAAAGGCGTTCAAGTATCTAGTGATAAGACTCAACTCTTCAACGCTGGTCTTTTCCTCGGAAAAGATGGATTAAACCTAGGAAGCGTTGTGTTTGAGGAAAAGAATGAGTTAGGACAAGTAGAGTTCTACTCATTTAAAGACAGTCCGATGGTTTACGCACCTTTATCAGCAGATAAATATCCATCTGCAATGGGTGGTGCTAATGAAATAGATAGATGGACACGTAGGGATTTTCAGACAGAATACAGTGATGTTGATTCCCTCAAAGCTTATGCCTTGCGCACTATTAAGCAATATGCTTATCCTCTAATGACCTATACCGTAAGTGTTCAATCTAGTTTCATTGAAAACTACAAGGATATTAATCTAGGTGACACTGTTAAAATCATCGATAATAATTTTAGAGGTGGTTTAGCCCTCGAAGCGCGTGTATCTGAAATGATTATCAGCTTTGACAATCCTACAAACAATTCTGTTGTTTTTACTAATTTCAGAAAGTTGGATAATAAACCGTCTAGTGAATTACAACAACGTATCGATGAGATTGTTTCTAAATCATTGCCTTATCAAGTTGAGATAAGGACCACGAATGGAACAGTATTTAAGAACGGCATTGGTCGTTCTACTGTTAAACCAATTTTGAAACAAGGCGATAAAATTGTTGATGCAACTTATCGATTTGTGATTGACGGTACTATTAAATACTCAGGTCTGACCTATGATATGATAGCATCAGAGATTAACCAACCAACAACGTTGACGATTGCTGCGTGGGTAGATAATAAAGAAGTAGCTTCGGAAGAGATTACTTTCTTAAACGTCTCAGATGGTAAACAAGGACCTAAGGGCCCACAAGGACCACAAGGACCTAAAGGAGATAGAGGTAATGATGGAATTGCAGGTAAGGATGGGGTTGGATTAAAGACCACAACTATCATTTATGGAATAAGCGATAGTGACACTGCTATGCCTACTAACTGGACTAGTCAACCACCAGCATTAATTAAAGGGAAATACCTATGGACCAAAACAGTATGGACATATACTGATAATTCATCTGAAACAGGTTATCAAAAAACTTACATTGCCAAAGATGGTAACGATGGAAATGATGGCTTGCCAGGTAAAGATGGCGTTGGGATTGTTAATACTACCTTGCGTTATGCAAAATCAACGGACGGTGTCAATAAGCCGTCTGGTAGCGTAATCGCAGCGATTAGTGATAAATACCAACCATCTAATTCATCGACTGACAACTTAATGATGACTGGTCAACGTGTCCGATTAGAACAAGGTAAGACCTACATCCTATCTGCTGAAACCAATGGAACGTTTACCAATCGGCACAATCCCGACCAACAAAGCGATAATGCTACGATTTGGCTTGTCAATCCAAGTTTCAGTACATGGGCAGTTATTTCTGATAGCAACACGGCTAACGGTACGAAATACACTCATACCCGTCCTACAGGCGATTATAAAATCCGTGTTAACAGTTACACACCAGACAATAGCACTTGGGTTAAGAATATAGTATTTGAAGACGGCACTTGGTCGCCTGACATTCCAACAGTTAATCCCGGGGAATATCTCTGGACAAGGACTACGTGGTTCTATTCAGACGGTACGAGTGAGCAAGGTTTTTCCGTTGCTAAGATGGGTGAACAAGGTCCAAAAGGTGACCGTGGAGACCGTGGACCTCAAGGTGTTCAAGGATTACAAGGACCCAAAGGTGACCAAGGAATACCAGGACCAAAAGGTGCTGACGGAAAAACGCAATACACCCATATAGCTTATGCCGACACTGTTTCTGGTAGTGGTTTTAGTCAAACAGATGTCAATAAAGCCTATATTGGTATGTACCAAGACTTCAATGCCGAAGATAGCAAAAATCCACAAGACTATCGTTGGAGCAAGTGGAAAGGTAGTGATGGTCGTGATGGCATTCCTGGAAAAGCTGGAGCAGACGGACGGACTCCTTACGTCCACTTTGCCTATGCAGACAGCGCCGATGGTAGAACTGGTTTCAGTTTGACCCAAACTGGTAATAAACGCTATTTAGGTGTGCTTACCAACTTCTTCAAGGAAGACAGTACTAATCCTTCTGACTACACGTGGAATGATACGGCTGGCAGTGTTTCGGTTGGTGGTGAGAATCTAATCATTAACTCGGCTTTCCCGAAGAATCTTGACAATTGGGGATTTTGGGAAACGGGATTGCCTAACGAAAATCTTCATATAGCAACACATGATTTTTATTACAATGATACAAAAAATCTATTTAGACTAGATAGTGATGGTAAAGGGGTTCCTGCATCATCAAGACGTTTTCCAGTTAAACGTAACACTGATTATTCTCTCAACATTCAAACGTTTGCGACTGGAAATATCAAAGGTGTAACTATCTATTTTTTGGGTCGGAAGGCAAATGAAACTGACAAGACATTTACTAAAGTCGTGCATGTAAAAACATATTCTGGTTCACCATCGGTGACACAGGCGGTTAAATGGCACTTAACTTTCAACTCTGGAGATTGCGATGAAGGGTACATTCGCATTGATAATAATGGTACTACTGACGGTAAAACATCTATGCTATTCTTCGCTGAGTTGGACTGTTACGAGGGAACAACCGATAGAGCATGGCAAGCGTCAACTAAGGACTTAGAAGAAGAGATGGGAACTAAAGCCGATGCTGCTATGACGATTGAACAGATTAATGCACTTAATGAAAGGGCTGCAATCATTAAAGCAGAGATGGAAGCCAAAGCAAGCGCTGAAATTTTGAATAACTGGATTAAAAATTACCAAGATTTCGTTAAGGCAAACGAGACCGAGAGAGCTGCAGCCGAGAAAGCTTTGGTTAACTCAAGTCAGCGGGTATCAACCATCGCTAAGGAGTTAGGTGAACTGTCTGATCGTTGGAATTTCATCGATACTTACATGAGCACATCGAATGATGGGCTTGTGATTGGAAAGAATGACGGTAGCTCAAGCATTATGTTCAACCCTAACGGTCGAATTTCAATGTATTCAGCAGGGTCTGAAGTCATGTATATCTCGCAAGGTGTAATCCACATCGAAAACGGTATCTTCTCGAAAACAATCCAAGTTGGTCGATATCGTGAGGAGCAATACCATCTCAACCCAGACATGAATGTCATTCGATATGTAGGAGGATTTTAA

Genome Context

Genome Context

Tertiary structure

PDB ID
775e404c994acb43ac3a4a80d80f821476f27bf5e4169ad6c854217ee83caada
ESMFold
Source ESMFold
Method ESMFold
Resolution 0,7005
Oligomeric State monomer
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50

Literature

Title Authors Date PMID Source
Global Survey and Genome Exploration of Bacteriophages Infecting the Lactic Acid Bacterium Streptococcus thermophilus McDonnell,B., Mahony,J., Hanemaaijer,L., Neve,H., Noben,J.P., Lugli,G.A., Ventura,M., Kouwen,T.R. and van Sinderen,D. 2017 28955321 GenBank