Genbank accession
AYP29049.1 [GenBank]
Protein name
tail fiber protein and host specificity
RBP type
TF
Evidence Phold
Probability 1,00
Protein sequence
MQIWIHDKSMRKVCALNNEIPGMLPYSNSQWHPYLEYSTSTFDFTIPKIVNGKLHDDIKYINDQMHVSFYYDNSYHVFYVSQLVENDFSFQVTCNNTNLELAAEIERPLASVDGAKTLEWYLQKLELLGFAGLEIGFNEIPDRTRTLTFESQSGTKLEQLHSLMNQFDAEFIFRTELNRDGTMKRFIIDIYQEADKNHHGIGKARGDVILYYQSGLKGVQVTSDKTQLFNAGNFIGQDGVNLNDVEFEEKNELGQVEFYSRKGTSFVFAPLSRERYPSTMNPDSADNWTRRDFQTEYKDVESLKAYALRTIKQYAYPLLTYTVDVQSSFLDNYKDINLGDTVKIIDNNFRGGLALEARVSEMIISFDNPTNNSVVFTNFRKLDNKPSSELQQRIDEIVSKSLPYHVEIRTTNGTVFKNGIGRSTVKPILKQGDKIVDATYRFVIDGTIKYVGMTYDMVASEITQPTTLTISAWVDNKEVASEEVTFVNVSDGKQGPKGADGKTPYVHFAYADSADGQKGFSLTQTGTKRYLGVYTDFNQADSTNPADYTWSDTAGSVSVGGDNLIVNSAFPKNLNNWGFWEPELPNENLHIATHVFYYNAARNLFRLDDNSNSGVPAASRRFPVKRNTDYSFNIQTFATGNIKGLTIYFLGRKANETDKAFTNVVNLKTHTGSPSVTQAVKWHLTFNSGDCDEGFIRIDNSGTTDGNTSSLFFAELDCYEGTTDRAWQASSKDLEEEIDTKADDVLTQAQLNRLNETNSIIKAELNAKASLDTLNQWVEAYQNFVNANNANRAQAEKDLADASARVTKLENDLNDMSERWNFIDSYMAASNEGLVIGKKDESSSIMFNPNGRISMFSAGNEVMYISKGVIHIENGIFSKTIQIGRYREEQDLLNPDRNVIRYVGGA
Physico‐chemical
properties
protein length:906 AA
molecular weight: 102226,42470 Da
isoelectric point:5,05790
aromaticity:0,11038
hydropathy:-0,50243

Domains

Domains [InterPro]
DC_0002
STR
1–906
IPR010572
ENZ
141–381
AYP29049.1
1 906
Architecture
STR
STR 1-906
Legend: ATT STR RBD CBM LEC ENZ CHP LNK TAS TTP UNK Unmapped

Taxonomy

  Name Taxonomy ID Lineage
Phage Streptococcus phage SW6
[NCBI]
2419655 Viruses > Duplodnaviria > Heunggongvirae > Uroviricota > Caudoviricetes
Host Streptococcus thermophilus
[NCBI]
1308 cellular organisms > Bacteria > Bacillati > Bacillota > Bacilli > Lactobacillales

Coding sequence (CDS)

Coding sequence (CDS)
Genbank protein accession
AYP29049.1 [NCBI]
Genbank nucleotide accession
MH892351.1 [NCBI]
CDS location
range 15733 -> 18453
strand +
CDS
ATGCAGATTTGGATTCATGACAAAAGCATGCGGAAAGTGTGTGCATTGAATAATGAAATTCCCGGAATGTTACCATACTCTAATAGTCAATGGCACCCTTACCTTGAATATTCAACAAGTACATTTGATTTTACAATTCCTAAAATCGTAAACGGTAAGTTACACGATGATATCAAATATATCAATGACCAGATGCACGTGTCATTTTACTACGACAATTCCTACCACGTTTTCTATGTTTCTCAACTCGTTGAAAATGATTTTAGTTTTCAAGTGACTTGTAATAATACAAACTTGGAACTAGCAGCAGAAATAGAGCGTCCTTTAGCTAGTGTTGACGGTGCTAAAACTCTTGAATGGTATCTTCAAAAACTTGAGTTACTTGGTTTTGCTGGTCTTGAAATTGGTTTTAATGAGATTCCTGATAGAACAAGAACGCTTACTTTTGAATCTCAAAGTGGAACTAAACTAGAGCAACTTCATAGCTTGATGAATCAATTTGATGCAGAATTTATTTTCCGTACCGAATTAAACCGAGACGGAACTATGAAACGTTTCATCATCGACATCTACCAAGAAGCAGATAAAAACCATCACGGTATAGGTAAAGCTAGAGGAGATGTCATTCTCTACTATCAAAGCGGATTGAAAGGCGTTCAAGTTACTAGTGATAAAACGCAACTTTTCAACGCTGGTAATTTCATTGGACAAGATGGCGTTAACCTAAACGATGTCGAATTTGAGGAAAAGAACGAGCTAGGACAAGTAGAGTTCTATTCTCGAAAGGGCACTAGCTTCGTTTTCGCCCCACTGTCAAGGGAACGCTACCCATCTACCATGAATCCAGACAGCGCTGATAACTGGACACGTAGGGATTTTCAGACAGAATACAAGGACGTTGAATCCTTAAAAGCTTACGCCTTGCGTACTATCAAGCAGTATGCTTATCCACTATTGACTTACACAGTAGATGTTCAGTCTAGCTTTCTGGATAACTATAAAGACATCAATCTAGGTGACACTGTTAAAATCATCGATAATAATTTTAGAGGTGGTTTAGCCCTCGAAGCGCGTGTATCTGAAATGATTATCAGCTTTGACAATCCCACAAACAACTCGGTTGTTTTTACTAATTTCAGAAAATTGGATAATAAACCGTCTAGCGAATTACAACAACGTATCGATGAGATTGTTTCTAAGTCATTGCCATATCATGTTGAGATAAGGACCACGAATGGTACAGTATTTAAGAATGGTATTGGTCGTTCTACTGTTAAACCAATTTTGAAACAAGGCGATAAAATTGTTGATGCAACTTATCGATTTGTGATTGACGGAACTATTAAATACGTAGGTATGACTTACGATATGGTAGCGTCAGAGATAACTCAACCAACAACGCTTACTATCTCAGCTTGGGTAGATAATAAAGAAGTAGCTTCAGAAGAAGTTACTTTTGTAAATGTATCAGATGGTAAACAAGGACCTAAAGGTGCTGACGGGAAAACACCTTATGTTCACTTTGCTTATGCCGATAGTGCCGATGGTCAAAAGGGTTTCAGTTTAACCCAGACTGGTACCAAGAGGTATTTAGGTGTGTACACAGATTTCAATCAAGCGGACAGCACTAACCCAGCTGATTATACTTGGAGTGACACGGCTGGCAGCGTTTCGGTTGGTGGTGATAATTTAATCGTTAACTCAGCTTTTCCAAAGAATCTTAACAATTGGGGATTTTGGGAACCGGAATTGCCTAATGAGAATCTTCATATAGCAACACATGTATTTTATTATAATGCTGCAAGAAACCTGTTTAGGCTAGATGATAATAGCAATAGTGGGGTTCCTGCTGCATCAAGACGTTTTCCAGTCAAACGCAACACAGACTACTCGTTCAATATTCAGACATTCGCTACTGGTAATATCAAGGGCTTAACTATCTATTTTTTGGGTCGGAAGGCAAATGAAACTGACAAGGCATTTACTAACGTCGTGAATCTCAAAACACATACAGGTTCACCGTCAGTAACACAAGCTGTTAAATGGCACTTAACGTTTAATTCTGGTGATTGTGATGAAGGATTCATCCGTATAGACAACAGTGGAACGACTGACGGTAATACGTCTAGTCTATTCTTCGCTGAATTAGACTGCTATGAGGGAACCACTGACCGAGCGTGGCAAGCGTCGTCGAAAGATTTGGAAGAGGAAATAGACACCAAAGCCGATGATGTCCTAACACAAGCACAACTCAACAGACTGAACGAAACGAACTCTATTATTAAAGCTGAATTAAACGCTAAAGCATCACTTGATACACTCAATCAGTGGGTGGAAGCCTATCAAAATTTTGTTAACGCAAACAATGCCAATCGTGCACAAGCTGAAAAAGATTTAGCTGATGCAAGTGCTCGTGTAACTAAACTAGAAAACGACTTAAATGATATGTCAGAACGTTGGAATTTTATCGATAGCTACATGGCAGCATCAAATGAAGGTCTTGTTATTGGTAAAAAAGATGAATCAAGCTCTATCATGTTCAATCCAAACGGGCGTATCTCAATGTTCTCAGCTGGAAACGAGGTAATGTACATTTCAAAAGGTGTCATCCATATCGAAAACGGTATTTTCTCTAAAACTATCCAAATCGGACGATATCGAGAGGAACAAGATTTATTGAATCCAGACCGTAATGTCATTAGATACGTAGGAGGTGCATAA

Genome Context

Genome Context

Tertiary structure

PDB ID
71e85a9be95ee7abfdf891b943f58e79eb156518f6c1e9e1318e22592f211af0
ESMFold
Source ESMFold
Method ESMFold
Resolution 0,7636
Oligomeric State monomer
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50

Literature

Title Authors Date PMID Source
Biodiversity of Streptococcus thermophilus Phages in Global Dairy Fermentations Lavelle,K., Martinez,I., Neve,H., Lugli,G., Franz,C., Ventura,M., Bello,F., Sinderen,D. and Mahony,J. 2018 GenBank