Protein

Accession
ARU14756.1 [Not found in db]
Protein name
tail fiber protein and host specificity
RBP type
TF
Evidence Phold
Probability 1,00
Protein sequence
MQIWIHDKSMRKVCALNNEIPGMLPYTNSQWHPYLEYSTSTFDFTIPKIVNRKLHDDIKYINDQMFVSFYFDNSYHVFYVSKLVENDFSFQVTCNNTNLELAMEVARPLADSGGPKTIEWYLQNLELLGFAGLEIGVNEISDRTRTLTFESQSGTKLEQLHSLMNQFDAEFIFRTELNRDGTMKRFIIDIYQEADENHHGIGKARGDVVLYYQSGLKGVQVTSDKTQLFNAGNFIGQDGVNLNDVEFEEKNELGQVEFYSRKGTSFVFAPLSRERYPSTMNPDSADNWTRKDFQTEYKDVESLKAYALRTIKQYAYPLLTYTVDVQSSFLDNYKDINLGDTIKIVDNNFRGGLALEARVSEMIISFDNPTNNSVVFTNFRKLDNKPSSELQQRIDEIVSKSLPYHVEIRTTNGTVFKNGIGRSTVKPILKQGDKIVDATYRFVIDGTIKYSGMTYDMVASEINQPTTLTISAWVDNKEVASEEVTFLNVSDGKQGPQGPQGPKGADGKTPYVHFAYADSADGQKGFSLTQTGSKRYLGVYTDFNQAGSTNPADYTWSDTAGSVSVGGDNLITNSSFPKNLDNWGYWDAGSPNENLHIAKHGFYYNGTRTLFRLDCDNSYGVPAASRRFPVKRNTDYSLNIQMFATENIKGVNIYFLGRKSNETNEMFTKAVNIKHLEKSPSTTDVVKFHFTFNSGECDEGFIRVDNYGATYGTSSLFFTELDCYEGTTDRSWQASPEDLKDEIDTKADNALTQAQLNKLSEINSVMKAELEAKASLATVNQWIKAYQDFVNANSADRAQAQKALADASARVVKLENNLNDMSERWNFIDNYMAASNDGLVIGKKDNSSSIMFNPNGRISMFSAGNEVMYISKGVIHIENGIFSKTIQIGRFREEQDFINPDRNVIRYAGGI
Physico‐chemical
properties
protein length:911 AA
molecular weight: 102783,31640 Da
isoelectric point:5,20562
aromaticity:0,11416
hydropathy:-0,50670

Domains

Domains [InterPro]
Legend: Pfam SMART CDD TIGRFAM HAMAP SUPFAM PRINTS Gene3D PANTHER Other

Taxonomy

  Name Taxonomy ID Lineage
Phage Streptococcus phage P9902
[NCBI]
1971448 No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)
Genbank protein accession
ARU14756.1 [NCBI]
Genbank nucleotide accession
KY705289.1 [NCBI]
CDS location
range 14945 -> 17680
strand +
CDS
ATGCAAATCTGGATTCATGATAAAAGTATGCGTAAAGTGTGTGCTTTGAATAATGAAATTCCCGGAATGTTGCCATATACGAACAGTCAATGGCATCCATACCTTGAATACTCAACAAGTACGTTTGATTTTACAATTCCTAAAATTGTGAACAGGAAACTGCACGATGATATCAAATATATCAATGACCAGATGTTTGTATCATTCTATTTCGATAATTCCTATCATGTTTTTTATGTATCAAAACTCGTTGAGAATGATTTTAGTTTTCAAGTCACTTGTAATAACACCAACCTTGAATTGGCAATGGAAGTTGCACGACCACTTGCAGACAGTGGCGGTCCCAAAACTATTGAATGGTATCTTCAAAATCTTGAGTTGCTTGGTTTTGCAGGTCTGGAAATAGGTGTCAATGAAATTTCTGATAGAACAAGAACGCTTACTTTTGAATCTCAAAGTGGAACTAAACTAGAGCAACTTCATAGCTTGATGAATCAATTTGATGCAGAATTTATTTTCCGTACCGAATTAAACCGAGACGGAACTATGAAACGTTTCATCATCGACATCTACCAAGAAGCAGATGAAAACCATCACGGTATAGGTAAGGCAAGAGGAGATGTTGTTCTCTACTACCAAAGCGGATTGAAAGGCGTTCAAGTTACTAGTGATAAAACGCAACTTTTCAACGCTGGTAATTTCATTGGACAAGATGGCGTTAACCTAAACGATGTCGAATTTGAGGAAAAGAACGAGCTAGGACAAGTAGAGTTCTATTCTCGAAAGGGCACTAGCTTCGTTTTCGCCCCACTATCAAGGGAACGCTACCCATCTACCATGAATCCAGACAGCGCTGATAACTGGACACGTAAGGATTTTCAAACAGAATACAAGGACGTTGAATCCTTAAAAGCTTACGCCTTGCGTACTATCAAGCAGTATGCTTATCCACTATTGACTTACACAGTAGATGTTCAGTCTAGCTTTCTGGATAACTATAAAGACATCAATCTAGGTGATACCATCAAGATTGTGGATAACAATTTTAGAGGTGGTTTAGCCCTCGAAGCGCGTGTATCTGAAATGATTATCAGCTTTGACAATCCCACAAACAACTCGGTTGTTTTTACTAATTTCAGAAAATTGGATAATAAACCGTCTAGCGAATTGCAACAACGTATCGATGAGATTGTTTCTAAGTCATTGCCATATCATGTTGAGATAAGGACCACAAACGGTACAGTATTTAAAAATGGTATTGGTCGCTCTACTGTTAAACCAATTTTAAAGCAAGGCGATAAAATTGTTGATGCAACTTATCGATTTGTGATTGACGGTACTATTAAATATTCAGGTATGACTTATGATATGGTAGCGTCAGAGATTAACCAACCAACCACGCTTACTATCTCAGCTTGGGTAGATAATAAAGAAGTAGCTTCAGAGGAAGTTACTTTTTTAAATGTTTCAGATGGGAAACAAGGCCCACAAGGACCACAAGGACCTAAAGGTGCTGACGGGAAAACACCTTATGTTCACTTTGCTTATGCCGATAGTGCCGATGGTCAAAAGGGTTTCAGTTTGACACAGACTGGAAGTAAACGCTATTTAGGTGTGTACACCGATTTCAATCAGGCGGGCAGCACTAACCCAGCTGATTATACTTGGAGTGACACGGCTGGCAGTGTTTCGGTTGGTGGTGATAATTTAATCACTAACTCATCTTTCCCAAAAAATCTTGACAATTGGGGATATTGGGATGCTGGATCGCCTAATGAAAATCTTCATATAGCAAAACATGGTTTTTATTACAATGGCACAAGAACACTTTTTAGACTAGATTGTGATAATAGTTACGGGGTCCCTGCAGCATCAAGACGTTTTCCAGTTAAACGTAACACTGATTATTCTCTCAATATTCAGATGTTTGCAACTGAAAACATCAAAGGTGTAAATATCTATTTCCTCGGGCGCAAGTCAAATGAAACTAACGAGATGTTTACTAAAGCAGTTAATATCAAACATTTGGAAAAATCGCCGTCAACGACTGATGTCGTGAAGTTCCACTTCACATTTAATTCTGGTGAATGTGATGAAGGTTTCATCCGAGTTGACAACTATGGAGCGACTTACGGAACGTCTTCGCTATTCTTCACAGAACTAGACTGCTATGAGGGTACTACTGATAGATCGTGGCAAGCGTCACCGGAAGACTTAAAGGACGAGATAGACACTAAAGCTGACAATGCATTAACGCAGGCTCAGCTTAATAAATTAAGTGAAATTAATTCGGTGATGAAAGCAGAGCTCGAAGCAAAAGCATCCCTTGCCACTGTTAATCAATGGATTAAGGCTTATCAAGATTTTGTTAATGCAAACAGCGCAGATCGTGCGCAGGCTCAAAAGGCTTTGGCAGATGCCAGTGCACGAGTAGTGAAACTAGAAAATAACTTAAACGATATGTCAGAGCGTTGGAATTTTATCGATAACTACATGGCAGCATCAAATGATGGTTTGGTTATCGGGAAGAAAGATAACTCCAGCTCTATTATGTTCAATCCAAATGGGCGTATATCAATGTTTTCAGCTGGTAATGAGGTAATGTATATCTCTAAGGGTGTGATTCACATTGAGAACGGTATTTTTTCTAAGACTATTCAAATTGGACGATTCCGAGAAGAACAAGATTTTATTAATCCGGATCGCAACGTCATCCGCTATGCAGGAGGTATTTAA

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

PDB ID
735262e28b112ccf9d0d59d7c5b0d24c6e73e2a6991eb95b5aba0f5008e616fe
ESMFold
Source ESMFold
Method ESMFold
Resolution 0,7715
Evidence 0,7715

Literature

No literature entries available.