Protein

Genbank accession
YP_012026695.1 [GenBank]
Protein name
side tail fiber protein
RBP type
TF
Evidence RBPdetect
Probability 0,87
TF
Evidence RBPdetect2
Probability 0,66
TF
Evidence Phold
Probability 1,00
Protein sequence
MAITTKIIVQQILNIDDTKATASKFPRYTVTLGNSISSITANELVSSIEAAAKSAAAAKDSEIAAKTSELNAKNSEQEAAISAGASEASATQSATSATQSATSATKSAESAAAAKVSETNAKASETKAKTSETNAKTSETNAKSSETKAKASETNAKASETNAAASTAAAKISEDKAKISETNAAQSAAASNGFRNEAEIFSEQAAASASAAKISETNAKTSETKAKASETNAAGSATSASQSVTTIQGLKSDVEQLKSDTQAIKNSAVTETTALKADVEQLKTDTQAIKNSAVTETTALKADVEQLKIDTQGIKDSAVSETTTLKNQAAASATNAANSATEAGKQATNAANSANTAKTEADRSKTEADRSEAAANSTPDIQPLPDVWIPFNDSLDMITGFAPGYKKITVGDEEITLPSDKIVSFTRASTATYINKSGILTIAEVDEPRFERDGLLIEGQRTNYFLNSNSPALWNKTASIGIEESNDGRFNYGRVTVTNEAPNAAGYQILSMVIGNAISGTTGDYITVSCRAKAGSNSRLRMRLAKVTDGVAIFHSDSVLDLDTGKVTSGPLYTSRSVKDGDWWYFETTFGFDTDLSAVCRFEIVCPENESVIQTGASLNLATPQVETGSCASSFIITGGAPATRASDMVLIPTDCNQPSSIPLSLLVEVNRNWDIPPNSAPRIVHVANLPEDQLLVAFRAPSSDTVEPLPYSQLGVSQSYTPVSTKTSGKMVTGFVCNKNSELRCVTNAVFGAPVKTTWKAGLSKNLRIGGISADGGKHLFGHVRNFRIWHKELTDRQMRESV
Physico‐chemical
properties
protein length:804 AA
molecular weight: 84358,41180 Da
isoelectric point:5,39501
aromaticity:0,04726
hydropathy:-0,34950

Domains

Domains [InterPro]
Coil
247–267
YP_012026695.1
1 804
Legend: Pfam SMART CDD TIGRFAM HAMAP SUPFAM PRINTS Gene3D PANTHER Other

Taxonomy

  Name Taxonomy ID Lineage
Phage Escherichia phage EC100
[NCBI]
2894397 Viruses > Duplodnaviria > Heunggongvirae > Uroviricota > Caudoviricetes
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)
Genbank protein accession
YP_012026695.1 [NCBI]
Genbank nucleotide accession
NC_105587.1 [NCBI]
CDS location
range 81486 -> 83900
strand -
CDS
ATGGCAATAACTACTAAGATTATTGTACAACAAATATTAAATATTGATGATACTAAAGCTACTGCTAGTAAATTTCCTAGATACACAGTAACTCTTGGAAATTCTATTAGCTCTATTACTGCTAATGAGTTAGTATCCTCTATAGAGGCCGCTGCTAAATCTGCTGCGGCTGCAAAAGATTCTGAAATAGCAGCTAAAACCTCAGAACTTAATGCTAAGAATTCGGAGCAGGAAGCTGCTATTTCCGCTGGAGCTTCTGAAGCATCGGCTACCCAATCAGCTACTTCTGCTACTCAGTCTGCCACTTCTGCTACTAAATCTGCAGAATCAGCTGCAGCAGCTAAAGTATCCGAGACTAATGCGAAAGCTAGTGAAACTAAGGCTAAAACCTCCGAGACTAATGCAAAAACATCAGAAACTAATGCAAAGTCTAGTGAAACCAAAGCTAAGGCCTCCGAGACTAATGCTAAGGCTAGTGAAACTAATGCTGCTGCTTCGACTGCAGCTGCAAAAATTAGTGAAGATAAAGCAAAAATCAGTGAGACTAATGCAGCTCAATCAGCTGCTGCTTCTAACGGTTTTAGGAATGAGGCGGAAATATTCTCTGAGCAAGCTGCTGCATCGGCATCTGCGGCAAAAATCTCTGAAACCAATGCAAAAACCTCGGAAACAAAAGCTAAGGCTAGTGAAACTAATGCTGCAGGATCTGCAACTTCCGCCAGTCAATCTGTAACTACTATTCAAGGACTTAAATCAGATGTTGAACAGTTAAAATCTGATACCCAAGCCATTAAAAATAGTGCTGTAACAGAGACAACAGCTTTAAAAGCAGATGTTGAGCAATTAAAAACAGATACCCAAGCCATTAAAAATAGTGCTGTAACAGAGACAACAGCTTTAAAAGCAGATGTTGAGCAATTAAAAATAGATACACAAGGTATTAAGGATAGCGCGGTATCTGAGACAACAACTTTAAAAAACCAAGCTGCTGCTTCTGCAACTAATGCTGCAAATTCTGCTACTGAAGCTGGAAAACAAGCTACTAATGCTGCCAATAGTGCTAATACTGCTAAAACTGAAGCCGACCGTTCAAAAACTGAGGCTGATAGATCAGAAGCTGCTGCTAATTCTACCCCCGACATTCAACCTCTTCCAGATGTATGGATACCGTTTAACGATTCTCTAGATATGATCACCGGCTTTGCACCAGGCTATAAAAAAATAACAGTCGGTGATGAGGAAATAACACTGCCTAGCGACAAGATTGTTAGCTTTACCCGTGCATCAACTGCGACATATATTAATAAGTCCGGTATATTGACCATTGCTGAAGTTGACGAACCGCGATTTGAACGAGATGGTTTGCTTATTGAAGGGCAAAGAACTAATTATTTCTTAAATTCCAACTCACCGGCATTATGGAATAAAACAGCATCGATAGGTATCGAGGAGTCAAACGATGGTAGATTCAATTATGGGCGTGTAACTGTAACAAACGAAGCACCAAACGCTGCCGGCTATCAAATTTTGAGCATGGTGATAGGTAATGCTATTAGCGGAACAACGGGTGATTATATTACAGTGTCATGTCGTGCTAAAGCAGGGTCTAATTCTAGGCTTCGAATGAGGCTTGCAAAAGTAACTGATGGTGTAGCAATTTTTCATTCTGATTCAGTTCTCGATCTTGATACTGGTAAGGTAACTTCTGGTCCTTTATACACATCAAGATCGGTAAAAGATGGTGATTGGTGGTACTTTGAAACGACATTCGGATTTGATACAGATCTATCTGCTGTTTGTCGTTTTGAGATTGTATGCCCTGAAAATGAAAGTGTTATACAAACAGGTGCATCGCTTAACTTAGCTACACCTCAAGTAGAGACCGGATCGTGCGCTTCTTCATTCATAATCACAGGAGGTGCGCCAGCAACTCGCGCTAGTGACATGGTTTTAATACCAACTGATTGCAATCAACCATCATCTATACCATTAAGTCTACTTGTTGAGGTAAACAGAAACTGGGATATACCTCCAAACTCAGCCCCAAGGATAGTACATGTAGCAAATTTACCAGAAGACCAGTTATTAGTTGCTTTCAGAGCTCCATCAAGTGATACAGTAGAACCGCTGCCTTATTCTCAGTTGGGAGTAAGTCAGTCATATACGCCAGTATCAACAAAAACAAGCGGGAAAATGGTGACAGGTTTCGTTTGTAACAAAAACTCAGAATTAAGATGCGTAACAAACGCTGTGTTCGGTGCGCCTGTAAAAACAACATGGAAAGCTGGTTTATCTAAAAATCTTCGTATTGGCGGCATTAGTGCAGATGGTGGGAAACATCTTTTCGGGCATGTCAGAAACTTTAGAATCTGGCATAAAGAATTAACAGATCGTCAAATGAGGGAGTCTGTATGA

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

PDB ID
c9705f57ea72be6b116cc1c0824dfeeb5bcb694fbf7f77401589ca1be1b06aff
ESMFold
Source ESMFold
Method ESMFold
Resolution 0,6971
Evidence 0,6971

Literature

Title Authors Date PMID Source
Complete genome sequences of 17 Escherichia coli bacteriophages isolated from wastewater, pond water, cow manure and bird feces Vitt,A.R., Ahern,S.J., Gambino,M., Holst Sorensen,M.C. and Brondsted,L. 2022-10-20 GenBank