Protein

Genbank accession
YP_010659586.1 [GenBank]
Protein name
tail protein
RBP type
TSP
Evidence DepoScope
Probability 1,00
TSP
Evidence RBPdetect
Probability 0,91
TSP
Evidence RBPdetect2
Probability 0,95
Protein sequence
MNSHNPFNTGGNGCPNDFRSGDQLVDRIIGDAYHVVKEVYLALGNLTYIYNYLQKYGLIITVDSEEAIKDIPLSIGKFARIYNKSETAGYYFTDYLYVDDDTSGIKPNEPGATGSWVSTKATGSNASFVRIWKYHAVTDGETTIQLPTDLPIVGVQTIYVQGIRQDLNEGFTYNEGDATITLADELETGNLVTVIIGITDPDMDIDIFAALKGTDGASNIGTSFGITVEDALKRNAIVYATEFGVTGDGVACGYACVAAIEYIMQNGGTLVFPKGEVNWGTTRHKFAIYNGKPCTIKGSDGGTTWTFDNVDPVANAPGASWPFSEPFLVEFGGQPTTQGVFVEPVTVENLTIDYTRQANKGGPTHELMGVGAHPTPYSDGTLGLRFSYCRSPVVRNVRMNEIYGSGIQFWKCSMALAENNYLYNVSANQPLGSSGNESVDHFGWAIWSGASPKTIIRRNTAINKRVFVCDPALKSPNNNVVYNGTLCGYIGIFAEYGSNGGTSTIYPPDFEWQSDSSADKRTYVTIEDNLVWGYTMSVKSEAATSVRILNNTLLNHYIGVSVQASADVVGNYINGLQADLQKCPQNGFESQRGGIQLSWWASSAQDHRQYVAQNYVYSAAYNCISIGKAYATIDDNQFTITGSARFINSVTSFNVDLLEMRGNKLLLSENCTQTVPMRITNCSKVVMEANFVENKSTTVRMSVQLVNPVIRNNTFKGWVQLYLASAGSIVEDNTATDPTNNKGLVIQVASADDCTIRRNRITKYNADDEDQIIFLSKAARTVIEGNIVNYDAVTGAGRTKSIPIIKTFGTCFYTRILNNRVVGDSSATFGFTLFDGVSGARVLECRGNSTDNATRQMFTMYSQSGPWFISGNDWAQTYSSEINTAANIYASYKPVQGEKVPYLRPQAGGAEGIVYTASGWLTYGSIASS
Physico‐chemical
properties
protein length:929 AA
molecular weight: 101256,91860 Da
isoelectric point:5,06313
aromaticity:0,10764
hydropathy:-0,20366

Domains

Domains [InterPro]
YP_010659586.1
1 929
Legend: Pfam SMART CDD TIGRFAM HAMAP SUPFAM PRINTS Gene3D PANTHER Other

Taxonomy

  Name Taxonomy ID Lineage
Phage Escherichia phage vB_EcoP_SP5M
[NCBI]
2750853 Viruses > Duplodnaviria > Heunggongvirae > Uroviricota > Caudoviricetes
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)
Genbank protein accession
YP_010659586.1 [NCBI]
Genbank nucleotide accession
NC_070869 [NCBI]
CDS location
range 64724 -> 67513
strand -
CDS
ATGAATTCTCATAACCCATTTAACACAGGCGGTAATGGTTGCCCTAATGACTTTCGGTCTGGTGATCAATTGGTTGACCGAATCATTGGTGATGCCTACCACGTAGTAAAGGAGGTATACCTTGCACTGGGTAACCTTACTTACATCTATAACTACCTACAAAAGTATGGCCTGATTATTACTGTAGATAGTGAGGAAGCAATCAAGGACATTCCACTCTCTATTGGGAAGTTTGCTCGTATTTATAATAAATCAGAGACGGCAGGTTACTACTTCACTGACTACCTTTATGTAGATGATGATACTAGTGGTATTAAGCCTAACGAACCCGGTGCTACTGGTTCTTGGGTTAGTACTAAAGCCACTGGTTCCAACGCTTCGTTTGTTCGTATTTGGAAATACCATGCAGTTACTGATGGTGAGACTACTATTCAGTTACCTACAGATTTGCCAATTGTTGGTGTACAAACCATCTATGTACAAGGCATTCGACAAGACCTCAATGAAGGTTTTACTTATAATGAGGGTGATGCAACTATTACCCTTGCTGACGAGTTAGAGACTGGTAACTTGGTTACAGTGATTATTGGCATCACTGACCCAGACATGGATATCGATATTTTTGCTGCACTTAAAGGAACCGACGGGGCTTCTAACATTGGTACATCATTCGGTATCACGGTAGAGGATGCCTTAAAGCGTAACGCTATAGTTTATGCTACTGAGTTCGGCGTGACTGGGGATGGTGTTGCCTGTGGTTATGCATGTGTGGCGGCTATTGAATATATTATGCAGAATGGTGGAACGCTAGTGTTCCCTAAAGGTGAGGTGAATTGGGGAACTACTCGCCATAAATTTGCTATCTATAATGGTAAACCTTGTACTATCAAAGGTTCCGATGGCGGAACCACCTGGACATTCGATAACGTTGATCCAGTTGCTAATGCCCCCGGTGCGTCATGGCCGTTCAGTGAACCGTTCCTTGTTGAATTTGGGGGACAGCCGACCACACAAGGCGTCTTTGTGGAGCCAGTAACTGTTGAAAATCTCACTATTGACTATACCCGTCAAGCTAACAAAGGTGGGCCTACCCATGAGCTGATGGGGGTAGGGGCGCATCCTACACCATATTCGGATGGCACTCTTGGTCTGCGTTTTAGTTACTGCCGTTCGCCAGTAGTACGAAATGTTAGGATGAACGAGATTTATGGCTCAGGTATCCAGTTCTGGAAATGTTCTATGGCTCTGGCAGAGAACAATTATCTGTACAACGTTTCAGCTAACCAACCTTTAGGTTCCAGTGGCAATGAGTCTGTCGATCACTTTGGTTGGGCTATCTGGTCTGGTGCAAGCCCGAAAACGATTATCCGACGTAATACTGCGATTAATAAGCGTGTTTTCGTATGTGATCCAGCATTGAAATCCCCTAATAATAACGTGGTGTATAACGGTACTCTTTGTGGGTATATAGGTATTTTCGCAGAGTACGGGAGTAATGGGGGTACTTCCACCATCTACCCACCGGACTTTGAATGGCAGTCAGATTCTTCCGCTGACAAGAGAACCTATGTAACTATAGAGGACAACCTCGTATGGGGTTACACCATGTCAGTGAAGTCTGAGGCAGCTACTTCAGTTCGAATTCTGAATAATACTTTGTTAAATCATTATATTGGTGTATCTGTTCAGGCATCAGCTGATGTTGTAGGCAACTACATCAATGGATTACAGGCTGATTTACAAAAATGCCCACAGAATGGATTTGAATCGCAGCGTGGTGGAATTCAGCTTTCTTGGTGGGCATCATCTGCACAAGACCACCGTCAGTATGTAGCTCAAAACTATGTTTACTCGGCTGCGTATAACTGCATTTCCATTGGTAAGGCTTACGCCACAATTGATGATAACCAGTTTACGATCACAGGGTCAGCTAGGTTTATAAACTCGGTGACTTCCTTCAATGTAGACTTGTTGGAGATGAGAGGAAACAAGCTCCTGTTGTCTGAAAATTGCACACAGACAGTCCCCATGCGTATTACCAATTGTTCGAAGGTGGTGATGGAAGCTAACTTCGTAGAAAATAAAAGCACCACAGTTCGTATGTCAGTGCAGCTGGTCAACCCTGTAATACGTAACAATACTTTCAAAGGGTGGGTTCAGTTATATCTGGCATCGGCTGGCAGTATAGTTGAAGATAACACAGCTACCGACCCCACTAATAATAAGGGATTAGTAATCCAAGTGGCATCGGCGGATGACTGCACTATCCGCAGGAACCGGATTACAAAATATAATGCTGATGACGAAGATCAGATAATCTTCTTGTCAAAGGCTGCTAGAACGGTCATCGAAGGCAACATCGTAAACTATGATGCGGTGACTGGTGCGGGGAGAACGAAGAGCATCCCGATAATCAAAACATTTGGTACGTGCTTTTACACTCGGATCCTTAACAACCGGGTTGTTGGTGATAGTAGTGCTACATTTGGTTTCACTCTTTTTGATGGTGTGTCTGGGGCGAGAGTACTAGAATGCCGTGGGAATAGCACAGATAATGCAACTCGGCAGATGTTCACAATGTATTCTCAGAGCGGGCCTTGGTTCATTTCTGGAAATGATTGGGCACAGACATATAGCTCTGAGATAAACACCGCAGCCAATATCTATGCATCCTATAAGCCAGTCCAAGGGGAGAAAGTACCCTATCTTCGACCACAAGCAGGGGGTGCTGAGGGGATAGTTTATACTGCATCTGGATGGTTAACTTATGGTAGTATCGCATCTTCTTGA

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

PDB ID
4c4ad095d46e28918c5d005d2120a44274017da950e57e333dd6a46dd75896b6
ColabFold
Source ColabFold
Method ColabFold
Resolution 0,2346
Evidence 0,2346

Literature

Title Authors Date PMID Source
Complete genome sequences of eight phages infecting swine Enterotoxigenic Escherichia coli Ferreira,A., Oliveira,H., Silva,D., Almeida,C., Burgan,J., Azered,J. and Oliveira,A. 2020-09-03 GenBank