Protein

Genbank accession
UPW39087.1 [GenBank]
Protein name
putative tail fiber protein
RBP type
TSP
Evidence DepoScope
Probability 1,00
TF
Evidence RBPdetect
Probability 0,84
TF
Evidence RBPdetect2
Probability 0,73
Protein sequence
MAITTKIIVQQILNIDDTKATASKFPRYTVTLGNSISSITANELVSSIEAAAKSAAAAKDSEIAAKTSELNAKNSEQEAAISAGASEASATQSATSATQSAASANKSAESAAAAKISETNAKTSETNAKTSETNAKTSETNADASAAAAKISETNAKASETNAAQSATDAKLSEGNAKVSETNAAQSAADSSGFRNEAEIFSGQAAASASAAKISETNAKTSETKAKASETNAAGSAASANQSVTTIQGLKSDVEQLKSDTQAIKNSAITETTALKADVEQLKSDTQAIKNSAVTETTALKAEVEQLKTDTQGIKDSAVSETTTLKNQAAASATQAGNSAVEAGQQASNAAGSANSSKVEADRAKAEADRAEIAANRAPDLQPFPDVWIPFNDSLDMLAGYSPGYKKITVGEDVITMPSDKVVSFSRASNATYINKHGEFCIANIDEPRFEKQGLLIEGQRTNHITFSNDPASLNTDRHRIDVTYGVDKYGFTYTTATVNQNGQGDTPNLFYCETANAINCQKNEYVSLSIRVKAGQDIYITPQFYLVGEDGGLILGARSVISCGTGEISSVVEGSGTIAHRIYREDNGWLKVEAMCKFVEIGGNSIGSVNYCRVDGQPLQVGDEISFCNPQFEKGFCASSFIITGSTPATRALDYVTIPARNNFFGTNISLLAEVSVNWDSFHLDNIYPMIVDNNHYFVRGKAFVAEFDRTSVTPYTYVVTEDGSTILSRGYSFEKQFSPHVFGFILRGDVEVTSFVNGNRGETSHGFTWKGTDADALVEIGGRLSDSTKLYGHIRNLRIWNRVLTDSQMREKV
Physico‐chemical
properties
protein length:815 AA
molecular weight: 86429,15570 Da
isoelectric point:5,08132
aromaticity:0,06626
hydropathy:-0,36061

Domains

Domains [InterPro]
Legend: Pfam SMART CDD TIGRFAM HAMAP SUPFAM PRINTS Gene3D PANTHER Other

Taxonomy

  Name Taxonomy ID Lineage
Phage Escherichia phage vB_EcoS_ESCO40
[NCBI]
2918880 Viruses > Duplodnaviria > Heunggongvirae > Uroviricota > Caudoviricetes
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)
Genbank protein accession
UPW39087.1 [NCBI]
Genbank nucleotide accession
OM386660 [NCBI]
CDS location
range 72399 -> 74846
strand -
CDS
ATGGCAATAACTACTAAGATTATTGTACAACAAATATTAAATATTGATGATACTAAAGCTACTGCTAGTAAATTTCCTAGATACACAGTAACTCTTGGAAATTCTATTAGCTCTATTACTGCTAATGAGTTAGTATCCTCTATAGAGGCCGCTGCTAAGTCTGCTGCGGCTGCAAAAGATTCTGAAATAGCAGCTAAGACTTCAGAGCTTAATGCTAAGAACTCTGAACAGGAAGCTGCTATTTCTGCTGGAGCTTCAGAAGCTTCTGCTACTCAGTCTGCTACCTCTGCTACTCAGTCCGCAGCGTCAGCCAATAAGTCTGCAGAGTCCGCTGCCGCAGCTAAAATATCCGAGACTAATGCGAAGACTAGTGAAACTAATGCGAAGACTAGTGAAACTAATGCGAAGACTAGTGAAACTAATGCTGACGCATCAGCTGCTGCTGCAAAAATTAGTGAGACTAATGCAAAGGCAAGTGAGACTAATGCAGCTCAATCAGCTACTGATGCAAAACTTAGCGAGGGTAATGCAAAGGTCAGTGAGACTAATGCAGCTCAATCAGCAGCTGATTCTAGCGGTTTTAGGAATGAGGCGGAAATATTTTCTGGGCAAGCTGCTGCATCGGCATCTGCGGCAAAAATCTCTGAAACCAATGCAAAAACCTCGGAAACAAAAGCTAAGGCTAGCGAAACTAATGCGGCAGGGTCTGCAGCCTCCGCCAACCAATCTGTAACTACTATTCAAGGACTTAAATCAGATGTTGAACAATTAAAATCTGATACCCAGGCCATTAAAAATAGTGCTATAACAGAGACAACAGCTTTAAAAGCGGATGTCGAACAGTTAAAATCTGATACCCAGGCCATTAAAAATAGTGCTGTAACAGAGACAACAGCTTTAAAAGCAGAGGTTGAGCAATTAAAAACAGATACACAAGGTATTAAGGATAGCGCGGTATCTGAGACAACAACTTTAAAAAACCAAGCTGCTGCTTCTGCTACACAAGCAGGTAATAGTGCTGTTGAGGCCGGGCAACAAGCTAGCAATGCTGCTGGTAGTGCAAATAGCTCTAAAGTAGAGGCAGACCGTGCAAAAGCAGAAGCAGACCGTGCGGAAATTGCAGCTAACAGGGCTCCGGATCTTCAACCATTCCCTGACGTATGGATTCCCTTTAATGACTCGCTTGATATGCTTGCAGGTTACTCGCCAGGATACAAGAAAATAACAGTTGGTGAAGATGTTATTACAATGCCCTCGGATAAGGTTGTCAGTTTCTCCCGCGCATCAAATGCAACATATATAAACAAACACGGTGAGTTTTGTATTGCCAATATCGATGAGCCTAGATTTGAAAAGCAAGGACTCTTGATTGAAGGGCAGAGGACAAATCACATTACTTTCAGTAACGACCCGGCTTCTTTAAATACGGATAGACATCGTATCGATGTTACATATGGTGTTGATAAGTATGGTTTTACATACACAACGGCAACTGTAAACCAAAATGGTCAAGGTGATACGCCAAACCTATTTTATTGTGAAACAGCAAATGCGATTAACTGTCAGAAAAATGAATATGTCTCTTTATCCATACGAGTAAAAGCAGGCCAGGATATCTATATTACCCCTCAATTTTATTTAGTAGGTGAGGACGGAGGACTTATTCTCGGCGCTAGATCTGTCATAAGTTGCGGAACCGGCGAAATTTCTTCTGTAGTAGAAGGGAGCGGGACTATAGCACATAGAATATATAGAGAAGACAACGGATGGTTGAAAGTTGAAGCTATGTGTAAATTTGTGGAGATTGGCGGAAATTCTATTGGTTCGGTGAATTATTGTCGGGTGGATGGTCAGCCTTTACAAGTTGGGGATGAAATATCATTTTGTAACCCGCAATTTGAAAAGGGTTTTTGTGCATCCTCCTTCATTATTACGGGAAGCACGCCTGCAACAAGGGCTCTAGACTATGTTACTATACCAGCAAGGAATAATTTCTTCGGGACAAATATTTCATTACTTGCTGAGGTAAGTGTTAACTGGGATAGCTTTCACTTAGATAATATATACCCAATGATAGTGGATAACAACCACTACTTCGTAAGAGGAAAAGCATTTGTTGCGGAATTTGACAGAACCTCCGTTACACCTTACACGTATGTGGTAACTGAGGATGGATCAACTATACTGTCCAGAGGTTATTCATTTGAAAAACAATTTTCTCCACATGTATTTGGTTTTATTCTCCGTGGCGATGTAGAGGTCACCTCGTTTGTTAATGGTAATAGGGGTGAGACCTCACACGGTTTTACATGGAAAGGAACGGACGCGGACGCACTAGTAGAAATAGGCGGTAGACTTTCAGATTCAACTAAATTGTATGGTCATATTCGTAACCTTAGAATATGGAATAGAGTATTAACAGATAGTCAAATGCGGGAGAAAGTTTAA

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

PDB ID
3e6987a04e60809ac64612c3afbdfb096764b25bbfffc6281210087cc3035129
ESMFold
Source ESMFold
Method ESMFold
Resolution 0,7081
Evidence 0,7081

Literature

Title Authors Date PMID Source
Avian Pathogenic Escherichia coli bacteriophages Nicolas,M., Trotereau,A. and Schouler,C. 2015-06 GenBank