Protein

Genbank accession
QHR70449.1 [GenBank]
Protein name
putative tail fiber protein
RBP type
TSP
Evidence RBPdetect
Probability 0,87
TF
Evidence RBPdetect2
Probability 0,96
Protein sequence
MAENMITGSKGGSSKPYVPKEMEDNLISINKIKILLAVSDGECDPSFTLRDLYLDDVPVIADDGTVNYQGVKAEFRPGTQTQDYIQGFTDTSSEVTLARDITTSNPYVISVTNKTLSAIRIKMLMPTGIKQEDNGDLVGVKVTYAVDMAVDGDSYKEVLLDTIEGKTRSGYDRSRRIDLPAFNDRVLLRVRRVTADSASSRVTDLIKLQSYAEVIDAKFRYPLTGLVYVEFDSELFPNQIPNISIKKKWKLINVPSNYDPVMREYHGSWDGTFKKAWSNNPAWVLYDIITNQRYGLDQRELGVQVDKWSLYEAAQYCDQKVPDGKGGTEPRYLCDVVIQSQIEAYQLIRDICSIFRGMSFWNGESLSIVIDKPRDPSYIFTNDNVVDGDFQYTTASEKSMYTQCNVTFDDEQNMYQQDVEGVFDTEAALRFGYNPTSITAIGCTRRSEANRRGRWILKTNLRSTTVNFATGLEGMIPSIGDVIAIADNFHSSNLKLNLSGRVMEVSGLQVFAPFKIDARPGDFIIINKPDGKPVKRTISKVSGDGKTIELNIGFGFEVKPDTVFAIDRTDIALQQYVVTSIGKGDDDDEFTYSITAVEYDPNKYDEIDYGVNIDDRPTSIVQPDTMAAPENVQISSYSRIVQGASVETMVVSWDKVPYASLYEMQWRKGDGNWLNTPQTANKEIEVEGIYSGNYQVRVRSVSASGSTSPWSRIVTASLTGKVGEPGAPVNLTASDNEVFGIRVKWGMPEGSGDTAYIELHQSPDGTAENSSLLTLIPYPQYEYWHGTLPAGHVVWYRIRSVDRIGNVSGWTDFVRGMASDDVEAVLGNILDKIFDTEAGKDLKENAIDSANKIKDQAQSIIQNALANDADVRIMRKENGKRKAEFRQSIQLIADETEARVTAMTQLKAEFDEEITSEVTRLDQAIATESETRATAIEELKSQIGDDIQGQLTRVEEAIASETEARVSADTALTAKFGDVESALTEKLDSWAGVNGVGAQYAMKLGLTYNGQKYSAGMVMQLSQGSSGLISQILFDANRFAIMTSSTGGSYTLPFVVENNQVFINSLLVKNGSITNAMIGNFIQSNNYVWNQTGWRLDKNGTFENYGSTPGEGAMKMTNETISVRDANGVLRVQIGRLTGTW
Physico‐chemical
properties
protein length:1141 AA
molecular weight: 126473,23250 Da
isoelectric point:4,77865
aromaticity:0,08764
hydropathy:-0,36188

Domains

Domains [InterPro]
Legend: Pfam SMART CDD TIGRFAM HAMAP SUPFAM PRINTS Gene3D PANTHER Other

Taxonomy

  Name Taxonomy ID Lineage
Phage Escherichia phage egaa
[NCBI]
2696393 Viruses > Duplodnaviria > Heunggongvirae > Uroviricota > Caudoviricetes
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)
Genbank protein accession
QHR70449.1 [NCBI]
Genbank nucleotide accession
MN850607 [NCBI]
CDS location
range 24375 -> 27800
strand -
CDS
ATGGCTGAAAATATGATAACGGGCAGTAAGGGTGGATCATCAAAACCTTATGTTCCGAAAGAGATGGAAGATAACCTGATCTCAATCAACAAAATCAAAATACTTCTTGCCGTTTCCGATGGTGAGTGCGATCCAAGTTTTACACTTCGCGATCTGTATCTTGATGATGTTCCGGTAATTGCAGACGACGGAACTGTTAACTACCAGGGTGTGAAAGCTGAATTTCGCCCTGGAACGCAGACGCAAGATTACATCCAGGGATTTACTGACACATCAAGCGAAGTGACGCTGGCGCGTGACATTACTACATCAAATCCTTATGTAATTTCTGTAACAAACAAAACATTATCGGCTATCAGAATCAAAATGCTAATGCCAACAGGCATTAAGCAAGAGGATAACGGCGATCTTGTCGGCGTTAAGGTTACTTATGCTGTTGATATGGCTGTTGACGGAGACTCTTACAAAGAAGTATTGCTAGACACCATCGAAGGTAAAACGCGTTCCGGTTACGACAGAAGCCGAAGGATTGACCTTCCGGCATTTAATGATCGCGTATTGCTTAGGGTTAGAAGGGTTACGGCAGACAGCGCATCCTCTCGCGTTACTGATCTGATTAAGCTACAAAGTTACGCTGAGGTTATTGATGCAAAATTCCGTTATCCTCTGACTGGTCTTGTATACGTTGAATTTGACAGTGAGTTGTTCCCTAACCAGATCCCTAACATTTCAATCAAGAAGAAATGGAAGTTAATTAATGTTCCGAGCAATTACGATCCGGTAATGCGAGAGTATCACGGTTCATGGGATGGTACTTTCAAGAAAGCGTGGTCGAACAATCCGGCGTGGGTTCTTTATGACATTATCACAAACCAGCGATACGGATTAGATCAGCGAGAACTTGGCGTGCAGGTTGACAAATGGAGTCTTTACGAAGCGGCGCAATACTGCGATCAGAAAGTGCCGGATGGAAAAGGCGGTACAGAGCCGCGTTATCTATGCGACGTTGTGATTCAAAGCCAGATTGAGGCTTATCAGCTTATTCGTGATATTTGCTCAATCTTCCGAGGCATGAGCTTTTGGAATGGCGAGAGCTTGTCAATCGTCATTGATAAGCCGCGCGATCCGTCGTACATCTTCACCAATGACAACGTTGTTGATGGTGATTTTCAGTACACAACAGCAAGCGAAAAGAGCATGTACACGCAGTGCAACGTGACGTTCGACGACGAACAAAACATGTACCAACAGGACGTAGAGGGCGTATTCGACACCGAGGCAGCATTGCGATTTGGATACAATCCTACAAGCATAACAGCGATTGGATGTACACGCAGGAGTGAGGCTAATCGGCGCGGTCGATGGATACTAAAAACTAACTTGCGCAGCACTACGGTAAACTTTGCTACTGGACTGGAAGGCATGATCCCATCAATAGGTGATGTGATTGCTATTGCTGACAACTTTCACAGCAGCAACCTTAAGTTAAACCTATCAGGGCGCGTGATGGAAGTTTCAGGCTTGCAGGTGTTCGCTCCGTTTAAGATTGACGCGCGACCAGGTGATTTCATTATCATCAACAAGCCAGACGGGAAGCCAGTTAAGCGCACAATCTCAAAAGTAAGCGGTGACGGAAAAACCATTGAGCTAAATATTGGGTTTGGATTTGAGGTTAAACCTGACACGGTTTTTGCAATCGACCGCACTGATATTGCATTGCAGCAATACGTTGTAACGAGTATCGGCAAAGGTGATGATGATGATGAATTTACATACTCCATCACGGCTGTTGAATATGACCCGAACAAATACGACGAGATTGATTATGGAGTAAACATTGACGACAGGCCAACTTCAATTGTCCAGCCTGACACAATGGCAGCGCCTGAAAATGTGCAAATATCCTCCTACTCGCGAATTGTCCAGGGTGCAAGCGTTGAAACAATGGTTGTGTCGTGGGATAAAGTACCTTACGCATCGCTGTATGAAATGCAGTGGCGAAAAGGTGATGGCAACTGGCTGAATACACCACAGACTGCAAACAAAGAAATTGAGGTTGAAGGTATTTATTCAGGAAACTACCAGGTAAGGGTTAGATCTGTTTCTGCTTCAGGTTCGACGTCGCCGTGGTCCAGAATTGTAACAGCTTCACTGACTGGTAAGGTAGGAGAGCCAGGCGCGCCAGTTAACTTAACTGCATCCGACAATGAGGTGTTTGGCATTCGTGTTAAGTGGGGGATGCCAGAAGGCAGCGGAGACACGGCATACATTGAGCTTCATCAGTCGCCGGATGGAACGGCTGAAAACTCAAGCCTGTTAACTCTGATCCCATATCCACAATATGAATACTGGCACGGTACGCTTCCGGCTGGTCATGTTGTATGGTATAGAATCCGCAGTGTTGACAGAATCGGCAACGTTTCCGGATGGACTGATTTTGTTAGAGGCATGGCTTCAGATGATGTGGAGGCTGTTTTAGGCAATATTCTTGATAAGATTTTTGATACCGAAGCAGGGAAGGATCTGAAAGAGAATGCCATTGATAGCGCAAACAAAATCAAGGATCAGGCGCAAAGCATCATTCAAAACGCATTGGCGAATGATGCAGACGTTAGGATTATGAGGAAGGAAAACGGAAAACGCAAAGCCGAATTCAGGCAATCAATACAGTTGATCGCAGATGAAACTGAGGCGCGCGTTACCGCAATGACGCAACTCAAGGCTGAATTTGACGAGGAAATAACTAGCGAAGTAACGAGGCTTGATCAGGCAATTGCAACAGAATCGGAAACGAGAGCAACAGCCATTGAGGAATTGAAATCACAGATTGGAGATGATATTCAGGGGCAGTTAACGAGAGTTGAGGAGGCGATTGCAAGCGAAACAGAGGCGCGCGTTTCTGCTGACACAGCATTAACAGCGAAGTTTGGAGATGTTGAATCAGCGCTGACGGAAAAACTTGATTCATGGGCTGGCGTTAATGGAGTTGGCGCACAGTACGCAATGAAACTTGGATTGACATACAACGGTCAGAAGTACAGTGCTGGTATGGTGATGCAGCTTTCGCAGGGTTCATCCGGCCTTATTTCGCAGATTTTGTTTGATGCGAACAGGTTCGCTATCATGACAAGTTCGACCGGAGGGTCGTATACATTGCCTTTTGTGGTTGAAAATAACCAAGTTTTCATTAACAGCCTGTTAGTGAAAAACGGATCAATCACAAACGCCATGATCGGTAACTTCATCCAATCAAATAATTACGTTTGGAATCAAACAGGATGGAGGCTGGATAAAAACGGAACGTTTGAAAATTATGGTTCTACTCCTGGAGAGGGGGCCATGAAAATGACAAACGAAACGATCAGCGTTAGGGATGCAAACGGCGTTTTGCGTGTTCAGATCGGTAGGCTTACTGGTACGTGGTAA

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

PDB ID
0d885daf2d7e59d3b25646ea090fa82cf1f63e872bb74077af4aa97f30093e15
ColabFold
Source ColabFold
Method ColabFold
Resolution 0,2334
Evidence 0,2334

Literature

Title Authors Date PMID Source
Exploring the Remarkable Diversity of Culturable Escherichia coli Phages in the Danish Wastewater Environment Olsen,N.S., Forero-Junco,L., Kot,W. and Hansen,L.H. 2020 GenBank