Protein

Genbank accession
QHR65240.1 [GenBank]
Protein name
tail fiber protein
RBP type
TSP
Evidence RBPdetect
Probability 0,78
TF
Evidence RBPdetect2
Probability 0,96
TF
Evidence Phold
Probability 1,00
Protein sequence
MAENMITGSKGGSSKPYVPKEMEDNLISINKIKILLAVSDGECDPDFTLRDLYLDDVPVIADDGTVNYQGVSAEFRPGTQTQDYIQGFTDTSSEVTLARDITTSNPYVISVTNKTLSAIRIKMLMPTGIKQEDNGDLVGVKVTYAVDMAVDGDSYKEVLRDTIEGKTRSGYDRSRRIDLPSFNDRVLLRVRRITADSASSRVTDLIKLQSYAEVIDAKFRYPLTGLVYVEFDSELFPNQIPNISIKKKWKLINVPSNYDPVSREYRGSWDGTFKKAWSNNPAWVLYDIITNQRYGLDQRELGVQVDKWSLYEAAQYCDQKVPDGKGGTEPRYLCDVVIQSQIEAYQLIRDICSIFRGMSFWNGESLSIVIDKPRDPSYIFTNDNVVDGDFQYTTASEKSMYTQCNVTFDDEQNMYQQDVEGVFDTEAALRFGYNPTSITAIGCTRRSEANRRGRWILKTNLRSTTVNFATGLEGMIPSIGDVIAIADNFHSSNLKLNLSGRVMEVSGLQVFVPFKIDARPGDFIIINKPDGKPVKRTISKVSDDGKTIELNIGFGFEVKPDTVFAIDRTDIALQQYVVTSIGKGDDDDEFTYSITAVEYDPNKYDEIDYGVNIDDRPTSIVQPDTMAAPENVQISSYSRIVQGASVETMVVSWDKVPYASLYEMQWRKGDGNWLNTPQTANKEIEVEGIYSGNYQVRVRSVSASGSTSPWSRIVTASLTGKVGEPGAPVNLTASDNEVFGIRVKWGMPEGSGDTAYIELHQSPDGTAENSSLLTLIPYPQYEYWHGTLPAGHVVWYRIRSVDRIGNVSGWTDFVRGMASDDVEAVLGDILDKIFDTEAGQDLKENAIDSANKIKDQAQAIIQNALANDADIKWTRVQNGKRKAEYGHALELIATETEARVTQIEELRASIDEDIVSSIKTVQEAIATESETRATQINALDSKFTTEIDGVKRDTAASINQVNQTIANESEARAQAVNALDAKFTKEIEDLNGVIKTEVEANISEVKQAIANETEARVQADQALTAKFGDVESALAQKLDSWAGVSSVGAKYSMKLGLTYNGQQYSAGMIMQLSQSSSGLISQILFDANRFAIMTSSTGGVYTLPFVVENNQVFINSLLVKNGSITNAMIGNYIQSNNFVANQQGWRLDKNGRFENYGSTSGEGAMKLTNETISVRDANGRLRVQIGRLTGTW
Physico‐chemical
properties
protein length:1192 AA
molecular weight: 132034,06720 Da
isoelectric point:4,77467
aromaticity:0,08473
hydropathy:-0,37240

Domains

Domains [InterPro]
Legend: Pfam SMART CDD TIGRFAM HAMAP SUPFAM PRINTS Gene3D PANTHER Other

Taxonomy

  Name Taxonomy ID Lineage
Phage Escherichia phage grams
[NCBI]
2696401 Viruses > Duplodnaviria > Heunggongvirae > Uroviricota > Caudoviricetes
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)
Genbank protein accession
QHR65240.1 [NCBI]
Genbank nucleotide accession
MN850567.1 [NCBI]
CDS location
range 27364 -> 30942
strand +
CDS
ATGGCTGAAAATATGATAACTGGCAGTAAGGGTGGATCATCAAAACCTTATGTTCCAAAAGAGATGGAAGATAACCTGATCTCAATCAATAAAATCAAAATCCTTCTTGCAGTTTCCGATGGCGAGTGCGATCCAGATTTTACACTTCGCGATCTGTATCTTGATGATGTTCCGGTAATTGCAGACGACGGAACTGTTAACTACCAGGGTGTGAGCGCAGAGTTTCGCCCAGGCACACAGACTCAAGATTACATCCAGGGATTTACTGACACATCAAGCGAAGTGACGCTGGCGCGTGATATTACTACGTCAAATCCTTATGTGATTTCCGTAACCAACAAAACATTATCTGCTATCAGAATCAAAATGCTAATGCCAACAGGCATTAAGCAAGAGGATAACGGCGATCTTGTCGGCGTTAAGGTTACTTATGCTGTTGATATGGCTGTTGACGGAGACTCCTACAAAGAAGTATTGCGAGACACTATCGAAGGTAAAACTCGTTCCGGTTACGACAGAAGCAGAAGGATTGACCTTCCGTCATTCAATGATCGCGTATTACTTAGGGTTAGAAGGATTACGGCAGACAGCGCATCTTCTCGTGTTACTGATCTGATTAAGCTACAAAGTTACGCTGAGGTTATTGATGCAAAATTCCGTTATCCTCTGACTGGTCTTGTATACGTTGAATTTGACAGTGAGTTGTTTCCTAACCAGATCCCAAATATTTCAATCAAGAAAAAGTGGAAACTGATTAATGTACCAAGCAACTATGATCCTGTATCTCGAGAGTATAGAGGTTCATGGGATGGAACATTCAAAAAAGCCTGGTCAAATAATCCAGCATGGGTGCTTTACGACATCATTACAAATCAGCGTTATGGATTAGATCAGAGAGAGCTTGGTGTACAGGTTGATAAGTGGAGTCTTTACGAAGCTGCGCAATACTGCGATCAGAAAGTTCCAGACGGAAAAGGAGGCACAGAGCCACGCTATCTATGTGATGTTGTTATCCAGAGTCAGATTGAGGCTTATCAGCTTATTCGTGACATTTGCTCTATCTTCCGTGGAATGAGCTTTTGGAATGGCGAGAGTTTGTCAATCGTTATTGATAAACCACGCGATCCATCTTACATCTTCACAAATGATAACGTTGTTGATGGTGATTTCCAGTATACGACGGCAAGCGAAAAGAGCATGTACACGCAGTGCAACGTGACGTTCGACGACGAACAAAACATGTATCAACAGGACGTAGAGGGCGTATTCGACACCGAGGCAGCATTGCGCTTTGGATACAATCCTACAAGCATAACCGCGATCGGATGTACACGCAGGAGTGAGGCTAATCGGCGCGGTCGATGGATACTAAAAACCAACTTGCGCAGCACTACGGTAAACTTTGCTACTGGACTGGAAGGCATGATCCCATCAATAGGTGATGTGATTGCTATTGCTGACAATTTTCACAGCAGCAACCTTAAATTAAACCTATCAGGGCGCGTGATGGAAGTTTCCGGCTTGCAGGTGTTCGTTCCGTTTAAGATTGACGCGCGACCAGGTGATTTCATTATCATCAACAAGCCGGACGGAAAGCCAGTTAAGCGCACAATCTCAAAAGTAAGCGATGACGGAAAAACCATTGAGCTAAACATTGGGTTTGGGTTTGAAGTTAAGCCGGATACAGTTTTTGCAATCGACCGCACTGACATTGCGTTGCAGCAATATGTTGTAACGAGTATCGGAAAAGGTGATGATGATGATGAATTTACATACTCCATCACGGCTGTTGAATATGACCCTAACAAATACGACGAGATTGATTACGGAGTAAACATTGACGACAGGCCAACTTCAATTGTCCAGCCTGACACGATGGCAGCACCGGAAAACGTGCAAATATCCTCATACTCGCGAATTGTCCAGGGTGCAAGCGTTGAAACAATGGTTGTGTCGTGGGATAAAGTACCTTACGCATCACTGTATGAAATGCAATGGCGAAAAGGTGATGGCAACTGGCTGAATACACCACAGACCGCAAACAAAGAAATTGAGGTTGAAGGTATTTATTCAGGAAACTACCAGGTAAGGGTTAGATCTGTTTCTGCTTCAGGTTCTACGTCTCCGTGGTCCAGAATTGTGACAGCTTCACTGACTGGTAAGGTAGGAGAGCCAGGCGCGCCAGTTAACTTAACCGCATCCGACAATGAAGTTTTTGGCATTCGTGTTAAGTGGGGGATGCCAGAAGGGAGCGGAGACACGGCATACATTGAGCTTCATCAGTCGCCAGATGGAACGGCTGAAAACTCAAGCCTGCTAACGCTGATCCCGTATCCACAATATGAATACTGGCATGGAACTCTTCCGGCTGGTCATGTTGTATGGTATCGGATCAGGAGCGTAGACAGGATCGGCAACGTTTCCGGTTGGACTGATTTTGTTAGAGGTATGGCTTCAGATGATGTGGAGGCTGTTTTAGGAGATATTCTTGATAAGATTTTTGATACTGAAGCAGGTCAGGATCTGAAAGAGAACGCCATTGACAGTGCAAACAAGATAAAGGACCAGGCGCAAGCAATCATCCAGAATGCTCTTGCTAATGATGCTGATATTAAGTGGACGCGAGTACAAAACGGAAAGAGAAAGGCTGAATATGGTCACGCTCTTGAGCTTATAGCCACAGAGACTGAGGCACGAGTTACGCAGATTGAAGAATTGAGGGCGTCAATCGACGAGGATATTGTTTCAAGCATAAAGACAGTGCAGGAGGCTATTGCCACAGAATCTGAAACGCGAGCAACTCAAATAAACGCTTTGGATTCAAAATTTACAACTGAAATTGACGGTGTAAAACGCGATACAGCAGCAAGCATTAATCAGGTGAATCAAACAATTGCCAACGAATCTGAAGCGAGAGCGCAGGCCGTTAACGCTCTTGATGCGAAGTTCACAAAGGAGATCGAAGATTTAAACGGAGTTATTAAGACTGAGGTTGAGGCTAACATCTCTGAAGTGAAACAGGCTATCGCTAACGAGACAGAGGCAAGGGTGCAGGCTGACCAGGCGTTAACAGCTAAGTTTGGAGACGTTGAATCTGCATTAGCTCAAAAACTTGATTCATGGGCTGGCGTGTCGTCTGTTGGTGCTAAATACTCAATGAAACTTGGGTTAACTTACAACGGACAGCAGTACAGCGCAGGCATGATCATGCAGCTTTCGCAGAGTTCATCCGGCCTTATCTCGCAGATTTTGTTTGATGCGAACAGGTTCGCCATCATGACAAGTTCGACTGGCGGGGTGTATACGCTGCCTTTCGTGGTGGAAAATAACCAGGTATTCATTAACAGCCTGTTAGTGAAAAACGGTTCAATCACCAATGCGATGATCGGTAATTATATTCAGTCGAATAACTTTGTTGCTAATCAGCAGGGGTGGAGGCTGGATAAAAACGGCAGATTTGAGAACTACGGTTCTACATCTGGAGAAGGGGCCATGAAGTTGACCAACGAAACAATAAGTGTACGAGACGCAAACGGGCGCTTGCGTGTTCAGATTGGTAGGCTTACTGGTACATGGTAA

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

PDB ID
788496d502c14e8074420d32c43ce51eeac83f79e2fae682d084d8a0de9d5c20
ESMFold
Source ESMFold
Method ESMFold
Resolution 0,7814
Evidence 0,7814

Literature

Title Authors Date PMID Source
Exploring the Remarkable Diversity of Culturable Escherichia coli Phages in the Danish Wastewater Environment Olsen,N.S., Forero-Junco,L., Kot,W. and Hansen,L.H. 2020 GenBank