Protein

Genbank accession
QHR67350.1 [GenBank]
Protein name
putative tail fiber protein
RBP type
TSP
Evidence RBPdetect
Probability 0,84
TF
Evidence RBPdetect2
Probability 0,96
Protein sequence
MAKYMISGSKGGSKKPYVPKEMEDNLISINKIKVLLAVSDGECDPDFTLRDLYLDDVPVIASDGTVNYEGVTAEYRPGTQTQDYIQGFTDTSSEVTVSRDITTDNPYIISVTNKNLSAIRIKILMPVGIKQEDNGDLVGVRVEYAVDMAIDGGSYSEVMRDVIDGKTRSGYDRSRRIDLPKFDERVLIRVKRLTPDSTSSKVTDKIKLQSYAEVVDAKFRYPLTGLVFVEFDSELFPTQIPNISIKKKWKIINVPSNYDPISREYHGSWDGTFKKAWSNNPAWVLYDLVTNQRYGLDQRELGIQIDKWSLYEAGVYCDQKVPDGKGGTEPRYLCDVVIQNQVEAYQLIRDICSIFRGMSFWNGESLSIVIDKPRDPSYVFTNENVINGDFQYINASEKSMYTQCNVTFDDEQNMYQQDVEGVFDTEAALRFGYNPTSITAIGCTRRSEANRRGRWVLKTNLRSTTVNFATGLEGMIPSIGDVIAIADNFQSSNLTLNLSGRVMEVSGLQVFVPFKVDARPGDFIIINKPDGKPVKRTISKVSADGKTIELNIGFGFDVKPDTVFAIDRTDLALQQYVVTTISKGDDENEFTYSITAVEYDPNKYDEIDYGVNIDDRPTSIVQPDVMAAPENVKISSYSRVVQGVSVETMVVSWDKVPYASLYEMQWRKGDGNWLNTPQTANKEIEVEGIYSGNYQVRVRSVSASGNASPWSKIATATLTGKVGEPGAPINLTASDNEVFGIRVKWGMPEGSGDTAYIELHQSPDGTVENSSLLTLIPYPQYEYWHSTLPAGQVVWYRIRSVDRIGNVSSWTDFVRGMASDDVESVLGDILDKIFDTEAGQEIKENAIDSANKIKDQAQSIIQNALANDADVKWTRVQNGKRKAEYGHALELIANETEARVTQIEELRASIDGEITSSIKTVQEAIATESETRATQIQQLDSKFTKEIDGVRKDTSASISDVRQTITNESEARAQAVQQLDAKFTKEINDLDGVIKTEVEANISEVKQAIANETEARVQADQALTARFGDVESALVEKLDSWASVDSVGAKYAMKLGLTYKGQQYSAGMVMQLSQGSSGLISQILFDANRFAIMTSSTGGTFTLPFVVENNQVFINSLLVKNGSITNAMIGNVIQSNNFVQNQQGWRLDKNGIFENYGSTPGEGATKFTNEGLKVKDANGVLRVEVGRITGSW
Physico‐chemical
properties
protein length:1192 AA
molecular weight: 132244,60630 Da
isoelectric point:4,81963
aromaticity:0,08557
hydropathy:-0,37366

Domains

Domains [InterPro]
Legend: Pfam SMART CDD TIGRFAM HAMAP SUPFAM PRINTS Gene3D PANTHER Other

Taxonomy

  Name Taxonomy ID Lineage
Phage Escherichia phage ityhuna
[NCBI]
2696410 Viruses > Duplodnaviria > Heunggongvirae > Uroviricota > Caudoviricetes
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)
Genbank protein accession
QHR67350.1 [NCBI]
Genbank nucleotide accession
MN850582 [NCBI]
CDS location
range 25175 -> 28753
strand +
CDS
ATGGCTAAATATATGATAAGCGGCAGTAAGGGCGGAAGCAAAAAGCCATACGTGCCAAAAGAGATGGAAGATAACCTGATCTCGATAAACAAGATTAAAGTTTTGCTGGCTGTATCTGATGGCGAGTGCGATCCAGATTTCACGTTGCGCGATCTTTATCTTGATGATGTTCCGGTTATTGCCAGCGATGGCACTGTTAACTACGAGGGAGTTACGGCTGAATATAGACCAGGCACACAGACGCAAGATTACATCCAGGGGTTTACTGACACATCAAGCGAGGTGACAGTTTCGCGAGACATTACAACAGACAATCCCTATATTATCTCTGTGACAAACAAAAATCTTTCTGCAATAAGAATCAAGATTCTGATGCCAGTTGGCATAAAACAGGAGGATAACGGCGATCTTGTTGGCGTAAGGGTTGAGTATGCCGTAGATATGGCTATTGATGGCGGTTCTTATAGCGAGGTTATGAGAGATGTAATTGACGGCAAGACAAGATCAGGATACGACCGCAGCAGAAGGATTGATCTTCCTAAGTTTGATGAGCGCGTTTTAATCCGAGTAAAGCGACTGACTCCAGACAGCACATCTTCAAAGGTGACTGATAAAATCAAGCTGCAAAGTTACGCTGAGGTTGTTGATGCAAAATTCCGTTATCCTCTGACTGGCCTTGTATTCGTAGAATTTGACAGCGAATTGTTTCCTACGCAAATCCCTAACATTTCTATAAAAAAGAAATGGAAGATTATTAATGTGCCAAGCAACTATGATCCAATATCAAGAGAATATCACGGGTCATGGGATGGGACTTTTAAAAAAGCGTGGTCAAATAATCCGGCTTGGGTTCTTTATGATCTGGTGACAAATCAGCGTTACGGACTTGATCAGCGAGAGTTAGGAATACAGATCGACAAGTGGAGCTTATACGAGGCGGGCGTTTACTGTGATCAGAAAGTTCCAGACGGCAAGGGTGGTACAGAGCCTCGCTACCTATGCGATGTGGTGATTCAGAATCAAGTTGAGGCTTATCAGCTAATCCGTGACATTTGCTCAATCTTTCGCGGAATGAGTTTTTGGAATGGTGAGAGCTTATCAATCGTGATTGATAAGCCGCGCGATCCATCATACGTGTTTACTAATGAAAACGTCATCAACGGTGATTTTCAGTACATAAACGCAAGCGAAAAAAGCATGTACACGCAGTGTAACGTGACGTTTGACGACGAACAAAACATGTATCAGCAGGACGTAGAGGGGGTTTTTGATACTGAGGCTGCATTACGATTTGGATACAATCCAACAAGCATTACAGCGATCGGTTGTACACGCAGGAGCGAAGCGAATCGTCGCGGTCGGTGGGTTTTGAAAACAAACCTTAGAAGCACTACTGTAAACTTTGCTACCGGACTAGAGGGGATGATTCCATCAATAGGTGATGTGATTGCTATCGCTGATAATTTTCAGAGCAGCAACCTAACGTTAAACCTATCGGGCCGAGTGATGGAAGTTTCAGGATTGCAGGTTTTCGTTCCGTTTAAGGTTGATGCTCGCCCTGGTGATTTTATTATCATCAACAAGCCGGACGGCAAGCCAGTTAAGCGCACGATCTCAAAGGTTAGCGCAGACGGAAAAACCATTGAGTTAAATATTGGATTTGGTTTTGATGTTAAGCCTGATACTGTTTTTGCGATTGACCGTACTGATCTTGCGTTGCAGCAATACGTTGTGACAACCATCAGCAAGGGTGATGACGAAAACGAGTTTACCTATTCAATCACGGCTGTAGAGTACGATCCGAACAAATACGACGAGATTGATTATGGAGTAAACATTGATGACAGACCGACTTCAATTGTTCAGCCTGACGTGATGGCAGCGCCTGAGAACGTTAAGATCTCATCTTATTCTCGCGTCGTGCAGGGTGTTAGCGTTGAGACTATGGTTGTTTCATGGGATAAGGTTCCTTACGCATCGCTTTATGAAATGCAGTGGCGAAAAGGTGATGGTAACTGGCTGAATACGCCGCAGACCGCTAACAAAGAGATAGAGGTAGAAGGGATTTACTCTGGCAACTACCAAGTAAGGGTGAGATCCGTTTCTGCAAGCGGTAACGCTTCCCCGTGGTCAAAGATTGCAACCGCCACTCTGACAGGTAAAGTTGGCGAGCCAGGAGCGCCGATTAATCTTACGGCTTCTGATAATGAAGTTTTTGGCATTCGTGTCAAATGGGGTATGCCGGAAGGATCAGGCGATACGGCTTACATTGAGCTTCACCAATCGCCAGACGGAACGGTTGAAAACTCAAGTCTGCTTACGCTGATTCCATATCCTCAATATGAGTATTGGCATAGCACGTTACCAGCGGGGCAAGTTGTATGGTATAGAATCCGCAGCGTTGACAGAATAGGCAACGTTTCCAGTTGGACTGACTTTGTTCGCGGTATGGCGTCAGATGATGTTGAATCTGTTTTGGGCGACATTCTGGACAAGATTTTTGATACAGAAGCTGGTCAAGAAATCAAAGAGAACGCCATAGACAGCGCCAATAAAATCAAAGACCAGGCACAATCAATCATACAGAACGCGTTGGCAAATGATGCAGATGTGAAGTGGACGCGAGTGCAAAACGGAAAGCGCAAGGCTGAATATGGTCATGCTCTTGAGCTTATCGCCAATGAAACAGAAGCGCGCGTAACTCAAATCGAAGAGTTAAGGGCATCAATTGATGGCGAGATAACATCAAGCATCAAGACAGTGCAGGAGGCAATTGCCACTGAATCAGAGACGCGAGCGACTCAAATTCAGCAGCTTGATTCTAAATTCACAAAAGAAATCGACGGCGTGCGCAAGGATACTTCTGCAAGCATTAGCGATGTAAGGCAGACAATCACTAACGAGTCAGAAGCGCGCGCTCAGGCCGTTCAGCAGCTTGACGCTAAGTTCACGAAAGAGATAAACGACCTTGACGGAGTTATCAAAACAGAAGTCGAGGCTAACATCTCAGAAGTGAAACAGGCGATCGCCAATGAGACAGAGGCAAGGGTTCAGGCTGACCAGGCATTAACAGCACGATTTGGCGACGTTGAATCTGCATTGGTTGAAAAGTTGGATTCTTGGGCGAGCGTTGATTCAGTTGGCGCTAAATACGCTATGAAACTTGGCCTTACTTACAAAGGCCAGCAATACAGCGCAGGAATGGTGATGCAGCTTTCGCAGGGTTCATCCGGCCTTATCTCGCAAATTTTGTTTGATGCTAACAGGTTCGCCATTATGACTAGCTCTACTGGAGGGACTTTTACTTTGCCTTTCGTGGTTGAGAATAATCAGGTTTTCATTAATAGTCTTTTGGTGAAGAACGGTTCAATCACTAATGCGATGATTGGTAATGTGATTCAGTCAAACAACTTTGTTCAAAACCAGCAAGGATGGAGGCTTGATAAAAACGGAATCTTTGAGAATTACGGATCAACGCCAGGAGAAGGAGCTACTAAATTCACCAATGAGGGATTGAAGGTAAAAGATGCAAACGGAGTATTGAGGGTTGAAGTCGGAAGGATTACCGGAAGCTGGTAA

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

PDB ID
c194497c66696bd839f7449e20f0f845006b27cac66e7efa96bd8d752a67b502
ColabFold
Source ColabFold
Method ColabFold
Resolution 0,2368
Evidence 0,2368

Literature

Title Authors Date PMID Source
Exploring the Remarkable Diversity of Culturable Escherichia coli Phages in the Danish Wastewater Environment Olsen,N.S., Forero-Junco,L., Kot,W. and Hansen,L.H. 2020 GenBank