Protein

Genbank accession
QHR68082.1 [GenBank]
Protein name
tail fiber protein
RBP type
TSP
Evidence RBPdetect
Probability 0,87
TF
Evidence RBPdetect2
Probability 0,96
TF
Evidence Phold
Probability 1,00
Protein sequence
MTIIYDVTGHKGGGGKQHTPQETPDSLHSLAKIRILLALGEGEFESITSASELRQRVYLDGTPIQNADLSENFPGARVEFRPGTQHQDVIHGFSAVESEQSVGVKLENGTPWVRQINDTSLDAVRIRIGIPALYTNEDNGDLVGGRIDYKIVVYTDNADPREFRFAAVGKTMSLYERDHRIELPPNVNTGWRVEVHRITADSTSAKVVNDIRVQSITEIIDARLRYPLTALLFVEFDAKAFQNIPRVSIKCKGRKVLIPNNYDPINHTYSGDWDGTFKRAWTDNPAWHWYDICITERFGLGRRIKPQMLNRYALYQIAQRCDQLVSDGNGGREIRFKNDMYIQSQTDAWTVLKDLAAIFAGMTWWGNQMLNIVSDQPVAAVSHTITNASVIDGRFDYASGSQKTRYSTFAVAYGNPKNHYEDAIATGQRVELVRRHKINRLDITAIGCTRESEAQRRGHWALISNQLDQQVSFKVGMEGLFFIPGSVVAIADTNISGGFETRGGRLLSDPGTRTVLNTDSEITFRPGDKFLVRTDSGNVETREIASVNGNKVTLKTALDADPIPDQPFCVDGDDIQLQKFRITDLEYDDSTSTFSVRGIEYNDSKYDAVDNGARLDPGIFTQVPDGVMKGPESVTITPSQISSQGQLITNVDIVFPPVKDAVVYEIQWRRTSLQNMEIQWGNDWVNIPRTASNGAHIPNVFSGNYQARVRAIGMGEISSPWVSSAITPVEGRLGGLNAPIITNAISGLHQILWKWNHNNAATDISYTELEVRKTGETEWKFLTNVPYPGAEYAQTSLEFGIYQQLRARVADKIGNLSDWSAPFEGQVSDKVDEYMKGLDDEFLTSEDGKRFQEAINTIPQGIYEAMLTDAQQLFNARAEYKGIYAEITVAYNVAADAHKAVAQLETLIGTRLDDAEASIHTLQTAQSTHEQAFAQYQQTVAAKFSEQEAAIQQVQTATADVAGALAEYKTQVAAQFGQQSAAIEQKMTSSFNHAGGSATYSLKAGVTYNGTYYDAGMQLSVVTSGNAVKSRIAFKADQFYIMHPSNGTLSSAFIVDGGEVYIDTARIKNASINFAQITDTLQSNNYDGSTRGWRLGKDGTFINLGTGSGGGMKQTNTRISVKDGNGVLRVQIGELTGSW
Physico‐chemical
properties
protein length:1139 AA
molecular weight: 125816,97020 Da
isoelectric point:5,42644
aromaticity:0,08955
hydropathy:-0,37779

Domains

Domains [InterPro]
Legend: Pfam SMART CDD TIGRFAM HAMAP SUPFAM PRINTS Gene3D PANTHER Other

Taxonomy

  Name Taxonomy ID Lineage
Phage Escherichia phage welsh
[NCBI]
2696462 Viruses > Duplodnaviria > Heunggongvirae > Uroviricota > Caudoviricetes
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)
Genbank protein accession
QHR68082.1 [NCBI]
Genbank nucleotide accession
MN850589.1 [NCBI]
CDS location
range 6564 -> 9983
strand +
CDS
ATGACGATTATCTACGACGTCACAGGGCATAAAGGCGGCGGCGGCAAACAGCACACCCCACAGGAGACGCCAGATAGCCTGCATTCGCTGGCTAAAATCCGCATCTTGCTTGCACTGGGCGAGGGTGAATTCGAAAGTATTACCAGCGCCAGCGAATTGCGGCAGCGTGTTTACCTGGACGGAACACCGATCCAGAACGCTGACCTGTCCGAAAACTTCCCTGGCGCGCGTGTGGAGTTCCGCCCCGGCACACAGCACCAGGATGTGATCCACGGATTTTCAGCGGTGGAAAGTGAGCAATCTGTCGGCGTAAAACTGGAAAACGGCACGCCGTGGGTGCGCCAGATTAACGATACCAGTCTGGACGCTGTTCGCATTCGCATCGGCATTCCAGCCCTGTACACTAACGAAGATAATGGCGACCTGGTGGGCGGGCGCATCGACTATAAGATAGTTGTGTATACGGATAACGCCGACCCGCGTGAGTTCAGATTCGCCGCCGTTGGTAAAACAATGTCGCTATATGAGCGCGACCACCGCATCGAATTACCTCCGAACGTTAACACCGGCTGGCGCGTGGAGGTGCACCGCATAACGGCAGACTCCACATCGGCGAAAGTGGTTAATGATATCCGGGTGCAGTCCATCACTGAGATTATTGACGCCCGCCTGCGTTACCCGCTAACCGCGCTGTTGTTTGTGGAGTTCGACGCCAAAGCGTTCCAGAACATCCCGCGCGTGTCCATCAAGTGCAAAGGCCGTAAAGTTCTAATACCGAACAACTACGACCCGATTAATCATACCTATTCCGGGGACTGGGACGGCACGTTTAAACGCGCATGGACGGATAACCCTGCGTGGCACTGGTACGATATTTGTATTACTGAGCGCTTCGGCCTCGGTCGGCGTATCAAACCGCAAATGTTAAACCGGTACGCGCTCTACCAGATTGCGCAGCGCTGCGATCAGTTGGTCAGCGACGGCAACGGTGGTCGAGAAATCCGCTTTAAGAATGATATGTACATCCAGTCGCAGACAGACGCCTGGACCGTGCTTAAGGATTTGGCGGCCATCTTCGCCGGAATGACCTGGTGGGGTAACCAGATGTTGAATATCGTCAGTGACCAACCGGTCGCGGCGGTGTCGCACACTATCACCAACGCCTCGGTAATTGATGGGCGATTCGACTACGCATCTGGTAGCCAGAAAACGCGCTATTCCACATTCGCGGTAGCATACGGCAATCCGAAAAACCACTATGAAGATGCTATCGCAACGGGGCAACGTGTCGAACTGGTACGCCGCCATAAGATTAACCGTCTTGATATTACGGCGATCGGCTGTACGCGTGAATCTGAAGCGCAACGCCGGGGGCACTGGGCGCTAATCTCAAACCAGCTTGACCAGCAAGTTAGTTTTAAGGTGGGCATGGAGGGGTTATTCTTTATCCCCGGTAGCGTAGTTGCGATCGCAGATACTAATATTTCTGGCGGATTCGAGACTCGTGGCGGTCGCCTGTTGTCGGACCCTGGAACGCGTACCGTGCTGAACACGGACAGCGAAATCACATTCCGCCCTGGCGATAAGTTCCTGGTACGCACCGATAGCGGTAATGTGGAGACTCGCGAGATCGCCAGCGTCAACGGCAACAAGGTTACGCTAAAAACCGCACTTGATGCCGACCCGATTCCAGACCAGCCGTTTTGCGTTGATGGCGACGATATCCAGTTGCAAAAATTCCGCATCACCGACCTGGAATATGACGACTCTACGAGCACTTTCTCGGTGCGCGGGATTGAATACAACGATAGCAAGTATGATGCCGTTGATAATGGCGCTCGCCTTGACCCTGGCATCTTTACGCAAGTGCCTGACGGTGTAATGAAGGGGCCGGAATCCGTGACCATCACCCCGTCGCAGATTTCATCGCAAGGCCAGCTAATCACCAACGTGGATATTGTATTCCCTCCGGTGAAGGATGCCGTGGTGTATGAAATCCAGTGGCGACGTACCAGCTTGCAGAATATGGAAATCCAGTGGGGTAACGACTGGGTGAATATCCCACGCACGGCGTCGAACGGCGCGCACATCCCCAACGTGTTCTCCGGTAACTATCAGGCACGCGTCCGCGCGATCGGTATGGGCGAGATTTCATCTCCGTGGGTGTCGTCCGCCATCACGCCGGTGGAAGGTCGTCTCGGTGGGCTTAACGCACCAATCATCACCAACGCTATTTCGGGCCTTCACCAGATCTTGTGGAAGTGGAACCACAACAACGCCGCTACTGATATCTCGTACACCGAGCTTGAAGTGCGCAAGACCGGTGAGACTGAATGGAAATTCCTTACCAACGTCCCATATCCGGGGGCGGAGTATGCGCAAACATCGCTGGAGTTCGGAATTTACCAGCAGTTGCGCGCCCGTGTAGCGGATAAAATCGGCAACCTGTCGGACTGGTCGGCCCCGTTTGAAGGGCAGGTGAGTGACAAAGTTGACGAGTACATGAAGGGGCTTGATGACGAGTTCTTGACTTCCGAGGATGGTAAACGCTTCCAGGAAGCAATCAACACGATCCCGCAGGGTATTTACGAGGCGATGCTCACCGACGCGCAGCAATTGTTCAATGCCCGCGCCGAGTATAAAGGGATTTATGCGGAAATCACGGTGGCGTATAACGTGGCGGCGGACGCGCATAAAGCCGTAGCGCAACTGGAAACGTTGATCGGCACGCGCCTTGATGACGCTGAAGCATCAATCCACACGTTGCAGACTGCGCAAAGCACACATGAACAAGCGTTCGCCCAGTACCAGCAAACTGTTGCCGCTAAGTTTTCGGAACAGGAAGCAGCCATCCAGCAGGTACAAACGGCAACGGCGGACGTGGCGGGCGCGCTGGCAGAGTATAAGACCCAGGTCGCGGCGCAGTTCGGACAGCAGTCCGCAGCTATCGAGCAGAAGATGACGTCTTCGTTTAACCATGCTGGTGGCAGCGCCACGTACAGTCTTAAGGCTGGCGTGACGTATAACGGGACTTACTATGACGCCGGTATGCAGCTTTCAGTTGTGACGTCTGGCAACGCGGTTAAATCCCGCATCGCATTCAAGGCGGACCAGTTCTACATCATGCATCCGTCCAATGGGACGCTGTCGTCTGCGTTTATTGTGGACGGTGGCGAGGTGTATATCGACACGGCACGCATCAAGAACGCGTCCATCAACTTCGCGCAGATAACAGACACGCTGCAATCGAATAACTACGACGGCAGCACGCGCGGCTGGCGCTTAGGGAAAGATGGAACGTTTATCAACCTGGGCACCGGTAGCGGCGGCGGGATGAAACAGACGAACACAAGAATAAGCGTGAAGGACGGAAACGGCGTTCTCCGCGTGCAGATTGGTGAGCTTACTGGTAGCTGGTAG

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

PDB ID
5fc3666bfe74c73483594b5c01ec5592cbe563a24d07b84d6bfb577b5bf4fac1
ESMFold
Source ESMFold
Method ESMFold
Resolution 0,7848
Evidence 0,7848

Literature

Title Authors Date PMID Source
Exploring the Remarkable Diversity of Culturable Escherichia coli Phages in the Danish Wastewater Environment Olsen,N.S., Forero-Junco,L., Kot,W. and Hansen,L.H. 2020 GenBank