Protein

Genbank accession
QLF82388.1 [GenBank]
Protein name
tail fiber protein
RBP type
TSP
Evidence RBPdetect
Probability 0,88
TF
Evidence RBPdetect2
Probability 0,96
TF
Evidence Phold
Probability 1,00
Protein sequence
MIQKVISGSKGGSPKPHNPVEMEDNLISINKIKILLAVSDGEIDETFNLKQLMFNSVPVQNEDGSFNFEGVTAEFRSGTQTQEYIKGMEDSSSEVTVNREVTTDNPYTISVTNKTLSAIRIKMFMPRGVRIESNGDKNGVRVEYEVQQAVDGGSFEKVLSDVIEGKTMSGYDRSRRINLPSFNNQVIFRVVRKTPDSSDSNVVDAIQVKSYAEVIDAKFRYPLTGLLFVEFDSKMFPNQLPTISIRKRWKIVNVPSNYDPESRTYNGNWDGTFKKAWTNNPAWVLYDLMINQRYGLDQKELGISVDKWALYEAAQYCDQMVPDGKGGTEPRYLCDVIIQSQTDAYKVIRDICSIFRGMSFWNGESISVIIDRPREPAYIFTNDNVVNGDFSYTFASEKSMYTTCNVMFDDEQNMYQQDVEPVFDREATLRFGNNVTSITAIGCTRRSEANRRGRWILKTNLRSTTVNFATGLEGMIPTIGDVVAIADNFWSSNLTMNLSGRLLEVSGSQIFLPFRVDARAGDFIIVNKPDGKPVKRTISSVSADGKTIEVNIGFGFPVKPNTVFAIDRTDIALQQYIVTKIDKGDDDEEFTYKITAVEYDPNKYDEIDYGVNIDDRPTSIVEPDQIPKPENVKVSSESRIVQGMSVETMIVSWDKVPYAVFYDVQWRKDNGNWQNVPQTANKEVYVEGIYAGNYQVRVRSVAGSGTTSGWSNIVAATLTGKQGEPGRPINLTATDDVVFGIRTKWGFSDGSGDTAYTELQQSPDGTVDSASLLSLIPYPQHEYYHSPMPGGNIVWYRVRTVDRIGNVSQWTDFVRGMASTNVDDIIGEISVDIENSPGYEWLVDNATDNAAQNAANAEAAIENALANDKDAIYMKKENGKRKAEYTKSLKLIADETQARVTSIEQLKASFDDQISASNSELREVIATETGAISREIDQLKAQIGDDIQASLTDIREAIANETEARTQADLTLNARLGNNEAALAQKLDSWSNASSTGAMYGVKLGLKYNGQEYSAGMAMSLIGSGAAVKAQILFEASRFAIMTGMNGKTQYPFVVENGQVILSSAIIKNGFITNAMIGNVIQSNNFSSGSAGWMVNKNGSAEFNAVTVRGKLYASNGEFAFNGTGNTVQINNNGITVNLPGGGKVVVGRW
Physico‐chemical
properties
protein length:1150 AA
molecular weight: 127126,83740 Da
isoelectric point:4,91796
aromaticity:0,09043
hydropathy:-0,36870

Domains

Domains [InterPro]
Legend: Pfam SMART CDD TIGRFAM HAMAP SUPFAM PRINTS Gene3D PANTHER Other

Taxonomy

  Name Taxonomy ID Lineage
Phage Escherichia phage vB_EcoS_Chapo
[NCBI]
2750856 Viruses > Duplodnaviria > Heunggongvirae > Uroviricota > Caudoviricetes
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)
Genbank protein accession
QLF82388.1 [NCBI]
Genbank nucleotide accession
MT682715.1 [NCBI]
CDS location
range 26077 -> 29529
strand +
CDS
ATGATTCAAAAAGTGATAAGTGGTTCGAAAGGCGGCTCTCCTAAGCCTCACAATCCAGTTGAAATGGAAGATAACCTGATCTCAATCAACAAAATAAAGATTCTTCTGGCAGTATCTGATGGCGAGATTGACGAAACATTCAACCTCAAGCAGCTGATGTTTAACTCAGTCCCGGTGCAAAACGAGGATGGCTCTTTCAACTTTGAAGGCGTAACGGCAGAGTTCAGGTCGGGCACGCAAACTCAGGAATATATCAAGGGGATGGAAGATAGCTCTAGTGAGGTAACTGTAAACCGTGAGGTTACTACAGATAACCCATACACGATCTCAGTAACCAACAAAACGCTTTCGGCAATCCGTATCAAAATGTTTATGCCTCGCGGCGTACGAATTGAGAGCAACGGAGACAAAAACGGCGTTCGAGTTGAGTACGAAGTGCAGCAAGCGGTTGATGGCGGTTCGTTTGAAAAGGTTCTATCTGACGTAATCGAAGGCAAAACAATGTCAGGTTATGATCGAAGCCGACGCATAAACCTTCCGAGCTTCAACAATCAGGTGATATTCAGAGTCGTTCGTAAAACTCCAGACTCTAGCGACTCGAACGTTGTTGACGCGATTCAGGTAAAGAGCTATGCCGAGGTGATTGATGCAAAATTCCGTTACCCTCTGACTGGTCTTCTTTTCGTCGAGTTCGATTCGAAGATGTTCCCAAACCAGTTACCTACGATCTCAATTCGTAAGCGATGGAAGATTGTAAACGTTCCTTCGAACTACGATCCAGAATCACGAACTTATAACGGGAATTGGGATGGAACTTTTAAGAAGGCGTGGACGAATAATCCGGCTTGGGTGCTTTATGACCTGATGATTAATCAGCGTTATGGTCTGGATCAGAAGGAGCTTGGAATATCTGTCGATAAATGGGCGCTATATGAGGCGGCGCAATATTGCGACCAGATGGTCCCTGACGGCAAAGGCGGCACAGAGCCTCGCTACCTTTGCGACGTGATAATTCAATCTCAGACTGATGCGTACAAGGTGATTCGAGATATTTGCTCAATCTTCCGTGGAATGAGTTTTTGGAATGGTGAAAGCATTTCGGTAATCATCGACAGGCCACGTGAGCCTGCGTACATCTTCACTAACGACAACGTTGTTAACGGTGACTTCTCCTATACTTTCGCAAGCGAGAAAAGTATGTACACGACGTGCAACGTGATGTTTGATGATGAACAAAACATGTATCAGCAGGACGTTGAGCCAGTATTCGATCGAGAAGCCACTCTACGGTTCGGGAATAACGTAACAAGCATTACAGCGATCGGTTGCACACGTCGAAGTGAAGCTAACCGTCGCGGACGATGGATTCTGAAAACAAACCTTCGCAGCACTACGGTAAACTTTGCTACCGGGCTTGAGGGTATGATTCCGACAATTGGAGATGTTGTAGCGATAGCTGATAACTTCTGGTCAAGTAACTTGACTATGAACCTGTCAGGGCGTTTGCTCGAAGTGTCTGGAAGTCAGATTTTCCTGCCGTTCCGCGTTGATGCTCGCGCAGGTGACTTTATTATCGTAAATAAGCCCGATGGCAAGCCCGTGAAGCGCACAATCTCAAGTGTTAGTGCGGATGGTAAGACTATAGAGGTTAACATCGGCTTTGGCTTTCCTGTGAAGCCTAACACGGTATTCGCAATCGACCGTACCGACATTGCGTTGCAGCAGTACATCGTGACCAAAATCGACAAGGGTGATGATGATGAGGAATTTACCTACAAAATCACGGCAGTGGAGTACGATCCTAACAAGTACGATGAGATTGATTACGGAGTTAACATCGACGACCGACCGACGAGCATCGTTGAGCCAGATCAGATCCCTAAACCGGAAAACGTGAAAGTGTCCTCAGAGTCGAGAATCGTTCAGGGGATGAGCGTAGAAACGATGATTGTTAGCTGGGATAAAGTGCCTTACGCAGTTTTCTATGACGTCCAGTGGCGAAAGGATAACGGCAACTGGCAAAATGTACCGCAGACGGCAAATAAAGAGGTATACGTTGAAGGCATTTACGCTGGAAACTATCAGGTTCGCGTGCGATCCGTCGCTGGTTCAGGCACGACTTCAGGTTGGTCAAATATCGTAGCTGCGACGCTGACGGGTAAACAAGGTGAGCCGGGACGACCGATTAACCTTACAGCTACGGATGATGTTGTTTTTGGTATCCGTACAAAATGGGGGTTCTCTGATGGTTCTGGAGATACGGCCTACACGGAGTTGCAGCAGTCACCGGATGGAACAGTGGATAGCGCAAGTTTGCTTTCTTTGATTCCGTATCCGCAGCATGAGTATTATCACTCACCGATGCCTGGAGGGAATATTGTGTGGTATCGGGTAAGGACGGTTGACAGGATCGGTAACGTATCTCAGTGGACTGATTTTGTCAGAGGCATGGCATCAACAAACGTTGACGATATCATCGGAGAGATTTCTGTCGATATCGAAAACTCGCCGGGTTACGAGTGGCTTGTTGATAACGCAACAGACAACGCGGCGCAGAACGCAGCTAACGCAGAGGCAGCAATAGAAAACGCGCTCGCCAATGACAAAGATGCGATCTACATGAAGAAGGAGAACGGAAAACGAAAAGCTGAGTACACGAAATCACTTAAACTTATTGCTGATGAGACGCAGGCGCGAGTGACGTCGATCGAGCAGTTGAAGGCAAGTTTTGACGATCAGATTAGCGCAAGCAATAGCGAGTTGCGTGAAGTTATAGCGACCGAGACTGGAGCGATATCGCGTGAAATTGACCAACTCAAGGCTCAGATTGGTGACGATATTCAGGCGAGCTTGACAGATATCCGAGAGGCTATCGCAAACGAGACTGAGGCCAGAACTCAGGCCGACTTAACGTTAAACGCGAGGCTTGGAAATAACGAGGCGGCACTTGCTCAAAAACTTGACTCTTGGAGTAATGCGAGTTCTACCGGGGCGATGTACGGCGTTAAGCTGGGGCTGAAATACAACGGGCAGGAATACAGCGCAGGAATGGCTATGTCCCTGATTGGTTCCGGCGCGGCTGTTAAGGCTCAGATTCTTTTTGAGGCGTCACGATTTGCAATCATGACCGGGATGAATGGAAAGACACAATACCCGTTTGTTGTTGAGAATGGTCAGGTTATTTTAAGTAGCGCGATCATTAAAAACGGATTCATCACTAATGCTATGATTGGAAACGTTATTCAATCGAATAACTTCTCTTCAGGTAGTGCTGGATGGATGGTTAACAAGAATGGATCGGCTGAGTTTAATGCGGTTACTGTCAGAGGCAAACTTTACGCAAGTAACGGTGAGTTTGCATTCAACGGAACGGGTAATACCGTTCAGATTAACAATAACGGCATCACGGTAAACTTACCTGGTGGCGGGAAAGTTGTAGTGGGGAGGTGGTAA

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

PDB ID
92467c15a03873cb168804c8dc34777cc700ae41b6f340fc71539a6fa57b5a04
ESMFold
Source ESMFold
Method ESMFold
Resolution 0,7703
Evidence 0,7703

Literature

Title Authors Date PMID Source
Genome Sequences of Four Bacteriophages Infecting Toxigenic Escherichia coli (STEC) Dias,C., Almeida,C., Lobocka,M. and Oliveira,H. 2020-09-03 GenBank