Protein

Genbank accession
QLF82471.1 [GenBank]
Protein name
tail fiber protein
RBP type
TSP
Evidence RBPdetect
Probability 0,86
TF
Evidence Phold
Probability 1,00
Protein sequence
MADIKVVRIESLPATTAVTEDDYLVVQQPDLTRRVKIGDVVHVDGTVSHVISFKEGGKLNGPMDFAYFEEEDLYLRWKGEFPHTVPALSSPYADGGITDAAWMVYTDPSLREELESTIGASMIMTAEGQSVQDVMDVTVQTANDAKALAQRVDFGTVHTVGDVIHLDNFVGPAVIEEGRTTNYPSVAAGEKFLNGVVSRRDTTTVDGIFRGATTGAMYTIAVTNGVATTKRIALRDEFKRLEANNPTRTVIRAGDDLNTAGYLQLDATGRWGMWNQQTASWQPLAIEQGGTGARDAAGVRINIGAFYKQRAALEPNFNINNLTGNQDGVYYQPMTAYATEANGYPAGSGAGHLIVWQNNANGGTGCRQEYYPFSNVDVWYLRTYQANTNQWTAWQPMVRPRNDDTFRSHIGLGSRNSPTFGHLYLSQSSADVKSASGIVNGDKYSADGVLEHGYRIYSEVRNDNKAWLTIHLHKGAKGSETHRYLGFREDGVLDCPKYMQVGDLTGQLTNWGLGEWIRSSGAERGFWGSKKAAKMVIWDGGMDEQGEGTLEWGVYNNRKAKWEPLPIACGGTSARTVGDAQAQFRIPLAAGARPYVNLPRTAAMQDGKYYPIIVRTDPNFAATIGTDLTIVTRSSSGGDPMNCATLQCHYRTGGWTDRGDSFQGVINFYQNEQAILGCVSPTRGKQEYVAFYVEARAFPVTVYAGATVIEVSTREADWQVGNVTGNQDGVKFCAPLESADLSLAIKGDDVTNTRPLVEFKGTSGFYTGGGTMWHYLGTPERYAVMSKMNMPKVELWADGIDYICPGSQTTRKALFSNAGFQAASEGTEDLINNTFTSKCGNGARLQGQAEFRSTPEAGQVIVRDVVGTAHRFYNFNKDGTFSAPGGFVCHTGADWNNQFGPNNPSKILAGNVNGPEGSMVVGGLSVAFSGNYAFQMAGRLDQLYTRSIEQGDHRAWNKVIQHRGQGLGTNDLNDYKADREGIYHQEANANATAERNYPPGQQMAGTLIVLRNSANEGTGCIQIYKMYLGGTWERYYNNLGSGMAWSPWKRTSFPENTTAPVMPDLWLPLTSNLKPALGEGEMVFSRPSTATYFTKRGVMAVAQANQPRFERDGLLIEGQRTNLMLNSEDPSKWGAQQAITVGSTVTNTNGTKGARFTVNTTSGVETTALNLATVPATRGADVTGAEKFCTGSIIARGGKAHQRLRVRFDMYNGSTTVFQGDAYVNLSTLEVKTTGDAAGRIKVKAERWQTAGPAWIRIAATFEAVASDNKIGCQFQIAPPEGAQHAVGDWVDVAIPQFELGSCESSFIPTGSSSVTRAADLCKFPMTDNLAPRPFTIAATVDANWRGWGKAPNAAPRVIDTEGHQSGAAFIMGFGSATNIADDGYPYCDIGGSNRRVYEMAKARKLKIGFRIKGDGKTCSFANGLVSTETQSSWEFLAGGAFIRIGGQTATGERHLFGHIKDIRVWNSALTDTQLMMESVE
Physico‐chemical
properties
protein length:1481 AA
molecular weight: 160975,01110 Da
isoelectric point:6,07458
aromaticity:0,09656
hydropathy:-0,37353

Domains

Domains [InterPro]
Legend: Pfam SMART CDD TIGRFAM HAMAP SUPFAM PRINTS Gene3D PANTHER Other

Taxonomy

  Name Taxonomy ID Lineage
Phage Escherichia phage vB_EcoM_Gotham
[NCBI]
2750849 Viruses > Duplodnaviria > Heunggongvirae > Uroviricota > Caudoviricetes
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)
Genbank protein accession
QLF82471.1 [NCBI]
Genbank nucleotide accession
MT682716.1 [NCBI]
CDS location
range 65673 -> 70118
strand -
CDS
ATGGCTGATATTAAAGTTGTCAGAATTGAATCTCTTCCTGCCACTACCGCAGTGACAGAGGATGATTACCTGGTTGTTCAGCAACCAGACCTGACCCGTCGTGTAAAAATTGGCGATGTTGTCCATGTGGATGGGACTGTTTCTCATGTAATCTCCTTTAAGGAAGGTGGTAAGTTAAACGGCCCAATGGATTTTGCCTACTTCGAAGAGGAAGACCTCTACCTGCGTTGGAAAGGCGAATTTCCGCACACTGTTCCTGCACTGTCTTCACCGTACGCTGATGGTGGGATTACTGACGCTGCATGGATGGTATATACTGACCCGTCCCTAAGGGAGGAGCTGGAATCTACCATTGGCGCATCTATGATCATGACAGCCGAAGGGCAGTCTGTCCAGGATGTAATGGATGTTACTGTTCAGACAGCTAACGATGCTAAAGCACTTGCACAGCGTGTAGATTTTGGTACAGTGCACACCGTAGGCGATGTTATTCACCTTGATAACTTTGTTGGCCCTGCAGTTATTGAAGAGGGAAGAACCACCAACTACCCTTCTGTAGCAGCTGGTGAAAAATTTCTGAACGGTGTGGTATCTCGCCGTGATACTACAACTGTTGATGGTATTTTCCGTGGCGCTACCACCGGAGCCATGTACACCATTGCAGTAACCAACGGTGTAGCAACAACGAAAAGAATAGCACTTCGTGATGAATTTAAACGCCTAGAGGCAAACAACCCAACAAGAACAGTTATCCGTGCTGGAGATGATTTAAATACGGCAGGGTACTTGCAGCTTGATGCAACAGGCCGTTGGGGTATGTGGAACCAACAAACAGCTTCGTGGCAACCTCTTGCTATAGAGCAAGGCGGCACAGGGGCTAGAGATGCCGCTGGAGTCCGTATCAATATCGGTGCTTTCTATAAGCAACGTGCAGCCCTTGAGCCAAATTTCAATATCAATAACTTGACTGGTAATCAGGATGGTGTATACTACCAGCCGATGACTGCTTATGCAACTGAGGCAAATGGTTACCCTGCAGGTTCTGGTGCTGGTCACTTGATTGTTTGGCAGAACAATGCTAACGGCGGTACAGGTTGTCGTCAGGAATACTACCCATTCTCTAACGTAGATGTTTGGTATTTGAGAACCTATCAGGCCAACACAAATCAGTGGACTGCATGGCAGCCGATGGTTAGACCTCGTAACGATGATACCTTTAGATCGCACATCGGTCTCGGGTCTAGAAATTCTCCTACCTTCGGGCACCTTTACCTGTCTCAAAGTTCTGCTGATGTTAAGTCAGCATCCGGTATTGTTAACGGGGACAAATACAGCGCTGACGGTGTCCTTGAGCATGGATATAGGATCTACTCTGAGGTAAGAAACGACAATAAGGCTTGGTTGACAATCCACCTCCACAAAGGTGCAAAAGGATCTGAAACTCATAGATATTTAGGTTTCCGCGAAGACGGTGTATTAGACTGCCCTAAATATATGCAAGTTGGGGATCTTACTGGTCAGCTGACAAACTGGGGACTTGGAGAGTGGATCCGTAGTTCAGGGGCAGAAAGAGGTTTCTGGGGGTCCAAGAAAGCCGCCAAGATGGTCATCTGGGATGGCGGTATGGACGAGCAAGGTGAAGGAACGCTTGAGTGGGGTGTTTATAACAACCGGAAAGCCAAATGGGAACCACTACCGATAGCTTGTGGTGGTACCAGTGCCAGGACTGTAGGGGATGCTCAAGCACAATTCAGAATCCCTCTGGCAGCAGGTGCTAGACCATATGTAAACCTGCCTAGAACTGCAGCTATGCAGGACGGTAAGTATTATCCTATTATCGTCAGAACTGACCCCAACTTTGCAGCAACTATCGGGACAGATTTAACCATTGTCACCAGATCTTCTTCTGGCGGCGACCCAATGAACTGTGCTACACTACAGTGTCACTACAGAACTGGTGGCTGGACAGACAGAGGTGATTCTTTCCAAGGGGTTATTAATTTCTACCAAAACGAGCAAGCAATCCTCGGATGTGTATCTCCAACCAGAGGTAAGCAGGAGTATGTTGCATTCTATGTAGAGGCACGTGCATTCCCTGTTACAGTATATGCTGGCGCTACTGTAATAGAAGTGTCAACCAGGGAGGCTGATTGGCAGGTAGGCAACGTGACAGGTAACCAAGATGGTGTCAAGTTCTGTGCTCCTCTGGAATCTGCAGATTTGAGTCTTGCAATCAAAGGTGATGATGTAACAAACACCAGACCTCTGGTTGAGTTCAAAGGAACATCAGGTTTCTACACTGGTGGTGGAACTATGTGGCACTACCTGGGGACTCCTGAACGCTATGCAGTGATGAGCAAAATGAACATGCCCAAAGTCGAGCTTTGGGCAGACGGTATTGACTACATTTGTCCTGGTAGTCAGACCACTAGAAAAGCACTGTTCTCTAACGCAGGGTTCCAAGCTGCATCGGAAGGAACAGAGGATCTGATCAACAACACCTTTACCTCTAAATGTGGTAATGGTGCAAGACTGCAAGGCCAGGCAGAGTTCAGATCTACTCCGGAAGCTGGTCAAGTTATCGTTCGCGATGTTGTAGGCACAGCTCATAGATTCTACAACTTCAACAAAGATGGCACTTTCTCAGCACCGGGTGGTTTTGTATGTCATACTGGTGCAGACTGGAACAACCAGTTTGGGCCTAACAACCCATCCAAAATACTGGCTGGCAACGTCAACGGACCTGAAGGTTCAATGGTTGTTGGCGGGTTGTCTGTGGCATTTTCTGGAAACTACGCTTTCCAGATGGCAGGTCGTTTAGACCAGCTGTATACTCGTTCCATAGAGCAGGGAGACCACAGAGCGTGGAACAAAGTTATTCAGCACCGTGGTCAAGGGTTGGGAACTAATGACCTTAACGACTACAAGGCAGATCGTGAAGGGATTTACCATCAAGAGGCAAATGCCAACGCCACAGCGGAAAGAAATTACCCACCAGGACAGCAGATGGCTGGTACGCTGATCGTTCTTAGAAACTCCGCTAATGAAGGAACAGGTTGTATCCAGATATATAAAATGTATCTTGGTGGAACATGGGAAAGGTATTACAATAACCTTGGCAGCGGAATGGCTTGGAGTCCTTGGAAGAGAACAAGCTTCCCAGAAAACACAACTGCTCCAGTTATGCCTGACTTGTGGTTGCCTTTAACCTCTAACCTAAAACCTGCATTGGGTGAAGGTGAGATGGTCTTCTCCAGACCTTCCACAGCAACCTATTTCACTAAGCGGGGTGTTATGGCTGTTGCACAAGCAAACCAACCACGTTTTGAAAGAGATGGGTTGCTTATCGAGGGGCAAAGGACAAACCTCATGCTCAACAGTGAGGACCCAAGCAAGTGGGGTGCACAGCAAGCGATTACTGTCGGTAGCACTGTAACAAATACTAACGGCACAAAAGGCGCAAGATTTACAGTAAACACCACATCCGGTGTAGAAACAACAGCGTTAAACCTTGCCACTGTTCCAGCCACTAGGGGTGCCGATGTGACAGGTGCTGAGAAATTCTGTACTGGGTCTATTATTGCAAGAGGTGGTAAAGCCCACCAGAGACTGCGTGTAAGATTCGACATGTACAATGGATCCACCACGGTGTTCCAAGGTGACGCTTATGTTAACCTGTCAACACTTGAAGTTAAAACAACAGGGGATGCAGCTGGGAGAATTAAAGTAAAAGCTGAGAGATGGCAAACAGCAGGTCCTGCTTGGATAAGGATTGCTGCTACGTTTGAAGCAGTGGCCTCTGACAATAAGATCGGGTGCCAGTTCCAAATTGCTCCACCAGAAGGTGCTCAACATGCCGTAGGAGATTGGGTTGATGTTGCAATACCTCAATTTGAGTTAGGGTCCTGTGAGTCCTCGTTTATCCCTACAGGGTCTTCATCTGTAACCAGGGCGGCAGATCTTTGTAAATTCCCAATGACGGATAACTTAGCGCCTAGACCATTCACTATCGCTGCTACGGTGGATGCCAACTGGAGAGGTTGGGGTAAAGCTCCTAACGCAGCTCCTAGGGTTATCGATACAGAGGGTCATCAATCTGGTGCTGCATTTATCATGGGGTTTGGCTCTGCAACAAATATTGCGGATGATGGCTATCCATACTGTGATATTGGAGGGTCAAACAGACGTGTTTACGAAATGGCTAAAGCCCGTAAATTGAAAATCGGGTTCAGGATAAAAGGTGACGGTAAAACCTGCTCTTTTGCTAACGGACTGGTGAGCACGGAAACGCAGTCTTCTTGGGAATTCCTAGCAGGAGGCGCTTTCATCCGCATTGGTGGGCAAACGGCGACAGGTGAAAGACACTTATTTGGTCATATTAAGGACATAAGAGTTTGGAACTCTGCACTGACAGATACCCAGCTTATGATGGAGAGTGTTGAATAA

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

PDB ID
4352a3fc8183351c75519ce6658471c0b07bce63d6a9503d3b0d8c438f78c97b
ESMFold
Source ESMFold
Method ESMFold
Resolution 0,6118
Evidence 0,6118

Literature

Title Authors Date PMID Source
Genome Sequences of Four Bacteriophages Infecting Toxigenic Escherichia coli (STEC) Dias,C., Almeida,C., Lobocka,M. and Oliveira,H. 2020-09-03 GenBank