Protein

Genbank accession
QFG06204.1 [GenBank]
Protein name
tail protein
RBP type
TSP
Evidence RBPdetect
Probability 0,74
TF
Evidence Phold
Probability 1,00
Protein sequence
MKKILDSARNYLKNNSRIKTASLISLELPGSTGTSTAFIYLTDYFRDVLYNGILYQAGKVKSISSHKQNRDLSIGSLSFTITGTAQDEVLKLVQNGVSFLDRTVSIHQAIITEDGSILPVDPDTNGPLLYFRGRITGGGIKDNISTSGVGTSTITWNCSNQFYDFDRVNGRYTDDASHRGLEVVAGQLVPSNGAKRPEYQEDYGFFHSNKSISILAKYQVQEERYKLKSKKKLFGLSRSYSLKKYYETVTKEVDIDFNLAAKYIPVVYGVQKIPGIPIFADTELHNPNIVYVVYAFAEGEIDGFLDFSFGDNPMICVDANDSSARTCFGTKKIAGDTMQRIASGTSSSSPSVHGQEYKYNDGNGDIRIWTYHGKSDQTASEVLVDIAKERGFYLQNMNGNGPEYWDARYKLLDTAYAVVRFTINENRTEIPEVSAEIQGKKVKVYHSDGRVTANSTSLNGIWQTLDYLTSDRYGANITIDQFPLQQLIQEAAILDIIDESYQVSWQPYWRYVGWTDPLAENRQIVQMNTILDTSESVFKNVQGLLESYGGAINNLSGQYRVTVEKYSNTPLEINFLDTYGDLELSDTTGRNKFNSVQASIVDPALSWKTNSITFYNSKYKEQDKNLDKKLQLSFANITNYYTARSFADRELKKSRYSRTLSFSLPYQFIGIEPNDAIAFTYDRYGWNKKYFLVDEVENSREGKINVTLQEYGEDVFINSEQVDNSGNDIPDISNNVLPPRDFKYTPTPGGLVGSIGKNGELSWLPSLTNNVVYYSIVHSGHAEPYIVQQLETNPNERMIQEIIGEPAGLAIFEIRAVDINGRRSSPVTLSIELNSAKNLSVVSNFRVTNTASGDVTEFVGPDVKLAWDRIPEEDIIESIFYTLEIHDSQNRMLRSVRIENQYTYDYLLTYNKADFALQNSGALGINRKLYFRIRAEGDDGEQSVEWASI
Physico‐chemical
properties
protein length:949 AA
molecular weight: 106908,96910 Da
isoelectric point:5,19676
aromaticity:0,11170
hydropathy:-0,43604

Domains

Domains [InterPro]
QFG06204.1
1 949
Legend: Pfam SMART CDD TIGRFAM HAMAP SUPFAM PRINTS Gene3D PANTHER Other

Taxonomy

  Name Taxonomy ID Lineage
Phage Escherichia phage VEc33
[NCBI]
2847072 Viruses > Duplodnaviria > Heunggongvirae > Uroviricota > Caudoviricetes
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)
Genbank protein accession
QFG06204.1 [NCBI]
Genbank nucleotide accession
MN316588 [NCBI]
CDS location
range 86748 -> 89597
strand -
CDS
ATGAAGAAAATACTAGATAGTGCTAGAAACTACTTAAAAAATAATAGCAGAATAAAAACTGCTAGTCTAATTTCCCTAGAATTACCTGGCTCTACTGGTACTAGTACTGCTTTTATCTATTTAACTGATTATTTTAGAGATGTACTATATAATGGCATCTTATACCAGGCGGGTAAAGTTAAGTCTATTAGCTCACATAAACAAAATAGAGATTTATCTATTGGTAGTCTATCTTTTACTATTACTGGTACAGCACAGGATGAAGTACTAAAACTAGTACAAAATGGTGTATCCTTCTTAGATAGAACCGTATCAATTCATCAAGCAATTATTACTGAAGATGGTTCTATTCTGCCAGTAGACCCAGATACAAATGGTCCTTTACTATACTTTAGGGGGAGGATTACTGGAGGGGGTATTAAAGATAACATTAGTACCTCTGGAGTAGGAACCTCTACAATTACCTGGAATTGTTCTAACCAATTCTATGACTTTGATAGAGTTAATGGTAGATATACTGATGACGCTTCCCATAGGGGGCTTGAAGTTGTAGCAGGACAATTAGTTCCATCTAACGGGGCTAAAAGACCTGAGTACCAAGAAGACTATGGGTTCTTCCACTCTAATAAAAGTATCTCTATCCTAGCAAAATATCAGGTTCAAGAAGAAAGATACAAGCTAAAATCAAAGAAAAAATTATTTGGTTTATCTAGAAGCTACAGCCTTAAAAAATACTATGAAACTGTCACTAAGGAAGTGGATATAGATTTTAACCTTGCTGCTAAATACATACCGGTGGTTTATGGAGTTCAAAAAATTCCTGGAATACCCATTTTTGCTGATACAGAACTACACAATCCCAACATAGTTTATGTCGTATACGCCTTTGCTGAGGGGGAGATTGATGGTTTTCTTGATTTCTCCTTTGGTGATAACCCTATGATCTGCGTAGATGCTAATGACAGCTCCGCTAGAACCTGCTTTGGTACTAAAAAAATAGCAGGAGATACAATGCAGAGGATAGCATCTGGAACCTCTTCTAGTAGTCCATCCGTCCATGGTCAAGAGTATAAGTATAATGATGGAAATGGTGACATAAGGATTTGGACTTATCACGGAAAATCTGATCAAACAGCCTCTGAAGTACTAGTAGATATAGCTAAAGAACGTGGATTCTACCTTCAGAATATGAATGGCAATGGGCCGGAGTACTGGGATGCTAGATATAAGCTACTAGATACTGCATACGCAGTGGTGCGCTTCACTATTAATGAAAATAGGACTGAGATTCCAGAAGTTAGTGCTGAAATTCAAGGTAAAAAAGTAAAAGTCTATCATTCTGATGGTAGAGTAACTGCTAATAGTACTAGTTTAAATGGTATTTGGCAAACACTTGATTACTTAACCTCTGATAGATATGGCGCTAATATTACCATTGATCAGTTCCCTCTTCAGCAATTAATACAGGAGGCAGCTATTTTAGATATTATAGATGAATCCTATCAGGTATCTTGGCAGCCATATTGGAGATACGTTGGGTGGACTGATCCACTAGCAGAAAATAGACAAATAGTACAAATGAATACTATTTTGGATACATCTGAATCAGTATTTAAAAATGTGCAAGGTTTGTTAGAGTCCTATGGTGGGGCTATTAACAATTTATCTGGCCAGTATAGGGTTACTGTAGAAAAATACTCTAATACTCCCTTAGAGATTAATTTTCTAGATACTTACGGTGATTTGGAGCTATCAGATACTACTGGTAGAAATAAATTCAACTCAGTTCAAGCATCTATCGTAGATCCCGCCCTTAGCTGGAAAACTAATTCTATTACATTCTATAATTCCAAGTATAAGGAACAGGACAAGAACCTAGATAAAAAATTACAACTATCTTTTGCTAATATTACTAACTACTATACTGCAAGAAGTTTTGCGGATAGAGAACTTAAGAAATCCAGATACTCAAGAACACTTTCATTCTCATTACCATACCAGTTCATAGGGATTGAACCCAATGATGCTATTGCATTCACGTATGACCGTTACGGGTGGAACAAAAAGTATTTCCTAGTTGATGAGGTTGAAAACTCTAGAGAGGGTAAGATAAATGTTACCTTACAGGAGTATGGAGAAGATGTATTCATCAACTCTGAGCAGGTTGATAATAGCGGTAATGATATTCCCGATATTAGTAATAATGTCCTTCCTCCTAGGGACTTTAAGTATACCCCTACTCCTGGCGGTTTAGTAGGCTCTATAGGTAAAAATGGTGAGTTATCCTGGCTTCCGAGTCTAACCAATAATGTAGTTTATTACTCTATTGTGCACTCAGGCCATGCCGAACCTTACATAGTACAACAGTTAGAGACCAATCCTAACGAACGTATGATCCAAGAAATAATTGGGGAACCAGCAGGTCTGGCTATATTTGAGATAAGAGCAGTAGATATTAATGGTAGAAGAAGTTCCCCGGTGACTCTGTCTATAGAACTTAACTCTGCTAAAAACCTTAGTGTAGTATCTAATTTTAGGGTAACTAATACTGCTTCTGGAGATGTAACTGAGTTTGTTGGCCCAGATGTAAAACTAGCTTGGGATAGAATACCTGAAGAAGATATAATAGAGAGTATATTTTATACACTTGAAATACACGATTCGCAAAATAGGATGTTAAGAAGTGTACGTATTGAAAATCAGTATACTTATGACTATTTATTAACATATAATAAGGCAGACTTTGCTCTCCAGAACAGCGGTGCTCTAGGTATAAATAGAAAATTGTATTTTCGTATTAGGGCAGAAGGGGATGATGGAGAACAGTCTGTGGAGTGGGCATCCATTTAA

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

PDB ID
7830fa77923f6d37d4ce1e6f7bca502db17a93b8d24ba204856b023904706177
ESMFold
Source ESMFold
Method ESMFold
Resolution 0,8026
Evidence 0,8026

Literature

Title Authors Date PMID Source
Complete genome sequence of Escherichia coli bacteriophage VEc33 Denisenko,E., Kislichkina,A., Krasilnikova,V., Verevkin,V. and Volozhantsev,N. 2025 GenBank