Genbank accession
QXV80353.1 [GenBank]
Protein name
receptor-binding tail tip protein
RBP type
TSP
Evidence DepoScope
Probability 0,98
TSP
Evidence RBPdetect
Probability 0,89
Protein sequence
MGFFAGKYSDGKTVLSLNTESGGDINRHYSPNANSIFHSDMPFVLVDGTYEAGLGNAGNGFFVCQMPSDIINIKSNDPGRVILTAIEINGTHRAFLNGTQSKVGQTIVATQADPFRSFASVSQTSGFAFGNSLASGTYNYNPSIGHEESISRSGTGGTTLHSTYHGIVRPGAGAPVGVTVAEAFAQLGFPTNSSTVPVDGNNPYYWDPGWMSPLGAAHRGHDWFYVCNSNIRGYGGVRQGLPGNVNTMYHDGGNRFVCRGSTTNLANQSGDSTILQDWYNITPTKVIWYVLNLRYSNGGMSISGNPFTGSDILISPSNFIIKGVSLPNTGYKFINQNAFGNLGYRPDMEYIGNNAAYTGVFGDTTARCELVGSSNGGLWSPVDYGGAKSQISIYKFGVGKQWYVNSNNNTIGNEHGVVWGPSAVPLRLFGGNVGSSYMGDDITPHYPGTGDRYVGLSTIGLGIPGGNATVILTTEVISGNLNCAGVPANTWNNGVFQVQGRRAYSYSGGDAIFHQILTLPVGYLVPFHTTSAFRYTPNNALSRNSFIYTVKNLGNGNVELGVVMHVSLGSAVFLPRLRVTVQRLT
Physico‐chemical
properties
protein length:585 AA
molecular weight: 62393,76170 Da
isoelectric point:8,14469
aromaticity:0,11282
hydropathy:-0,15829

Domains

Domains [InterPro]
IPR059609
RBD
1–585
QXV80353.1
1 585
Architecture
RBD
RBD 1-585
Legend: ATT STR RBD CBM LEC ENZ CHP LNK TAS TTP UNK Unmapped

Tail Spike Domain Segmentation

Tail Spike Domain Segmentation

This protein has been segmented into three structural domains: N-terminal, central domain, and C-terminal.

Domain Layout
N-terminal
Central
C-terminal
QXV80353.1
1 585
Domain Start End Length (AA) Confidence
N-terminal 1 10 10 0,0854
Central domain 11 209 200 0,4716
C-terminal 210 585 375 0,8232
Legend: N-terminal Central domain C-terminal
3D Structure with Domain Coloring

The structure is colored according to the domain segmentation: N-terminal (blue), Central (green), C-terminal (pink).

Domain Coloring
N-terminal
1-10
Central
11-209
C-terminal
210-585

Taxonomy

  Name Taxonomy ID Lineage
Phage Escherichia phage IrisVonRoten
[NCBI]
2852004 Uroviricota > Caudoviricetes > Demerecviridae > Tequintavirus > Tequintavirus irisvonroten
Host Escherichia coli K-12
[NCBI]
83333 Bacteria > Proteobacteria > Gammaproteobacteria > Enterobacteriales > Enterobacteriaceae > Escherichia

Coding sequence (CDS)

Coding sequence (CDS)
Genbank protein accession
QXV80353.1 [NCBI]
Genbank nucleotide accession
MZ501075 [NCBI]
CDS location
range 110472 -> 112229
strand +
CDS
ATGGGTTTTTTCGCTGGAAAGTATAGCGATGGTAAGACCGTACTATCCTTAAATACTGAATCTGGTGGTGACATTAATCGTCACTATAGTCCAAATGCCAATAGTATTTTTCATAGTGATATGCCATTTGTCCTAGTTGATGGTACTTACGAGGCTGGGCTAGGTAATGCCGGAAATGGGTTTTTTGTATGTCAGATGCCTTCTGACATAATAAATATTAAATCCAATGACCCAGGTAGAGTTATACTAACTGCTATTGAAATTAATGGTACTCACAGAGCTTTTCTTAATGGTACTCAGAGTAAAGTTGGTCAAACTATAGTTGCCACTCAGGCAGATCCCTTTAGATCTTTTGCTAGTGTTTCTCAAACCAGTGGGTTTGCATTTGGTAATAGTCTAGCATCTGGCACTTATAATTATAATCCCTCGATAGGGCATGAAGAATCTATCTCTAGGAGTGGTACAGGGGGTACTACGTTACATAGTACCTATCATGGGATAGTTAGGCCCGGTGCGGGAGCTCCTGTAGGTGTTACTGTTGCAGAAGCCTTTGCACAATTAGGTTTTCCTACTAATAGTAGCACAGTACCTGTAGATGGCAATAACCCGTACTATTGGGATCCTGGATGGATGTCGCCTCTAGGAGCAGCGCATAGAGGGCATGATTGGTTTTATGTCTGCAATTCTAATATACGTGGATATGGTGGGGTTAGACAAGGGCTCCCAGGTAATGTAAATACTATGTACCACGACGGTGGGAACAGATTTGTTTGTAGGGGATCTACTACTAATTTAGCTAACCAGTCGGGTGACTCTACAATACTACAGGATTGGTATAATATAACTCCTACTAAGGTTATTTGGTATGTTCTTAATTTAAGGTACTCTAATGGTGGAATGAGTATTTCAGGCAACCCCTTTACTGGTTCCGATATTCTTATATCTCCTTCTAATTTCATAATTAAAGGGGTAAGTCTCCCCAATACCGGGTATAAATTTATAAATCAGAATGCTTTTGGTAACCTAGGTTACCGTCCTGATATGGAGTATATAGGGAATAACGCAGCGTATACTGGGGTTTTTGGGGATACTACTGCAAGGTGTGAACTTGTAGGGTCTAGTAATGGAGGATTATGGTCTCCTGTAGACTATGGTGGAGCTAAATCACAGATTAGCATTTATAAGTTCGGAGTAGGTAAACAGTGGTATGTAAACTCAAATAATAATACTATTGGTAATGAACATGGGGTTGTTTGGGGGCCCTCAGCAGTTCCACTGCGACTTTTTGGTGGAAATGTAGGTAGTTCTTATATGGGAGATGATATTACACCCCACTATCCAGGAACTGGGGATAGATACGTTGGTTTATCAACTATTGGGCTGGGTATACCAGGCGGAAATGCTACAGTAATTCTTACTACTGAAGTTATATCAGGTAATCTTAATTGTGCAGGTGTTCCCGCTAATACATGGAATAATGGTGTGTTTCAGGTGCAAGGAAGAAGAGCATATAGCTATAGCGGCGGGGATGCAATATTCCACCAGATTTTAACGCTACCTGTGGGCTATCTAGTACCTTTTCATACTACATCTGCTTTTAGATACACGCCTAACAATGCTCTTAGTAGAAACAGTTTTATATATACCGTTAAAAATCTAGGAAATGGAAATGTAGAGTTAGGAGTAGTTATGCACGTGAGTTTGGGTAGCGCAGTTTTCCTACCTAGATTAAGAGTAACAGTTCAACGCCTTACCTAA

Genome Context

Genome Context

Tertiary structure

PDB ID
a7c3bfb4d15c6da87c275a12680ec68e33a99ab809e282bf305ccd0211cda817
ESMFold
Source ESMFold
Method ESMFold
Resolution 0,2353
Oligomeric State monomer
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50

Literature

Title Authors Date PMID Source
Systematic exploration of Escherichia coli phage-host interactions with the BASEL phage collection Maffei,E., Shaidullina,A., Burkolter,M., Heyer,Y., Estermann,F., Druelle,V., Sauer,P., Willi,L., Michaelis,S., Hilbi,H., Thaler,D.S. and Harms,A. 2021 GenBank