Protein
View in Explore- Genbank accession
- XXQ42890.1 [GenBank]
- Protein name
- tail spike protein
- RBP type
-
TFTSPTSPTSP
- Protein sequence
-
MAYSWQEQILPAGAQTVPVEINYLDRSYIYLYVNREEVFDFTWDSDTRIRFDKPLDAESTVLIVRRTAKEFLYIMFSEGAAFIKENLDTQNTQFLHLAQELVEGRSIEGFFGTINMNGFRIINLAAGKDATDAVNKSQLDEVSTRVDNIEGAFNGVTSSYPMYFNTTEPTKEFRIALNFTKAAVYINGVCQTPGYSYTIRDNTVILADPVPAGTHVFMRLGEDVLEGGYATTESLAALSNALTRIDGPEVNPEYYHPHWRDSFDVRGWGVAGDAVTDDTAALQAMLAAAAPNQYIDGKGLTFKVTALPDLSRFRNAAFKYERLAGQPLTYVSEGYFEAGLTKITDTPFYNAWTQDKSFVYDNVIYAPFMAGERHGVQNLHVAWVRSGDDGQTWTMPEWLTPIHPNYASGVNYHCMSMGVSNNRLYMLVETRNLSDARRVRAEMWSRPMPYARRPTGGISTVAGDNFATVVLPMHGLKVGDTINFSNSGVTGVSGNTVVRTVVDANTITVPLTNAAASTLTNAGVTWSFGTRFWDCQWEVTLLPGIAYSTNADLAVTETHSFTALEGNAVAVGYHNGDVSPRRLGILYFPDVYNNPGVFERRTVSQEYANDASEPCIRYYDGELYLTTRGTSATSAGSTLARSTNSGLTWEYLRFPNKVHHSNLPFAKVGDELYIFSSERAANEWEADTPDNRYNGNYARTFMCKVNVRNWPSSLDEVQWFNVTDQIYQGNIVNSSVGVGSVCVKDGWLYYLFGGEDFLSPWSIGGNSAKRWYVQDGHPTDLYSYRIKVGPQQHVSRDFKYGAAPNRTVPVSMGVDGLRHVSAPMVFDESVEVNSLRVTGTGHNGIPAVRSEVLLDGDYGQIYKSVPTGNPAQQRLILSGGSGNSAASGAIVQLYGSNHSTPNRAVLYATGGVYTQNNLLPYNDSSVALGGASNRWSTVYAVTGTINTSDGTLKTAPEEVESALLDAWADVHIISFKWLDSLATKGDAARTHFGVIAQQVRDVLVTHGLMEPDATSCKYGFLCHDEYPEMVEHDEEGNVSVITPAGSRWGIRPDQMFFIEMLYQRRELARLKEQVKSLAHNKE
- Physico‐chemical
properties -
protein length: 1082 AA molecular weight: 119744,20150 Da isoelectric point: 5,24774 aromaticity: 0,11183 hydropathy: -0,28494
Domains
Domains [InterPro]
DC_1609
STR
1–345
STR
1–345
IPR005604
ATT
14–102
ATT
14–102
IPR011049
STR
115–168
STR
115–168
IPR008635
RBD
120–159
RBD
120–159
IPR024428
ENZ
339–788
ENZ
339–788
IPR001724
Unmapped
523–546
Unmapped
523–546
1
1082
Architecture
STR 1-13 | ATT 14-102 | STR 103-298 | ATT 299-334 | STR 335-947 | RBD 948-1079 |
Legend:
ATT
STR
RBD
CBM
LEC
ENZ
CHP
LNK
TAS
TTP
UNK
Unmapped
Tail Spike Domain Segmentation
Tail Spike Domain Segmentation
This protein has been segmented into three structural domains: N-terminal, central domain, and C-terminal.
Domain Layout
1
1082
| Domain | Start | End | Length (AA) | Confidence |
|---|---|---|---|---|
| N-terminal | 1 | 278 | 278 | 0,9963 |
| Central domain | 279 | 608 | 331 | 0,9279 |
| C-terminal | 609 | 1082 | 473 | 0,5848 |
Legend:
N-terminal
Central domain
C-terminal
3D Structure with Domain Coloring
The structure is colored according to the domain segmentation: N-terminal (blue), Central (green), C-terminal (pink).
Domain Coloring
N-terminal
1-278
1-278
Central
279-608
279-608
C-terminal
609-1082
609-1082
Taxonomy
| Name | Taxonomy ID | Lineage | |
|---|---|---|---|
| Phage |
Escherichia phage vB_EcoP_P64441 [NCBI] |
3403485 | Viruses > Duplodnaviria > Heunggongvirae > Uroviricota > Caudoviricetes |
| Host |
Escherichia coli [NCBI] |
562 | cellular organisms > Bacteria > Pseudomonadati > Pseudomonadota > Gammaproteobacteria > Enterobacterales |
Coding sequence (CDS)
Coding sequence (CDS)
Genbank protein accession
XXQ42890.1
[NCBI]
Genbank nucleotide accession
PQ847473.1
[NCBI]
CDS location
range 37197 -> 40445
strand -
strand -
CDS
ATGGCCTATAGCTGGCAAGAACAAATTCTACCAGCTGGTGCACAGACTGTCCCAGTAGAGATCAACTATCTCGACCGTTCCTACATTTACTTGTACGTGAACCGTGAAGAGGTATTTGATTTTACTTGGGACAGTGATACGCGCATCCGCTTTGACAAGCCTCTCGATGCAGAGAGCACTGTACTGATTGTACGTCGCACTGCTAAAGAGTTTCTTTACATCATGTTCTCAGAAGGCGCTGCCTTTATTAAGGAGAACTTAGACACTCAGAATACCCAGTTCCTGCACTTAGCTCAGGAACTGGTAGAAGGTCGCAGTATCGAAGGTTTCTTTGGTACTATCAACATGAACGGCTTCCGCATCATTAATCTGGCAGCAGGTAAGGACGCTACGGATGCAGTTAATAAGTCACAGCTGGATGAGGTGAGTACCAGGGTAGATAACATCGAAGGTGCATTTAACGGTGTTACCTCCAGCTACCCTATGTACTTCAACACTACTGAACCTACTAAAGAGTTCCGAATTGCCCTTAACTTCACAAAGGCAGCAGTGTACATTAACGGGGTGTGTCAAACTCCGGGGTACAGTTACACCATTAGGGACAATACAGTAATACTAGCCGACCCTGTTCCCGCAGGTACTCACGTATTCATGCGACTAGGTGAGGATGTTCTGGAAGGAGGTTATGCTACTACTGAGAGTTTAGCTGCACTGAGTAACGCCTTAACTCGCATAGACGGGCCTGAGGTTAATCCTGAGTATTACCACCCTCATTGGAGAGATTCCTTTGACGTACGTGGATGGGGAGTAGCAGGTGATGCAGTTACGGATGATACGGCAGCACTTCAAGCCATGCTGGCAGCAGCGGCCCCTAATCAGTACATAGACGGTAAAGGTTTAACTTTTAAAGTAACTGCCTTACCTGACCTCAGCCGCTTCCGCAATGCCGCGTTTAAGTATGAGCGCTTAGCGGGTCAGCCTCTTACTTATGTATCGGAGGGTTATTTCGAAGCAGGTCTTACTAAGATTACAGACACCCCGTTCTACAATGCGTGGACTCAGGATAAGTCTTTCGTATATGATAATGTAATATACGCACCATTCATGGCAGGTGAACGGCATGGCGTACAGAACCTGCATGTAGCATGGGTACGTTCAGGTGACGATGGTCAGACGTGGACGATGCCAGAATGGCTTACTCCTATCCACCCTAACTATGCCTCAGGTGTTAACTACCATTGTATGAGCATGGGTGTTTCTAATAACCGCTTGTACATGCTAGTAGAGACCCGTAATCTATCTGACGCTAGGAGGGTACGGGCGGAGATGTGGTCGCGCCCAATGCCGTATGCTCGTAGGCCTACTGGGGGTATCAGCACAGTGGCTGGGGATAACTTCGCTACTGTAGTACTTCCTATGCATGGGCTTAAGGTAGGAGATACTATTAACTTCTCGAACTCTGGGGTTACGGGTGTTTCAGGTAATACTGTAGTACGTACCGTAGTTGACGCTAATACTATTACAGTACCTTTAACTAACGCAGCTGCCTCAACGTTGACCAACGCCGGGGTTACTTGGAGCTTTGGTACACGTTTCTGGGACTGCCAGTGGGAAGTGACCTTACTGCCGGGAATCGCATACTCTACTAACGCTGACTTAGCTGTAACTGAAACCCATAGCTTTACCGCCCTTGAAGGTAACGCAGTTGCAGTAGGTTATCATAACGGTGACGTCTCTCCTAGACGTTTAGGTATTCTGTACTTCCCTGATGTGTACAACAATCCCGGAGTATTTGAACGTCGTACAGTGTCTCAGGAGTACGCGAACGATGCCTCCGAACCTTGCATACGCTATTACGACGGGGAGTTGTATCTTACTACTCGGGGAACATCCGCCACCTCCGCAGGTTCTACCTTGGCACGTAGTACTAACTCAGGCTTGACTTGGGAGTACTTACGTTTCCCTAACAAGGTACATCACAGTAACCTACCTTTTGCTAAGGTGGGGGACGAGTTGTATATCTTCTCAAGCGAGCGGGCTGCAAACGAGTGGGAAGCGGATACCCCTGATAACCGTTACAACGGTAATTACGCACGCACCTTTATGTGCAAGGTTAACGTCCGTAACTGGCCTAGCTCATTAGATGAGGTGCAGTGGTTTAATGTTACCGACCAGATTTATCAGGGTAACATAGTTAACTCCTCAGTAGGGGTAGGTTCTGTATGCGTTAAGGATGGGTGGCTGTACTATCTGTTCGGCGGAGAGGATTTCTTATCCCCGTGGTCCATTGGAGGTAACTCTGCTAAGCGCTGGTACGTGCAGGATGGTCATCCTACTGACCTGTACAGCTACCGTATTAAGGTAGGTCCGCAGCAGCATGTGTCTCGCGACTTTAAATACGGGGCAGCACCTAACCGTACTGTACCTGTGTCCATGGGGGTAGACGGGTTACGCCATGTTTCAGCGCCTATGGTCTTTGATGAGAGCGTAGAGGTTAACTCCTTACGTGTTACAGGTACTGGGCATAACGGAATCCCTGCTGTACGGTCTGAAGTACTTCTGGACGGGGATTACGGTCAGATTTACAAGTCAGTGCCTACTGGTAACCCAGCACAGCAACGGCTTATTCTGAGCGGGGGGTCAGGTAACTCCGCTGCCTCTGGCGCTATTGTGCAACTCTATGGATCTAATCACAGTACGCCTAATAGGGCAGTCCTGTACGCAACGGGAGGTGTCTACACTCAAAACAACCTGCTACCTTATAACGACTCCTCTGTAGCATTAGGGGGCGCGTCTAATAGATGGAGTACGGTGTATGCTGTTACAGGCACTATCAATACTTCTGACGGTACTCTTAAAACAGCCCCGGAGGAAGTGGAATCGGCACTGTTAGATGCTTGGGCGGATGTACACATCATTAGCTTTAAGTGGCTTGACAGTCTAGCTACTAAAGGGGACGCTGCTCGTACTCACTTTGGTGTAATAGCACAGCAGGTGCGTGACGTGCTCGTTACGCACGGACTTATGGAGCCTGATGCTACTTCATGTAAGTACGGGTTCCTGTGCCACGATGAGTACCCTGAGATGGTGGAACATGACGAGGAAGGTAATGTAAGTGTAATTACTCCTGCAGGAAGTCGTTGGGGTATTCGACCTGACCAGATGTTCTTCATTGAGATGCTGTACCAGCGTAGAGAGCTTGCCCGTCTAAAGGAGCAGGTAAAATCGCTGGCACATAATAAGGAGTAA
Genome Context
Genome Context
Tertiary structure
PDB ID
25270fcf5c50e91ac786ce3c36501e01edeadf60eed753a95a98becae1c33374
Model Confidence
Very high
pLDDT > 90
pLDDT > 90
High
90 > pLDDT > 70
90 > pLDDT > 70
Low
70 > pLDDT > 50
70 > pLDDT > 50
Very low
pLDDT < 50
pLDDT < 50
Literature
| Title | Authors | Date | PMID | Source |
|---|---|---|---|---|
| Characterization of the novel phage vB_EcoP_P64441 and its potential role in controlling UPEC in disposable catheters and inhibiting biofilm formation | Xiaoyue,L. | 2025-08 | — | GenBank |