Genbank accession
XSB43047.1 [GenBank]
Protein name
side tail fiber protein
RBP type
TF
Evidence GenBank
Probability 1,00
TSP
Evidence RBPdetect
Probability 0,67
Protein sequence
MWGNMAVKISGVLKDGTGKPVQNCSIQLKAKRNSTTVVVNTVASENPDEAGRYSMDVEYGQYSVILLVEGFPPSHAGTITVYEGSRPGTLNDFLGAMTEDDVMPEALRRFEEMVEEVARNAEAASQSAAAAKKSETAAASSKNAAKTSETNAANSAQAAATSKTASANSATAAKKSETNAKNSETAAKTSETNAKSSQTAAKTSETNAKASETAAKNSQVAAAQSESAAAGSATSAAGSATAAANSQKAAKTSETNAKSSQTAAKTSETNAKASETAAKNSQDAAAQSESAAAGSASAAAASATASANSQKAAKTSETNAKTSETAAANSAKASAASQTAAKASEDAAREYASQAAEPYKYVLQPLPDVWIPFNDSLDMITGFSPSYKKIVIGDDEITMPGDKVVKFKRASTATYINKSGVFSVAKIDEPRFEKEGLLIEGQRTNYFVKSNTPAEWTSTSNIDKTNNGVYEFGFSYAKMRTKDNMTGQSSALSLHTCSASRGIDVSGDNKYCTVSCRVKAPDGLRCRLRFEKYDGSVYTFLGDAYLTFGTLIIEKTGGAANRIAATATKDPVTGWIFYEATIEAVEGETLIGAMIQYAPKKGGITEAGDYIYLATPQFENGGCASSFVITTTVPATRSSDMVTIPTENNIYNRPLTCLVEVNRNWGDIPPNVAPRIFDFSGVPPIESITYAFNTTEKYYGQLYMQTYKASTSTYVSSVFAGRADVRKFIGGFNIYSDGTKRVVSNGEATKTMKTEWTGVKTRTFIRIGGQATSGTRHLFGHLRNLRLWHKELTDAQMGESIK
Physico‐chemical
properties
protein length:802 AA
molecular weight: 84815,14780 Da
isoelectric point:8,46839
aromaticity:0,07481
hydropathy:-0,42294

Domains

Domains [InterPro]
IPR013609
ATT
5–135
IPR008969
ATT
8–85
G3DSA:2.60.40.1120
STR
9–102
XSB43047.1
1 802
Architecture
ATT
STR
ATT 5-247 | STR 260-802
Legend: ATT STR RBD CBM LEC ENZ CHP LNK TAS TTP UNK Unmapped

Tail Spike Domain Segmentation

Tail Spike Domain Segmentation

This protein has been segmented into three structural domains: N-terminal, central domain, and C-terminal.

Domain Layout
N-terminal
Central
C-terminal
XSB43047.1
1 802
Domain Start End Length (AA) Confidence
N-terminal 1 132 132 0,9880
Central domain 133 362 231 0,7025
C-terminal 363 802 439 0,7272
Legend: N-terminal Central domain C-terminal
3D Structure with Domain Coloring

The structure is colored according to the domain segmentation: N-terminal (blue), Central (green), C-terminal (pink).

Domain Coloring
N-terminal
1-132
Central
133-362
C-terminal
363-802

Taxonomy

  Name Taxonomy ID Lineage
Phage Escherichia phage PhiR49_1_star
[NCBI]
3416491 Viruses >
Host Escherichia coli
[NCBI]
562 cellular organisms > Bacteria > Pseudomonadati > Pseudomonadota > Gammaproteobacteria > Enterobacterales

Coding sequence (CDS)

Coding sequence (CDS)
Genbank protein accession
XSB43047.1 [NCBI]
Genbank nucleotide accession
PV340569.1 [NCBI]
CDS location
range 37068 -> 39476
strand +
CDS
TTGTGGGGGAATATGGCAGTAAAGATTTCAGGTGTACTGAAAGACGGCACAGGAAAACCGGTACAGAACTGCTCAATCCAGCTGAAAGCAAAACGTAACAGCACCACGGTGGTGGTGAACACGGTGGCCTCTGAAAATCCGGATGAAGCCGGGCGTTACAGCATGGATGTTGAGTATGGCCAGTACAGCGTCATCCTGCTGGTTGAAGGTTTTCCGCCTTCACATGCCGGGACCATTACCGTCTATGAAGGTTCCAGACCAGGTACGCTGAATGATTTTCTCGGTGCCATGACGGAAGATGATGTCATGCCGGAGGCATTGCGTCGTTTTGAGGAAATGGTGGAAGAAGTGGCACGCAACGCCGAAGCCGCCTCTCAGAGCGCAGCGGCGGCAAAGAAATCCGAAACTGCAGCGGCATCATCGAAGAACGCGGCGAAAACCTCAGAAACGAATGCAGCTAATAGTGCACAGGCGGCAGCGACCTCAAAGACTGCATCGGCAAACTCCGCGACAGCAGCCAAAAAATCAGAAACCAACGCGAAAAATAGCGAGACAGCCGCAAAGACGAGCGAAACCAACGCAAAGTCCAGCCAGACGGCAGCGAAAACCAGCGAAACGAATGCCAAAGCCAGTGAAACTGCGGCAAAAAACAGCCAGGTTGCAGCAGCCCAAAGCGAGAGCGCGGCAGCCGGTTCTGCGACTTCAGCAGCTGGATCAGCAACTGCTGCGGCTAACAGCCAGAAAGCTGCGAAGACGAGTGAAACTAACGCAAAGTCCAGCCAGACGGCAGCGAAGACCAGCGAAACGAATGCCAAAGCCAGCGAAACTGCGGCGAAAAACAGTCAGGATGCAGCAGCCCAAAGCGAGAGTGCTGCAGCTGGTTCTGCAAGCGCGGCAGCTGCTTCTGCCACTGCATCAGCCAACAGTCAAAAAGCAGCAAAAACCAGTGAAACCAATGCAAAGACAAGCGAGACTGCAGCGGCGAACTCGGCGAAAGCATCGGCAGCAAGCCAGACAGCAGCTAAAGCAAGTGAAGACGCAGCCAGAGAGTATGCAAGTCAGGCAGCAGAGCCGTATAAATATGTCTTACAGCCGCTGCCTGATGTGTGGATACCGTTTAACGATTCACTGGATATGATTACGGGCTTTTCGCCGTCATATAAAAAAATTGTTATTGGTGATGATGAAATAACGATGCCTGGCGACAAGGTTGTTAAGTTTAAACGCGCATCAACTGCCACATATATCAATAAATCAGGCGTATTTAGTGTTGCTAAAATTGATGAGCCACGATTTGAAAAAGAAGGTTTATTGATTGAAGGACAGCGCACTAACTATTTTGTTAAATCCAATACTCCCGCTGAATGGACGAGTACCAGCAATATCGATAAAACTAATAATGGTGTTTATGAATTTGGTTTTTCATATGCCAAAATGCGAACAAAAGATAATATGACAGGACAATCATCTGCACTTAGTCTGCATACATGCAGTGCATCCCGGGGGATTGATGTTAGTGGCGATAATAAGTATTGCACTGTTTCATGCAGGGTTAAAGCTCCTGATGGTCTTCGTTGTCGTTTGCGTTTTGAAAAATACGATGGGTCGGTTTATACATTTTTAGGAGATGCTTATTTAACTTTCGGAACTCTGATAATAGAAAAAACTGGCGGAGCAGCCAATAGAATAGCAGCTACTGCAACTAAAGATCCGGTTACAGGGTGGATTTTCTATGAGGCAACTATAGAAGCTGTTGAAGGTGAAACCTTAATTGGCGCAATGATTCAGTATGCGCCGAAAAAAGGTGGTATAACTGAAGCGGGAGATTATATTTACCTTGCAACACCACAATTTGAAAACGGCGGATGTGCTTCATCTTTTGTTATTACGACAACTGTACCCGCAACCCGCTCCAGTGATATGGTGACGATCCCAACTGAAAATAATATCTATAATAGACCGCTTACGTGTCTTGTCGAGGTTAATAGAAATTGGGGCGATATTCCTCCTAATGTAGCACCGCGTATTTTTGATTTTTCTGGTGTGCCACCTATTGAGTCAATTACATACGCTTTTAACACAACTGAGAAATATTACGGTCAGCTTTATATGCAAACTTATAAAGCGTCGACAAGTACTTACGTTTCTAGTGTGTTTGCTGGTCGAGCTGATGTTCGAAAATTCATTGGTGGTTTTAATATTTATTCTGATGGTACTAAACGAGTAGTTTCTAACGGTGAGGCTACTAAAACTATGAAAACGGAGTGGACGGGCGTAAAAACACGGACCTTTATTCGAATTGGAGGTCAAGCCACATCGGGAACTCGTCATCTATTCGGCCATTTGAGAAATCTTCGTCTCTGGCATAAAGAATTAACTGATGCGCAAATGGGGGAGAGTATTAAATGA

Genome Context

Genome Context

Tertiary structure

PDB ID
addd734a86ab84be9492e12e9bd8ce692292c13c1cab9f122c403207ca1e8f2b
ESMFold
Source ESMFold
Method ESMFold
Resolution 0,7511
Oligomeric State monomer
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50

Literature

Title Authors Date PMID Source
E. coli prophages encode an arsenal of defense systems to protect against temperate phages Brenes,L.R. and Laub,M.T. 2025 GenBank