Genbank accession
WBF54159.1 [GenBank]
Protein name
lateral tail fiber protein
RBP type
TF
Evidence GenBank
Probability 1,00
TSP
Evidence DepoScope
Probability 1,00
TSP
Evidence RBPdetect
Probability 0,91
TSP
Evidence RBPdetect2
Probability 0,95
Protein sequence
MANFPTYPATTLEQGVDLVIFSSNQMHDVINGDATATVETETGLIPTLRKALVDNFFFKSPVAWSAGSTETVFNQLRYFENGILSGYYYAPSATTANPVPMQGTPVGDSNWTLYALKTEQLASDVYPWYFKGATGYETEISPPYIFDNAIVTINGVIQIQGEAFTIKDSKIILAEPLGLDPSTGLPNKLFAFIGKTTASTSYVEKNLLSSTTGAAMVGLPSGGNLLQAQYFVTPEQFGAIGDGVTDDTQAILKTITFANTNNIQVRADKNYRFTSSIAMSGIRWYGGTFTGNGGTMISTVSCWIENVRFEKCYVKMLGGDCRFYRNIFSNATSTAAFLMQAMTSEGTLDFSYNEMYGCKYAILQQGTGEVMTYGRYSNNYIHDIKGDAIELNVVQRHYTEGLIIENNHIANVDASGQGANWGIGIGVAGSGPYGVDAPDSQYVRNFSIIGNRVYNCRQCLHVEMGKNFTIRDNEVYPNTAVSTGTGLTTCGVALYGCQDFEVDGLTGYLLNDPSVSTRMVFIDWGVNGGRYAGPPINFTIKNLDIPESSIEIATAGSDDWENSTIVSNINCNDFKWRGLPSSSTFNNIRCRSIDFIGQHGSGEGSGGGFYARSQFTYMKWVGCTALSGDETTVSFAKIYTDRCDQVGNNFGVPTAVDGTGHRGPVLTTISEQYFTTYDEFPGGREFPTGTVIHCASGKKHVVTVGGAFFSDNEKIKATVTGQTYLQSNALNWASNSYAKAAGTKIVIPGAGANGGDLVTTIARATYVTNSLYTIDIADPIVTPTAENTKIKALNPVTFVTVNNA
Physico‐chemical
properties
protein length:804 AA
molecular weight: 86850,10490 Da
isoelectric point:4,97270
aromaticity:0,11194
hydropathy:-0,13259

Domains

Domains [InterPro]
IPR011050
STR
227–480
IPR012334
STR
234–514
WBF54159.1
1 804
Architecture
ATT
STR
RBD
ATT 10-114 | STR 157-514 | RBD 515-802 |
Legend: ATT STR RBD CBM LEC ENZ CHP LNK TAS TTP UNK Unmapped

Tail Spike Domain Segmentation

Tail Spike Domain Segmentation

This protein has been segmented into three structural domains: N-terminal, central domain, and C-terminal.

Domain Layout
N-terminal
Central
C-terminal
WBF54159.1
1 804
Domain Start End Length (AA) Confidence
N-terminal 1 246 246 0,9930
Central domain 247 676 431 0,9922
C-terminal 677 804 127 0,8500
Legend: N-terminal Central domain C-terminal
3D Structure with Domain Coloring

The structure is colored according to the domain segmentation: N-terminal (blue), Central (green), C-terminal (pink).

Domain Coloring
N-terminal
1-246
Central
247-676
C-terminal
677-804

Taxonomy

  Name Taxonomy ID Lineage
Phage Escherichia phage EC_OE_11
[NCBI]
3017778 Viruses > Duplodnaviria > Heunggongvirae > Uroviricota > Caudoviricetes
Host Escherichia coli
[NCBI]
562 cellular organisms > Bacteria > Pseudomonadati > Pseudomonadota > Gammaproteobacteria > Enterobacterales

Coding sequence (CDS)

Coding sequence (CDS)
Genbank protein accession
WBF54159.1 [NCBI]
Genbank nucleotide accession
OQ108497.1 [NCBI]
CDS location
range 68375 -> 70789
strand +
CDS
ATGGCGAATTTTCCAACATATCCTGCTACGACACTAGAACAAGGTGTTGATCTGGTTATATTCTCGTCTAACCAGATGCATGATGTTATTAATGGGGATGCTACAGCAACTGTAGAAACCGAAACAGGTTTAATACCTACACTGCGTAAAGCACTTGTTGATAACTTCTTCTTTAAGAGTCCTGTTGCTTGGTCAGCAGGATCAACTGAAACTGTATTTAACCAGTTACGTTATTTTGAAAATGGTATCTTGAGTGGGTACTATTATGCTCCATCTGCCACTACTGCGAACCCTGTACCTATGCAGGGTACACCTGTAGGAGACAGTAACTGGACTCTGTACGCTCTTAAAACTGAACAGTTAGCTTCAGATGTGTATCCTTGGTATTTTAAAGGTGCTACAGGATATGAAACAGAAATCAGTCCTCCTTATATCTTTGATAATGCTATTGTCACCATTAACGGTGTAATTCAGATTCAAGGTGAAGCATTTACTATCAAAGATAGTAAAATTATTCTGGCAGAACCATTAGGTTTAGATCCGTCTACTGGTCTACCTAACAAATTGTTTGCTTTTATTGGTAAAACAACAGCCTCTACTTCTTACGTAGAGAAAAACCTTTTGTCATCTACTACTGGTGCAGCAATGGTTGGTCTTCCATCTGGAGGCAACCTGTTACAAGCTCAATACTTTGTAACGCCTGAGCAGTTTGGTGCAATTGGCGATGGAGTTACCGATGATACTCAAGCAATTTTAAAAACCATTACTTTCGCTAACACGAATAACATTCAAGTACGTGCAGATAAGAATTATAGATTCACAAGCTCAATTGCTATGTCTGGTATTCGTTGGTATGGTGGTACATTCACTGGTAACGGTGGAACAATGATTTCTACTGTATCCTGCTGGATTGAAAACGTTCGCTTTGAAAAATGTTATGTTAAGATGTTAGGTGGGGATTGCCGATTCTATCGTAATATCTTCTCCAATGCAACATCTACAGCAGCATTCTTAATGCAAGCAATGACTAGTGAAGGTACGTTGGACTTCAGTTACAACGAAATGTATGGTTGTAAATATGCGATCCTGCAACAAGGTACTGGAGAAGTAATGACCTATGGGCGTTACTCTAACAACTATATTCACGATATCAAAGGTGATGCTATTGAACTTAATGTAGTTCAAAGACACTACACGGAAGGTTTGATTATCGAGAATAACCACATTGCTAACGTAGATGCTTCTGGACAAGGTGCAAACTGGGGTATTGGTATTGGTGTAGCAGGTAGTGGCCCATATGGTGTTGATGCTCCTGATTCACAATATGTACGTAATTTCAGTATCATTGGGAATAGGGTTTACAATTGCCGTCAATGTTTACACGTTGAAATGGGTAAAAACTTCACAATACGTGATAATGAAGTTTATCCTAACACAGCAGTTTCAACAGGTACAGGTTTAACTACTTGTGGTGTTGCATTATACGGATGCCAAGACTTTGAAGTTGATGGTTTAACTGGTTATCTGCTTAATGATCCGTCTGTTTCAACACGCATGGTCTTTATTGACTGGGGTGTTAACGGTGGAAGATATGCGGGGCCACCAATTAACTTTACAATTAAGAATTTGGATATTCCAGAATCTTCGATTGAGATTGCAACGGCTGGATCAGACGACTGGGAAAACTCTACAATTGTTAGTAACATCAATTGTAATGATTTTAAATGGCGTGGGCTACCATCTAGCTCTACTTTTAATAATATTCGTTGCCGTAGTATTGACTTTATTGGTCAACACGGAAGCGGAGAAGGAAGTGGTGGCGGTTTCTATGCTCGCAGCCAATTCACGTATATGAAGTGGGTAGGATGTACAGCACTTAGCGGAGATGAAACAACAGTTTCTTTTGCTAAAATCTACACGGATCGTTGCGATCAGGTTGGAAATAACTTTGGTGTTCCAACTGCTGTTGATGGTACAGGTCATCGTGGGCCAGTCTTAACCACTATTTCTGAACAATATTTTACAACTTATGATGAATTCCCAGGTGGGCGTGAATTCCCTACTGGAACAGTTATTCATTGTGCAAGCGGTAAAAAACATGTTGTTACAGTAGGTGGTGCTTTCTTTAGTGACAACGAGAAAATAAAAGCAACTGTTACAGGTCAAACCTATCTACAGTCTAATGCTTTAAACTGGGCTAGTAATAGCTACGCTAAAGCGGCAGGTACTAAGATTGTTATTCCAGGTGCAGGAGCAAATGGTGGCGATCTTGTGACAACTATAGCACGTGCAACATATGTGACAAATAGTTTGTACACAATAGATATTGCAGATCCAATTGTAACACCTACAGCAGAGAATACAAAAATTAAAGCTCTGAATCCTGTTACTTTTGTCACTGTCAATAATGCTTAA

Genome Context

Genome Context

Tertiary structure

PDB ID
ce741cb59021cd1c94524093734182ed2d113de05d3553ec0146c43019b292d5
ESMFold
Source ESMFold
Method ESMFold
Resolution 0,7355
Oligomeric State monomer
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50