Genbank accession
WMI32669.1 [GenBank]
Protein name
tail fiber protein
RBP type
TF
Evidence GenBank
Probability 1,00
TF
Evidence Phold
Probability 1,00
TSP
Evidence RBPdetect
Probability 0,88
Protein sequence
MAAMTPQQSVRAVYKAEQEINSTENKNTSASTVKGDNGFCILASVYNSSSDQYIQNFKAITFDSVPEIGVTREADVTSFPVESGSDVSDHVQIKNNRFKLSGIITETPIRLMRDQLYSAGVNGARISQAIEYLDQIFESRQPIVLLTEHRTFENVILKGYSYDYKSEFAMQFDLEFEQIRLVSKATTNAIAVKTAPNKSTGGDVKGQVSSDGPKSDITVRTGKATNTPTNNGG
Physico‐chemical
properties
protein length:233 AA
molecular weight: 25527,12620 Da
isoelectric point:5,51687
aromaticity:0,07725
hydropathy:-0,43820

Domains

Domains [InterPro]
DC_0628
STR
42–233
IPR048494
ATT
62–188
WMI32669.1
1 233
Architecture
STR
ATT
STR
STR 42-61 | ATT 62-188 | STR 189-233
Legend: ATT STR RBD CBM LEC ENZ CHP LNK TAS TTP UNK Unmapped

Tail Spike Domain Segmentation

Tail Spike Domain Segmentation

This protein has been segmented into three structural domains: N-terminal, central domain, and C-terminal.

Domain Layout
N-terminal
Central
C-terminal
WMI32669.1
1 233
Domain Start End Length (AA) Confidence
N-terminal 1 18 18 0,8795
Central domain 19 217 200 0,0433
C-terminal 218 233 15 0,9327
Legend: N-terminal Central domain C-terminal
3D Structure with Domain Coloring

The structure is colored according to the domain segmentation: N-terminal (blue), Central (green), C-terminal (pink).

Domain Coloring
N-terminal
1-18
Central
19-217
C-terminal
218-233

Taxonomy

  Name Taxonomy ID Lineage
Phage Escherichia phage iGC_PHA_EC001
[NCBI]
3049681 Viruses > Duplodnaviria > Heunggongvirae > Uroviricota > Caudoviricetes
Host Escherichia coli
[NCBI]
562 cellular organisms > Bacteria > Pseudomonadati > Pseudomonadota > Gammaproteobacteria > Enterobacterales

Coding sequence (CDS)

Coding sequence (CDS)
Genbank protein accession
WMI32669.1 [NCBI]
Genbank nucleotide accession
OR437326.1 [NCBI]
CDS location
range 53628 -> 54329
strand +
CDS
ATGGCAGCAATGACTCCACAACAGTCGGTGAGAGCCGTCTATAAGGCAGAACAGGAAATCAACTCTACCGAGAATAAAAATACCTCGGCATCTACTGTAAAGGGTGATAATGGTTTCTGCATTTTAGCAAGTGTTTATAATAGTTCAAGTGATCAATACATTCAAAACTTTAAAGCAATTACTTTCGATTCAGTCCCTGAAATAGGTGTAACTCGTGAGGCAGATGTTACAAGTTTTCCAGTAGAATCAGGATCGGATGTAAGTGACCACGTTCAAATAAAAAACAACAGATTTAAACTTTCAGGTATAATTACCGAAACACCAATTCGGCTTATGCGAGATCAATTATACAGTGCAGGTGTAAATGGTGCAAGGATTTCTCAAGCAATAGAATATCTAGATCAAATATTTGAGTCAAGACAACCAATAGTTCTTTTGACAGAACACAGAACTTTTGAAAATGTTATCCTCAAAGGTTATTCGTATGATTATAAATCTGAATTTGCTATGCAATTTGATTTAGAGTTTGAACAAATCCGCCTCGTTTCTAAGGCCACTACTAACGCTATTGCTGTAAAAACAGCACCAAATAAGTCAACTGGTGGTGATGTGAAAGGTCAAGTTTCTTCGGATGGTCCTAAGTCAGATATTACAGTAAGAACTGGCAAAGCTACTAATACCCCAACAAACAACGGAGGTTAA

Genome Context

Genome Context

Tertiary structure

PDB ID
ad70e10ec1bc6f451afea46f0663dfe4618b2f52df2e1f042f6e27624721ff89
ESMFold
Source ESMFold
Method ESMFold
Resolution 0,7162
Oligomeric State monomer
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50

Literature

Title Authors Date PMID Source
Complete Genome Sequence of Escherichia phage iGC_PHA_EC001 Khan,T., Haider,A., Rahman,S., Moon,S.B., Mahmud,I., Mondal,S.I., Begum,A., Biswas,S.K., Jubair,M. and Rahman,M. 2024 38019277 GenBank