Genbank accession
AFO72153.1 [GenBank]
Protein name
cell wall hydrolase
RBP type
TSP
Evidence RBPdetect
Probability 0,90
Protein sequence
MIEANIDKFNVIEKGTITLNVMFEEGSNLINTSFSESMENVKNKVLVVDQYGNKISEKINDSIFKDVGVIMQKVIQQQENSTVDIESEFKGIEQTCNLKGYGDVSCITGRGVKVKDSYTGLVGLFYIDTDKHNWDSNGNYEIDLDLNFQNIMDEKTAGQDEQKEESSSLNGEGTLNGREVKAEFTAYYPSNNAMEGGYYQAMDGKRLVPSNNTCAAPSKLKFKTKIQAKCPGTKIDGKTYTVTDRGGAIDLKNGVYRIDILMSSEKECNDFGRRKGTIIIGDGTGYTNAIGKAKELISIAKSKLGCKYVWGATGENTFDCSGFTQWCYKKIGISIPRTASAQSKAGKPVDLNDRSKWKAGDLLCRVSGGSNNHVVMYIGNNQIIHSPQTGDVVKIQSVDSYRKGKAYTHVRRYL
Physico‐chemical
properties
protein length:414 AA
molecular weight: 45744,07360 Da
isoelectric point:8,18628
aromaticity:0,08213
hydropathy:-0,53720

Domains

Domains [InterPro]
IPR056937
ATT
3–146
IPR059180
RBD
182–280
IPR000064
ENZ
290–414
IPR038765
STR
292–413
IPR000064
ENZ
305–412
AFO72153.1
1 414
Architecture
ATT
RBD
STR
ATT 3-146 | RBD 151-291 | STR 292-414
Legend: ATT STR RBD CBM LEC ENZ CHP LNK TAS TTP UNK Unmapped

Tail Spike Domain Segmentation

Tail Spike Domain Segmentation

This protein has been segmented into three structural domains: N-terminal, central domain, and C-terminal.

Domain Layout
N-terminal
Central
C-terminal
AFO72153.1
1 414
Domain Start End Length (AA) Confidence
N-terminal 1 193 193 0,7651
Central domain 194 403 211 0,5082
C-terminal 404 414 10 0,2847
Legend: N-terminal Central domain C-terminal
3D Structure with Domain Coloring

The structure is colored according to the domain segmentation: N-terminal (blue), Central (green), C-terminal (pink).

Domain Coloring
N-terminal
1-193
Central
194-403
C-terminal
404-414

Taxonomy

  Name Taxonomy ID Lineage
Phage Clostridium phage phiMMP04
[NCBI]
1204535 Uroviricota > Caudoviricetes > Sherbrookevirus >
Host Clostridioides difficile
[NCBI]
1496 cellular organisms > Bacteria > Bacillati > Bacillota > Clostridia > Peptostreptococcales

Coding sequence (CDS)

Coding sequence (CDS)
Genbank protein accession
AFO72153.1 [NCBI]
Genbank nucleotide accession
JX145342 [NCBI]
CDS location
range 11380 -> 12624
strand +
CDS
ATGATAGAGGCTAATATAGATAAATTTAATGTCATTGAAAAAGGGACTATTACACTAAATGTTATGTTTGAAGAAGGGTCTAATCTTATTAACACGAGCTTTTCAGAGAGTATGGAGAATGTAAAAAATAAAGTATTAGTTGTAGACCAGTATGGCAATAAGATTAGTGAAAAGATAAATGACTCTATATTTAAAGATGTTGGGGTAATTATGCAAAAAGTTATACAACAACAAGAAAATAGTACTGTAGATATAGAAAGCGAATTTAAAGGAATAGAGCAGACTTGCAACCTAAAAGGTTATGGTGATGTAAGTTGTATAACTGGTAGAGGTGTAAAGGTTAAGGATAGCTATACAGGGCTTGTAGGTCTATTTTATATAGATACAGATAAACACAACTGGGACAGTAACGGAAATTATGAGATAGATTTAGATTTAAATTTTCAAAATATCATGGATGAAAAGACAGCAGGACAGGACGAACAAAAGGAAGAAAGTTCTAGTTTAAATGGAGAAGGTACTTTAAATGGAAGAGAAGTAAAAGCAGAATTTACAGCGTATTATCCTTCAAACAATGCCATGGAGGGTGGATACTATCAAGCTATGGATGGTAAAAGACTTGTACCTTCAAACAATACTTGTGCTGCACCTAGTAAACTTAAATTTAAAACAAAAATTCAAGCAAAATGTCCTGGAACTAAAATTGATGGTAAAACTTATACAGTAACAGATAGAGGCGGAGCGATTGACTTAAAAAATGGAGTGTATAGAATAGACATATTAATGTCTAGTGAAAAAGAATGTAATGATTTTGGAAGAAGAAAAGGAACAATAATAATTGGAGATGGTACAGGATATACAAATGCGATAGGAAAAGCAAAAGAATTAATTAGCATAGCAAAAAGTAAATTAGGTTGTAAGTATGTTTGGGGGGCAACTGGGGAGAATACATTCGATTGCAGCGGGTTTACTCAGTGGTGCTACAAAAAGATAGGGATAAGTATTCCTCGTACTGCTTCCGCACAAAGCAAAGCAGGTAAACCAGTAGATTTGAATGATAGAAGCAAGTGGAAAGCAGGAGATTTATTGTGTAGGGTCAGCGGAGGAAGTAACAACCATGTTGTGATGTACATTGGAAATAATCAAATAATCCATTCTCCACAAACAGGTGATGTGGTGAAAATACAGTCTGTTGACTCATATAGAAAAGGGAAAGCATATACACATGTCAGAAGATATTTATAA

Genome Context

Genome Context

Tertiary structure

PDB ID
15545f3a6bb4349dae52799ffa776446574609cfe366c06a810a5b2f4e002c94
ESMFold
Source ESMFold
Method ESMFold
Resolution 0,8466
Oligomeric State monomer
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50