Genbank accession
XLV08213.1 [GenBank]
Protein name
colanic acid degradation protein
RBP type
TSP
Evidence DepoScope
Probability 1,00
TSP
Evidence RBPdetect
Probability 0,91
TSP
Evidence RBPdetect2
Probability 0,95
Protein sequence
MAVFSNFEGTMQPKFILGKKGASINFINDSVSIKDYLGEQFLPLKVGTSSDPNSAVNVKYLEDNPNALFGVVPPTDDVGKNNNTYFQIDEDGIVDIFAKSNGTWVKSLYINKKVLSSKNGGDFIGLITGGTVQQAQFYITPEEFGAKGDGETDDTAAVNLCAAYAKENKIQIQAGGNYRIKTAIYLEDVTWFGGTITGNGGTMVSVINSQIQFATLTGCYLKFFGGDGKFYFNKFINITSTAAFLMQGMTQPGTIDFCFNEMTQCKYGVLQQGTGEKMKVGRYSYNHIHDIYGDAIELNVINSHYPDGFVIEGNVIENVDGSNAPIPLSNWGIGIGVAGSGPYGLDAPDTQYVRNFVIKNNRLYNVRQCIHVELGMDFKILNNEVYPSSLVSVGTGLTTAGVITYGSKEFIIDGLTGHLLNDPSITNRMVSINWGVTSGRFAGPPRNFKISNLYIPESSILVYTSGSDNWLNNTELSNITCSKILWRGLPSSSKFSDIRTKELDVIGQHLSTEGEGGGIYTRSKYTYTNWTNCVVQDDCNVSKYSFSKMYVDRIDQSSNNFKVTTAIDGTGHRGPVLIPVVEQYYIPYDTFPGGRYFSEGTIIHKQSGGKYIVTVGGSFFGKYDTIRQTVAGQTYIEALGVSWGENQYAKAAGTEIIISGAGENGGDLRTYITRAVYVVNNVYRIDIADPIITATAAGAALKAAQPVTYITLP
Physico‐chemical
properties
protein length:713 AA
molecular weight: 77701,51320 Da
isoelectric point:5,46339
aromaticity:0,11360
hydropathy:-0,15442

Domains

Domains [InterPro]
IPR011050
STR
131–476
IPR012334
STR
142–504
DC_0045
RBD
432–713
XLV08213.1
1 713
Architecture
ATT
STR
RBD
ATT 1-170 | STR 171-504 | RBD 505-713
Legend: ATT STR RBD CBM LEC ENZ CHP LNK TAS TTP UNK Unmapped

Tail Spike Domain Segmentation

Tail Spike Domain Segmentation

This protein has been segmented into three structural domains: N-terminal, central domain, and C-terminal.

Domain Layout
N-terminal
Central
C-terminal
XLV08213.1
1 713
Domain Start End Length (AA) Confidence
N-terminal 1 154 154 0,9878
Central domain 155 582 429 0,9908
C-terminal 583 713 130 0,8957
Legend: N-terminal Central domain C-terminal
3D Structure with Domain Coloring

The structure is colored according to the domain segmentation: N-terminal (blue), Central (green), C-terminal (pink).

Domain Coloring
N-terminal
1-154
Central
155-582
C-terminal
583-713

Taxonomy

  Name Taxonomy ID Lineage
Phage Escherichia phage SHEFM2K
[NCBI]
3378341 Viruses >
Host Escherichia coli
[NCBI]
562 cellular organisms > Bacteria > Pseudomonadati > Pseudomonadota > Gammaproteobacteria > Enterobacterales

Coding sequence (CDS)

Coding sequence (CDS)
Genbank protein accession
XLV08213.1 [NCBI]
Genbank nucleotide accession
PQ390715 [NCBI]
CDS location
range 271391 -> 273532
strand +
CDS
ATGGCAGTATTTTCTAATTTTGAAGGAACTATGCAGCCCAAATTCATATTGGGGAAAAAAGGTGCTTCAATAAATTTTATCAATGATTCAGTTTCTATAAAAGATTATCTAGGAGAACAGTTCCTACCACTGAAGGTAGGAACTTCATCTGACCCTAATTCAGCAGTTAACGTAAAATATCTTGAAGATAACCCTAATGCATTATTTGGTGTTGTACCCCCTACCGATGATGTAGGAAAGAACAATAATACGTATTTTCAAATTGATGAAGATGGTATTGTTGATATTTTTGCAAAAAGTAATGGTACATGGGTAAAATCATTATACATTAATAAAAAAGTTCTATCCAGTAAAAATGGTGGCGATTTCATCGGACTTATTACTGGAGGTACAGTTCAACAGGCCCAGTTTTATATAACACCAGAAGAATTTGGTGCTAAAGGTGATGGCGAAACTGATGACACTGCTGCTGTAAATTTATGTGCTGCTTATGCTAAAGAAAATAAAATCCAAATTCAGGCTGGTGGTAATTACCGTATCAAAACTGCAATATACCTGGAAGATGTAACTTGGTTTGGGGGAACTATTACTGGTAATGGTGGTACTATGGTATCCGTTATTAATTCCCAAATACAATTTGCAACACTTACTGGATGTTATTTAAAATTCTTCGGTGGCGATGGTAAGTTTTATTTTAATAAATTTATAAACATCACAAGTACTGCTGCATTTTTAATGCAAGGTATGACACAACCAGGAACAATAGATTTTTGCTTTAATGAAATGACCCAATGTAAGTATGGTGTTCTTCAACAAGGTACTGGTGAGAAAATGAAGGTTGGTAGATATTCATATAACCACATACATGATATATATGGTGATGCAATTGAACTAAACGTAATTAACTCACATTATCCTGATGGTTTTGTGATTGAAGGAAACGTAATTGAAAACGTTGATGGTTCTAACGCTCCAATCCCACTTTCAAACTGGGGCATTGGTATTGGCGTTGCTGGTTCTGGTCCTTATGGTTTAGATGCACCAGATACACAATATGTCAGAAACTTTGTTATTAAAAATAACAGACTATATAATGTTAGACAATGTATCCACGTTGAATTAGGAATGGACTTTAAAATACTTAATAATGAAGTTTATCCATCATCTTTAGTTTCTGTTGGTACTGGATTAACCACTGCTGGTGTTATAACTTATGGTTCGAAAGAATTCATTATTGATGGGTTGACTGGTCATTTGCTTAATGATCCTTCTATCACAAATAGAATGGTTTCAATAAACTGGGGTGTAACTTCTGGTCGTTTTGCTGGACCACCTAGAAACTTTAAAATTTCTAATCTCTATATTCCAGAGTCTTCTATACTTGTTTATACTTCCGGTTCTGATAATTGGTTAAATAATACAGAATTGAGCAATATCACCTGTTCTAAAATCCTTTGGCGTGGTTTGCCATCATCATCTAAATTTAGTGATATTAGAACTAAAGAATTAGATGTAATTGGTCAACACCTCTCTACCGAAGGCGAAGGTGGTGGTATCTATACTCGCTCTAAATATACTTATACTAATTGGACTAACTGTGTTGTTCAAGATGATTGTAATGTTTCAAAATATAGTTTTTCTAAAATGTATGTAGACAGAATAGACCAATCATCTAATAACTTTAAAGTTACAACTGCAATCGATGGTACAGGACACCGTGGCCCAGTACTGATACCTGTAGTTGAGCAATATTATATTCCATACGATACTTTTCCTGGGGGACGCTATTTCTCTGAAGGCACGATTATTCATAAACAATCGGGTGGAAAATATATTGTAACTGTTGGTGGTTCTTTTTTCGGAAAATATGACACAATTAGACAAACGGTAGCAGGACAAACATACATAGAAGCATTGGGTGTGTCATGGGGTGAAAACCAATATGCAAAAGCTGCTGGTACTGAGATCATTATATCCGGTGCTGGTGAAAATGGTGGTGATTTGAGAACGTATATAACTCGTGCAGTGTATGTTGTAAACAACGTATATAGAATTGATATTGCAGATCCTATCATAACAGCAACTGCCGCTGGTGCAGCATTAAAAGCTGCCCAACCCGTAACTTACATAACCCTACCATAA

Genome Context

Genome Context

Tertiary structure

PDB ID
a2d60778024a50d68ac86d93925a26d48fa046f7bf446d317f4eb7620c4c722c
ESMFold
Source ESMFold
Method ESMFold
Resolution 0,7235
Oligomeric State monomer
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50