Genbank accession
XKV17373.1 [GenBank]
Protein name
colanic acid degradation protein
RBP type
TF
Evidence GenBank
Probability 1,00
TSP
Evidence DepoScope
Probability 1,00
TSP
Evidence RBPdetect
Probability 0,91
TSP
Evidence RBPdetect2
Probability 0,95
Protein sequence
MANFPTYPATTLEQGVDLVIFSSNQMHDVINGDATATVETETGLIPTLRKALVDNFFFKSPVAWSAGSTETVFNQLRYFENGILSGYYYAPSATTANPVPMQGTPVGDSNWTLYALKTEQLASDVYPWYFKGATGYETEISPPYIFDNAIVTINGVIQIQGEAFTIKDSKIILAEPLGLDPSTGLPNKLFAFIGKTTASTSYVEKNLLSSTTGAAMVGLPSGGNLLQAQYFVTPEQFGAIGDGVTDDTQAILKTITFANTNNIQVRADKNYRFTSSIAMSGIRWYGGTFTGNGGTMISTVSCWMENVRFEKCYVKMLGGDCRFYRNIFSNATSTAAFLMQAMTSEGTLDFSYNEMYGCKYAILQQGTGEVMTYGRYSNNYIHDIKGDAIELNVVQKHYTEGLIIENNHIANVDASGQGANWGIGIGVAGSGPYGVDVPDSQYVRNFSIVGNRVYNCRQCLHVEMGKNFTIRDNEVYPNTAVSTGTGLTTCGVALYGCQDFEVDGLTGYLLNDPSVSTRMVFIDWGVNSGRYAGPPINFTIKNLDIPESSIEIATSGSDAWENSTIVSNINCNVFKWRGLPSSSTFNNIRCRSIDFIGQHGSGEGSGGGFYTRSQFTYMKWVGCTALSGDETTVSFAKIYTDRCDQVGNNFGVPTAVDGTGHRGPVLTTISEQYFTAYDEFPGGREFPTGTVIHCASGKKHVVTVGGAFFSANEKIKATVTGQTYLQSNALNWASNGYAKAAGTKIVIPGAGANGGDLVTTIARATYVTNSLYTIDIADPIVTPTAENTQIKALNPVTFVTVNNA
Physico‐chemical
properties
protein length:804 AA
molecular weight: 86766,13730 Da
isoelectric point:5,04852
aromaticity:0,11194
hydropathy:-0,11244

Domains

Domains [InterPro]
IPR011050
STR
227–480
IPR012334
STR
234–514
XKV17373.1
1 804
Architecture
ATT
STR
RBD
ATT 10-114 | STR 157-514 | RBD 515-802 |
Legend: ATT STR RBD CBM LEC ENZ CHP LNK TAS TTP UNK Unmapped

Tail Spike Domain Segmentation

Tail Spike Domain Segmentation

This protein has been segmented into three structural domains: N-terminal, central domain, and C-terminal.

Domain Layout
N-terminal
Central
C-terminal
XKV17373.1
1 804
Domain Start End Length (AA) Confidence
N-terminal 1 246 246 0,9927
Central domain 247 673 428 0,9923
C-terminal 674 804 130 0,9273
Legend: N-terminal Central domain C-terminal
3D Structure with Domain Coloring

The structure is colored according to the domain segmentation: N-terminal (blue), Central (green), C-terminal (pink).

Domain Coloring
N-terminal
1-246
Central
247-673
C-terminal
674-804

Taxonomy

  Name Taxonomy ID Lineage
Phage Escherichia phage Eryne
[NCBI]
3378211 Viruses >
Host Escherichia coli UTI89
[NCBI]
364106 Pseudomonadota > Gammaproteobacteria > Enterobacterales > Enterobacteriaceae > Escherichia > Escherichia coli

Coding sequence (CDS)

Coding sequence (CDS)
Genbank protein accession
XKV17373.1 [NCBI]
Genbank nucleotide accession
PQ529420 [NCBI]
CDS location
range 27612 -> 30026
strand +
CDS
ATGGCGAATTTTCCAACATATCCTGCTACGACACTAGAACAAGGTGTTGATCTGGTTATATTCTCGTCTAACCAGATGCATGATGTTATTAATGGAGATGCTACAGCAACTGTAGAAACCGAAACAGGTTTAATACCTACACTGCGTAAAGCACTTGTTGATAACTTCTTCTTTAAGAGTCCTGTTGCTTGGTCAGCAGGATCAACTGAAACTGTATTTAACCAGTTACGTTATTTTGAAAACGGTATCTTGAGTGGGTACTATTATGCCCCATCTGCCACTACTGCGAACCCTGTACCTATGCAGGGCACACCTGTAGGGGACAGTAACTGGACTCTGTACGCTCTTAAAACTGAACAGTTAGCTTCAGATGTGTATCCTTGGTATTTTAAAGGTGCTACAGGATATGAAACAGAAATCAGTCCTCCTTATATCTTTGATAATGCTATTGTCACCATTAACGGTGTAATTCAGATTCAAGGTGAAGCATTTACTATCAAAGATAGTAAAATTATTCTGGCAGAACCATTAGGTTTAGATCCGTCTACTGGTCTACCTAACAAATTGTTTGCTTTTATTGGTAAAACAACAGCCTCTACTTCTTACGTAGAGAAAAACCTTTTGTCATCTACCACTGGTGCAGCAATGGTTGGTCTTCCATCTGGAGGCAACCTGTTACAAGCTCAATACTTTGTAACGCCTGAGCAGTTTGGTGCAATTGGTGATGGAGTTACCGATGATACTCAAGCAATTTTAAAAACTATTACTTTCGCTAACACGAATAACATTCAAGTACGTGCAGATAAGAATTATAGATTCACAAGCTCAATTGCTATGTCTGGTATTCGTTGGTATGGTGGTACATTCACGGGTAACGGTGGAACAATGATTTCTACTGTGTCCTGTTGGATGGAAAACGTTCGCTTTGAAAAATGTTATGTTAAGATGTTAGGTGGTGATTGCCGATTCTATCGTAATATCTTCTCCAACGCAACATCTACAGCAGCATTCTTAATGCAAGCAATGACAAGTGAAGGTACGTTGGACTTCAGTTACAACGAAATGTATGGTTGTAAATATGCGATCCTACAACAAGGTACTGGAGAAGTAATGACCTACGGGCGTTACTCTAACAACTATATTCACGATATCAAAGGTGATGCTATTGAACTTAACGTAGTTCAAAAACACTACACTGAAGGTTTGATTATCGAGAATAACCACATTGCTAACGTAGATGCTTCTGGACAAGGGGCAAACTGGGGTATTGGTATTGGTGTAGCAGGTAGTGGCCCATATGGTGTTGATGTTCCTGATTCACAATATGTACGTAATTTCAGTATCGTTGGGAATAGGGTTTACAATTGCCGTCAATGTCTACACGTTGAAATGGGTAAAAACTTCACGATACGTGATAATGAAGTTTATCCTAACACAGCAGTTTCAACAGGTACAGGTTTAACTACTTGTGGTGTTGCATTATACGGATGCCAAGATTTTGAAGTTGATGGTTTAACTGGTTATCTGCTTAATGATCCATCTGTTTCAACACGCATGGTCTTTATTGACTGGGGTGTTAACAGTGGAAGATATGCGGGGCCACCAATTAACTTTACAATTAAGAATTTGGATATTCCAGAATCTTCAATTGAGATTGCAACATCTGGATCAGATGCCTGGGAAAACTCTACAATTGTTAGTAATATCAATTGTAATGTTTTCAAATGGCGTGGGCTACCATCTAGCTCTACCTTTAATAATATTCGTTGCCGTAGTATTGACTTTATTGGTCAACACGGAAGCGGAGAAGGAAGTGGTGGCGGTTTCTATACTCGCAGCCAATTCACGTATATGAAGTGGGTAGGATGTACAGCACTTAGCGGAGATGAAACAACAGTTTCTTTTGCTAAAATCTACACTGATCGTTGCGATCAGGTTGGAAATAACTTTGGTGTTCCAACTGCTGTTGATGGTACAGGTCATCGTGGGCCAGTTTTAACTACTATTTCTGAACAATATTTTACAGCTTATGATGAATTCCCAGGTGGGCGTGAATTCCCTACTGGAACAGTTATTCATTGTGCAAGCGGTAAAAAACATGTTGTTACAGTAGGTGGTGCTTTCTTTAGTGCCAACGAGAAAATAAAAGCAACTGTTACAGGTCAAACCTATCTACAGTCTAATGCTTTAAACTGGGCTAGTAATGGCTACGCTAAAGCAGCAGGTACTAAGATTGTTATTCCAGGTGCAGGTGCAAATGGCGGCGATCTTGTGACAACTATAGCACGTGCTACATACGTGACAAATAGTTTGTACACAATAGATATTGCAGATCCAATTGTAACACCTACAGCAGAGAATACACAAATTAAAGCTCTGAATCCTGTTACTTTTGTTACTGTCAATAATGCTTAA

Genome Context

Genome Context

Tertiary structure

PDB ID
6d4c72665e2550d0cb870a90353d2d190b0cf31782390732f93a0069b1f40581
ColabFold
Source ColabFold
Method ColabFold
Resolution 0,2721
Oligomeric State monomer
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50