Genbank accession
CAM0052648.1 [GenBank]
Protein name
tail spike protein with colonic acid degradation activity
RBP type
TF
Evidence Phold
Probability 1,00
TSP
Evidence DepoScope
Probability 1,00
TSP
Evidence RBPdetect
Probability 0,91
TSP
Evidence RBPdetect2
Probability 0,94
Protein sequence
MTTKVPNTMIEGSTINVKDFGAKGDGVTDDTAAIQDAINAGRRIYIPTGVYLINEVAIPDNRIIEGDGVESKLVVPVGVTHTRGMFYNAMSRWSNTTIRDLHFDGSNNYPTDKSVYYGDVAINNVMIRKSGACTDVTIQDCYFTKASTSSIYVSDGDHTTSLKIINNKFNDGNYKLKTIGIYGSNNVSDAKAPRGIEVSGNVINGGGSRIHKDGRIEGFTSSTDAIHLDNCRHSIISKNIIRENSGDAIRVEQSKYIMVSDNSIYRSGSAGITVYHSSQRCSIIGNTIDGWGYTIQAYCIRSHGGKYYICREFPDATHAVLPTDPSTVSWIIECPYNLTGIDTSTILPYSSTDYYSSGSSTGILPFRGSSAISVTSSSYAVKIIGNICNGNTSKDASNKYHTASEHGYSNKHTVNSPVGVTGDSNTVSGNAFSNCQGHELYAGEYQDPINQRGKSGLQYISDDNSYSAHRGHGKNTDKYYTIHDNLSITSGGGEFTPTLTPSTSGSITLTGAYNALSWYRVGQMVTISGQIRVGSVSSPVGGVKIEGLPFTQLNLADGAERVAGVALCNNVEAATPPITQFFVSGAGNVMWISGTTGTTTRNIGDLIKSGTVIDVNFTYRTQI
Physico‐chemical
properties
protein length:623 AA
molecular weight: 66844,34300 Da
isoelectric point:6,46245
aromaticity:0,08347
hydropathy:-0,30321

Domains

Domains [InterPro]
IPR012334
STR
14–316
IPR012334
STR
14–310
IPR024535
ENZ
15–272
IPR006626
Unmapped
93–119
CAM0052648.1
1 623
Architecture
STR
STR 12-623
Legend: ATT STR RBD CBM LEC ENZ CHP LNK TAS TTP UNK Unmapped

Tail Spike Domain Segmentation

Tail Spike Domain Segmentation

This protein has been segmented into three structural domains: N-terminal, central domain, and C-terminal.

Domain Layout
N-terminal
Central
C-terminal
CAM0052648.1
1 623
Domain Start End Length (AA) Confidence
N-terminal 1 28 28 0,8660
Central domain 29 295 268 0,9918
C-terminal 296 623 327 0,6081
Legend: N-terminal Central domain C-terminal
3D Structure with Domain Coloring

The structure is colored according to the domain segmentation: N-terminal (blue), Central (green), C-terminal (pink).

Domain Coloring
N-terminal
1-28
Central
29-295
C-terminal
296-623

Taxonomy

  Name Taxonomy ID Lineage
Phage Vibrio phage K452
[NCBI]
3105720 Viruses > unclassified bacterial viruses >
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)
Genbank protein accession
CAM0052648.1 [NCBI]
Genbank nucleotide accession
OZ196320.1 [NCBI]
CDS location
range 87000 -> 88871
strand -
CDS
ATGACAACAAAAGTTCCAAATACAATGATTGAAGGTTCGACCATAAACGTAAAAGATTTTGGCGCTAAAGGTGATGGTGTAACTGATGATACTGCTGCTATTCAAGATGCCATTAACGCAGGCAGGCGTATATATATCCCAACAGGCGTATATCTAATAAATGAGGTTGCTATACCCGACAACCGCATCATTGAAGGTGACGGCGTTGAGTCAAAACTGGTAGTTCCTGTTGGCGTTACTCACACCAGAGGCATGTTCTATAATGCCATGTCTCGTTGGTCGAATACTACAATTCGAGACTTACATTTTGATGGTAGTAATAATTACCCTACCGACAAAAGCGTATATTACGGCGACGTGGCCATAAACAATGTTATGATTCGAAAATCTGGAGCGTGTACAGATGTTACTATCCAAGACTGCTATTTCACAAAGGCGTCAACTAGTAGCATTTATGTATCGGACGGTGACCACACCACTAGTTTGAAAATCATCAATAACAAATTCAATGACGGTAACTATAAACTGAAAACTATAGGAATTTATGGTTCTAACAACGTGTCTGACGCTAAAGCCCCTCGCGGTATTGAGGTTAGTGGTAATGTTATCAATGGAGGCGGGTCACGAATACACAAAGACGGGCGTATTGAGGGGTTCACTTCGTCAACTGATGCTATCCACTTAGACAACTGTAGACATTCAATAATCAGTAAGAATATCATTCGTGAGAATAGCGGTGATGCAATTCGGGTTGAACAATCGAAGTATATCATGGTATCTGACAATAGCATTTACCGTTCTGGTTCTGCTGGTATCACAGTGTACCATTCATCACAACGTTGTTCTATTATAGGAAACACTATCGATGGTTGGGGTTACACTATCCAAGCGTATTGTATCCGATCGCACGGCGGGAAGTATTACATATGTCGAGAGTTCCCCGACGCAACTCACGCAGTATTACCTACTGACCCTAGTACTGTGTCTTGGATTATAGAGTGCCCATACAACCTCACAGGGATTGATACGTCGACTATTTTACCTTATAGTTCTACTGATTATTACTCTAGCGGTTCGTCCACTGGTATCCTCCCGTTCCGAGGGTCAAGTGCTATATCAGTAACAAGTTCATCTTACGCAGTGAAAATTATAGGTAATATATGTAATGGTAACACGAGCAAAGATGCTAGTAATAAGTACCACACCGCGAGCGAACATGGATACTCAAATAAACATACAGTGAATAGTCCGGTTGGCGTTACAGGTGATTCGAATACTGTGTCAGGTAATGCGTTTTCTAATTGTCAAGGGCATGAACTATACGCGGGTGAATATCAAGACCCCATAAACCAACGCGGGAAATCAGGCTTGCAATATATCAGTGATGATAATTCATACAGCGCACATCGCGGCCATGGTAAGAATACTGACAAATATTATACGATTCATGACAACTTATCAATAACATCAGGCGGGGGTGAGTTCACCCCAACATTGACGCCGTCAACATCAGGTAGCATTACATTGACGGGGGCGTACAACGCTTTATCGTGGTATCGTGTCGGACAGATGGTGACAATATCAGGCCAAATACGCGTGGGGAGTGTTAGCTCGCCAGTAGGCGGCGTCAAGATTGAAGGTCTACCATTTACACAATTGAACCTTGCTGATGGGGCTGAACGCGTGGCGGGTGTGGCGTTATGCAATAATGTAGAAGCGGCCACGCCGCCTATCACCCAATTCTTTGTATCGGGAGCGGGTAATGTTATGTGGATATCAGGAACAACAGGAACAACAACACGCAACATCGGCGATTTGATAAAATCTGGAACCGTCATCGATGTGAACTTCACATATCGTACACAGATATAA

Genome Context

Genome Context

Tertiary structure

PDB ID
34e1c83328c0143a60c3305b5c4bd8d8ee2601f0a615ec0ae9019b46dd2dbade
ESMFold
Source ESMFold
Method ESMFold
Resolution 0,7266
Oligomeric State monomer
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50