Genbank accession
UKL29997.1 [GenBank]
Protein name
NeuD/PglB/VioB family sugar acetyltransferase protein
RBP type
TSP
Evidence DepoScope
Probability 0,97
Protein sequence
MKKIVIFGSGGFAREVHQIIDDINFHIELANIQYEFLGFLDGNAENHGKEVHGYPVLGGINWLVENPSVEVVVAIGNPAVKRKVVSEIKSRTSNSFVTLIHPSVIIGQNVTVGEGSVLCANTTITTDISIGEHVILNLDCTVGHDAVIEDYVTAAPSVNVSGNVRVGEGCDLGTNSVVIQGKEIGEWSIVGAGAVVIRDVPANTTSVGNPSKVIKEREEHWQLQY
Physico‐chemical
properties
protein length:225 AA
molecular weight: 24105,98610 Da
isoelectric point:5,09246
aromaticity:0,05778
hydropathy:0,11956

Domains

Domains [InterPro]
IPR011004
STR
1–214
G3DSA:3.40.50.20
Unmapped
1–91
IPR041561
ATT
3–87
IPR020019
STR
3–212
IPR020019
STR
6–210
IPR050179
Unmapped
47–219
UKL29997.1
1 225
Architecture
ATT
STR
ATT 1-87 | STR 88-224 |
Legend: ATT STR RBD CBM LEC ENZ CHP LNK TAS TTP UNK Unmapped

Tail Spike Domain Segmentation

Tail Spike Domain Segmentation

This protein has been segmented into three structural domains: N-terminal, central domain, and C-terminal.

Domain Layout
N-terminal
Central
C-terminal
UKL29997.1
1 225
Domain Start End Length (AA) Confidence
N-terminal 1 41 41 0,0112
Central domain 42 214 174 0,1916
C-terminal 215 225 10 0,8395
Legend: N-terminal Central domain C-terminal
3D Structure with Domain Coloring

The structure is colored according to the domain segmentation: N-terminal (blue), Central (green), C-terminal (pink).

Domain Coloring
N-terminal
1-41
Central
42-214
C-terminal
215-225

Taxonomy

  Name Taxonomy ID Lineage
Phage Bacillus phage PK1
[NCBI]
2912239 Viruses > Duplodnaviria > Heunggongvirae > Uroviricota > Caudoviricetes
Host Bacillus sp.
[NCBI]
1409 cellular organisms > Bacteria > Bacillati > Bacillota > Bacilli > Bacillales

Coding sequence (CDS)

Coding sequence (CDS)
Genbank protein accession
UKL29997.1 [NCBI]
Genbank nucleotide accession
OM112209 [NCBI]
CDS location
range 38537 -> 39214
strand +
CDS
ATGAAAAAGATTGTTATATTTGGCAGCGGTGGTTTTGCGAGAGAAGTTCATCAAATTATTGATGATATTAACTTTCATATAGAGTTAGCAAACATTCAATATGAATTTTTAGGTTTTTTAGATGGGAACGCTGAAAATCACGGAAAAGAAGTGCATGGTTATCCTGTTTTAGGTGGTATTAACTGGCTTGTGGAAAACCCGAGTGTTGAAGTGGTTGTTGCCATCGGGAATCCTGCTGTAAAACGTAAGGTTGTGTCGGAAATTAAAAGTCGTACGAGCAACTCGTTTGTGACTTTGATCCATCCTTCAGTCATAATTGGGCAAAACGTTACAGTAGGAGAAGGTTCTGTGTTGTGCGCCAACACTACTATTACTACGGATATTTCAATAGGAGAGCACGTAATCTTAAATCTTGATTGTACAGTGGGTCATGATGCAGTTATAGAGGATTACGTCACAGCAGCACCTAGTGTTAATGTATCCGGGAATGTAAGAGTAGGTGAAGGTTGTGATTTGGGCACAAACTCTGTTGTCATTCAAGGTAAAGAAATCGGAGAATGGTCTATTGTAGGAGCGGGTGCGGTTGTGATTAGAGATGTGCCAGCAAATACTACATCAGTTGGAAACCCTTCAAAAGTCATAAAAGAAAGAGAAGAACACTGGCAATTACAATATTAA

Genome Context

Genome Context

Tertiary structure

PDB ID
ab9cc6374868f3c38e1f5fa29fc5d0e7a84bf989596d52a5ec164971839b9a3b
ESMFold
Source ESMFold
Method ESMFold
Resolution 0,9621
Oligomeric State monomer
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50