Genbank accession
AKO59190.1 [GenBank]
Protein name
putative carbohydrate binding protein
RBP type
TF
Evidence GenBank
Probability 1,00
TSP
Evidence DepoScope
Probability 1,00
TSP
Evidence RBPdetect
Probability 0,91
TSP
Evidence RBPdetect2
Probability 0,95
Protein sequence
MTAKPGNARFALAASADFNGSIDAVSVSQVLSLAAFRSADGAWWEIAEPVVNVLAFGARPDNATNAGPSFQEALDYIAAVGYNRELYIPPVGTYLFTSGVTIGDGSAQRLALTIRGGGSGTRLVHSADKALFTILGDCTEFIARDFEVWAGVDQNDPTKGVFYIPNGLSRSSFYDIQGQQNGPNTRLSSFFYSEAGVPMDEITFHNIVEVHNHTGFRLGAGSSVWFVGGRSVGNYPATVCTGLDIVGGMGGVFVWGTDFIINHYNARVRNLTGSTNREIFFSQSCFDGGFIGLYIEDESYVNIVGCWAASADQACIECSFSGNGGVLNISGGTVYNAGVVSLNAGDKIGVIFGGLGRIDMSGVTVRENRNRGIYLTNPARTQFSNIRANNFFGNGTTGTNPCDIFVSGAAVVENNTVTNGIIRGSGDTIIRNNVGDAATITTPTIPASGTPITNQTGRTVEVFISGGTVSQVTKRGVPVFSSSNVSLLLQPGDAIAVTYTAAPTWAWVLP
Physico‐chemical
properties
protein length:510 AA
molecular weight: 53695,29540 Da
isoelectric point:5,04033
aromaticity:0,10196
hydropathy:0,08784

Domains

Domains [InterPro]
IPR011050
STR
41–328
IPR011050
STR
50–328
IPR012334
STR
51–418
AKO59190.1
1 510
Architecture
STR
RBD
STR 41-418 | RBD 419-509 |
Legend: ATT STR RBD CBM LEC ENZ CHP LNK TAS TTP UNK Unmapped

Tail Spike Domain Segmentation

Tail Spike Domain Segmentation

This protein has been segmented into three structural domains: N-terminal, central domain, and C-terminal.

Domain Layout
N-terminal
Central
C-terminal
AKO59190.1
1 510
Domain Start End Length (AA) Confidence
N-terminal 1 65 65 0,9822
Central domain 66 435 371 0,9967
C-terminal 436 510 74 0,9833
Legend: N-terminal Central domain C-terminal
3D Structure with Domain Coloring

The structure is colored according to the domain segmentation: N-terminal (blue), Central (green), C-terminal (pink).

Domain Coloring
N-terminal
1-65
Central
66-435
C-terminal
436-510

Taxonomy

  Name Taxonomy ID Lineage
Phage Brucella phage 11sa_141
[NCBI]
1667370 Uroviricota > Caudoviricetes > Perisivirus >
Host Brucella abortus S19
[NCBI]
430066 Pseudomonadota > Alphaproteobacteria > Hyphomicrobiales > Brucellaceae > Brucella > Brucella abortus

Coding sequence (CDS)

Coding sequence (CDS)
Genbank protein accession
AKO59190.1 [NCBI]
Genbank nucleotide accession
KJ133691 [NCBI]
CDS location
range 23602 -> 25134
strand +
CDS
ATGACGGCAAAACCAGGCAATGCGAGGTTTGCCCTTGCGGCATCGGCTGATTTTAACGGGAGCATAGATGCTGTTTCGGTGTCCCAGGTTTTGAGCCTCGCAGCATTCAGATCAGCCGATGGGGCTTGGTGGGAAATTGCTGAACCAGTTGTCAATGTATTGGCATTTGGTGCAAGGCCCGACAATGCGACAAACGCTGGGCCGTCGTTTCAGGAAGCCCTCGATTACATTGCAGCCGTTGGGTACAACCGTGAACTTTATATACCGCCTGTTGGCACATATTTGTTCACATCAGGTGTCACGATTGGTGACGGGTCCGCGCAGAGGCTAGCCCTCACTATTCGCGGAGGTGGCTCCGGAACTCGGTTGGTTCACAGCGCTGATAAAGCATTGTTCACAATACTAGGTGATTGCACAGAGTTTATAGCGCGTGATTTTGAGGTGTGGGCGGGGGTCGACCAAAATGATCCGACTAAGGGCGTCTTCTACATACCTAATGGCCTATCACGTTCATCGTTTTACGACATTCAGGGGCAGCAAAACGGGCCGAATACTCGATTGTCGTCATTCTTCTATAGTGAAGCTGGCGTACCTATGGATGAAATTACATTCCACAATATTGTGGAGGTTCATAATCACACTGGTTTTCGGCTAGGTGCCGGGTCGTCAGTGTGGTTCGTTGGGGGGAGGTCAGTCGGCAACTATCCTGCCACAGTATGCACAGGTCTTGATATAGTCGGCGGAATGGGTGGGGTATTTGTTTGGGGAACAGACTTCATTATCAACCATTATAACGCTCGGGTAAGAAACCTAACTGGTTCCACGAACCGCGAAATTTTCTTTTCTCAATCATGCTTTGATGGTGGATTTATCGGTCTATACATCGAAGACGAGAGCTATGTGAATATTGTTGGATGTTGGGCGGCGTCTGCCGACCAAGCTTGCATTGAATGCTCTTTCTCAGGAAACGGCGGTGTCTTGAACATCTCCGGCGGTACAGTCTATAACGCCGGAGTTGTATCACTTAATGCTGGAGACAAAATCGGCGTTATTTTCGGCGGCTTGGGACGTATTGATATGTCTGGGGTTACAGTACGTGAAAACCGCAACCGAGGGATATACTTGACAAATCCAGCAAGGACGCAGTTCTCTAACATTCGGGCGAATAATTTCTTTGGTAATGGTACAACAGGTACAAACCCATGCGATATATTCGTCAGTGGGGCAGCGGTTGTCGAAAATAACACTGTTACCAACGGGATTATACGTGGTTCGGGCGATACTATAATTAGAAATAATGTCGGGGATGCTGCGACTATAACAACACCGACTATACCCGCAAGCGGCACACCGATAACAAATCAAACGGGCCGTACCGTTGAAGTTTTCATATCTGGGGGGACGGTTTCGCAGGTAACTAAACGGGGCGTTCCTGTATTTTCTTCGTCAAACGTTAGTCTACTACTACAACCCGGCGATGCCATAGCGGTTACGTACACAGCGGCCCCAACATGGGCATGGGTGCTGCCTTAG

Genome Context

Genome Context

Tertiary structure

PDB ID
3122cee2ce48f6deb869ea47a0f5a4c0467872bfa9eddc5bf37413820539286b
ESMFold
Source ESMFold
Method ESMFold
Resolution 0,8684
Oligomeric State monomer
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50