Genbank accession
CAH1081742.1 [GenBank]
Protein name
sulfate
RBP type
TSP
Evidence DepoScope
Probability 1,00
Protein sequence
MSQRSRVLPGFGLSLGFTLAYVSFIVLIPLAAVFIKSFGIGWDGLWEILTSERILKSLQLSFSSALIAAFINVVFGLLLAWCLVRYNFPGKRLVDALVDLPFALPTAVAGIALTSLYAPTGWIGQYLEPLGIQVAYTPIGITLALVFIGIPFIVRTVQPVLSDIETELEEAASALGANRWQTITKIILPILLPALFTGFALAFARGVGEYGSVIFIAGNQPFKTEIAPLMIISRLEEYDYAGATTIAAVMLVLSFIILFVINLLQAWANRRTGRNVT
Physico‐chemical
properties
protein length:277 AA
molecular weight: 30114,32610 Da
isoelectric point:7,80172
aromaticity:0,11913
hydropathy:0,87545

Domains

Domains [InterPro]
IPR005667
Unmapped
5–272
IPR005667
Unmapped
5–266
IPR011865
Unmapped
7–270
G3DSA:1.10.3720.10:FF:000004
Unmapped
12–270
IPR035906
STR
12–271
IPR035906
STR
14–264
IPR035906
STR
20–271
IPR000515
STR
54–261
IPR000515
STR
58–255
IPR000515
STR
73–272
CAH1081742.1
1 277
Architecture
STR
STR 12-272 |
Legend: ATT STR RBD CBM LEC ENZ CHP LNK TAS TTP UNK Unmapped

Tail Spike Domain Segmentation

Tail Spike Domain Segmentation

This protein has been segmented into three structural domains: N-terminal, central domain, and C-terminal.

Domain Layout
N-terminal
Central
C-terminal
CAH1081742.1
1 277
Domain Start End Length (AA) Confidence
N-terminal 1 273 273 0,9276
Central domain 274 273 1 0,0640
C-terminal 274 277 3 0,9415
Legend: N-terminal Central domain C-terminal
3D Structure with Domain Coloring

The structure is colored according to the domain segmentation: N-terminal (blue), Central (green), C-terminal (pink).

Domain Coloring
N-terminal
1-273
Central
274-273
C-terminal
274-277

Taxonomy

  Name Taxonomy ID Lineage
Phage Acinetobacter phage MD-2021a
[NCBI]
2899278 Viruses > Duplodnaviria > Heunggongvirae > Uroviricota > Caudoviricetes
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)
Genbank protein accession
CAH1081742.1 [NCBI]
Genbank nucleotide accession
CAKLQF020000007 [NCBI]
CDS location
range 114905 -> 115738
strand +
CDS
ATGTCGCAGCGATCCCGAGTGCTGCCTGGGTTTGGTCTATCATTGGGCTTCACCCTCGCTTACGTTTCTTTTATTGTGCTTATTCCATTAGCCGCAGTCTTTATCAAATCATTTGGAATCGGATGGGACGGACTATGGGAAATTTTAACCTCTGAACGTATTTTAAAATCGCTTCAACTCAGCTTTAGTTCAGCCTTAATTGCGGCGTTTATTAATGTTGTTTTCGGGCTACTTTTAGCTTGGTGCCTTGTCCGTTATAACTTTCCCGGAAAGCGTCTGGTTGATGCCTTAGTGGACTTACCTTTTGCACTTCCAACAGCAGTTGCAGGTATTGCGTTGACCTCACTCTATGCGCCTACAGGCTGGATCGGTCAATATTTAGAACCCCTTGGTATTCAGGTTGCATATACCCCCATCGGGATTACGCTAGCTTTAGTGTTTATCGGCATTCCTTTTATTGTTCGAACCGTTCAACCGGTATTAAGCGATATTGAAACTGAACTTGAAGAGGCGGCTTCCGCACTCGGTGCAAATCGTTGGCAGACCATTACCAAAATTATTTTACCAATATTACTCCCTGCCCTATTTACAGGCTTTGCTTTAGCATTTGCTCGCGGTGTAGGTGAATATGGTTCTGTAATTTTCATTGCAGGTAACCAGCCGTTTAAAACTGAAATTGCTCCGCTTATGATCATTTCTCGCCTCGAAGAATATGACTATGCAGGTGCAACGACTATTGCAGCAGTGATGTTGGTTCTCTCTTTCATTATTTTATTTGTCATTAACTTACTTCAAGCATGGGCAAACCGTCGTACAGGGAGAAATGTCACATGA

Genome Context

Genome Context

Tertiary structure

PDB ID
02be7a78604058f67edab4eb3737cd03df15c3014e20e9364438510602c7ea59
ESMFold
Source ESMFold
Method ESMFold
Resolution 0,9614
Oligomeric State monomer
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50