Protein
View in Explore- Genbank accession
- CAH1082654.1 [GenBank]
- Protein name
- allantoicase
- RBP type
-
TSP
- Protein sequence
-
MATLHAPAFELPEILNTKTNLADARIGAQVIECSDDFFAEAKRMLQFEAPIFVEDKFDDYGKWMDGWETRRKRHAGYDWCIVKLGVSGKISALDIDTTFFTGNYPASASLEACYAPNDDLTGAKWQSILENTELGPSQHHIFMVNNDAIFTHIRLNIFPDGGVARLRVYGDVHIQVTDHEQTLDLLALENGGRVIAYSDAHFGHPRNLINPGRGVNMGDGWETKRRRAPGYDWCILALGKSGKIEKIEIDTAHFKGNFPAEVSIQAVYLENATDAQLIPQSMFWSYLLEAQPMQMDHIHEYMNEILQHEKASHIRINMIPDGGISRVRLWGKIAKS
- Physico‐chemical
properties -
protein length: 336 AA molecular weight: 37901,53650 Da isoelectric point: 5,59071 aromaticity: 0,09821 hydropathy: -0,31042
Domains
Domains [InterPro]
G3DSA:2.60.120.260
STR
12–173
STR
12–173
IPR005164
Unmapped
13–334
Unmapped
13–334
IPR005164
Unmapped
16–334
Unmapped
16–334
IPR005164
Unmapped
17–334
Unmapped
17–334
IPR008979
STR
17–172
STR
17–172
IPR005164
Unmapped
19–332
Unmapped
19–332
G3DSA:2.60.120.260
STR
20–173
STR
20–173
IPR015908
STR
27–172
STR
27–172
1
336
Architecture
STR 12-335 |
Legend:
ATT
STR
RBD
CBM
LEC
ENZ
CHP
LNK
TAS
TTP
UNK
Unmapped
Tail Spike Domain Segmentation
Tail Spike Domain Segmentation
This protein has been segmented into three structural domains: N-terminal, central domain, and C-terminal.
Domain Layout
1
336
| Domain | Start | End | Length (AA) | Confidence |
|---|---|---|---|---|
| N-terminal | 1 | 324 | 324 | 0,1484 |
| Central domain | 325 | 325 | 2 | 0,0070 |
| C-terminal | 326 | 336 | 10 | 0,9948 |
Note: Constraints were applied during segmentation.
Sequence started with non-N-terminal domain|C-terminal too short, adjusted boundary
Sequence started with non-N-terminal domain|C-terminal too short, adjusted boundary
Legend:
N-terminal
Central domain
C-terminal
3D Structure with Domain Coloring
The structure is colored according to the domain segmentation: N-terminal (blue), Central (green), C-terminal (pink).
Domain Coloring
N-terminal
1-324
1-324
Central
325-325
325-325
C-terminal
326-336
326-336
Taxonomy
| Name | Taxonomy ID | Lineage | |
|---|---|---|---|
| Phage |
Acinetobacter phage MD-2021a [NCBI] |
2899278 | Viruses > Duplodnaviria > Heunggongvirae > Uroviricota > Caudoviricetes |
| Host | No host information | ||
Coding sequence (CDS)
Coding sequence (CDS)
Genbank protein accession
CAH1082654.1
[NCBI]
Genbank nucleotide accession
CAKLQH020000008
[NCBI]
CDS location
range 17629 -> 18639
strand +
strand +
CDS
ATGGCAACATTGCACGCTCCAGCTTTTGAACTTCCTGAAATTCTGAATACTAAAACAAATTTGGCAGATGCCCGAATTGGTGCACAAGTGATTGAGTGTTCAGATGATTTCTTTGCAGAAGCAAAGCGTATGCTGCAATTTGAAGCACCTATTTTTGTTGAAGATAAATTTGATGATTATGGCAAATGGATGGACGGTTGGGAGACCCGCCGTAAACGTCATGCAGGCTATGACTGGTGTATTGTGAAACTAGGTGTGAGTGGAAAAATCAGTGCACTTGATATCGACACCACGTTTTTTACAGGAAATTATCCTGCATCTGCCTCATTAGAGGCATGTTATGCACCAAATGATGACCTTACTGGGGCAAAGTGGCAGAGTATTTTAGAAAATACCGAGCTAGGGCCGAGTCAACATCATATTTTTATGGTCAATAATGATGCAATTTTCACGCATATACGCCTCAATATTTTCCCAGATGGTGGTGTTGCCCGTTTACGCGTTTATGGTGACGTTCATATTCAGGTGACCGACCACGAGCAGACTCTCGATTTATTGGCCTTAGAAAATGGCGGTCGTGTAATTGCTTATAGCGATGCGCACTTTGGACATCCACGTAATTTGATTAACCCAGGCCGTGGCGTCAACATGGGCGATGGGTGGGAAACCAAACGCCGCCGTGCACCAGGTTATGACTGGTGTATTCTTGCATTGGGTAAAAGCGGAAAAATTGAAAAAATTGAAATTGATACGGCGCATTTTAAAGGTAACTTTCCTGCTGAAGTTTCTATTCAGGCTGTCTACCTTGAAAATGCAACCGATGCACAGCTGATTCCACAAAGTATGTTTTGGTCTTACTTACTTGAAGCCCAACCTATGCAAATGGACCATATTCATGAATATATGAATGAAATTTTACAGCATGAAAAAGCCTCGCATATCCGTATTAATATGATTCCGGATGGTGGTATTAGCCGTGTCCGTTTATGGGGAAAAATTGCCAAGTCATGA
Genome Context
Genome Context
Tertiary structure
PDB ID
f25ffae038099fb628b6cadda48ab99c46afed3703614af26b44de2c627e88be
Model Confidence
Very high
pLDDT > 90
pLDDT > 90
High
90 > pLDDT > 70
90 > pLDDT > 70
Low
70 > pLDDT > 50
70 > pLDDT > 50
Very low
pLDDT < 50
pLDDT < 50