Protein
View in Explore- Genbank accession
- AGK86857.1 [GenBank]
- Protein name
- tail spike protein
- RBP type
-
TFTFTSPTSPTSP
- Protein sequence
-
MTNGWMYFIDFDQFGIKDDGTDATSTTSGFVAAFKRAVELGHSAVYVPEGTYLIDAVGVGDYLPEYGGGLQFPSNIEVILHEKALFKVQPNDSTGYACFNLEGVENVTIRGGHIVGDRHEHNYRQDVNENRRTHEWGFGIQVRGSKNVTIENVTIEDCTGDNIWVTSKGMMNWPGVYIPSESVTIRKCRTLRGRRNNIAAGASVGLLIDDCDIIEAGGDEIGPQLGIDLEGYADNSIKYQHPYEINVINCRFKDNGRGSMNINVSGKVNAIGNFCDDYIGYGFSTDVTISNNVITNETGVHKKFGIDSIRKSTSETANRAVVTGNVIRGFQTGIAARGKTVTVSNNILEDISSIGIYPYLCDQAVVSSNIIDSDCLHIWVRESKDIKVSDNKGTGAANNVSIKVDASKDVLLSDNEVSGKGGVRVSRSTNVRIVDNDIDMIGPDYGIYFDKQSEVHLRDNLVKNAAFTAIRGYADQYSSYIKGNIIQDCKYMIAIHIDGGSKHMIKDNDITFRRGSNAGYGVYLIGANDSRLHNNDIRVMDGFGLINSFYTIQSTNTKLIGNTYDTGEMKTNDTDFLRYNEKLPK
- Physico‐chemical
properties -
protein length: 585 AA molecular weight: 64398,14100 Da isoelectric point: 5,39302 aromaticity: 0,08718 hydropathy: -0,34889
Domains
Domains [InterPro]
IPR012334
STR
12–374
STR
12–374
IPR011050
STR
29–418
STR
29–418
IPR051550
Unmapped
86–539
Unmapped
86–539
IPR006626
Unmapped
104–144
Unmapped
104–144
IPR039448
ENZ
136–272
ENZ
136–272
DC_0453
RBD
359–580
RBD
359–580
IPR012334
STR
392–581
STR
392–581
1
585
Architecture
STR 12-581 |
Legend:
ATT
STR
RBD
CBM
LEC
ENZ
CHP
LNK
TAS
TTP
UNK
Unmapped
Tail Spike Domain Segmentation
Tail Spike Domain Segmentation
This protein has been segmented into three structural domains: N-terminal, central domain, and C-terminal.
Domain Layout
1
585
| Domain | Start | End | Length (AA) | Confidence |
|---|---|---|---|---|
| N-terminal | 1 | 23 | 23 | 0,9394 |
| Central domain | 24 | 574 | 552 | 0,9957 |
| C-terminal | 575 | 585 | 10 | 0,4471 |
Note: Constraints were applied during segmentation.
C-terminal too short, adjusted boundary
C-terminal too short, adjusted boundary
Legend:
N-terminal
Central domain
C-terminal
3D Structure with Domain Coloring
The structure is colored according to the domain segmentation: N-terminal (blue), Central (green), C-terminal (pink).
Domain Coloring
N-terminal
1-23
1-23
Central
24-574
24-574
C-terminal
575-585
575-585
Taxonomy
| Name | Taxonomy ID | Lineage | |
|---|---|---|---|
| Phage |
Bacillus phage SIOphi [NCBI] |
1285382 | Uroviricota > Caudoviricetes > Herelleviridae > Siophivirus > Siophivirus SIOphi |
| Host |
Bacillus subtilis [NCBI] |
1423 | cellular organisms > Bacteria > Bacillati > Bacillota > Bacilli > Bacillales |
Coding sequence (CDS)
Coding sequence (CDS)
Genbank protein accession
AGK86857.1
[NCBI]
Genbank nucleotide accession
KC699836.1
[NCBI]
CDS location
range 34503 -> 36260
strand +
strand +
CDS
TTGACTAATGGGTGGATGTACTTCATTGATTTTGATCAGTTCGGTATCAAGGATGACGGGACAGATGCTACATCAACAACTAGTGGGTTCGTAGCAGCGTTTAAAAGGGCCGTAGAACTTGGTCACTCTGCTGTATATGTACCAGAAGGTACTTACTTAATTGACGCTGTAGGCGTTGGTGACTACCTCCCTGAGTACGGAGGAGGTCTTCAGTTCCCGTCCAACATTGAAGTTATCCTACATGAGAAAGCCCTTTTTAAAGTCCAACCGAATGACTCTACCGGGTACGCTTGTTTTAACTTAGAAGGCGTAGAAAATGTAACAATCCGAGGCGGACATATTGTAGGTGATCGTCATGAACACAACTACAGACAAGACGTTAACGAGAACAGACGTACTCACGAATGGGGATTCGGTATCCAAGTTCGTGGAAGTAAAAATGTTACAATCGAAAATGTAACAATTGAAGACTGCACCGGAGATAACATCTGGGTAACATCTAAAGGTATGATGAACTGGCCGGGAGTGTACATTCCATCTGAAAGCGTAACAATCAGGAAATGTAGAACACTTAGAGGAAGACGAAATAACATTGCTGCCGGAGCAAGCGTAGGTCTACTTATCGATGACTGCGATATTATTGAAGCAGGCGGGGACGAAATTGGACCGCAGCTAGGTATTGACCTTGAAGGATACGCAGACAATAGCATCAAGTACCAGCATCCTTACGAGATAAATGTGATTAACTGCCGATTTAAAGACAATGGAAGAGGTTCTATGAACATCAACGTGTCTGGTAAAGTTAATGCTATCGGTAACTTCTGCGATGATTACATCGGTTACGGGTTTTCAACTGATGTAACTATTAGTAACAATGTTATCACAAACGAAACAGGGGTTCACAAAAAGTTCGGGATAGACTCTATTCGTAAGTCTACTTCGGAAACAGCTAACAGAGCAGTTGTTACTGGCAATGTAATTCGTGGATTTCAAACGGGTATCGCTGCTAGAGGTAAAACGGTTACTGTAAGCAACAACATCCTAGAAGATATTAGTTCTATTGGAATCTACCCTTATCTATGCGATCAAGCAGTAGTATCGAGTAATATCATAGACAGCGACTGTCTACACATTTGGGTTAGAGAATCGAAGGATATAAAAGTTAGTGACAATAAAGGTACTGGCGCAGCTAACAATGTATCTATAAAAGTAGACGCCTCTAAAGACGTATTACTAAGCGATAATGAAGTTTCTGGAAAAGGTGGAGTTCGAGTAAGTCGCTCAACAAATGTGAGAATTGTGGATAATGACATTGATATGATCGGCCCAGACTACGGCATCTACTTTGATAAACAGTCTGAAGTACATCTTAGAGATAACCTTGTTAAAAACGCCGCTTTCACCGCAATCAGGGGTTATGCAGATCAGTACAGCAGCTACATAAAAGGAAACATCATTCAAGATTGTAAGTACATGATCGCTATTCATATTGACGGTGGGTCAAAACATATGATTAAAGACAATGATATTACATTCCGCAGAGGGTCTAATGCAGGCTACGGTGTCTATTTAATCGGAGCAAATGACTCTCGTTTACATAATAATGACATTAGAGTAATGGATGGGTTTGGCCTTATCAACTCTTTCTACACGATCCAGTCTACGAACACTAAGTTGATAGGGAATACATATGATACAGGTGAAATGAAAACAAACGATACTGACTTCTTACGATACAACGAGAAGCTACCTAAGTAA
Genome Context
Genome Context
Tertiary structure
PDB ID
191e6ba4fac17a0d1f37e83350f2153bf111f300fad8da5638cc01cc29d76946
Model Confidence
Very high
pLDDT > 90
pLDDT > 90
High
90 > pLDDT > 70
90 > pLDDT > 70
Low
70 > pLDDT > 50
70 > pLDDT > 50
Very low
pLDDT < 50
pLDDT < 50