Genbank accession
YP_009910526.1 [GenBank]
Protein name
tail spike protein
RBP type
TF
Evidence GenBank
Probability 1,00
TF
Evidence Phold
Probability 1,00
TSP
Evidence DepoScope
Probability 1,00
TSP
Evidence RBPdetect
Probability 0,91
TSP
Evidence RBPdetect2
Probability 0,95
Protein sequence
MSNVKKVGILPTDPYRRYLPSAFDESMNIYEQIITCIEYVNNLGISFNELVDWLDKVVLQQNEKLKEQDQKIDKLRDEWHIFEDYIVNILLKEKVVEILKEWLADGTLAEIINKDVFDMKADTEWVKSEFTKRGVHYKDFGAKLDGITDDSDAIIAAHNYANEHNYPVIVKNEKFVLNKNVTVKTSTDLTGSILTTTYVDPATIEYERTFNLFNIEGADLIDLSTRAINPEFIKGATRIPSLKNQPSGALIIKTEQTDIIRDNGGVQSNIMKAECNIMMKNQYGDLAYPLTKNYVDALKFQVFLRPFEHQLEFKFPKVEVKGRIYGIAKVHRNNTSFSGLLMEEINPSATISSIYTLFEYEDCADMEATNISCPIIGREVKTGENGLGYFLLMTRSAKFRGSNLQQISGWSGINGNWMRDISVVDSNMLVVGGHANVYDLTVDRSVIQKNIIAHGGGVIQLLNSQVIGSASPPNNLSGTGAVQTRWDYDGEFEGEIIVENVVLHNASYVVEYSPSTYNCGRTIVLPKTTIRNVHMRNLLKKKGAGVWFRGYRGEYAGNYPQVTIDSLSWDFVGTYTTRFVEFESDVANSLATNKDFKFYFRNIHPPRLSYGDVFNPITAFIHVPKVTNNDTVVYYDIQNCTVNMGLGSTANLDVTIDNSDFYAVNLLAPESVTTNGQPAFINVKNSTVHRGVTNFNVGANTYNRVRLTIASSIFKRLRKTDGSYDPQIGFPIEDFVSYTADNIADARAEIRGDNSARLFGYIDETIWKIKKDPLRIFV
Physico‐chemical
properties
protein length:778 AA
molecular weight: 87797,34700 Da
isoelectric point:5,56536
aromaticity:0,10540
hydropathy:-0,26272

Domains

Domains [InterPro]
DC_0396
ATT
1–225
Coil
Unmapped
58–78
IPR011050
STR
138–550
YP_009910526.1
1 778
Architecture
ATT
STR
RBD
ATT 1-225 | STR 226-550 | RBD 552-778
Legend: ATT STR RBD CBM LEC ENZ CHP LNK TAS TTP UNK Unmapped

Tail Spike Domain Segmentation

Tail Spike Domain Segmentation

This protein has been segmented into three structural domains: N-terminal, central domain, and C-terminal.

Domain Layout
N-terminal
Central
C-terminal
YP_009910526.1
1 778
Domain Start End Length (AA) Confidence
N-terminal 1 150 150 0,9921
Central domain 151 767 618 0,9857
C-terminal 768 778 10 0,4471
Legend: N-terminal Central domain C-terminal
3D Structure with Domain Coloring

The structure is colored according to the domain segmentation: N-terminal (blue), Central (green), C-terminal (pink).

Domain Coloring
N-terminal
1-150
Central
151-767
C-terminal
768-778

Taxonomy

  Name Taxonomy ID Lineage
Phage Bacillus phage DK3
[NCBI]
2500810 Viruses > Duplodnaviria > Heunggongvirae > Uroviricota > Caudoviricetes
Host Bacillus cereus
[NCBI]
1396 cellular organisms > Bacteria > Bacillati > Bacillota > Bacilli > Bacillales

Coding sequence (CDS)

Coding sequence (CDS)
Genbank protein accession
YP_009910526.1 [NCBI]
Genbank nucleotide accession
NC_049969.1 [NCBI]
CDS location
range 19184 -> 21520
strand +
CDS
ATGAGTAATGTTAAAAAAGTTGGTATATTACCAACAGACCCTTATCGTAGATATTTACCAAGCGCTTTTGATGAATCAATGAATATTTACGAACAAATTATTACATGTATTGAATATGTAAACAATCTAGGTATTTCTTTCAATGAACTTGTAGACTGGCTAGATAAAGTTGTATTACAACAAAATGAAAAATTAAAAGAACAAGATCAAAAGATTGACAAGTTACGTGATGAGTGGCATATCTTTGAAGATTACATTGTGAATATTCTTTTAAAAGAAAAAGTAGTAGAAATTTTAAAAGAATGGTTAGCTGATGGAACCTTAGCAGAAATTATCAATAAAGATGTGTTTGATATGAAAGCCGATACAGAATGGGTAAAATCTGAATTCACTAAACGTGGTGTTCACTATAAAGATTTTGGAGCTAAATTAGATGGTATCACAGATGATTCAGACGCTATCATAGCAGCTCATAATTACGCAAATGAACATAACTATCCTGTTATCGTAAAGAATGAGAAGTTTGTTTTAAATAAAAACGTTACTGTAAAAACTTCTACTGATTTAACAGGAAGTATTTTAACAACAACTTATGTTGACCCGGCTACCATTGAATATGAAAGAACATTTAACTTATTCAATATTGAAGGTGCTGATTTAATTGATTTATCAACTAGAGCTATTAATCCCGAATTCATTAAAGGAGCAACTAGAATTCCTAGTTTGAAAAATCAACCTTCTGGCGCTTTAATTATTAAAACGGAACAAACTGATATTATTCGTGATAATGGTGGGGTCCAATCTAATATTATGAAAGCTGAATGTAATATTATGATGAAAAACCAATATGGTGATCTTGCGTATCCATTAACGAAAAACTATGTTGATGCTTTAAAATTCCAAGTATTCTTGAGACCTTTCGAACATCAATTAGAATTCAAATTTCCTAAAGTTGAAGTAAAAGGTCGCATTTATGGTATTGCTAAAGTTCACCGAAATAACACATCATTTAGTGGATTATTAATGGAAGAAATTAATCCTAGTGCTACAATATCAAGTATTTATACATTGTTTGAATATGAAGATTGTGCTGATATGGAAGCAACAAATATTTCGTGTCCGATTATTGGTAGAGAAGTTAAAACTGGTGAAAATGGTTTGGGTTACTTCTTGTTAATGACTCGATCAGCTAAATTTAGAGGTTCAAACCTTCAACAAATTTCTGGTTGGTCTGGTATCAATGGTAACTGGATGAGAGATATTAGTGTAGTTGATTCTAATATGCTTGTTGTTGGTGGACATGCGAATGTTTATGATTTAACTGTTGATCGTTCGGTAATACAGAAAAACATTATTGCTCATGGTGGAGGGGTAATACAGTTACTTAATTCTCAAGTTATTGGTAGTGCTAGTCCTCCGAATAACTTATCTGGTACAGGAGCTGTGCAAACACGATGGGATTATGATGGCGAATTTGAAGGTGAGATCATTGTTGAAAACGTGGTACTTCATAATGCTTCTTATGTTGTTGAATATAGTCCTTCTACTTATAACTGTGGTAGAACAATAGTATTACCGAAAACAACAATTAGAAATGTTCATATGAGAAACCTTCTTAAAAAGAAAGGCGCTGGTGTTTGGTTCCGTGGTTACCGTGGAGAATATGCCGGTAACTATCCACAAGTAACGATTGATTCTCTATCTTGGGATTTCGTGGGAACATACACAACAAGATTTGTTGAATTTGAAAGCGATGTTGCAAATAGTTTAGCTACGAATAAAGATTTCAAGTTTTACTTTAGAAATATTCACCCACCACGCTTGTCATATGGTGATGTATTCAATCCAATCACAGCTTTCATTCATGTTCCTAAAGTAACAAATAATGATACAGTGGTGTATTATGATATTCAAAACTGTACTGTTAATATGGGGCTTGGATCGACTGCAAACTTAGATGTCACAATTGATAATTCTGATTTCTATGCTGTTAACTTATTAGCTCCTGAAAGTGTTACAACGAATGGGCAACCAGCATTCATTAATGTTAAGAATTCTACAGTTCATCGTGGTGTAACTAACTTTAATGTTGGCGCAAACACTTATAACCGTGTTCGTTTAACAATTGCTAGTTCTATCTTTAAAAGATTAAGAAAGACTGATGGTTCTTATGATCCACAGATTGGTTTTCCTATTGAAGATTTCGTTTCTTATACCGCTGATAATATTGCTGATGCTAGAGCTGAAATACGTGGGGATAACTCCGCTAGATTGTTTGGATACATTGATGAAACAATATGGAAAATTAAGAAAGATCCGTTGAGAATATTTGTATAG

Genome Context

Genome Context

Tertiary structure

PDB ID
8517d3e4bc4d4cc7111cdddf905d053437cb2d8cb020bd6e85a46fa58f23ffa3
ESMFold
Source ESMFold
Method ESMFold
Resolution 0,6677
Oligomeric State monomer
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50