Genbank accession
YP_009151108.1 [GenBank]
Protein name
tail spike protein
RBP type
TSP
Evidence DepoScope
Probability 1,00
TF
Evidence GenBank
Probability 1,00
TSP
Evidence Phold
Probability 1,00
TSP
Evidence RBPdetect
Probability 0,91
TSP
Evidence RBPdetect2
Probability 0,95
TSP
Evidence UniProt/TrEMBL
Probability 1,00
Protein sequence
MLERPDYFMIGDDIKQRFVDILDNLTKVNNRNAYYAIPEAFGAVGDGVADDTKALQDTINAVQGTGAIIILRPGAKYKITATLEITGGIVFKGDSQNRPRIFSTTQTFTGINIAGTLVGSTSLAASATINTNYIDVADASNIKAGNLIEIVSNESWYHDAREESTDARRAELHRVESVTDKRVYLNDALFDSYVLPTETVTVTIWTPIRVDMRDMELALTKYGGNTDSIRKVGIQLDHTIDAHLENVYVTDAQNAGITIKHSYRPVVKGGITSGANNYFSGYGVQIVGCTLARVHDRFITASRRGVDVSGFSIPSHHTVVEGCTVVGSGYNSMGTKYGFLDNHGTGAYCGGIGTHGPADHTIIRNNNFQYLHTAIIDRSRNTVAEENYFIGDFAKPLIDASFGENNIYRNNICVDNLAGLKKTVVSDGGANINSRKAPAFIRMQATALANGSTGGFTHIEGNFAMVQDTFIEFYGEAADPTVPTLKNYTIKDNTVYFSPVAGADPAYFINNKTVNNASILMSGSIFKNNSWKRTSGSGAVYMFGKVDPRQATEVDGPKAYSFYMTDDSVSSVFLGNSNLFYARMIVDAGGSTGGAYGCVRIGQANTGTNDIGTSNNIAAVAGVPNGTTGTDGKLNLAIQDGILYVENRLGSTQRIMVTVMNAV
Physico‐chemical
properties
protein length:663 AA
molecular weight: 71378,01570 Da
isoelectric point:5,83966
aromaticity:0,08899
hydropathy:-0,16290

Domains

Domains [InterPro]
IPR012334
STR
39–542
IPR011050
STR
39–416
YP_009151108.1
1 663
Architecture
STR
RBD
STR 35-542 | RBD 543-663
Legend: ATT STR RBD CBM LEC ENZ CHP LNK TAS TTP UNK Unmapped

Tail Spike Domain Segmentation

Tail Spike Domain Segmentation

This protein has been segmented into three structural domains: N-terminal, central domain, and C-terminal.

Domain Layout
N-terminal
Central
C-terminal
YP_009151108.1
1 663
Domain Start End Length (AA) Confidence
N-terminal 1 50 50 0,9610
Central domain 51 542 493 0,9910
C-terminal 543 663 120 0,9706
Legend: N-terminal Central domain C-terminal
3D Structure with Domain Coloring

The structure is colored according to the domain segmentation: N-terminal (blue), Central (green), C-terminal (pink).

Domain Coloring
N-terminal
1-50
Central
51-542
C-terminal
543-663

Taxonomy

  Name Taxonomy ID Lineage
Phage Bacillus phage Mater
[NCBI]
1540090 Uroviricota > Caudoviricetes > Herelleviridae > Matervirus > Matervirus mater
Host Bacillus megaterium
[NCBI]
1404 cellular organisms > Bacteria > Bacillati > Bacillota > Bacilli > Bacillales

Coding sequence (CDS)

Coding sequence (CDS)
Genbank protein accession
YP_009151108.1 [NCBI]
Genbank nucleotide accession
NC_027366.1 [NCBI]
CDS location
range 83833 -> 85824
strand -
CDS
ATGTTAGAAAGACCAGATTACTTTATGATCGGGGACGATATTAAACAAAGGTTTGTTGACATCCTCGACAACCTTACCAAGGTAAACAATCGAAATGCCTACTACGCTATTCCTGAAGCATTCGGAGCTGTAGGGGACGGAGTAGCAGATGATACAAAGGCACTACAAGATACAATTAATGCAGTGCAAGGTACCGGGGCGATCATTATTCTTCGTCCCGGTGCTAAGTATAAAATCACAGCAACTTTAGAAATTACTGGTGGAATCGTTTTCAAAGGTGACTCACAGAACAGACCGAGAATTTTCTCCACTACTCAAACGTTTACAGGTATTAACATTGCTGGAACACTAGTTGGCTCTACCTCTCTAGCAGCTTCAGCTACTATTAATACAAACTATATTGACGTAGCAGATGCCTCTAACATTAAAGCAGGTAACTTAATTGAAATCGTGTCTAATGAGTCATGGTACCATGACGCTCGAGAAGAGTCAACAGATGCGCGTAGAGCGGAGCTTCACCGAGTAGAAAGCGTTACAGATAAACGAGTGTACTTGAATGACGCATTATTCGATAGCTATGTCCTCCCTACTGAAACAGTGACTGTAACTATTTGGACTCCTATTCGAGTAGATATGCGGGATATGGAGCTAGCGCTTACTAAGTACGGAGGTAACACGGATTCTATCCGTAAAGTAGGCATTCAGTTAGACCATACGATCGATGCTCACTTAGAGAACGTTTATGTAACGGATGCACAGAATGCAGGTATCACAATTAAGCATAGCTACCGTCCAGTAGTTAAGGGTGGTATCACTTCTGGAGCAAATAACTACTTCTCTGGTTACGGTGTTCAAATCGTAGGCTGTACACTAGCCAGAGTGCATGACCGCTTTATAACTGCTTCCAGACGTGGAGTAGACGTAAGTGGTTTCAGTATCCCTTCTCACCACACAGTTGTTGAAGGCTGTACAGTAGTAGGTAGCGGATACAATAGTATGGGTACAAAATATGGCTTCCTAGATAATCACGGTACAGGCGCTTACTGCGGAGGTATTGGTACTCACGGACCAGCAGACCATACGATTATTCGAAACAATAACTTCCAATACTTACATACAGCGATCATTGACCGCTCACGTAATACGGTAGCAGAAGAGAACTATTTCATTGGAGACTTTGCTAAACCACTTATCGATGCATCTTTCGGAGAGAACAACATCTACAGAAATAACATTTGTGTAGATAACTTGGCAGGTCTGAAGAAAACTGTAGTCTCTGATGGTGGAGCTAATATTAACTCACGTAAAGCCCCTGCCTTCATTCGTATGCAGGCTACAGCACTAGCTAATGGTTCTACAGGAGGGTTTACACACATCGAAGGTAACTTTGCAATGGTTCAGGACACATTCATCGAGTTCTATGGTGAGGCAGCAGACCCTACTGTACCAACTCTAAAGAACTATACAATTAAAGACAATACTGTATACTTCTCTCCAGTCGCAGGTGCAGACCCAGCTTACTTTATTAATAACAAGACAGTGAACAACGCCTCTATTCTAATGAGTGGTTCTATCTTTAAAAACAACTCTTGGAAGCGTACAAGCGGCTCTGGCGCTGTTTATATGTTTGGTAAAGTAGACCCTCGACAAGCTACGGAAGTCGATGGACCTAAAGCTTACTCGTTCTACATGACAGATGATTCAGTTAGCTCAGTCTTCCTAGGAAACTCTAACCTATTCTACGCTCGTATGATTGTCGATGCAGGGGGTTCTACAGGAGGAGCTTATGGTTGTGTACGTATTGGTCAGGCCAATACAGGGACCAATGACATCGGAACATCTAATAATATTGCTGCTGTAGCAGGAGTACCAAACGGAACTACAGGTACAGATGGTAAGCTAAACCTAGCTATCCAAGACGGTATACTCTATGTAGAAAATCGACTAGGCTCCACTCAGCGTATTATGGTTACCGTAATGAATGCAGTTTAA

Genome Context

Genome Context

Tertiary structure

PDB ID
b27c3cae6f655a6111b30bdc38527337955102e0c5342fd22450539c229ee93b
ESMFold
Source ESMFold
Method ESMFold
Resolution 0,7666
Oligomeric State monomer
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50