Genbank accession
YP_007517572.1 [GenBank]
Protein name
tail spike protein
RBP type
TSP
Evidence DepoScope
Probability 1,00
TSP
Evidence GenBank
Probability 1,00
TSP
Evidence Phold
Probability 1,00
TSP
Evidence RBPdetect
Probability 0,91
TSP
Evidence RBPdetect2
Probability 0,95
TSP
Evidence UniProt/TrEMBL
Probability 1,00
Protein sequence
MGNATRSDNLGLYSWESQDRLSESIPQVSSNFQALDEILSRAWIDIKSYGYDAKGNYNLSTGNGTDDAPAFQKALDRAQNRTSSVTIFVPDGTYRLGSELRIYSNTAIIMSPNATLVRDHSKYLLFNGVRATDGGSPISGYNGDGNILIQGGTLTGQGNKQTAKASIVHFGHAQNITFDNVTLKDCSNSHHIEFNACKDVYVKNCKFLGWFGTVDTYNEAIQLDLATPELTTVAGGDYTPCKNVYIDNTYFGKSTTAGSKPIGRGIGSHSGAINRFHENIHVTNCTFDSTVEWACRAYAYQDFFFTNNKIVNCGRGVNVRSNISSDDKDTINADTGAQTGKSQNISRFVISNNTFSGRMDAGRAIEIYGEETGRIYMTTITGNVVNLSSFAGSANEVIYLNYVRYANISGNNVGGANVGTCIGLNGNTTEVTIGNNICAFGDRGIAVYGANVVMQNISILGNTIRGMQRSGIHLDSIDGFSCVGNTVFDCNKAGGDENHIRVVVGNKNGTVSGNLCTVACPTSIYVSNTNSRINVTGNVLSGGLTNNSSGGASSNNI
Physico‐chemical
properties
protein length:557 AA
molecular weight: 59561,13680 Da
isoelectric point:6,21156
aromaticity:0,08438
hydropathy:-0,28061

Domains

Domains [InterPro]
IPR024535
ENZ
59–317
IPR011050
STR
61–357
IPR006626
Unmapped
197–218
YP_007517572.1
1 557
Architecture
ATT
STR
RBD
ATT 1-41 | STR 44-440 | RBD 441-557
Legend: ATT STR RBD CBM LEC ENZ CHP LNK TAS TTP UNK Unmapped

Tail Spike Domain Segmentation

Tail Spike Domain Segmentation

This protein has been segmented into three structural domains: N-terminal, central domain, and C-terminal.

Domain Layout
N-terminal
Central
C-terminal
YP_007517572.1
1 557
Domain Start End Length (AA) Confidence
N-terminal 1 49 49 0,9678
Central domain 50 546 498 0,9931
C-terminal 547 557 10 0,4125
Legend: N-terminal Central domain C-terminal
3D Structure with Domain Coloring

The structure is colored according to the domain segmentation: N-terminal (blue), Central (green), C-terminal (pink).

Domain Coloring
N-terminal
1-49
Central
50-546
C-terminal
547-557

Taxonomy

  Name Taxonomy ID Lineage
Phage Bacillus phage Curly
[NCBI]
2880541 Uroviricota > Caudoviricetes > Ehrlichviridae > Andromedavirus bolokhovo > Andromedavirus curly
Host Bacillus pumilus BL8
[NCBI]
1189615 Bacillota > Bacilli > Caryophanales > Bacillaceae > Bacillus > Bacillus pumilus

Coding sequence (CDS)

Coding sequence (CDS)
Genbank protein accession
YP_007517572.1 [NCBI]
Genbank nucleotide accession
NC_020479.1 [NCBI]
CDS location
range 18947 -> 20620
strand +
CDS
ATGGGTAACGCTACTAGATCAGACAACTTGGGCTTATACAGTTGGGAATCACAAGACAGACTTTCCGAGTCTATCCCTCAGGTAAGCTCAAACTTCCAAGCACTAGATGAAATATTGAGCCGTGCATGGATTGATATAAAGTCTTATGGCTATGACGCAAAAGGAAACTACAATTTAAGCACTGGAAATGGTACAGATGACGCTCCAGCATTCCAAAAGGCTTTAGACAGAGCACAAAACAGAACGAGTAGCGTCACTATCTTTGTGCCAGATGGAACATACCGACTAGGTAGTGAACTGAGAATATACAGCAATACAGCCATCATAATGAGCCCTAACGCCACCCTAGTCAGAGATCACTCAAAGTATCTCTTGTTCAACGGAGTAAGAGCCACAGACGGTGGGTCTCCTATCAGTGGGTACAACGGAGATGGTAACATTCTTATCCAAGGTGGAACCCTCACAGGTCAAGGTAATAAGCAAACAGCCAAAGCAAGTATTGTTCACTTTGGTCACGCTCAAAATATTACCTTCGACAACGTAACCCTTAAGGACTGTTCTAACTCGCATCATATCGAGTTTAACGCTTGTAAGGATGTATACGTTAAGAATTGTAAATTCCTCGGCTGGTTTGGAACAGTTGATACATACAACGAAGCTATCCAGCTAGACTTGGCAACTCCAGAGTTAACAACCGTGGCTGGCGGTGACTATACCCCTTGTAAAAATGTCTATATTGACAACACATACTTTGGGAAGTCAACCACAGCAGGGTCAAAACCAATTGGTCGAGGAATCGGGTCTCACTCAGGGGCTATCAACCGTTTCCATGAAAATATCCATGTTACAAATTGTACTTTCGATAGCACTGTAGAGTGGGCTTGTCGTGCTTATGCTTATCAAGATTTCTTCTTTACTAACAACAAAATTGTTAACTGTGGTCGTGGTGTCAACGTGAGGTCTAATATCTCATCGGATGACAAAGATACAATCAATGCTGATACAGGGGCGCAAACAGGCAAATCTCAAAATATCTCTAGGTTTGTTATCTCAAACAACACCTTTTCTGGTAGGATGGATGCAGGTCGAGCAATTGAGATATATGGGGAGGAAACTGGACGCATTTACATGACGACTATCACAGGTAATGTTGTCAACTTGTCATCCTTCGCAGGCTCAGCAAATGAGGTTATCTATCTCAACTATGTTCGATATGCAAATATCTCAGGTAACAATGTTGGCGGTGCTAATGTAGGTACTTGTATTGGTCTCAATGGAAATACTACAGAGGTTACTATCGGAAACAACATTTGTGCGTTTGGTGATCGTGGTATCGCCGTTTATGGTGCCAATGTGGTCATGCAGAACATCAGCATCTTAGGTAACACAATAAGAGGGATGCAAAGAAGCGGAATCCACCTTGATAGTATTGACGGGTTTTCTTGCGTTGGTAACACCGTATTTGATTGCAACAAGGCTGGCGGAGATGAAAATCATATAAGGGTTGTTGTCGGTAATAAAAATGGTACAGTCAGCGGCAATCTTTGTACAGTAGCTTGCCCCACTAGTATTTATGTTTCAAACACAAATAGTCGTATTAACGTCACAGGGAATGTATTATCTGGTGGTCTGACAAATAATTCATCGGGCGGAGCTTCTTCAAACAACATCTAA

Genome Context

Genome Context

Tertiary structure

PDB ID
1523d6d0a953c2be5b9985050a256824cb23d947c0dba041a7fb94af8bde0ae6
ESMFold
Source ESMFold
Method ESMFold
Resolution 0,8587
Oligomeric State monomer
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50