Genbank accession
YP_007517417.1 [GenBank]
Protein name
tail spike protein
RBP type
TSP
Evidence GenBank
Probability 1,00
TSP
Evidence Phold
Probability 1,00
TSP
Evidence DepoScope
Probability 1,00
TSP
Evidence RBPdetect
Probability 0,90
TSP
Evidence RBPdetect2
Probability 0,95
Protein sequence
MGNATRSGNYGLWGWESQDKLSETIPQISSNFQALDEILSRAWIDIKSYGYGAKGNYNLSTGSGTDDAPAFQKALDIAQNKTSSVTIFVPDGTYRLGSELRIYSNTAIILSPNATLVRDHAKYLLFNGVRAADGGSPVSGYNGDGNIVIKGGTLTGQGNKQTAKASIVHFGHAQNITFDNVTLKDCSNSHHIEFNACKDVYVKNCNFLGWFGDTDTYNEAIQLDLATPELTTVAGGDYTPCKNVYIDSTYFGKSSTAGSKPIGRGIGSHSGAINRFHENIHVTNCTFDSTVEWACRAYAYRDFFFTNNNIINCGRGINVRSNISTDDKDTIDANTGQQTGKSQNINRFVISNNSFTGTMNNGRAIEIYGEATGRIYMTTITGNIINLSSFSGSRNEVIYLNYVRYATVSGNNVGGANVGTCIGLNGDTTEVSIANNICAFGDRGIAVYGANVTMQNISILGNTIRGMQRSGIHLDSIDGFACSGNTIFDCNKAGGDENHIRVVVGNKNGTVSGNLCTVACPTSIYVSNTNSRVNVSGNVLSGGLTNNSSGGASSNNI
Physico‐chemical
properties
protein length:557 AA
molecular weight: 59390,90310 Da
isoelectric point:6,36582
aromaticity:0,08618
hydropathy:-0,27702

Domains

Domains [InterPro]
IPR012334
STR
44–440
IPR024535
ENZ
58–317
YP_007517417.1
1 557
Architecture
ATT
STR
RBD
ATT 1-45 | STR 46-495 | RBD 496-557
Legend: ATT STR RBD CBM LEC ENZ CHP LNK TAS TTP UNK Unmapped

Tail Spike Domain Segmentation

Tail Spike Domain Segmentation

This protein has been segmented into three structural domains: N-terminal, central domain, and C-terminal.

Domain Layout
N-terminal
Central
C-terminal
YP_007517417.1
1 557
Domain Start End Length (AA) Confidence
N-terminal 1 52 52 0,9694
Central domain 53 546 495 0,9941
C-terminal 547 557 10 0,3611
Legend: N-terminal Central domain C-terminal
3D Structure with Domain Coloring

The structure is colored according to the domain segmentation: N-terminal (blue), Central (green), C-terminal (pink).

Domain Coloring
N-terminal
1-52
Central
53-546
C-terminal
547-557

Taxonomy

  Name Taxonomy ID Lineage
Phage Bacillus phage Eoghan
[NCBI]
2880542 Uroviricota > Caudoviricetes > Ehrlichviridae > Andromedavirus eoghan >
Host Bacillus pumilus BL8
[NCBI]
1189615 Bacillota > Bacilli > Caryophanales > Bacillaceae > Bacillus > Bacillus pumilus

Coding sequence (CDS)

Coding sequence (CDS)
Genbank protein accession
YP_007517417.1 [NCBI]
Genbank nucleotide accession
NC_020477.1 [NCBI]
CDS location
range 18889 -> 20562
strand +
CDS
ATGGGTAACGCTACAAGATCGGGCAATTATGGACTATGGGGCTGGGAATCACAAGACAAGCTGTCAGAGACTATCCCTCAGATTAGCTCAAACTTTCAAGCACTAGATGAAATCCTCAGTCGTGCATGGATTGATATCAAATCGTACGGGTATGGAGCAAAAGGTAATTACAACTTGAGTACAGGAAGTGGAACAGATGACGCTCCAGCATTCCAAAAGGCTTTAGACATCGCACAAAACAAAACAAGTAGCGTAACCATATTTGTTCCTGACGGCACTTATAGATTGGGAAGCGAATTGAGGATTTACAGCAACACAGCAATTATCTTGAGCCCGAATGCCACATTAGTAAGAGACCACGCCAAGTATCTCTTATTCAACGGAGTAAGGGCGGCAGACGGTGGCTCCCCTGTTAGTGGATACAACGGAGATGGCAACATTGTTATCAAAGGAGGAACACTCACAGGGCAAGGAAACAAGCAGACAGCCAAAGCCAGTATTGTTCACTTTGGTCACGCTCAAAACATCACCTTTGATAATGTAACTCTCAAGGATTGTTCTAACTCGCATCACATTGAGTTCAACGCTTGTAAGGATGTATACGTTAAAAATTGCAACTTCCTTGGCTGGTTCGGGGATACTGATACATACAATGAAGCGATCCAGTTAGACTTGGCAACTCCAGAGCTAACCACTGTGGCAGGAGGAGACTATACTCCTTGTAAAAATGTGTACATTGATAGTACATACTTTGGTAAGTCATCCACAGCAGGATCAAAACCAATTGGTCGAGGAATCGGGTCTCACTCTGGGGCGATTAATAGATTCCATGAAAACATACATGTAACTAACTGCACTTTTGATAGCACTGTAGAGTGGGCTTGTCGTGCTTATGCTTATAGAGACTTCTTCTTTACAAACAATAATATTATCAACTGTGGTCGTGGTATAAACGTGAGATCGAATATCTCAACAGATGACAAAGACACAATAGACGCTAACACAGGACAACAGACAGGGAAATCTCAAAACATCAATCGCTTTGTCATCTCGAATAACTCGTTTACTGGAACCATGAACAATGGTCGAGCTATTGAGATTTACGGGGAAGCAACTGGCAGAATCTATATGACTACCATCACAGGGAACATCATCAATTTATCATCTTTTTCGGGGTCAAGAAACGAGGTTATTTATCTCAACTATGTCAGATATGCAACTGTCTCAGGAAACAATGTCGGTGGTGCAAATGTAGGTACTTGTATAGGTCTTAATGGAGACACTACGGAAGTATCAATTGCAAACAACATTTGTGCTTTTGGTGATCGAGGTATTGCTGTGTATGGTGCTAACGTTACTATGCAAAATATTAGCATCCTGGGAAATACAATTAGAGGCATGCAAAGAAGTGGCATACACCTCGACAGTATTGACGGTTTCGCCTGCTCTGGAAATACTATCTTTGACTGTAATAAGGCTGGCGGAGATGAAAACCACATTAGGGTTGTTGTCGGTAACAAAAATGGTACAGTCAGCGGAAACCTTTGTACAGTTGCTTGCCCTACGAGTATTTACGTTTCAAATACCAACAGCAGGGTAAACGTAAGCGGGAATGTACTATCTGGAGGACTCACCAATAACTCGTCAGGCGGAGCATCTTCAAATAATATCTAA

Genome Context

Genome Context

Tertiary structure

PDB ID
4326df8111d13ddb37496b1f704235f93998022a7ec75e5fd46d0e2778f37b8c
ESMFold
Source ESMFold
Method ESMFold
Resolution 0,8565
Oligomeric State monomer
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50