Genbank accession
YP_003714746.1 [GenBank]
Protein name
tail spike protein
RBP type
TSP
Evidence GenBank
Probability 1,00
TF
Evidence Phold
Probability 1,00
TSP
Evidence DepoScope
Probability 1,00
TSP
Evidence RBPdetect
Probability 0,90
TSP
Evidence RBPdetect2
Probability 0,95
Protein sequence
MDITAGRLESMNQRAWLMVTNYGADASGVVNSHSQIQLALNDARDRGGAWVLVPPGTYLLGATLRIYGNTRLTLMQGAEFQRNHGGTMLLNGDPGQAYGGYTGHSNIIVEGGLWNMRGTTAGMTSSALCMAFGHATNLSVTDLEIRDVPGYHGIEFNSTKHGTIRNCRFRGYVDPGGRDYSEAVQLDLAKSSAVFGGFGPYDNTPTEDVAVTGCYFGASGTAGTTAWPRGIGSHSATIQRWHRRIRISDCAFEGVLQYGVSAYNWEDVTITGNTFVKCGSGVRIRSVIKTDVNDTVNASGVQTNESQTMRNITVTGNTFRYGQAYDNAIIAQGEVNTGTILNLAIVGNTIDGTTGDQSGIRLNYASRVTVGDNVVANVAGTAISTENQDNTVLNGNVIWAAGAHGITMVSSDNSDILGNHIRDPANSGILVQGGSDIQIRDNFVEGANRVASTAYGIRVSTDPVAVAITNNKCRPGNSTTKAVRGLSISSGTGIQRFGNDCRGTWSGSGGTGVQDLSTSPSTVATDLG
Physico‐chemical
properties
protein length:528 AA
molecular weight: 55607,91910 Da
isoelectric point:6,06440
aromaticity:0,06818
hydropathy:-0,22045

Domains

Domains [InterPro]
IPR012334
STR
19–398
IPR012334
STR
20–398
IPR039448
ENZ
136–319
IPR011050
STR
309–504
YP_003714746.1
1 528
Architecture
STR
RBD
STR 15-504 | RBD 505-527 |
Legend: ATT STR RBD CBM LEC ENZ CHP LNK TAS TTP UNK Unmapped

Tail Spike Domain Segmentation

Tail Spike Domain Segmentation

This protein has been segmented into three structural domains: N-terminal, central domain, and C-terminal.

Domain Layout
N-terminal
Central
C-terminal
YP_003714746.1
1 528
Domain Start End Length (AA) Confidence
N-terminal 1 25 25 0,9331
Central domain 26 517 493 0,9902
C-terminal 518 528 10 0,3960
Legend: N-terminal Central domain C-terminal
3D Structure with Domain Coloring

The structure is colored according to the domain segmentation: N-terminal (blue), Central (green), C-terminal (pink).

Domain Coloring
N-terminal
1-25
Central
26-517
C-terminal
518-528

Taxonomy

  Name Taxonomy ID Lineage
Phage Streptomyces phage phiSASD1
[NCBI]
747763 Uroviricota > Caudoviricetes > Sasdunavirus >
Host Streptomyces avermitilis
[NCBI]
33903 cellular organisms > Bacteria > Bacillati > Actinomycetota > Actinomycetes > Kitasatosporales

Coding sequence (CDS)

Coding sequence (CDS)
Genbank protein accession
YP_003714746.1 [NCBI]
Genbank nucleotide accession
NC_014229.1 [NCBI]
CDS location
range 14866 -> 16452
strand +
CDS
ATGGACATCACTGCCGGCCGGCTGGAGTCCATGAATCAACGCGCCTGGCTGATGGTCACGAACTACGGCGCGGACGCCAGCGGTGTCGTGAACTCCCATTCTCAGATTCAGCTGGCGCTGAACGACGCGCGTGACAGGGGTGGAGCCTGGGTCCTGGTCCCTCCGGGGACGTACCTCCTGGGCGCCACCCTGCGCATCTACGGGAACACGCGGCTGACCCTCATGCAGGGCGCGGAGTTCCAGCGCAACCACGGCGGCACGATGCTTCTGAACGGCGACCCTGGCCAGGCGTACGGCGGCTACACAGGGCACTCGAACATCATCGTGGAGGGTGGCCTTTGGAACATGCGGGGCACGACGGCCGGCATGACCAGCTCCGCTTTGTGCATGGCCTTCGGGCACGCCACGAACCTGTCCGTGACGGACCTGGAGATCCGGGACGTGCCCGGCTATCACGGGATTGAGTTCAACTCCACCAAGCACGGCACCATCCGCAACTGTCGCTTCCGCGGCTACGTGGACCCAGGAGGCAGGGACTACAGTGAGGCCGTTCAGCTTGACCTGGCGAAGTCCTCCGCTGTGTTTGGCGGCTTCGGCCCGTACGACAACACGCCTACCGAAGACGTGGCCGTGACTGGTTGCTACTTCGGCGCTTCTGGCACGGCCGGCACGACTGCCTGGCCCCGCGGCATCGGCTCACACTCTGCCACGATCCAGCGCTGGCACCGCCGCATCCGCATCTCTGACTGTGCCTTTGAAGGCGTCCTCCAGTACGGCGTGAGCGCGTACAACTGGGAGGACGTCACGATCACGGGCAACACGTTCGTCAAGTGCGGCTCTGGCGTCCGCATCCGGTCCGTGATCAAGACGGACGTTAACGACACGGTCAACGCGTCCGGAGTCCAGACGAACGAGTCCCAAACGATGCGCAACATCACTGTCACTGGCAATACCTTCCGGTACGGCCAGGCCTACGACAACGCGATCATTGCCCAGGGCGAAGTCAACACGGGCACGATCCTGAACCTGGCCATCGTCGGCAACACCATCGACGGTACGACTGGCGACCAGTCGGGCATCCGCCTGAACTACGCCTCGCGCGTCACGGTCGGGGACAACGTCGTGGCGAACGTCGCCGGCACTGCCATCAGCACGGAGAACCAGGACAACACGGTGCTGAACGGGAACGTCATCTGGGCCGCTGGCGCGCACGGAATCACGATGGTGTCCAGCGACAACTCTGACATCCTCGGCAACCACATACGCGACCCTGCGAACTCCGGGATCTTGGTCCAGGGCGGATCGGATATCCAGATCCGGGACAACTTCGTGGAAGGCGCGAACCGTGTTGCCTCCACGGCGTACGGCATCCGCGTGTCTACGGACCCCGTCGCTGTCGCGATCACGAACAACAAGTGCCGGCCTGGCAACTCCACGACGAAGGCTGTCCGCGGCCTGTCCATCTCCAGCGGTACGGGTATCCAGCGCTTCGGCAACGACTGTCGCGGAACCTGGTCCGGGTCTGGCGGCACTGGCGTGCAGGACCTGTCCACGTCTCCGTCAACGGTCGCAACTGACCTGGGCTGA

Genome Context

Genome Context

Tertiary structure

PDB ID
54b9b177a661c14a4b5924f5b4acb13d29058a775fdf1ffbda249594ca9c7686
ESMFold
Source ESMFold
Method ESMFold
Resolution 0,8693
Oligomeric State monomer
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50