Protein
View in Explore- Genbank accession
- YP_008239656.1 [GenBank]
- Protein name
- tail spike protein
- RBP type
-
TFTSPTSPTSP
- Protein sequence
-
MSSGCGDVLSLADLQTAKKHQIFEAEVITGLQGGVAGGASIDYATNQVTGQVQKTMPAVLRDIGFYPASFDFSTGGTLTANDRNKVVYDLVSKTWYSWAGALPHVIAAGTNPVGAADWIPQTDPDLRSDLNTATAGKGADLVKFKTGTAVQRLVDSDGFNIIGKFAQVADLRSVASSAGMTVLVKEHTTGRGALGGGQFVAVAATIADDNGVYISSGTAGVTWVRKDAGSHVKIEWFGGKSEDSGIDHSPILANAQKTAGRSIEFQYGSYYFTPPCVIKPEMHFIGSGGAKTFWRNKNINADATVFFANTGTSSKWAENSIFERIHFSNDLATQATQSAFSMTNVGLFKFNNCGFYNSPIYASDLHFVTWNGCVFIKSPVTINEASVSPTFPINEMPSFIDCYMVESPIDITDVTDLHLNNTVMFYGPFGIKSTSHRPMQTGADSRGYPIMITNSTIDNIDGYCLDLNRVAIGTITNSLFSGGRVSNTAAIRLTEVMGLSFNSNVIHFAGQECMTLYDVQNLLMGNNQFSSCNGFAIKSQYARNVILNGNFFGNQKVTGGWNTCTGGVNFDTNDNLAWIITGNAFVGIPGVVGQTGNGRTVYTAVANAGLADN
- Physico‐chemical
properties -
protein length: 613 AA molecular weight: 65418,58570 Da isoelectric point: 5,66528 aromaticity: 0,10277 hydropathy: -0,03883
Domains
Domains [InterPro]
DC_1596
STR
1–389
STR
1–389
G3DSA:2.10.10.80
ATT
65–132
ATT
65–132
IPR040775
RBD
77–112
RBD
77–112
IPR011050
STR
369–595
STR
369–595
1
613
Architecture
STR 1-64 | ATT 65-132 | STR 133-595 |
Legend:
ATT
STR
RBD
CBM
LEC
ENZ
CHP
LNK
TAS
TTP
UNK
Unmapped
Tail Spike Domain Segmentation
Tail Spike Domain Segmentation
This protein has been segmented into three structural domains: N-terminal, central domain, and C-terminal.
Domain Layout
1
613
| Domain | Start | End | Length (AA) | Confidence |
|---|---|---|---|---|
| N-terminal | 1 | 248 | 248 | 0,9926 |
| Central domain | 249 | 602 | 355 | 0,9958 |
| C-terminal | 603 | 613 | 10 | 0,4790 |
Note: Constraints were applied during segmentation.
C-terminal too short, adjusted boundary
C-terminal too short, adjusted boundary
Legend:
N-terminal
Central domain
C-terminal
3D Structure with Domain Coloring
The structure is colored according to the domain segmentation: N-terminal (blue), Central (green), C-terminal (pink).
Domain Coloring
N-terminal
1-248
1-248
Central
249-602
249-602
C-terminal
603-613
603-613
Taxonomy
| Name | Taxonomy ID | Lineage | |
|---|---|---|---|
| Phage |
Salmonella phage FSL SP-031 [NCBI] |
1173749 | Uroviricota > Caudoviricetes > Sarkviridae > Cornellvirus > Cornellvirus SP31 |
| Host |
Salmonella enterica [NCBI] |
28901 | cellular organisms > Bacteria > Pseudomonadati > Pseudomonadota > Gammaproteobacteria > Enterobacterales |
| Host |
Salmonella enterica subsp. enterica serovar Cerro [NCBI] |
340188 | Bacteria > Proteobacteria > Gammaproteobacteria > Enterobacteriales > Enterobacteriaceae > Salmonella |
Coding sequence (CDS)
Coding sequence (CDS)
Genbank protein accession
YP_008239656.1
[NCBI]
Genbank nucleotide accession
NC_021775
[NCBI]
CDS location
range 40351 -> 42192
strand +
strand +
CDS
ATGTCAAGCGGATGCGGTGATGTTTTAAGTCTGGCGGATTTACAGACCGCCAAGAAGCACCAGATTTTTGAGGCCGAAGTTATCACCGGTTTGCAGGGTGGTGTTGCCGGTGGGGCGTCTATAGATTACGCGACGAATCAAGTTACAGGGCAGGTGCAGAAAACAATGCCTGCTGTTCTGCGCGATATCGGTTTTTATCCTGCTTCTTTCGATTTCTCTACAGGCGGCACACTGACGGCTAACGACCGTAATAAAGTGGTGTATGACCTGGTAAGTAAGACGTGGTACTCGTGGGCGGGGGCGTTGCCGCACGTCATCGCCGCGGGGACTAATCCGGTCGGTGCAGCGGACTGGATTCCGCAAACGGACCCCGATTTGAGGTCTGACTTAAACACCGCGACCGCGGGGAAAGGCGCAGACCTAGTTAAATTTAAGACGGGTACTGCGGTACAGCGCCTGGTAGACTCAGACGGATTTAACATCATTGGAAAATTCGCTCAGGTTGCCGATTTGCGCTCCGTTGCGTCTTCCGCCGGTATGACTGTCCTTGTCAAAGAACACACTACCGGCCGCGGCGCGCTCGGGGGCGGGCAGTTCGTTGCCGTAGCCGCCACAATAGCCGATGATAATGGTGTGTATATCTCTTCCGGTACGGCAGGAGTAACGTGGGTCCGCAAAGACGCAGGTTCTCATGTTAAAATAGAATGGTTTGGCGGGAAATCTGAAGACTCCGGTATCGACCATTCTCCTATCCTGGCTAATGCGCAGAAAACAGCAGGCCGCTCAATCGAGTTCCAGTACGGCAGCTATTATTTCACACCACCATGTGTCATCAAGCCAGAGATGCACTTCATTGGTAGTGGCGGGGCTAAAACGTTCTGGCGTAATAAAAACATAAATGCGGATGCCACTGTGTTTTTTGCTAACACTGGCACATCAAGCAAATGGGCTGAAAACTCTATTTTTGAAAGAATACACTTCAGTAATGATTTAGCTACTCAAGCAACCCAGTCAGCTTTCTCTATGACTAATGTTGGTCTTTTTAAATTTAATAATTGTGGTTTTTATAATTCCCCAATTTATGCATCAGACCTGCACTTTGTTACATGGAACGGATGCGTGTTCATAAAAAGTCCAGTAACAATTAATGAGGCTTCGGTTTCTCCAACATTTCCGATAAATGAAATGCCGTCATTCATAGATTGCTATATGGTTGAGTCACCTATAGATATAACTGATGTAACAGATCTGCATTTAAATAACACTGTTATGTTTTATGGCCCATTTGGTATAAAAAGTACATCCCACAGACCTATGCAGACTGGAGCGGATTCTCGCGGATATCCAATAATGATAACTAACAGTACCATTGATAACATTGATGGGTACTGCCTTGATTTAAACAGGGTAGCAATAGGGACTATTACAAACAGCTTGTTTAGCGGTGGAAGGGTAAGCAATACCGCAGCAATAAGACTTACAGAGGTTATGGGATTAAGTTTTAACTCTAACGTCATTCATTTTGCTGGGCAGGAATGCATGACGCTTTATGATGTTCAAAATTTACTAATGGGAAACAATCAGTTTAGTAGTTGCAATGGTTTCGCTATAAAATCACAATATGCAAGAAACGTAATATTAAATGGTAATTTCTTTGGAAATCAGAAGGTTACTGGAGGGTGGAATACCTGCACTGGAGGTGTAAACTTTGACACCAATGACAACTTGGCTTGGATAATAACTGGAAATGCATTTGTTGGTATACCAGGGGTTGTTGGGCAAACTGGAAATGGAAGAACAGTTTATACAGCGGTAGCTAACGCCGGACTCGCTGACAACTAA
Genome Context
Genome Context
Tertiary structure
PDB ID
d7e138f00f29414ee32fe8338316028cc8f60584a6cf8a9ef68461787cccec20
Model Confidence
Very high
pLDDT > 90
pLDDT > 90
High
90 > pLDDT > 70
90 > pLDDT > 70
Low
70 > pLDDT > 50
70 > pLDDT > 50
Very low
pLDDT < 50
pLDDT < 50