Genbank accession
UKM96676.1 [GenBank]
Protein name
tail spike protein
RBP type
TSP
Evidence DepoScope
Probability 1,00
TF
Evidence GenBank
Probability 1,00
TSP
Evidence Phold
Probability 1,00
TSP
Evidence RBPdetect
Probability 0,91
TSP
Evidence RBPdetect2
Probability 0,96
TSP
Evidence UniProt/TrEMBL
Probability 1,00
Protein sequence
MSSGCGDVLSLNDLQIAKKHQIFEAEVITGKQGGVAGGADIDYATNQVTGQTQKTLPAVLRDAGFSPASFNFTTGGTLGINDANKAVLWPKEDGGDGNYYAWRGSLPKVIPAASTPLTTGGISDSAWVAFGDITFRAEADKKFKYSVKLSDFTTLQQLADAAVDSILIDRDYNFSNNETVNFGGKTLTIDCKAKFIGDGNLVFTQLGKGSIVIAPFMESATTPWVIKPWTDDNQWITDPAAIVATLKQSKTDGYQPTVNDYAKFPGIESLLPPEAKGQSISSTLEIRECTGVEVHRASGLMACFLFRGCHFCKMVDADNPSGGAHGVITFENLSGDWGKGNYVIGGRTSYGSVSSAQFLRNNGGFARDGGVIGFTSYRAGESGVKTWQGTVGSTTSRNYNLQFRDSAVLYPVWDGFDLGADTDMNPEDDRPGDFPISQYPVHMLPLNHLIDNLFVRGSLGVGFGMDGQGLYVSNITVEDCAGSGAYILAHETVFTNIAIIDTNTKNFPANQIYISGACRVNGLRLVGIRSTTEQGMTVDAPNSTVSGITGFVDPSRINVANLMEEGLGNSRINSFNNDSAALRFRIHKLSKTLDSGSVYSHINGGPGSGSAWTEITAIAGSLPDAVSLKINRGDYRAVEIPVATTVLPDNAVRDNGSISLYLEGDSLKALVKRADGSYTRLTLA
Physico‐chemical
properties
protein length:684 AA
molecular weight: 72924,69790 Da
isoelectric point:5,15668
aromaticity:0,08918
hydropathy:-0,16213

Domains

Domains [InterPro]
G3DSA:2.10.10.80
ATT
65–135
IPR040775
RBD
71–132
IPR012332
STR
136–684
IPR015331
RBD
138–684
UKM96676.1
1 684
Architecture
ATT
STR
ATT 65-135 | STR 136-684
Legend: ATT STR RBD CBM LEC ENZ CHP LNK TAS TTP UNK Unmapped

Tail Spike Domain Segmentation

Tail Spike Domain Segmentation

This protein has been segmented into three structural domains: N-terminal, central domain, and C-terminal.

Domain Layout
N-terminal
Central
C-terminal
UKM96676.1
1 684
Domain Start End Length (AA) Confidence
N-terminal 1 155 155 0,9919
Central domain 156 594 440 0,9877
C-terminal 595 684 89 0,8567
Legend: N-terminal Central domain C-terminal
3D Structure with Domain Coloring

The structure is colored according to the domain segmentation: N-terminal (blue), Central (green), C-terminal (pink).

Domain Coloring
N-terminal
1-155
Central
156-594
C-terminal
595-684

Taxonomy

  Name Taxonomy ID Lineage
Phage Salmonella phage PBSE191
[NCBI]
2914173 Viruses > Duplodnaviria > Heunggongvirae > Uroviricota > Caudoviricetes
Host Salmonella enterica
[NCBI]
28901 cellular organisms > Bacteria > Pseudomonadati > Pseudomonadota > Gammaproteobacteria > Enterobacterales

Coding sequence (CDS)

Coding sequence (CDS)
Genbank protein accession
UKM96676.1 [NCBI]
Genbank nucleotide accession
OM291373.1 [NCBI]
CDS location
range 2192 -> 4246
strand +
CDS
ATGTCCAGTGGTTGCGGTGATGTACTGTCACTTAACGATTTACAAATAGCCAAGAAACACCAGATTTTCGAAGCCGAGGTGATCACCGGCAAACAGGGCGGTGTAGCAGGCGGCGCAGATATCGACTACGCCACTAACCAGGTAACCGGGCAGACGCAGAAGACGCTGCCCGCGGTCTTGCGTGACGCCGGTTTCTCTCCGGCGTCTTTTAACTTCACAACCGGCGGAACCCTGGGAATTAACGACGCCAATAAAGCGGTTCTTTGGCCGAAAGAAGATGGCGGGGATGGTAACTATTACGCATGGCGTGGCTCCCTGCCGAAAGTTATCCCCGCGGCATCGACACCTCTTACGACAGGCGGCATTTCTGATTCGGCTTGGGTAGCTTTTGGGGACATCACCTTCCGTGCGGAAGCGGATAAGAAATTTAAGTACTCCGTTAAGCTGTCTGACTTTACTACGTTACAACAACTGGCGGATGCTGCCGTCGACAGTATTCTTATCGACCGTGACTACAATTTCAGCAATAACGAAACCGTTAATTTTGGCGGGAAGACCCTGACCATCGACTGTAAAGCGAAGTTTATCGGCGACGGAAACCTGGTATTTACGCAATTAGGTAAAGGTTCCATTGTAATAGCCCCCTTTATGGAGAGTGCTACAACGCCGTGGGTGATTAAACCGTGGACCGACGATAATCAGTGGATAACCGACCCCGCGGCAATCGTGGCCACACTTAAACAGTCTAAAACAGATGGATACCAGCCGACGGTAAACGATTACGCCAAGTTCCCTGGCATAGAATCCCTTCTCCCTCCGGAAGCTAAAGGGCAAAGCATATCTTCTACCCTGGAAATTCGGGAATGTACAGGCGTCGAGGTTCACCGGGCGAGTGGTCTTATGGCGTGTTTCCTGTTCCGTGGATGCCATTTCTGTAAGATGGTAGACGCTGACAACCCGAGCGGCGGTGCACACGGCGTAATCACCTTCGAAAACTTAAGCGGAGATTGGGGCAAAGGTAACTATGTTATCGGCGGGCGCACAAGTTACGGTTCGGTAAGTAGCGCGCAATTCTTACGCAACAATGGCGGTTTCGCGCGCGATGGCGGGGTCATCGGGTTTACCTCGTATCGTGCAGGGGAAAGTGGTGTTAAGACGTGGCAAGGTACGGTAGGTTCTACGACATCTCGTAACTACAACCTGCAATTCCGGGATTCGGCGGTACTGTACCCTGTATGGGACGGCTTCGATTTAGGCGCAGATACTGACATGAACCCCGAAGATGACCGCCCAGGGGATTTCCCCATTTCTCAGTACCCGGTACATATGCTCCCTTTAAACCATTTGATAGACAATCTATTTGTTAGAGGTTCGCTGGGGGTAGGATTCGGTATGGACGGGCAAGGTCTGTATGTCTCTAACATAACCGTCGAGGATTGCGCTGGTTCTGGGGCTTATATTCTTGCCCACGAAACAGTATTCACTAATATCGCAATAATCGACACCAATACTAAAAACTTCCCTGCGAACCAGATATATATCTCGGGGGCCTGTCGTGTAAACGGCCTTCGTTTGGTCGGCATCCGTTCAACTACCGAACAGGGCATGACGGTAGACGCACCTAACTCCACTGTAAGCGGAATAACGGGCTTCGTGGACCCCTCAAGGATTAACGTAGCCAATTTGATGGAGGAAGGTCTTGGTAACTCTCGCATAAACAGTTTCAATAATGATTCTGCGGCGCTTCGGTTTCGTATTCATAAACTGTCAAAAACCCTTGATAGTGGGTCCGTGTACTCCCACATTAACGGCGGGCCAGGTTCTGGCTCAGCATGGACCGAAATTACCGCTATTGCGGGGAGCTTGCCTGATGCCGTATCATTAAAAATAAATAGGGGCGATTATCGTGCTGTTGAGATACCGGTAGCGACGACCGTCCTACCAGACAACGCTGTCAGGGATAACGGGTCTATATCACTGTATCTGGAGGGCGATAGCCTTAAAGCGTTAGTTAAGCGGGCCGACGGAAGCTATACAAGATTAACTTTGGCATAA

Genome Context

Genome Context

Tertiary structure

PDB ID
07c569d5e8cc592574166ff0d5e4d1c842375365692b9c6244c41a695bf0afdf
ESMFold
Source ESMFold
Method ESMFold
Resolution 0,6898
Oligomeric State monomer
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50