Genbank accession
QZB87251.1 [GenBank]
Protein name
tail fibers protein
RBP type
TF
Evidence GenBank
Probability 1,00
TSP
Evidence DepoScope
Probability 1,00
TSP
Evidence RBPdetect
Probability 0,91
TSP
Evidence RBPdetect2
Probability 0,95
Protein sequence
MATTPTSLPIPSEDPRDLKFNAGKFDEVMTSDAHYYVDRFGVKRWTIAGFQYTAEEAIRAYGYITMDSFEDGATLTLPNQVLRYEATGEYYRWDGAFPKAVAAGSTPASTGGVGLGAWISVGDAAFRQEANKKFKYSVKLSDYSTLQEAATAAVDGLLIDINYNFTDGESVDFGGKILTINCKARFIGDGALIFNNMGPGSVINQPFMESKTTPWVIFPWDADGKWITDAALVAATLKQSKIEGYQPGVNDWVKFPGLEALLPQNVKDQHITATLDIRSASRVEIRNAGGLMAAYLFRSCHHCKVIDSDSIIGGKDGIITFENLSGDWGLGNYVIGGRVHYGSGSGVQFLRNNGGESHNGGVIGVTSWRAGESGFKTYQGSVGGGTARNYNLQFRDSVALSPVWDGFDLGSDPGMAPEPDRPGDLPVSEHPFHQLPNNHLVDNILVMNSLGVGLGMDGRGGYVSNVTVQDCAGAGMLANTYNRVFSNITVIDCNYLNFDSDQIIIIGDCIVNGIRAAGIKPQPSKGLVISAPNSTISGLVGNVPPDKILVGNLLDAVLGQSRVIGFNSDTAELALRINKLSATLDSGALRSHLNGYAGSGSAWTEITAIAGSLPDAVSLKINRGDYRAVEIPVAVTVLPDNAVRDNGAISLYLEGDSLKALVKRADGSYTRLTLA
Physico‐chemical
properties
protein length:675 AA
molecular weight: 72043,14330 Da
isoelectric point:5,15447
aromaticity:0,08889
hydropathy:-0,09200

Domains

Domains [InterPro]
G3DSA:2.10.10.80
ATT
63–119
G3DSA:2.10.10.80
ATT
63–125
IPR040775
RBD
64–123
IPR015331
RBD
129–675
QZB87251.1
1 675
Architecture
ATT
STR
ATT 63-125 | STR 126-675
Legend: ATT STR RBD CBM LEC ENZ CHP LNK TAS TTP UNK Unmapped

Tail Spike Domain Segmentation

Tail Spike Domain Segmentation

This protein has been segmented into three structural domains: N-terminal, central domain, and C-terminal.

Domain Layout
N-terminal
Central
C-terminal
QZB87251.1
1 675
Domain Start End Length (AA) Confidence
N-terminal 1 145 145 0,9911
Central domain 146 582 438 0,9838
C-terminal 583 675 92 0,8667
Legend: N-terminal Central domain C-terminal
3D Structure with Domain Coloring

The structure is colored according to the domain segmentation: N-terminal (blue), Central (green), C-terminal (pink).

Domain Coloring
N-terminal
1-145
Central
146-582
C-terminal
583-675

Taxonomy

  Name Taxonomy ID Lineage
Phage Salmonella phage seszw
[NCBI]
2865759 Viruses > Duplodnaviria > Heunggongvirae > Uroviricota > Caudoviricetes
Host Salmonella typhimurium
[NCBI]
90371 cellular organisms > Bacteria > Pseudomonadati > Pseudomonadota > Gammaproteobacteria > Enterobacterales

Coding sequence (CDS)

Coding sequence (CDS)
Genbank protein accession
QZB87251.1 [NCBI]
Genbank nucleotide accession
MZ375315.1 [NCBI]
CDS location
range 19964 -> 21991
strand +
CDS
ATGGCTACCACACCGACTAGCTTACCAATCCCGTCAGAAGACCCGCGCGACCTGAAGTTTAACGCTGGTAAATTTGATGAAGTCATGACATCTGATGCACATTACTATGTGGACAGATTTGGCGTAAAACGCTGGACTATTGCTGGATTCCAGTACACTGCGGAAGAGGCCATTCGTGCTTATGGATATATCACAATGGATAGCTTTGAAGATGGCGCGACGTTGACGCTACCAAATCAGGTGCTACGTTACGAGGCAACCGGAGAATATTACCGATGGGATGGTGCATTTCCTAAGGCTGTAGCTGCTGGTTCAACTCCTGCATCAACTGGTGGCGTTGGTTTAGGCGCGTGGATTAGTGTTGGTGACGCAGCATTTAGACAGGAAGCCAACAAAAAATTCAAATATTCAGTAAAATTATCAGATTATTCTACCTTGCAGGAAGCAGCGACAGCAGCCGTAGATGGATTGCTTATTGATATCAATTACAACTTCACAGATGGTGAGTCTGTAGATTTTGGTGGTAAGATTTTAACCATTAACTGTAAGGCTAGGTTTATTGGAGATGGGGCTTTAATATTTAATAATATGGGGCCAGGCTCGGTAATTAATCAACCATTCATGGAGAGCAAGACTACTCCATGGGTCATTTTCCCGTGGGATGCTGATGGTAAATGGATTACAGATGCTGCCCTTGTTGCTGCAACGCTGAAGCAATCAAAGATTGAAGGCTATCAACCTGGGGTAAATGACTGGGTTAAATTCCCTGGATTAGAGGCATTACTCCCACAGAACGTTAAAGACCAACATATTACAGCCACTCTAGATATTCGCAGTGCCAGCCGAGTAGAAATAAGAAATGCTGGTGGTCTTATGGCTGCTTACCTTTTCCGTAGTTGTCATCACTGCAAGGTAATTGATTCAGATAGCATCATTGGTGGTAAAGATGGAATCATTACCTTTGAGAACCTTAGTGGTGATTGGGGATTAGGTAATTATGTTATTGGTGGACGTGTTCATTATGGTTCTGGTAGTGGTGTTCAGTTCCTGAGAAATAATGGTGGTGAATCCCACAATGGTGGAGTTATTGGTGTTACATCATGGCGAGCTGGTGAGTCTGGTTTCAAGACTTATCAGGGTTCCGTTGGTGGTGGTACTGCACGTAACTATAATCTACAGTTCAGGGATTCTGTTGCATTGTCTCCTGTTTGGGATGGTTTTGACTTGGGTTCTGACCCAGGTATGGCACCAGAACCGGATAGACCTGGGGATTTACCTGTATCTGAACATCCATTCCACCAACTGCCTAATAACCATTTGGTTGATAATATTCTTGTTATGAACTCACTTGGTGTTGGTTTAGGTATGGATGGTCGTGGTGGGTATGTTTCTAACGTTACCGTACAGGATTGTGCTGGTGCAGGTATGCTTGCAAATACTTACAACCGTGTATTTTCTAACATTACAGTTATTGATTGTAACTACCTTAATTTTGATTCTGACCAAATTATCATTATAGGTGATTGTATTGTTAATGGGATTAGGGCTGCTGGGATTAAACCACAACCATCAAAAGGTCTGGTTATCAGTGCACCAAACTCTACAATAAGTGGGTTGGTCGGTAATGTTCCTCCAGATAAAATTCTTGTTGGTAACTTACTTGACGCAGTATTAGGTCAGTCTAGAGTCATCGGGTTCAATAGTGATACTGCTGAGTTGGCTCTACGTATTAACAAGCTGTCAGCTACTCTGGATAGTGGTGCTTTACGTTCCCATCTGAACGGTTATGCTGGTTCTGGTTCAGCATGGACAGAAATTACCGCTATTGCGGGGTCCTTGCCTGATGCCGTGTCATTAAAAATAAACAGGGGCGATTATCGTGCTGTTGAGATACCGGTAGCGGTGACCGTCCTACCAGACAACGCTGTCAGGGATAACGGGGCTATATCACTGTATCTGGAAGGCGATAGCCTTAAGGCGTTAGTTAAGCGGGCCGATGGAAGCTATACAAGATTAACTTTGGCATAA

Genome Context

Genome Context

Tertiary structure

PDB ID
2128cafa3752590b840a047963de6f8f469e732c722586ee689660528b374897
ESMFold
Source ESMFold
Method ESMFold
Resolution 0,6831
Oligomeric State monomer
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50

Literature

Title Authors Date PMID Source
Genome-scale top-down reduction of phage to generate viable minimal phage genome Yuan,S. and Ma,Y. 2021-07-13 GenBank