Genbank accession
QEG05462.1 [GenBank]
Protein name
tail sheath protein
RBP type
TSP
Evidence DepoScope
Probability 1,00
Protein sequence
MTLLSPGFETKETTLSTTIVQSATGRAALVGKFQWGPAFQIVQVTNEVELVNKFGQPDNNTADYFMSGANFLQYGNDLRVVRVLNTEKAKNSTTLAGNVAFEITNEGSNYKVGDVIKVKYKNSEIETGGKVTKVSTEGKVKGIFIPTGKLIAHAKAVGTYPEVNGYTVEITSGTGNSSASITISSIVTDSGLLLTDLETSRSNITNQTFLTKLKKYDMPAVSAIYAGEIGNSLEVEILSRSAFTGTAPNLTMYPYGGERTAARNLVAYGPQNDNQYAFIVRRDGVMVESFVLSTVKGDKDVYGNSIYMDDFFARGASQYIYATAQGWVDGFSGIISLAGGLSANEASTNDHQNDPFIGAMMQGWDLFAERETIHVNLLIAGACAGEGDNFSTVQKYAVSIGDERQDCLVMVSPPRSTVVNIPVTTAIDNLIHWREGSNNYSDNNMNINTTYAVIDGNYKYQYDKYNDVNRWVPLAADIAGLCARTDAVSQPWMSPAGYNRGQIMNVVKLAIEPRKAHRDRLYQAAINPVIGAGGEGFILMGDKTATTVPSPFDRINVRRLFNMLKKNIGDSSKYKLFENNDNFTRASFRMEVSQYLSTIRSLGGIYDFRVQCDTTNNTPDVIDRNEFVASMFIKPAKSINYIMLNFTAVATGADFDEIIGPANQA
Physico‐chemical
properties
protein length:665 AA
molecular weight: 72668,90560 Da
isoelectric point:5,45225
aromaticity:0,09624
hydropathy:-0,21624

Domains

Domains [InterPro]
IPR052042
Unmapped
4–652
G3DSA:2.40.10.380
ATT
98–187
QEG05462.1
1 665
Architecture
TAS
ATT
STR
TAS
TAS
TAS 24-83 | ATT 97-187 | STR 188-513 | TAS 514-545 | TAS 548-647 |
Legend: ATT STR RBD CBM LEC ENZ CHP LNK TAS TTP UNK Unmapped

Tail Spike Domain Segmentation

Tail Spike Domain Segmentation

This protein has been segmented into three structural domains: N-terminal, central domain, and C-terminal.

Domain Layout
N-terminal
Central
C-terminal
QEG05462.1
1 665
Domain Start End Length (AA) Confidence
N-terminal 1 111 111 0,9062
Central domain 112 310 200 0,3671
C-terminal 311 665 354 0,2179
Legend: N-terminal Central domain C-terminal
3D Structure with Domain Coloring

The structure is colored according to the domain segmentation: N-terminal (blue), Central (green), C-terminal (pink).

Domain Coloring
N-terminal
1-111
Central
112-310
C-terminal
311-665

Taxonomy

  Name Taxonomy ID Lineage
Phage Shigella phage JK32
[NCBI]
2591059 Viruses > Duplodnaviria > Heunggongvirae > Uroviricota > Caudoviricetes
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)
Genbank protein accession
QEG05462.1 [NCBI]
Genbank nucleotide accession
MK962753 [NCBI]
CDS location
range 93222 -> 95219
strand +
CDS
ATGACTCTATTATCACCGGGTTTCGAAACGAAAGAAACTACTCTCTCCACTACCATTGTGCAATCTGCGACTGGTCGCGCGGCACTGGTGGGTAAATTCCAATGGGGTCCAGCTTTCCAGATTGTTCAAGTAACTAATGAAGTTGAATTAGTTAATAAATTTGGTCAGCCGGATAATAACACCGCAGACTATTTCATGAGCGGGGCTAACTTCCTCCAGTATGGCAATGATCTGCGTGTTGTTCGTGTATTGAATACAGAAAAAGCGAAAAACTCCACCACTCTCGCGGGTAACGTTGCATTTGAAATTACCAACGAAGGTTCCAACTATAAAGTTGGTGACGTAATTAAAGTAAAATATAAAAACTCAGAAATTGAAACAGGCGGTAAAGTTACTAAAGTAAGCACCGAAGGTAAAGTTAAGGGAATCTTTATTCCGACTGGTAAGCTGATCGCACACGCTAAAGCTGTTGGTACTTACCCAGAAGTAAACGGATATACCGTAGAAATCACTTCCGGTACGGGTAACTCTTCGGCATCTATTACTATTTCCAGCATTGTGACAGATTCCGGCTTGCTGTTAACCGACCTAGAAACTTCTCGCTCTAATATCACTAATCAGACTTTCTTGACCAAGCTGAAAAAATATGATATGCCAGCGGTGAGCGCAATCTATGCAGGTGAGATCGGAAACTCTCTGGAAGTTGAAATCCTTTCCCGTAGTGCATTTACTGGAACTGCTCCGAATCTGACCATGTATCCTTACGGTGGCGAACGTACCGCTGCACGCAATCTGGTGGCATACGGTCCGCAGAATGATAACCAGTACGCATTTATTGTACGTCGTGATGGCGTTATGGTAGAATCCTTTGTACTATCCACCGTAAAAGGTGATAAGGATGTTTACGGCAATTCCATTTATATGGATGATTTCTTTGCTCGTGGCGCAAGTCAGTACATCTACGCAACCGCTCAAGGCTGGGTGGATGGGTTTAGTGGCATCATCTCTCTTGCGGGTGGTCTGTCTGCTAATGAAGCATCCACCAACGACCATCAAAATGACCCGTTTATCGGTGCGATGATGCAAGGTTGGGATTTGTTCGCTGAACGTGAAACTATCCACGTAAACCTGCTTATTGCGGGTGCTTGCGCTGGTGAAGGTGATAACTTCTCTACCGTACAGAAATATGCTGTTTCCATCGGTGATGAGCGTCAGGATTGCTTAGTGATGGTTTCTCCGCCACGTAGCACCGTTGTTAATATTCCGGTCACTACCGCAATTGATAACCTGATCCACTGGCGCGAAGGTAGCAACAACTACTCAGACAACAACATGAACATTAATACCACTTACGCGGTTATTGATGGTAACTACAAATATCAGTATGACAAATATAACGATGTAAACCGTTGGGTTCCGTTGGCTGCTGATATTGCTGGCTTGTGTGCTCGTACTGATGCTGTATCCCAGCCGTGGATGAGTCCGGCAGGTTATAACCGTGGTCAGATCATGAACGTGGTTAAACTGGCAATTGAGCCTCGCAAGGCGCACCGTGACCGTCTGTATCAGGCCGCAATTAACCCGGTAATCGGTGCTGGTGGTGAAGGTTTTATCCTGATGGGTGATAAAACCGCTACGACTGTTCCTAGCCCGTTTGACCGAATTAACGTTCGTCGTCTGTTTAACATGCTGAAAAAGAATATCGGTGATTCAAGCAAATACAAACTGTTTGAAAACAATGATAACTTTACTCGCGCTTCCTTCCGTATGGAGGTTTCGCAATATCTCAGCACGATTCGCTCTCTGGGTGGCATTTATGACTTCCGAGTACAATGCGACACCACAAATAACACGCCAGATGTAATTGATCGTAACGAATTTGTTGCTAGCATGTTCATCAAACCAGCAAAATCGATCAACTATATCATGCTCAATTTCACTGCTGTTGCAACTGGTGCTGATTTTGACGAAATTATCGGTCCGGCTAACCAGGCATAA

Genome Context

Genome Context

Tertiary structure

PDB ID
1d113ae36f8e5fcfb5efe18db2937123a879dc2b63cdd5b15643843aff7ad482
ESMFold
Source ESMFold
Method ESMFold
Resolution 0,7745
Oligomeric State monomer
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50

Literature

Title Authors Date PMID Source
A quest of great importance - developing a broad spectrum Shigella ssp. and Escherichia coli phage collection Kaczorowska,J., Casey,E., Neve,H., Noben,J.-P., Lugli,G.A., Ventura,M., van Sinderen,D. and Mahony,J. 2019-09-26 GenBank