Genbank accession
QEP52571.1 [GenBank]
Protein name
tail fiber protein
RBP type
TF
Evidence GenBank
Probability 1,00
TF
Evidence Phold
Probability 1,00
TSP
Evidence DepoScope
Probability 1,00
TF
Evidence RBPdetect
Probability 0,90
TF
Evidence RBPdetect2
Probability 0,85
Protein sequence
MALKTKIIVQQILNIDDTTTTASKYPKYTVVLGTSISSITASELTGAVEASAASAAAAKDSEIAAKESETNAKDSENLAAIYANSSETSATQSAASATEAERQAGLSKDSADASATSAEESKGFRDSAELAAQNAEQSRLLAEQAKKDAEAAKTAAATSEQNAATSATESTNQAIAAAGSATEAGEYATTAKDSEIAAKTSELNAKNSENESAISAEASEASASQSAISASQSAASATKAAESSAAAKISETTAIESSAAAKTSEINAKTSEINAKTSETNAAAYAAAAKTSETNAADSAASASDSKGFRDEAEAFAAQASTSALAAKNSETNTKTSEINSKASEDAAKLAQQSASGSANTATQAMTTTKGYRDEAEVFKNTATTAATTATDKALEAAGSATIAGEKATNATSAADRAETAAASAEQVMQASLKKDQNLNDLANKDLAREALKVEAVNSVKDQYAGAYNSFRNPAWTYELRIANNGEWRVARNDNNSTSALSIGAGGTGAENVEGAKINFGIDRLKQTETETMMYAPGSNSPYRITIRPDAAWGVWTDETGRWIPLSIDAGGTGSNTEVGARKNLNTPVGGQAIIIPNNSNILGFMSTYAESGYYSSGELVTYQPPEASGWWMYELHVHGKNANGHVEYGNIVATAMNGNKWGIICSAGSWGGWYRIARSDRQLMLLSPDAQSALGDYSIAIGDHDSGLKWDRDGHISAFADSARIFAWTPSGINTYRVISSYVDDNARGMYVNGVRRGYPNALIAGQVEGGSFADWRSRASGLLVEHTGFDSAVAIFKSVYWGKDWIAGMDVVPWTSGGAETHLYVKGAEFIFDSAGNGSASNWVSRSDIRLKAHLKEIETASDKIDYLTGYTYYKRNNLIEDENSVYSIEAGLIAQDVERVLPEAVHSLNNDGQLDPKGEAIKGINYNGVVALLVNAFKEQKAKIDNQQEEINVLRNELYELKNLVKSMLNGNAPTITELP
Physico‐chemical
properties
protein length:983 AA
molecular weight: 103269,66310 Da
isoelectric point:4,84259
aromaticity:0,06511
hydropathy:-0,42208

Domains

Domains [InterPro]
Coil
Unmapped
132–162
IPR030392
CHP
849–909
QEP52571.1
1 983
Architecture
ATT
STR
STR
RBD
ATT 2-192 | STR 250-499 | STR 524-696 | RBD 705-983
Legend: ATT STR RBD CBM LEC ENZ CHP LNK TAS TTP UNK Unmapped

Tail Spike Domain Segmentation

Tail Spike Domain Segmentation

This protein has been segmented into three structural domains: N-terminal, central domain, and C-terminal.

Domain Layout
N-terminal
Central
C-terminal
QEP52571.1
1 983
Domain Start End Length (AA) Confidence
N-terminal 1 366 366 0,7504
Central domain 367 565 200 0,5132
C-terminal 566 983 417 0,7314
Legend: N-terminal Central domain C-terminal
3D Structure with Domain Coloring

The structure is colored according to the domain segmentation: N-terminal (blue), Central (green), C-terminal (pink).

Domain Coloring
N-terminal
1-366
Central
367-565
C-terminal
566-983

Taxonomy

  Name Taxonomy ID Lineage
Phage Salmonella phage 9-29
[NCBI]
2601629 Uroviricota > Caudoviricetes > Demerecviridae > Epseptimavirus > Epseptimavirus ev119
Host Salmonella enterica subsp. enterica
[NCBI]
59201 Bacteria > Proteobacteria > Gammaproteobacteria > Enterobacteriales > Enterobacteriaceae > Salmonella

Coding sequence (CDS)

Coding sequence (CDS)
Genbank protein accession
QEP52571.1 [NCBI]
Genbank nucleotide accession
MN218190.1 [NCBI]
CDS location
range 60747 -> 63698
strand +
CDS
ATGGCACTTAAAACTAAAATTATTGTACAGCAGATTCTGAACATAGATGACACTACAACTACTGCTAGTAAATACCCTAAGTATACGGTAGTTTTAGGTACTTCTATTAGTTCTATTACTGCTAGTGAACTAACAGGGGCTGTTGAGGCCTCTGCTGCTTCTGCTGCGGCAGCAAAAGATTCTGAAATTGCAGCAAAAGAATCTGAAACAAATGCTAAGGACTCGGAGAACCTAGCTGCAATTTACGCTAACTCTTCAGAAACTTCTGCAACTCAATCTGCTGCTTCTGCTACTGAAGCGGAGAGACAAGCTGGTTTATCTAAAGATAGTGCTGATGCCTCTGCTACGTCAGCTGAGGAATCCAAAGGATTCCGTGATTCTGCTGAACTTGCTGCACAAAATGCTGAACAGAGTCGTCTATTAGCTGAACAAGCTAAGAAGGACGCTGAGGCTGCTAAGACTGCTGCTGCTACTTCTGAGCAAAATGCTGCTACATCTGCTACTGAGTCTACCAATCAGGCTATTGCTGCTGCTGGTTCTGCTACAGAGGCGGGAGAATACGCTACTACTGCAAAAGACTCCGAGATAGCTGCTAAGACTTCAGAACTTAATGCTAAGAATTCTGAGAATGAATCTGCTATTTCTGCTGAAGCTTCTGAAGCTTCTGCTTCTCAGTCTGCTATTTCTGCTTCTCAATCTGCTGCATCCGCTACTAAAGCTGCAGAATCATCAGCTGCAGCAAAAATTAGTGAAACTACTGCTATAGAATCATCAGCTGCAGCAAAAACTAGTGAGATTAATGCAAAAACTAGTGAGATTAATGCAAAAACTAGTGAGACTAATGCAGCAGCATATGCAGCAGCAGCAAAAACTAGTGAGACTAATGCTGCTGATTCCGCTGCCTCTGCTTCTGACTCCAAAGGATTCAGGGATGAAGCAGAAGCATTCGCTGCACAAGCCTCCACATCAGCATTAGCAGCAAAAAACTCAGAAACTAATACAAAGACTAGTGAAATTAACTCAAAAGCTAGTGAAGACGCTGCTAAGCTAGCTCAGCAAAGTGCATCAGGTAGTGCGAATACAGCTACGCAAGCGATGACCACAACCAAAGGCTACAGAGACGAAGCAGAGGTATTTAAAAATACCGCCACTACTGCTGCAACGACAGCAACAGACAAGGCCCTGGAGGCCGCTGGTAGCGCTACAATAGCAGGAGAGAAAGCTACTAATGCCACCAGTGCAGCAGATAGAGCGGAGACAGCGGCTGCATCAGCAGAACAGGTTATGCAAGCGTCGTTAAAGAAGGATCAGAACCTTAATGATCTTGCAAACAAAGACCTTGCAAGGGAGGCGCTAAAAGTTGAGGCTGTTAATTCTGTAAAAGATCAATATGCTGGCGCTTATAATTCTTTTCGTAATCCTGCATGGACTTATGAATTACGGATCGCTAACAATGGGGAATGGCGCGTTGCGCGTAATGACAATAACAGCACATCTGCGCTTTCTATTGGTGCTGGCGGTACTGGTGCTGAAAACGTTGAGGGCGCAAAAATAAACTTTGGCATTGATCGTTTAAAGCAAACAGAAACAGAAACTATGATGTATGCACCAGGGAGTAACTCACCTTATCGAATCACAATTAGACCTGATGCGGCGTGGGGTGTTTGGACTGATGAAACTGGAAGATGGATTCCTCTTTCAATTGATGCTGGCGGCACTGGATCTAATACTGAAGTTGGAGCAAGAAAGAACTTAAATACTCCTGTTGGTGGTCAAGCAATTATTATTCCTAATAATTCAAACATTCTTGGGTTTATGTCAACATACGCAGAAAGCGGTTATTACTCAAGCGGAGAACTTGTTACATATCAACCACCTGAAGCCTCTGGTTGGTGGATGTATGAATTGCATGTTCACGGTAAAAATGCAAACGGTCATGTTGAGTATGGAAATATTGTTGCAACCGCAATGAACGGTAATAAATGGGGTATCATTTGCAGTGCTGGCTCTTGGGGTGGATGGTACAGGATAGCAAGATCTGATAGGCAATTGATGTTATTAAGTCCGGATGCTCAATCGGCTCTTGGTGATTACTCTATAGCAATTGGGGATCATGATTCAGGTCTTAAATGGGATCGTGATGGTCACATAAGCGCGTTTGCTGATTCAGCGAGAATATTTGCATGGACTCCGTCAGGAATAAACACTTACAGGGTCATATCATCTTATGTTGATGATAACGCAAGGGGTATGTATGTAAATGGAGTTAGGCGCGGTTATCCAAACGCTCTTATTGCTGGTCAAGTTGAGGGTGGTTCATTTGCCGACTGGCGTAGTCGTGCGTCTGGATTGCTTGTTGAGCACACTGGATTTGATTCTGCGGTTGCCATATTTAAATCTGTTTATTGGGGGAAAGATTGGATAGCTGGCATGGATGTTGTTCCGTGGACTTCTGGCGGCGCTGAAACACATCTTTATGTTAAGGGCGCTGAATTTATTTTTGATAGTGCTGGAAATGGTAGCGCATCAAATTGGGTTAGCAGGTCTGACATTAGATTAAAAGCACATCTAAAAGAGATCGAAACGGCATCCGACAAAATTGATTATCTAACTGGTTATACTTACTACAAGCGCAACAATCTAATTGAAGATGAAAACAGCGTTTATAGTATTGAGGCTGGATTGATCGCACAAGATGTTGAAAGGGTTTTGCCGGAAGCGGTTCATTCTTTGAATAACGATGGTCAGCTAGACCCAAAAGGCGAGGCAATCAAAGGCATTAACTATAATGGTGTTGTCGCACTTCTTGTTAACGCATTCAAAGAGCAAAAAGCAAAGATTGATAATCAACAGGAGGAAATTAACGTATTACGCAATGAGTTATATGAACTGAAAAACCTTGTAAAATCAATGCTTAACGGAAATGCTCCAACAATTACAGAACTACCGTAA

Genome Context

Genome Context

Tertiary structure

PDB ID
377649dfc6de8e6f1b3af518790b47dac648f5c7ba2f87b9fb6ea208aedb900a
ESMFold
Source ESMFold
Method ESMFold
Resolution 0,5878
Oligomeric State monomer
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50