Genbank accession
QIO01690.1 [GenBank]
Protein name
tail fiber protein
RBP type
TF
Evidence GenBank
Probability 1,00
TF
Evidence Phold
Probability 1,00
TSP
Evidence DepoScope
Probability 1,00
TF
Evidence RBPdetect
Probability 0,90
TF
Evidence RBPdetect2
Probability 0,90
Protein sequence
MALKTKIIVQQILNIDDTTTTASKYPKYTVVLGTSISSITASELTAAVEASAASAAAAKDSEIAAKESEINAKDSENLSANYANSSEASATQSATSATEAERQAGLSKDSADASAISASQSAASATKAAESSAAAKTSETNSLESSAAAKTSETNAKTSETNAKTSETNAAAYAAAAKTSETNAAYSAASASDSKGFRDEAEAFAAQASTSALAAKNSETNTKTSEINSKASEDAAKLAQQGASDSANTATQAMTTIQGLKSDVEQLKADTQTIKEGAAIEIGAAKTEAVGAIDTAKVNAIAKITPLKQAAEDAATLARQKAVTATEQATAAAGSATTAGEQAVAASSSATRAETAANKAEQTLSISLLKDQNLADLSDKVQARINLSVDRLKQSADSSRIYDPTNRYNLVVMDTGSWGVYDDTNNVFKPLGISAGGTGAWDAEGARNNIGALSKGGDTATAMIATRHSYPAGSSGQILGWSWRSIVEGYGIGTATADFYVNHTVGSVTYACIKPTRVAGESWEYLFSDFGEMSNIRSLIISHNTGAPGEASGNLSLSYGAGRITQYAARVDFYDGTARSVLDTYDDTRLTTLLPSAGVICRRGIGGNYQANSYSFSWENPGVDVWIDSTRIGRVTLDPTSDIDYKEQVEPWDGKSALNNINQLELVTFIFKDDIKRRVRRGIIAQQAATIDPEYTHSSEDKEGNTILSLDTNVLLLDALAAIQVLSARVSKLESLLEDKPTTLPEDPAPNQDLP
Physico‐chemical
properties
protein length:755 AA
molecular weight: 78689,36060 Da
isoelectric point:4,77603
aromaticity:0,05166
hydropathy:-0,33483

Domains

Domains [InterPro]
DC_0608
ATT
2–207
Coil
Unmapped
250–270
IPR030392
CHP
641–737
QIO01690.1
1 755
Architecture
ATT
STR
ATT 2-207 | STR 217-737 |
Legend: ATT STR RBD CBM LEC ENZ CHP LNK TAS TTP UNK Unmapped

Tail Spike Domain Segmentation

Tail Spike Domain Segmentation

This protein has been segmented into three structural domains: N-terminal, central domain, and C-terminal.

Domain Layout
N-terminal
Central
C-terminal
QIO01690.1
1 755
Domain Start End Length (AA) Confidence
N-terminal 1 416 416 0,9071
Central domain 417 615 200 0,2463
C-terminal 616 755 139 0,9820
Legend: N-terminal Central domain C-terminal
3D Structure with Domain Coloring

The structure is colored according to the domain segmentation: N-terminal (blue), Central (green), C-terminal (pink).

Domain Coloring
N-terminal
1-416
Central
417-615
C-terminal
616-755

Taxonomy

  Name Taxonomy ID Lineage
Phage Salmonella phage atrejo
[NCBI]
2713277 Uroviricota > Caudoviricetes > Demerecviridae > Epseptimavirus > Epseptimavirus atrejo
Host Salmonella enteritidis
[NCBI]
149539 cellular organisms > Bacteria > Pseudomonadati > Pseudomonadota > Gammaproteobacteria > Enterobacterales

Coding sequence (CDS)

Coding sequence (CDS)
Genbank protein accession
QIO01690.1 [NCBI]
Genbank nucleotide accession
MT074466.1 [NCBI]
CDS location
range 34469 -> 36736
strand +
CDS
ATGGCACTTAAAACTAAAATTATTGTACAGCAGATTCTGAACATAGATGACACTACAACTACTGCTAGTAAATATCCTAAATACACAGTAGTTTTAGGTACTTCTATTAGTTCTATTACTGCTAGCGAACTAACAGCGGCTGTTGAGGCCTCTGCTGCTTCTGCTGCGGCAGCAAAAGATTCTGAAATTGCAGCAAAAGAATCTGAAATAAATGCTAAGGACTCTGAGAACCTATCTGCAAATTATGCTAACTCTTCAGAAGCTTCTGCAACTCAATCTGCTACTTCTGCTACTGAAGCGGAGAGACAAGCTGGTTTATCTAAAGATAGTGCCGATGCCTCTGCTATTTCTGCTTCTCAATCTGCTGCGTCCGCTACTAAAGCTGCAGAATCATCAGCTGCAGCAAAAACTAGTGAAACTAACTCTCTAGAATCATCAGCTGCAGCAAAAACTAGTGAGACTAATGCAAAAACTAGTGAGACTAACGCAAAAACTAGCGAGACTAATGCAGCAGCATATGCAGCAGCAGCAAAAACTAGCGAGACTAATGCTGCTTATTCCGCTGCCTCTGCTTCTGACTCCAAAGGATTCAGGGATGAAGCAGAAGCATTCGCTGCACAAGCCTCCACATCAGCATTAGCAGCAAAAAACTCAGAAACTAATACAAAGACTAGCGAAATTAACTCAAAAGCTAGTGAAGACGCTGCTAAGCTAGCTCAGCAAGGTGCATCAGATAGCGCGAACACAGCTACGCAAGCGATGACCACAATACAGGGTCTTAAGTCCGATGTTGAACAGCTTAAAGCTGACACTCAGACCATTAAAGAAGGTGCGGCAATAGAGATTGGAGCAGCTAAGACGGAGGCAGTAGGAGCAATTGACACAGCTAAGGTAAACGCAATTGCTAAAATCACCCCGTTAAAACAAGCTGCGGAAGACGCTGCCACCTTAGCTAGACAAAAGGCAGTAACAGCTACAGAACAAGCTACAGCCGCGGCTGGAAGCGCTACAACCGCAGGAGAACAAGCCGTAGCAGCATCCAGTTCCGCAACTCGAGCTGAGACCGCAGCAAACAAAGCTGAACAAACTTTGAGCATATCTTTGTTAAAGGATCAGAACCTTGCAGACTTAAGTGACAAGGTACAGGCTCGTATAAACCTAAGTGTAGATCGTCTTAAACAGAGCGCTGATAGTTCCCGCATTTATGACCCGACAAACCGCTACAACTTAGTTGTAATGGATACAGGGTCTTGGGGTGTGTATGACGACACGAACAACGTGTTCAAACCTTTAGGTATATCCGCTGGAGGTACAGGAGCTTGGGACGCAGAGGGTGCTCGCAACAACATTGGGGCGCTGTCCAAAGGCGGTGACACTGCAACCGCCATGATTGCTACTCGACATTCTTATCCTGCTGGCTCATCAGGTCAAATTTTGGGTTGGTCGTGGCGATCAATTGTAGAAGGCTATGGTATCGGAACGGCGACTGCTGATTTTTACGTAAACCACACAGTAGGAAGCGTCACATATGCCTGCATTAAGCCTACTCGAGTGGCCGGTGAAAGCTGGGAGTATCTTTTTAGCGACTTTGGCGAAATGTCTAATATTAGAAGCCTGATTATTAGTCATAACACAGGAGCGCCAGGGGAAGCATCTGGAAATCTAAGTTTGAGTTACGGGGCCGGGCGCATTACACAGTATGCCGCTCGTGTAGATTTTTACGATGGAACTGCACGATCTGTTTTAGATACCTACGATGATACTCGCTTAACAACTCTCTTACCCTCGGCAGGTGTCATTTGCCGTAGAGGTATTGGTGGTAATTACCAAGCAAATAGCTATTCTTTCTCGTGGGAGAATCCGGGAGTCGATGTATGGATTGATAGTACTCGTATTGGGCGAGTAACGCTGGATCCTACAAGCGACATTGATTATAAAGAACAGGTTGAGCCATGGGACGGTAAGAGTGCGTTAAACAACATCAATCAGTTAGAGCTCGTGACGTTTATCTTTAAAGATGATATAAAGCGTCGGGTACGTCGTGGAATCATTGCACAGCAAGCGGCAACCATAGACCCTGAATACACACACTCAAGCGAGGATAAGGAGGGAAATACAATTCTTTCGCTTGATACTAACGTGCTGTTACTGGATGCGCTTGCTGCTATCCAGGTGCTAAGCGCCCGTGTCAGTAAGTTAGAGTCTTTGTTAGAAGATAAGCCAACCACTCTACCTGAAGACCCCGCTCCAAATCAAGACCTTCCTTAA

Genome Context

Genome Context

Tertiary structure

PDB ID
f28f028c7675464759025411d9c9c403374982a7828bc24d5d06e796a7783147
ESMFold
Source ESMFold
Method ESMFold
Resolution 0,6340
Oligomeric State monomer
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50