Genbank accession
XUL01394.1 [GenBank]
Protein name
tail fiber protein
RBP type
TF
Evidence GenBank
Probability 1,00
TF
Evidence RBPdetect
Probability 0,90
Protein sequence
MSDILAGALDTAQGYYDDAKNRLFGDGTTTLRFRNCGGVVLDAVTSEDHESELTIADNTLETGAKASDHAALEPKVISVTGTVVGYDNTSVQEDFISDITGLRSTDFLDDLGLPSAMSTVIDKTRDMVVNKLATFIDWGALDAAISSSLVPWLPDFSIAEELKTSDNMRIEQMYRNFLDFQKNVIFCTVDTGIFQYENMLLQSVRVRQQKDGSAEFTLRFREVIEVPITVTNAVASKSTGRPGNNGTKKSGRAASQGDATKNKNINNATSVKDKKVNDNRGVGVTLGDMLGVNLRGFFG
Physico‐chemical
properties
protein length:299 AA
molecular weight: 32461,94710 Da
isoelectric point:4,81099
aromaticity:0,07023
hydropathy:-0,26622

Domains

Domains [InterPro]
DC_0601
STR
1–294
IPR048494
ATT
41–231
XUL01394.1
1 299
Architecture
STR
ATT
STR
STR 1-40 | ATT 41-231 | STR 232-294 |
Legend: ATT STR RBD CBM LEC ENZ CHP LNK TAS TTP UNK Unmapped

Taxonomy

  Name Taxonomy ID Lineage
Phage Salmonella phage vB_SE1_KM
[NCBI]
3425682 Viruses >
Host Salmonella enterica subsp. enterica
[NCBI]
59201 Bacteria > Proteobacteria > Gammaproteobacteria > Enterobacteriales > Enterobacteriaceae > Salmonella

Coding sequence (CDS)

Coding sequence (CDS)
Genbank protein accession
XUL01394.1 [NCBI]
Genbank nucleotide accession
PV660647.1 [NCBI]
CDS location
range 18372 -> 19271
strand +
CDS
ATGAGTGATATTCTTGCTGGTGCACTCGATACTGCTCAAGGTTATTACGATGATGCAAAGAACCGTCTCTTCGGGGACGGTACAACCACTCTGCGCTTCCGTAACTGTGGAGGTGTTGTGCTGGATGCTGTTACCAGTGAAGACCACGAGTCTGAGCTGACTATCGCAGACAACACGCTAGAGACTGGTGCAAAGGCTTCTGACCATGCAGCACTCGAGCCAAAAGTAATTAGCGTAACCGGCACTGTAGTCGGTTATGATAATACGTCCGTACAAGAGGACTTTATCTCGGATATTACAGGTCTGAGAAGTACAGACTTCCTTGACGACCTTGGACTGCCATCTGCGATGTCTACTGTAATAGATAAGACTCGAGATATGGTAGTTAATAAGTTAGCTACCTTTATAGACTGGGGCGCACTAGACGCTGCTATATCTTCGTCCTTAGTTCCCTGGTTACCGGACTTTAGTATCGCCGAGGAATTGAAGACCTCGGATAACATGCGTATTGAACAGATGTACCGTAACTTCCTCGACTTCCAAAAGAATGTTATATTCTGTACGGTCGATACAGGCATCTTCCAGTATGAAAACATGTTGCTTCAGTCTGTACGTGTGAGACAACAGAAAGACGGTTCTGCTGAGTTTACTTTGCGTTTCCGCGAAGTTATTGAAGTACCAATCACAGTTACCAATGCTGTTGCTTCCAAGTCCACCGGTCGCCCGGGTAATAATGGTACAAAGAAAAGCGGTCGTGCTGCTAGTCAGGGCGATGCAACAAAGAACAAGAACATCAACAACGCAACATCCGTTAAAGACAAGAAAGTCAATGACAACCGCGGTGTAGGTGTTACACTTGGTGATATGCTGGGTGTTAACTTGAGAGGTTTCTTCGGATGA

Genome Context

Genome Context

Tertiary structure

PDB ID
e840642f4a3c5ed3f18704b75c2e0edbbe3feab9ffd71e1066e16aeddbf0380f
ESMFold
Source ESMFold
Method ESMFold
Resolution 0,6637
Oligomeric State monomer
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50