Genbank accession
AMR59805.1 [GenBank]
Protein name
tail fiber protein
RBP type
TF
Evidence Phold
Probability 1,00
Protein sequence
MPYTPDLVATVIRSLNRKNAVAKAVDPANVRLVSVAHRDEKTLDVTLTGGPGYSSYGQITAQVTKQNIADYFAKVKLDAIPTCASIHELLPHLDTLLGITFLTDDFEDLPITWTNNAARITFQAKQTSLIWYGSQTLDVINGDVDINTVYTVTNIILTLKAANKQQWINFYSNAAKRIVDLDKVDVSAPMSPEDIGQTSAYNSVVALTARPGSGMFGTKYVFFSRNKLETYAGKTTYVPDDFTGSLHDALAIETLGIRSVFDVNEIAPVESLAGEWPRQVQLAPVEGSYKVMGSATLNVARYQDNEQTLNLNYTTTSQLSDSYWATTLTNHAACGRLFVNVTVDAGAFVYRDSNLKPTFVIDELPQIGEVVWKFVNRGTILGRGGSGVVYSMPANTSADGADAIHISDTFRGKVTIENYGTIAGGGGGGDYYAFSTTKIGGAGGAPLGPGAAVPSGTNNRNGLAGSRDKGGATQLVVAQFTGYQSRYYGGPGGDWGQPGRYTAFEQWNTPATGSPVYVYGREDGMSWTRPGMAGEAVGGNKSIVNWVSVGTVKPNVTDEYQEFLDYLAVLDANRLASADGIDKVTDPTKPVVVEFLGTIISGTLFKKQNIPGGLVGTPYGDAVANLPSTTLFGIMAHSVSEFDIVHDIWRYNRAVMFTLKPAPTGTINLAVERNRVKSATSSAPVEKFYVDRFVYYDAYLGKMMECNPYTKSAPKPWVRPKISLS
Physico‐chemical
properties
protein length:725 AA
molecular weight: 78271,19100 Da
isoelectric point:5,86598
aromaticity:0,10069
hydropathy:-0,15269

Domains

Domains [InterPro]
IPR057701
STR
11–140
AMR59805.1
1 725
Architecture
STR
STR
RBD
STR 11-154 | STR 156-309 | RBD 311-725
Legend: ATT STR RBD CBM LEC ENZ CHP LNK TAS TTP UNK Unmapped

Taxonomy

  Name Taxonomy ID Lineage
Phage Enterobacteria phage SEGD1
[NCBI]
1805456 Uroviricota > Caudoviricetes > Chimalliviridae > Seoulvirus SPN3US >
Host Salmonella enterica
[NCBI]
28901 cellular organisms > Bacteria > Pseudomonadati > Pseudomonadota > Gammaproteobacteria > Enterobacterales

Coding sequence (CDS)

Coding sequence (CDS)
Genbank protein accession
AMR59805.1 [NCBI]
Genbank nucleotide accession
KU726251.1 [NCBI]
CDS location
range 131927 -> 134104
strand +
CDS
ATGCCATACACTCCCGATTTAGTGGCGACCGTGATTCGGTCGCTCAACCGTAAAAACGCGGTCGCCAAAGCAGTAGACCCCGCTAACGTACGCCTGGTGTCCGTGGCGCATCGTGACGAAAAAACCCTAGACGTCACACTGACAGGGGGTCCGGGGTATTCTAGTTACGGTCAAATTACGGCCCAAGTCACGAAGCAAAATATTGCTGATTATTTCGCCAAGGTTAAGTTGGATGCCATTCCCACTTGTGCGTCCATCCACGAACTGTTGCCGCATTTAGACACTCTACTAGGGATTACCTTTTTAACCGACGACTTTGAAGACCTGCCCATCACTTGGACAAACAATGCTGCGCGTATCACGTTCCAGGCGAAACAGACAAGTCTTATTTGGTACGGCAGCCAAACGCTAGATGTGATAAACGGTGACGTGGATATCAACACCGTCTACACTGTTACCAATATCATTCTCACGCTAAAGGCTGCGAATAAACAGCAGTGGATTAACTTCTACTCTAATGCAGCCAAACGGATCGTGGATCTTGATAAAGTCGATGTCAGCGCCCCGATGTCCCCGGAGGATATAGGCCAAACCTCTGCGTATAACAGCGTCGTAGCTTTGACCGCTAGACCGGGGAGCGGGATGTTTGGAACGAAGTATGTGTTCTTCTCACGTAACAAACTCGAGACATATGCAGGCAAAACCACATACGTCCCGGATGATTTCACCGGTAGCCTGCACGATGCTCTGGCCATCGAGACACTCGGCATTCGGAGCGTTTTTGATGTAAACGAAATTGCTCCTGTAGAGAGCCTGGCAGGCGAGTGGCCGAGACAAGTGCAATTGGCACCCGTAGAGGGTTCGTACAAAGTCATGGGCAGTGCCACGTTGAATGTAGCACGCTATCAAGATAACGAGCAAACCCTTAACCTGAACTACACAACTACGTCGCAACTTTCGGACAGCTACTGGGCCACAACGTTGACCAACCACGCGGCGTGCGGGCGTTTGTTTGTTAACGTGACCGTAGATGCGGGGGCTTTCGTTTATCGCGACAGCAACCTGAAACCGACGTTCGTTATCGACGAATTGCCGCAAATTGGCGAGGTGGTTTGGAAGTTCGTTAACCGGGGAACCATTCTTGGTCGTGGTGGTTCGGGTGTGGTCTACTCTATGCCTGCGAACACATCAGCGGACGGAGCTGACGCAATCCACATTAGCGATACCTTCCGTGGCAAAGTCACCATCGAAAACTACGGCACCATCGCCGGAGGTGGGGGCGGGGGCGACTACTATGCGTTTAGCACCACCAAGATCGGTGGAGCTGGTGGTGCGCCACTCGGGCCAGGCGCGGCGGTACCCAGCGGCACCAATAACCGCAACGGCTTAGCTGGGTCCCGTGACAAAGGCGGTGCTACCCAGTTAGTCGTCGCGCAATTTACCGGCTACCAGAGTCGCTACTATGGCGGTCCGGGTGGGGATTGGGGCCAGCCGGGACGATATACAGCGTTCGAGCAATGGAACACCCCGGCTACCGGCTCGCCAGTCTATGTCTACGGTCGTGAAGACGGGATGTCATGGACTCGCCCTGGGATGGCAGGCGAAGCTGTCGGTGGGAATAAAAGTATCGTCAACTGGGTATCCGTCGGAACGGTGAAACCGAATGTTACCGACGAGTACCAAGAGTTCCTCGACTACCTCGCAGTTTTAGATGCCAATCGACTGGCCAGTGCGGATGGGATAGATAAAGTCACCGACCCAACCAAGCCTGTAGTTGTGGAGTTCTTAGGTACGATTATTTCAGGTACGCTGTTCAAAAAGCAAAACATACCCGGCGGTTTGGTGGGAACACCATACGGTGATGCTGTAGCCAATTTACCGAGCACCACATTGTTTGGTATCATGGCACACAGTGTATCGGAGTTTGATATTGTGCACGATATTTGGCGTTACAACCGTGCGGTGATGTTCACGCTAAAACCCGCGCCTACCGGGACGATTAACCTCGCTGTGGAACGCAACCGGGTGAAATCGGCTACCAGTAGTGCACCGGTGGAGAAGTTCTATGTCGATCGCTTCGTGTATTACGATGCCTACCTCGGTAAGATGATGGAATGTAATCCGTATACCAAATCCGCCCCTAAACCGTGGGTGCGGCCAAAGATATCGTTATCATAA

Genome Context

Genome Context

Tertiary structure

PDB ID
9bd077b1bdb8dbff0e6db9a939944bb23b5a0ecd9442fcaf8fd0b5884f2e6eb9
ESMFold
Source ESMFold
Method ESMFold
Resolution 0,6628
Oligomeric State monomer
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50

Literature

Title Authors Date PMID Source
Complete genome sequence of a polyvalent bacteriophage, SEGD1, simultaneously inhibiting both Salmonella enterica and Escherichia coli O157:H7 Fan,J. and Ma,J. 2018-03-02 GenBank