Genbank accession
WMM35397.1 [GenBank]
Protein name
putative central straight fiber
RBP type
TSP
Evidence RBPdetect
Probability 0,84
Protein sequence
MTGYTMAYIQHSIYTDYDVIGRSFWLKLGEEVDRRDFTGIDTFFVMINNLTPSTIYQVQGAFYDSIIDSELLNAKIGINLSNETNFKTKEKPIIVAARSESEPVDVGVGAPIVVVETTGEASYCTIELKSTATEDSPWTKYYIGALGSTIKFGGVPIGDYKIRISGQVTMPDGVTVDSSGYYEFPNILTVAYNFVPPTAPIDIVFKAARIADGKERYDVRIEWDWERGAGANVREFLVTYINSEEYAKTGWAKAQKINVGAARAATIISFPWRVEHTFKVSSIAWGPNKQDITESAPVTFILNEDTPLDNSFVNETGIDVNYAFIKGSMKDGEIWRQTFLIDAATGAINIGLLDEEGKAPISFDPINRVVNVDGKVITRDINAANFIMTNLSGKDNPAIYTQGKSWGDNNSGIWMGMDNTSAKAKLDIGNATQWIRYDGTTLRISSGVVIGTPNGDVDIGTGLQGKQTVFVYKLATSLPAKPLEQDYPPPGWSKTPPNRTDMTQNIYATTGTLDPVTNKLLEGTSWSDVVQWSGTEGTIGHDGQRGPGMYSMGIPGLGGWDDGQANAFFQNNFGKPPVKYDVLTQFNSNAPQTAFTRQWNGVGWINPAMVLHGNMIVNGTVTADKIVAGNAFLSQIGVNIIYDRNAALSGNPEAYYKMKIDLNSGYIHIR
Physico‐chemical
properties
protein length:670 AA
molecular weight: 73380,64770 Da
isoelectric point:4,92836
aromaticity:0,10597
hydropathy:-0,23194

Domains

Domains [InterPro]
IPR057550
STR
2–82
WMM35397.1
1 670
Architecture
STR
STR
STR
STR
STR 1-82 | STR 88-182 | STR 196-320 | STR 335-649 |
Legend: ATT STR RBD CBM LEC ENZ CHP LNK TAS TTP UNK Unmapped

Tail Spike Domain Segmentation

Tail Spike Domain Segmentation

This protein has been segmented into three structural domains: N-terminal, central domain, and C-terminal.

Domain Layout
N-terminal
Central
C-terminal
WMM35397.1
1 670
Domain Start End Length (AA) Confidence
N-terminal 1 393 393 0,8848
Central domain 394 623 231 0,1688
C-terminal 624 670 46 0,7247
Legend: N-terminal Central domain C-terminal
3D Structure with Domain Coloring

The structure is colored according to the domain segmentation: N-terminal (blue), Central (green), C-terminal (pink).

Domain Coloring
N-terminal
1-393
Central
394-623
C-terminal
624-670

Taxonomy

  Name Taxonomy ID Lineage
Phage Salmonella phage EH7
[NCBI]
2986511 Viruses > Duplodnaviria > Heunggongvirae > Uroviricota > Caudoviricetes
Host Salmonella typhimurium
[NCBI]
90371 cellular organisms > Bacteria > Pseudomonadati > Pseudomonadota > Gammaproteobacteria > Enterobacterales

Coding sequence (CDS)

Coding sequence (CDS)
Genbank protein accession
WMM35397.1 [NCBI]
Genbank nucleotide accession
OR413347 [NCBI]
CDS location
range 85516 -> 87528
strand -
CDS
ATGACTGGGTATACGATGGCGTATATCCAGCATTCTATCTATACCGACTACGATGTTATCGGTAGATCATTCTGGCTTAAACTTGGCGAAGAAGTGGATAGAAGAGATTTCACTGGAATCGACACTTTCTTTGTTATGATCAATAATTTAACTCCCTCAACTATTTATCAGGTTCAGGGAGCTTTTTATGATTCAATTATTGACTCAGAACTTTTAAATGCAAAAATTGGTATCAACCTCTCTAATGAAACTAACTTTAAAACAAAAGAGAAGCCAATAATTGTTGCAGCAAGATCTGAGTCAGAACCTGTGGATGTTGGGGTGGGCGCACCAATAGTTGTTGTGGAAACAACTGGTGAAGCAAGCTACTGTACTATTGAGTTAAAAAGTACAGCCACTGAAGACAGTCCATGGACTAAATATTATATCGGGGCTTTAGGTTCTACTATTAAATTTGGCGGAGTTCCTATCGGAGATTATAAGATCAGAATATCTGGTCAAGTAACTATGCCTGATGGTGTTACAGTTGATTCTTCTGGTTACTATGAGTTCCCTAATATTCTAACTGTAGCTTATAATTTTGTTCCTCCTACTGCACCTATCGATATTGTTTTTAAAGCTGCACGAATTGCTGATGGTAAAGAACGATATGATGTTAGAATTGAGTGGGATTGGGAACGCGGTGCTGGTGCTAACGTTCGTGAGTTCTTGGTTACTTATATAAATTCCGAGGAATACGCTAAGACTGGCTGGGCTAAAGCTCAAAAGATAAACGTGGGTGCTGCTAGAGCTGCAACAATTATATCATTCCCATGGAGAGTTGAACATACGTTTAAGGTATCATCAATTGCCTGGGGACCAAATAAACAAGATATAACCGAGTCAGCTCCTGTAACATTTATTCTGAATGAAGATACTCCTCTAGATAATAGTTTTGTCAATGAGACGGGTATTGATGTTAACTATGCCTTTATTAAAGGCAGCATGAAAGATGGGGAAATCTGGAGACAAACATTCCTAATTGATGCAGCTACTGGTGCTATTAACATTGGTCTGCTCGATGAAGAAGGTAAAGCACCTATTTCTTTCGACCCTATAAACCGTGTTGTTAACGTTGATGGTAAAGTAATTACTAGAGATATTAATGCTGCGAACTTTATCATGACTAACTTGTCTGGTAAAGATAACCCAGCAATTTATACTCAGGGTAAATCCTGGGGGGATAATAACTCTGGTATTTGGATGGGTATGGATAATACCTCTGCCAAAGCTAAATTAGACATTGGTAATGCTACACAATGGATACGTTATGATGGCACTACTCTGCGCATCTCTAGTGGTGTAGTAATTGGAACGCCAAATGGTGACGTAGATATTGGAACTGGTTTACAAGGTAAGCAAACAGTATTTGTTTATAAGTTAGCAACATCTTTACCGGCCAAACCGCTAGAGCAAGATTATCCGCCTCCTGGTTGGTCAAAAACTCCACCTAACCGTACAGATATGACACAAAATATCTATGCGACTACAGGTACACTTGATCCAGTTACTAACAAACTTCTTGAAGGTACTAGCTGGTCTGATGTAGTTCAGTGGAGCGGTACTGAAGGTACTATAGGACATGATGGACAGCGTGGACCGGGGATGTACTCCATGGGTATTCCTGGGTTAGGTGGTTGGGATGATGGACAAGCTAACGCATTCTTCCAAAATAACTTTGGAAAACCTCCGGTTAAGTATGATGTTCTAACACAATTTAACAGTAATGCTCCGCAAACAGCATTTACCCGTCAATGGAATGGAGTTGGATGGATTAACCCTGCAATGGTTCTTCATGGCAATATGATTGTTAATGGAACGGTTACTGCGGATAAGATTGTGGCAGGAAATGCCTTCTTATCACAAATCGGTGTTAATATAATCTACGATAGAAATGCTGCGTTATCAGGGAACCCTGAAGCATACTACAAGATGAAGATAGACCTAAATAGTGGGTATATCCATATAAGGTAA

Genome Context

Genome Context

Tertiary structure

PDB ID
954ec24b097217fa7ca636c20515e1e83bf24a8e55d96f73a948f70cca2b504e
ESMFold
Source ESMFold
Method ESMFold
Resolution 0,7860
Oligomeric State monomer
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50

Literature

Title Authors Date PMID Source
Isolation and characterization of Salmonella and Escherichia coli-specific bacteriophages collected from Minnesota Wastewater Treatment Plant Cortes Ortega,E., Hansen,E.G., Farmer,M.L., Martinez-Villalobos,J.M. and Bowden,S.D. 2024-11-27 GenBank