Genbank accession
WDR21247.1 [GenBank]
Protein name
tail fibers protein
RBP type
TSP
Evidence DepoScope
Probability 1,00
TF
Evidence GenBank
Probability 1,00
TF
Evidence Phold
Probability 1,00
TSP
Evidence RBPdetect
Probability 0,91
TSP
Evidence RBPdetect2
Probability 0,95
TSP
Evidence UniProt/TrEMBL
Probability 1,00
Protein sequence
MNPQFSQPKGAVSKETNKDSIARKFGCKKSEVLYAKTGAGLTGYKVIYDKVSQRAYSLPSNIGAVTVTSLVDGILTHSGGTVDLNELAMTRREFITLQGDFVSGVTVNAKHEVIAYGNSRFRWDGDLPKVVSPNSEPVIPDWVNVDANPALSELAKSVGSSLVGYKPTGTTTTTGTVKGKLDSFIDFANDYCGGPITGQPDLTSQLQQAIIDAFNSGIADIKIVGDFYISGMILWYPGVRLIGGRYDSTLIRVSNTFPVGGTMFRSYRPPLWAAGCHGLSLENVYLVGRAAKDVYGIDINDASYFNLEGVRLDLFDKAISFNRWIDETRVDTSGTITYPNAHSAEMGGQSYFGTITRCYAGNCVTCVDFNGVVNRCTFISNTWTTSDLAYNFSNPRGVYETNTFITCNIEGVKSAFEWFFSINSPYHNVWINTSIDNGNPNITSLAKDGGRQTFIGLAIFPYGNPSLVNWYGINPNGHRSTVLGTDLGENLPEDQLKTQVREELHTLTGIANKQWAGQQVTETIPAGGYKAINISIPGLKGNSAVIASLSAIYAGVSVSAASNNTGIVAVIINNHSTSPVEINAYLSVTGIAKSFL
Physico‐chemical
properties
protein length:596 AA
molecular weight: 64284,51990 Da
isoelectric point:6,10447
aromaticity:0,10067
hydropathy:-0,08993

Domains

Domains [InterPro]
IPR040775
RBD
91–137
WDR21247.1
1 596
Architecture
ATT
STR
ATT 1-155 | STR 197-595 |
Legend: ATT STR RBD CBM LEC ENZ CHP LNK TAS TTP UNK Unmapped

Tail Spike Domain Segmentation

Tail Spike Domain Segmentation

This protein has been segmented into three structural domains: N-terminal, central domain, and C-terminal.

Domain Layout
N-terminal
Central
C-terminal
WDR21247.1
1 596
Domain Start End Length (AA) Confidence
N-terminal 1 203 203 0,9931
Central domain 204 487 285 0,9922
C-terminal 488 596 108 0,9685
Legend: N-terminal Central domain C-terminal
3D Structure with Domain Coloring

The structure is colored according to the domain segmentation: N-terminal (blue), Central (green), C-terminal (pink).

Domain Coloring
N-terminal
1-203
Central
204-487
C-terminal
488-596

Taxonomy

  Name Taxonomy ID Lineage
Phage Salmonella phage vB_SenM_UTK0004
[NCBI]
3028902 Viruses > Duplodnaviria > Heunggongvirae > Uroviricota > Caudoviricetes
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)
Genbank protein accession
WDR21247.1 [NCBI]
Genbank nucleotide accession
OQ359883.1 [NCBI]
CDS location
range 122900 -> 124690
strand -
CDS
ATGAATCCGCAATTTTCACAGCCGAAAGGCGCTGTCTCCAAAGAGACAAACAAGGACTCTATCGCACGCAAATTCGGTTGCAAGAAGTCCGAGGTTCTGTATGCCAAGACCGGAGCAGGGCTGACAGGATACAAGGTAATCTATGATAAGGTGTCTCAACGTGCCTATTCTCTTCCTTCTAACATTGGTGCTGTAACGGTCACCAGTTTAGTTGACGGCATCCTTACGCACTCTGGAGGCACCGTAGATTTAAATGAATTGGCAATGACACGCCGCGAGTTTATTACTTTGCAAGGAGACTTTGTTTCCGGTGTTACCGTCAATGCTAAGCATGAAGTGATCGCATACGGAAATAGCCGATTCCGTTGGGATGGGGATCTGCCGAAAGTTGTTTCCCCTAATTCAGAACCTGTCATACCAGATTGGGTTAACGTTGATGCCAACCCAGCACTGTCTGAATTAGCGAAATCTGTTGGCTCGTCTCTGGTGGGATATAAACCAACGGGGACAACCACTACGACTGGTACGGTCAAAGGAAAATTGGATAGTTTTATTGATTTTGCAAATGACTATTGCGGTGGCCCAATTACAGGACAACCTGATCTCACTTCTCAATTACAACAAGCAATTATTGATGCTTTTAATTCTGGTATTGCCGACATTAAGATTGTTGGGGATTTTTATATCTCTGGCATGATATTATGGTATCCAGGTGTTCGATTGATTGGTGGACGTTATGACTCTACATTGATACGCGTTTCCAATACTTTCCCAGTGGGTGGGACAATGTTTCGTTCTTACCGTCCTCCACTGTGGGCGGCGGGTTGTCATGGTCTATCTCTTGAAAATGTGTATTTGGTAGGCCGCGCCGCCAAGGACGTTTATGGCATAGATATCAATGATGCATCTTATTTTAATCTTGAAGGTGTGCGTCTGGATCTGTTTGATAAAGCAATTTCTTTTAATAGATGGATTGATGAAACTCGTGTAGACACGAGCGGTACAATTACCTATCCCAACGCGCATAGTGCCGAAATGGGTGGCCAATCATATTTTGGTACAATAACTCGTTGCTACGCAGGCAATTGCGTCACGTGCGTTGATTTTAATGGTGTGGTAAACCGTTGTACATTTATTAGCAACACATGGACAACTAGTGATCTGGCATATAATTTCAGCAACCCGCGTGGAGTTTATGAAACTAACACATTCATCACTTGTAATATTGAAGGTGTGAAGAGTGCATTTGAATGGTTCTTTTCTATAAATTCACCATATCACAATGTGTGGATTAATACGAGCATCGATAACGGGAACCCAAATATCACGAGTTTAGCGAAAGATGGTGGTCGACAGACATTTATTGGTTTAGCGATTTTCCCTTATGGAAACCCATCGTTGGTTAATTGGTATGGAATTAACCCCAATGGTCATCGGAGTACTGTCTTGGGTACTGATCTTGGGGAAAATTTACCAGAAGATCAGTTAAAAACCCAAGTGCGCGAGGAATTACACACATTAACGGGTATTGCTAATAAACAATGGGCAGGACAGCAAGTTACTGAAACTATCCCAGCAGGTGGATATAAAGCTATCAATATCAGTATTCCTGGTTTAAAAGGAAATTCCGCTGTGATCGCTTCTTTATCAGCAATATACGCTGGCGTTTCTGTGAGCGCAGCCAGTAATAATACTGGGATTGTGGCTGTTATTATAAACAACCATTCAACATCTCCTGTTGAAATAAACGCATATCTCAGTGTCACAGGCATAGCCAAAAGTTTCTTGTGA

Genome Context

Genome Context

Tertiary structure

PDB ID
ec3646bf86aca1226a414143fa3d51e5ee923ab6a22e395b00c045e7df1677ae
ESMFold
Source ESMFold
Method ESMFold
Resolution 0,7373
Oligomeric State monomer
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50

Literature

Title Authors Date PMID Source
Characterization of a Diverse Collection of Salmonella Phages Isolated from Tennessee Wastewater Bryan,D.W., Hudson,L.K., Wang,J. and Denes,T.G. GenBank