Protein
View in Explore- Genbank accession
- WAX16154.1 [GenBank]
- Protein name
- hypothetical protein
- RBP type
-
TSP
- Protein sequence
-
MPCVYKPNYVPYDFMNLSPAYQWGIGRVAGYTHFDQAEFQTLEGESHVTWDINFKNKIAGDLTSNPHKLTVENKNHIYHNNILRGSRYLANPVLLFNGAKYVETAQDNDSAVALSEKTEQGLMFYWDDFTGEYGVMKKGDTLTISVEVRTNNKDVDLVEAVRLKATPNFEPYTKITTQEVNDGFTRIVFQTKVLNTEWLPDDDWETIWGVQNPEVPERVGRTWNKDRLIGIVQNIPDVQLEYARPMVSREGSTTYQEAPIDLTPTNVFNWKGYNKGYRLLNDGRYVKSSFQYEWANPNFFKIPDSWTWQKIRTFMVMKFTKIPFKTPAGEDPNNVYGKVSWYSDSTEGSAIQSNIFIQEDGLYGGKLSMIEVPEGAKYYRIHVTGYSPQAVDFRMGFNDTSYVNNSSATIELTQEMYDGASVMLDGKVARAYADNFNERVKIDYRIDLLAEMRKAFPKIFEGLSIEESVARVRTIAYMMDLTATARGGDDDNIIHWHMKNKDDVAGEIHGFSKDELTTYTHRSTWQDWIQNDGTLWGYFYTESPRPTNGTQAWIELDYFNIRMTIVAENSEIDAWGFRSDDSRYDKTSVDIPMMRTEENMQNSSVQVTHGYDIYGIVNAKYPEFFGDCYTYEDCINKLNERIESFTFEVSSVIPDPDVGGEGYVNFVVEGDDYIRDSTTRLDKNTSDFIVVHNGLSRFVQENGYLYISGERPPEDRNIELGITTNVTFQLKFDKEYDNNKRIFRYNKKKQPWFLFVRDIQRSVLPPKVNNLVDMNRGRNRYSYGQTEEARSITLQCFIKAPSEQELAPLLEELADYLDVGETTLQLYDNPERVYKVVLDGSTDISQTLHMGELSLNFILLDNYAVGKEVVVRETFNTDSSIPFLQLKNEGTAPTYPTYLLDFKESAQFVDLIGQKESANISVGRRTKGGAPDNSKNLRPRVFYSKFTNNDGAGWVAMNDTHMPQYIETKKPTLGGSIQRSNGMINLNGWKYGDNKVDGLHGNGVVGTLQKNVNNFVCEITPAIYGNNPKNSLNAIYVIFYDAVNQPICHAKIGTRPEDGNVDMYIIEEGNNTNWGRFHQSNGNKWDDFKGKFVIERKNNKWRLTAGQYKNRKFDPAPESNFNMGTNMLKDVKSTGWVTLPPETWNREVSRVGLYLATWKDRPLAKHLSARRCIVWEDLSDESTLPDEKPILINAGDQVAIDSSKGQVYLNGVVTPSLVDPMTDWFPIERGENLVGVNNFIGDLTVVYNERFK
- Physico‐chemical
properties -
protein length: 1252 AA molecular weight: 143339,70530 Da isoelectric point: 5,14520 aromaticity: 0,11901 hydropathy: -0,57915
Domains
Domains [InterPro]
DC_0142
STR
1–742
STR
1–742
DC_0077
STR
613–1252
STR
613–1252
G3DSA:2.40.30.200
ATT
734–866
ATT
734–866
IPR008841
STR
783–858
STR
783–858
1
1252
Architecture
STR 1-733 | ATT 734-866 | STR 867-1252
Legend:
ATT
STR
RBD
CBM
LEC
ENZ
CHP
LNK
TAS
TTP
UNK
Unmapped
Tail Spike Domain Segmentation
Tail Spike Domain Segmentation
This protein has been segmented into three structural domains: N-terminal, central domain, and C-terminal.
Domain Layout
1
1252
| Domain | Start | End | Length (AA) | Confidence |
|---|---|---|---|---|
| N-terminal | 1 | 191 | 191 | 0,3341 |
| Central domain | 192 | 508 | 318 | 0,6023 |
| C-terminal | 509 | 1252 | 743 | 0,2576 |
Legend:
N-terminal
Central domain
C-terminal
3D Structure with Domain Coloring
The structure is colored according to the domain segmentation: N-terminal (blue), Central (green), C-terminal (pink).
Domain Coloring
N-terminal
1-191
1-191
Central
192-508
192-508
C-terminal
509-1252
509-1252
Taxonomy
| Name | Taxonomy ID | Lineage | |
|---|---|---|---|
| Phage |
Enterococcus phage EH802P2 [NCBI] |
2968651 | Viruses > Duplodnaviria > Heunggongvirae > Uroviricota > Caudoviricetes |
| Host | No host information | ||
Coding sequence (CDS)
Coding sequence (CDS)
Genbank protein accession
WAX16154.1
[NCBI]
Genbank nucleotide accession
OP172807
[NCBI]
CDS location
range 32268 -> 36026
strand -
strand -
CDS
ATGCCTTGTGTATATAAACCTAACTACGTGCCTTATGACTTCATGAACCTCTCCCCTGCGTATCAGTGGGGGATTGGTCGTGTGGCAGGGTACACTCACTTTGACCAAGCAGAGTTCCAAACTCTTGAAGGGGAGTCTCACGTCACATGGGATATTAACTTTAAGAATAAGATTGCCGGAGACCTCACTTCAAATCCTCATAAATTGACAGTGGAAAATAAGAACCACATCTACCACAACAACATACTTCGTGGATCACGCTACCTAGCAAATCCTGTACTACTCTTCAACGGAGCTAAGTATGTGGAGACAGCTCAGGACAACGACTCTGCTGTAGCTCTTTCTGAAAAGACTGAACAAGGATTGATGTTCTACTGGGATGACTTTACTGGGGAATACGGAGTTATGAAAAAAGGAGACACCTTGACAATCTCTGTTGAAGTACGTACCAATAACAAGGACGTAGACCTAGTGGAAGCTGTTCGCTTGAAAGCTACTCCTAACTTCGAGCCATACACTAAAATCACCACACAGGAAGTTAACGACGGGTTTACTAGGATTGTCTTCCAAACTAAGGTGCTGAATACTGAATGGCTTCCTGATGATGATTGGGAAACTATTTGGGGAGTACAGAACCCAGAGGTTCCTGAACGTGTTGGTCGTACTTGGAATAAAGACCGCTTGATTGGTATTGTTCAAAATATTCCAGATGTACAATTAGAATATGCTAGACCAATGGTAAGCCGAGAGGGGTCAACTACTTATCAGGAAGCTCCTATCGACTTAACTCCTACAAACGTGTTTAATTGGAAAGGGTACAACAAAGGCTACCGACTTTTAAATGATGGTCGCTATGTTAAGTCTAGCTTCCAGTATGAATGGGCTAACCCTAACTTCTTTAAAATTCCTGATAGCTGGACGTGGCAAAAGATTCGTACATTTATGGTTATGAAGTTTACTAAAATACCGTTTAAAACTCCTGCCGGTGAAGACCCTAACAATGTTTATGGTAAGGTTTCTTGGTATTCCGATTCTACTGAGGGGAGTGCTATTCAGTCTAACATATTCATTCAAGAAGATGGGTTGTACGGTGGTAAACTCTCGATGATAGAAGTCCCAGAAGGAGCTAAGTACTACCGTATCCACGTTACTGGGTACTCTCCTCAAGCAGTAGATTTCCGAATGGGATTTAACGACACTTCATACGTAAACAATTCATCAGCAACCATTGAGCTTACTCAAGAAATGTACGATGGTGCTTCGGTTATGCTTGATGGAAAAGTAGCTAGAGCTTATGCGGACAACTTCAATGAACGAGTTAAGATCGACTACCGAATTGACTTGCTAGCTGAAATGCGGAAAGCCTTTCCTAAGATATTCGAAGGTCTATCTATTGAAGAGTCCGTGGCAAGGGTTAGGACTATTGCTTACATGATGGATTTAACAGCTACTGCTCGTGGTGGTGATGATGACAATATCATCCACTGGCACATGAAGAATAAGGATGACGTCGCTGGAGAAATTCACGGTTTCAGCAAAGACGAACTTACAACGTATACACACCGAAGCACTTGGCAAGATTGGATACAGAATGATGGTACTCTTTGGGGTTACTTCTATACTGAATCACCAAGACCTACCAATGGTACTCAAGCTTGGATTGAATTAGACTACTTCAACATCCGCATGACAATCGTGGCTGAGAATAGTGAGATTGATGCTTGGGGATTCCGCTCTGATGACTCCCGTTACGACAAGACTTCGGTAGACATTCCAATGATGCGTACTGAGGAGAATATGCAAAACTCTTCTGTACAGGTTACCCATGGTTATGACATCTATGGCATCGTAAATGCTAAGTACCCAGAGTTCTTCGGTGATTGCTACACCTATGAAGACTGCATCAATAAACTTAATGAGCGTATTGAGTCATTCACTTTTGAAGTGTCTTCTGTCATTCCTGATCCGGACGTTGGTGGTGAAGGATATGTGAACTTCGTGGTAGAGGGGGATGACTACATTAGAGACTCCACTACACGGCTTGATAAAAACACCTCAGATTTCATCGTGGTACACAATGGCTTAAGTAGATTTGTTCAGGAGAATGGTTACCTATACATTAGTGGAGAACGACCTCCAGAAGACCGTAACATAGAACTAGGAATTACTACTAATGTCACTTTCCAATTAAAGTTTGATAAAGAGTATGACAACAACAAACGTATCTTCCGTTACAATAAGAAGAAACAGCCTTGGTTCCTTTTCGTTAGAGACATCCAACGATCTGTTCTACCGCCTAAAGTGAACAACTTGGTGGACATGAATAGAGGACGTAATCGCTACTCGTATGGTCAGACGGAAGAGGCTCGTTCAATAACACTTCAATGTTTTATTAAAGCTCCTTCTGAGCAAGAACTTGCCCCATTGTTAGAGGAATTAGCCGACTATTTAGATGTTGGAGAAACCACCCTACAACTGTATGACAACCCAGAACGTGTCTATAAAGTAGTCCTTGATGGATCGACTGACATTTCCCAAACACTTCATATGGGTGAGTTGTCATTAAACTTTATTTTATTGGACAATTATGCGGTTGGTAAGGAAGTTGTAGTTAGAGAGACATTCAATACAGACTCTTCAATCCCATTCTTGCAGTTGAAAAATGAGGGTACAGCACCTACTTACCCAACCTACCTACTGGACTTCAAAGAGAGTGCTCAATTCGTAGATTTGATTGGTCAGAAAGAATCAGCCAACATATCCGTCGGAAGACGTACCAAGGGTGGTGCTCCTGACAACTCTAAGAACTTAAGACCTAGAGTGTTCTACTCTAAGTTTACTAATAATGATGGAGCTGGCTGGGTAGCAATGAACGACACTCATATGCCTCAATACATTGAAACTAAGAAACCAACTTTGGGAGGTAGCATTCAACGATCTAACGGAATGATTAACCTAAATGGTTGGAAGTATGGGGACAACAAAGTTGATGGTCTTCACGGTAACGGAGTTGTGGGCACCTTGCAAAAGAATGTTAACAACTTTGTGTGTGAGATTACTCCAGCTATCTATGGTAACAATCCTAAGAACTCCCTGAATGCTATCTACGTAATCTTCTATGATGCTGTTAACCAGCCAATCTGCCATGCTAAAATCGGCACTCGACCAGAAGACGGAAACGTTGATATGTATATCATTGAGGAGGGTAACAATACCAACTGGGGAAGATTCCATCAATCCAATGGTAACAAGTGGGACGACTTCAAAGGAAAATTTGTAATTGAACGTAAGAACAACAAGTGGAGATTAACTGCTGGTCAGTATAAAAATCGCAAGTTTGATCCTGCTCCAGAGTCTAACTTTAACATGGGTACTAACATGCTTAAGGATGTAAAATCCACTGGATGGGTAACTCTCCCTCCTGAAACTTGGAATAGAGAAGTTTCTAGAGTAGGCTTGTACTTAGCTACTTGGAAAGACCGTCCTCTTGCTAAACACTTATCTGCAAGACGTTGTATTGTCTGGGAAGACTTATCAGACGAGTCTACATTGCCAGATGAAAAACCCATCTTAATCAATGCAGGGGATCAAGTTGCAATTGACTCTAGCAAAGGGCAGGTGTACTTGAACGGAGTTGTTACTCCAAGTCTCGTAGACCCTATGACAGACTGGTTCCCAATTGAACGAGGAGAGAACTTGGTAGGTGTCAACAACTTCATTGGAGATTTGACAGTAGTATATAATGAACGATTTAAATAG
Genome Context
Genome Context
Tertiary structure
PDB ID
9a7581321006f83a20d52adc7aaffc90992bc5f41139dd1600466ec76df92056
Model Confidence
Very high
pLDDT > 90
pLDDT > 90
High
90 > pLDDT > 70
90 > pLDDT > 70
Low
70 > pLDDT > 50
70 > pLDDT > 50
Very low
pLDDT < 50
pLDDT < 50