Protein
View in Explore- Genbank accession
- WVX92833.1 [GenBank]
- Protein name
- hypothetical protein
- RBP type
-
TSPTSPTF
- Protein sequence
-
MATLKQIQFKRSKTAGARPAASVLAEGELAINLKDRVLFTKDDQGNIIDLGFAKGGSIDGNVIHIGNYNQTGDYTLNGTFTQTGNFNLTGIARVTRDIIAAGQIMTEGGELITKSSGTSHVRFFDGNSRERGIIYAPANDGLTTQVLNIRVQDYAAGSESTYAFSGSGLFTSPEVSAWKSISSPQILTDKVITDGKKTGDYDISSLANNTPLAESETAINHLRVMRNAVGSGIFHEVKDNDGITWYAGDGLDAYLWSFTWSGGLKAGHSISVGTPGGSKGYSELGTASIALGDNDTGLKWHQDGYYFSVNNGTKTFLFSPSETTSLRKFVAGYSTNGTDLTTPPTENYALATVVTYHDNNAYGDGQTLLGYYQGGNYHHYFRGKGTTNINTHGGLLVTPGNIDVIGGSVNIDGRNNASTLMFRGNTTGSSSVDNMTISVWGNTFTNPSVGNRKNVMEISDATSWMSYIQRLTTGEVEMNVNGSFESSGVTAGNRGVHTTGEISSGAVNALRIWNADYGAIFRRSEGSLHIIPTAYGEGKYGDIGPLRPFSMALDTGKVTIPDLQSSYNTFAANGYIKFTGHGAGAGGYDIQYVQAAPIFQEIDDDAISKYYPIVKQKFLNGKAVWSLGTEINSGTFVIHHLKEDGSQGHTSRFNQDGTVNFPDNVQVGGGEATIARNGNIFSDIWKLFSSAGDITNLHDAIASRVAKEGDTMTGKLIVKRGSDAINIAADENDSGYLLGTSGGANSWYIGKGGADDTASFYNFKTTAGITLNSVGDIDFNVKNQATAASLNFYRLYLNGRQWTATQGHGYSNQWQTEAPFFVDFGESVPKDSYMPIIKGRSQIINEGYATKADFGIIRLGGDATWGNAVIRVGSAESGDSSHPNAIFVFQANGDFKAPAGLRAGVNLGVGTIPVWGGASIAIGDNDTGLVHGGDGRINMFANGQHIASWGVFHQEHPGLWSVGAALWTEVDKAIISHGHLIQANDNYSTFVRDVYVRSDIRVKKDLVKFENASEKLSKINGYTYMQKRGLDEEGNQKWEPNAGLIAQEVQAILPELVEGDPDGEALLRLNYNGVIGLNTAAINEHTAEIAELKSEIEELKALVKSLLK
- Physico‐chemical
properties -
protein length: 1108 AA molecular weight: 118771,09410 Da isoelectric point: 5,48584 aromaticity: 0,09567 hydropathy: -0,30316
Domains
Domains [InterPro]
DC_0538
STR
1–733
STR
1–733
IPR048390
ATT
450–549
ATT
450–549
G3DSA:6.20.80.10
STR
722–780
STR
722–780
1
1108
Architecture
STR 1-449 | ATT 450-549 | STR 550-1108
Legend:
ATT
STR
RBD
CBM
LEC
ENZ
CHP
LNK
TAS
TTP
UNK
Unmapped
Tail Spike Domain Segmentation
Tail Spike Domain Segmentation
This protein has been segmented into three structural domains: N-terminal, central domain, and C-terminal.
Domain Layout
1
1108
| Domain | Start | End | Length (AA) | Confidence |
|---|---|---|---|---|
| N-terminal | 1 | 214 | 214 | 0,2327 |
| Central domain | 215 | 413 | 200 | 0,3027 |
| C-terminal | 414 | 1108 | 694 | 0,7828 |
Note: Constraints were applied during segmentation.
Fixed 164 C-terminal predictions appearing before Central domain
Fixed 164 C-terminal predictions appearing before Central domain
Legend:
N-terminal
Central domain
C-terminal
3D Structure with Domain Coloring
The structure is colored according to the domain segmentation: N-terminal (blue), Central (green), C-terminal (pink).
Domain Coloring
N-terminal
1-214
1-214
Central
215-413
215-413
C-terminal
414-1108
414-1108
Taxonomy
| Name | Taxonomy ID | Lineage | |
|---|---|---|---|
| Phage |
Escherichia phage vB_EcoM_HZ_ZJUN4 [NCBI] |
3119840 | Viruses > Duplodnaviria > Heunggongvirae > Uroviricota > Caudoviricetes |
| Host |
Escherichia coli NDM-1 [NCBI] |
1411081 | Pseudomonadota > Gammaproteobacteria > Enterobacterales > Enterobacteriaceae > Escherichia > Escherichia coli |
Coding sequence (CDS)
Coding sequence (CDS)
Genbank protein accession
WVX92833.1
[NCBI]
Genbank nucleotide accession
PP216085
[NCBI]
CDS location
range 157898 -> 161224
strand -
strand -
CDS
ATGGCTACTTTAAAACAAATACAATTTAAAAGAAGCAAAACTGCAGGAGCACGTCCTGCCGCTTCAGTATTAGCCGAAGGTGAATTGGCTATAAACTTAAAAGACCGCGTACTTTTTACTAAAGATGACCAAGGAAATATCATTGATCTGGGTTTTGCTAAGGGCGGTAGTATTGACGGGAATGTTATTCATATAGGAAATTATAATCAAACTGGTGATTATACTTTAAATGGCACCTTCACTCAGACAGGTAATTTTAATTTAACTGGTATTGCTCGAGTAACTCGCGATATTATTGCCGCCGGGCAAATTATGACTGAGGGCGGAGAACTTATTACAAAAAGTTCAGGTACATCACATGTTCGTTTTTTCGATGGCAATAGCCGCGAACGCGGAATCATTTATGCCCCGGCTAATGATGGATTAACTACACAAGTACTTAATATCAGGGTTCAAGATTATGCTGCAGGAAGCGAAAGCACCTATGCATTTTCAGGCAGTGGACTATTTACTTCACCTGAAGTATCAGCGTGGAAATCTATTTCGTCTCCACAAATTCTGACCGATAAAGTTATTACAGATGGGAAGAAGACGGGCGATTATGATATATCTTCATTAGCAAATAACACTCCATTGGCAGAAAGCGAAACGGCTATTAACCACCTCCGTGTTATGCGAAATGCCGTAGGATCTGGTATATTCCATGAAGTTAAAGATAACGACGGGATAACCTGGTACGCCGGTGACGGGTTAGATGCCTATCTTTGGTCGTTTACCTGGTCCGGTGGATTGAAAGCAGGCCATTCTATTTCTGTTGGTACTCCTGGTGGCTCTAAAGGATACTCTGAACTAGGGACTGCTTCAATTGCTCTTGGAGATAATGATACCGGGCTAAAATGGCATCAGGACGGATATTATTTCAGCGTTAATAATGGAACGAAAACATTTTTATTTAGTCCTAGCGAAACAACTAGCCTAAGAAAATTTGTAGCTGGATATTCTACTAATGGAACCGATTTAACGACTCCTCCAACTGAAAACTATGCATTAGCCACTGTTGTTACTTACCATGATAATAACGCGTATGGTGACGGTCAGACTCTTTTAGGATATTACCAAGGTGGTAATTATCATCATTATTTCCGCGGTAAGGGTACCACAAACATTAATACTCACGGCGGTTTGTTAGTCACTCCAGGTAATATTGACGTTATTGGTGGTTCTGTTAATATTGATGGTCGTAATAATGCTTCTACGCTGATGTTTAGAGGTAACACAACTGGTAGCAGTTCAGTTGATAATATGACAATTTCTGTATGGGGTAATACGTTTACTAATCCTAGCGTAGGTAATCGTAAAAATGTCATGGAAATTTCTGACGCAACTAGTTGGATGAGCTATATTCAAAGACTTACTACCGGCGAAGTAGAAATGAACGTCAATGGTTCATTTGAATCATCCGGTGTTACTGCTGGAAATAGAGGAGTTCACACAACAGGCGAAATTTCATCTGGAGCAGTGAATGCTCTTCGTATTTGGAACGCAGATTATGGAGCCATTTTTAGACGTTCAGAAGGAAGTCTTCATATTATTCCAACTGCTTACGGTGAAGGTAAATATGGTGATATCGGTCCACTTCGCCCGTTTAGTATGGCTTTAGATACTGGTAAAGTTACTATTCCAGATTTACAATCAAGTTACAATACGTTCGCAGCAAACGGCTATATTAAATTTACTGGTCATGGCGCAGGCGCTGGTGGTTATGACATTCAGTATGTTCAAGCAGCTCCTATTTTCCAGGAAATTGATGATGATGCTATAAGCAAATATTATCCTATTGTTAAACAGAAGTTTTTAAACGGCAAAGCTGTTTGGTCTTTAGGTACTGAAATTAATTCGGGTACATTCGTTATTCATCATCTGAAAGAAGATGGTTCACAAGGCCATACGTCTCGTTTTAATCAGGACGGTACAGTTAACTTCCCGGATAACGTACAGGTCGGTGGCGGCGAAGCTACTATTGCTCGAAATGGTAATATCTTCTCTGATATTTGGAAATTATTTAGCTCTGCTGGTGATATAACCAACCTTCATGATGCTATTGCCTCCCGTGTTGCTAAAGAAGGCGATACGATGACCGGCAAATTAATCGTTAAAAGAGGCTCTGACGCTATTAACATTGCTGCCGATGAAAATGATTCTGGTTATTTACTTGGAACATCAGGTGGAGCGAATTCATGGTACATCGGTAAAGGCGGGGCAGATGACACTGCTTCATTTTATAATTTTAAGACTACGGCAGGAATTACTCTTAATAGTGTAGGCGATATTGACTTTAATGTTAAAAATCAAGCTACTGCAGCTTCATTAAATTTTTATCGTTTATATTTAAACGGAAGACAATGGACAGCTACCCAAGGCCACGGATATAGTAATCAATGGCAAACAGAAGCCCCATTCTTCGTTGACTTTGGTGAATCTGTTCCGAAAGATAGTTATATGCCTATAATTAAAGGAAGAAGCCAAATCATTAACGAAGGATATGCAACAAAGGCAGATTTTGGTATTATTAGATTGGGCGGAGATGCTACTTGGGGAAATGCAGTAATTCGTGTTGGTTCTGCGGAAAGTGGAGATAGCAGTCATCCTAATGCAATATTTGTGTTTCAAGCTAATGGCGATTTTAAAGCTCCGGCTGGTCTTCGCGCTGGTGTTAACTTGGGTGTCGGTACAATTCCGGTATGGGGCGGAGCATCTATCGCCATCGGTGATAATGATACTGGTTTAGTCCATGGCGGTGATGGTCGTATTAACATGTTTGCTAACGGACAACATATTGCGTCATGGGGGGTTTTCCATCAAGAACACCCAGGATTGTGGTCTGTTGGTGCTGCTTTATGGACTGAAGTTGACAAAGCTATTATTTCACATGGTCATTTGATTCAGGCGAATGACAACTATTCAACATTTGTTCGTGACGTTTATGTCCGCTCTGATATTCGTGTTAAAAAAGACCTTGTTAAATTTGAAAATGCTTCTGAGAAGCTTTCTAAAATTAACGGTTACACTTATATGCAGAAGCGAGGCCTAGATGAAGAAGGCAATCAGAAATGGGAACCTAACGCCGGTTTGATAGCTCAAGAAGTTCAAGCTATTTTACCAGAATTAGTTGAAGGTGACCCTGATGGTGAAGCTTTACTTCGTTTGAACTATAACGGCGTAATTGGTTTAAATACAGCTGCAATCAATGAGCATACTGCAGAAATTGCGGAACTTAAATCAGAAATTGAAGAACTTAAAGCATTAGTTAAATCATTGTTAAAATAA
Genome Context
Genome Context
Tertiary structure
PDB ID
b413ce1b739a92567e4c7bba17c4791b252951bd09c33aab2d04766ad0b91442
Model Confidence
Very high
pLDDT > 90
pLDDT > 90
High
90 > pLDDT > 70
90 > pLDDT > 70
Low
70 > pLDDT > 50
70 > pLDDT > 50
Very low
pLDDT < 50
pLDDT < 50