Protein
View in Explore- Genbank accession
- QBQ80950.1 [GenBank]
- Protein name
- hypothetical protein
- RBP type
-
TSPTFTF
- Protein sequence
-
MSNSEQFALQASESAANAKKSELNAETSKNSAATDAASALLSKQESAASASSALQSKNAAAGSATSAQQALTAVQGLKAEVQQLKTDTQHIKEEGVAEVTALKNAATTSANNAKASETASANNATLADTKAKEASVSATTANQAKVAAEAAKAASEASAVESATSAGKSQTAANAAKASETVATQKAASASTSETNAAASAQTATNKAGESAASATAAKASETNAKTSETNAGTSAGRSESAALRSEAAAQRAEDIADAIGLEDATLTVKGIVRLSNDTNSTAENLAATPKAVKTVMDVANTKAPSNSPVLTGVPTAPTPAPEVNNNQIATTQFVHQIISALIGDAPDALNTLKELADALGDDPNFATTITTLINSKLAKDQNGADIPNKQLFIDNVGLRETVNLALGALKKNQNGADIPDKGVFLNNINAASKTDMAAKKGMRYTVVNAPAGVEAGKFYPVAIRRSSGFSSELASRVTISTGSRTGNHRLNNCEFNGFVMTGGWTDRGRYAYGMFHAYTANERAIHSILMSNKDDDLCSVFYVEGEAFPISVYVEEGLSVVAPSADYVVGQTTYKWGATDPGTECVAAETVLNFSNGRGFYSSHALLTNADISGNKIYANGEIVARGENQVRMIGGDYGALWRNDGAKTYLLFTAKGDQYGGWNDLRPFMIDNATGEFTIGTKLNAGQGVNGNASSATKLQTARKIGGASFDGTADVNLPGVNIQGNQNTTGNAATATKLQTARRIANVPFDGSGDISIPARNVNAFALGLTQIVQDPEDGNLSNGVPWNAITGVYRQQGIGASNIVAHFCDGGGSTPSLQIRAMYRNGGLWYRTSRDGYGFEENWDQIYTKKNPPPAGETLPVGVPLPWPTDTPPSGYIVMQGQPFDKAAYPKLAAAYPSGVLPDMRGQTIKGKPASGRDVLSTEADGVKSHTHTASTASVDLGSKTTSSFDYGTKGTSSFDYGTKTSNNTGAHTHGISGTANSAGGHQHHSSGPYANSSDSDLFPNGYTQVSVTNKPVVPRQGGAGMTRVSGKTSSDGAHTHSLSGSAASAGAHAHTVGIGAHTHSVGIGAHSHTVALGSHSHGVTVNATGNTENTVKNVAFNYIVRAA
- Physico‐chemical
properties -
protein length: 1112 AA molecular weight: 114402,35140 Da isoelectric point: 6,40902 aromaticity: 0,05576 hydropathy: -0,36178
Domains
Domains [InterPro]
DC_0465
STR
1–649
STR
1–649
Coil
Unmapped
67–94
Unmapped
67–94
IPR048390
ATT
610–672
ATT
610–672
DC_0163
STR
644–1034
STR
644–1034
IPR011083
ATT
866–913
ATT
866–913
1
1112
Architecture
STR 1-609 | ATT 610-672 | STR 673-861 | ATT 862-914 | STR 915-1112
Legend:
ATT
STR
RBD
CBM
LEC
ENZ
CHP
LNK
TAS
TTP
UNK
Unmapped
Tail Spike Domain Segmentation
Tail Spike Domain Segmentation
This protein has been segmented into three structural domains: N-terminal, central domain, and C-terminal.
Domain Layout
1
1112
| Domain | Start | End | Length (AA) | Confidence |
|---|---|---|---|---|
| N-terminal | 1 | 432 | 432 | 0,9220 |
| Central domain | 433 | 631 | 200 | 0,8500 |
| C-terminal | 632 | 1112 | 480 | 0,7725 |
Legend:
N-terminal
Central domain
C-terminal
3D Structure with Domain Coloring
The structure is colored according to the domain segmentation: N-terminal (blue), Central (green), C-terminal (pink).
Domain Coloring
N-terminal
1-432
1-432
Central
433-631
433-631
C-terminal
632-1112
632-1112
Taxonomy
| Name | Taxonomy ID | Lineage | |
|---|---|---|---|
| Phage |
Escherichia phage vB_EcoS_HdK1 [NCBI] |
2508190 | Viruses > Duplodnaviria > Heunggongvirae > Uroviricota > Caudoviricetes |
| Host | No host information | ||
Coding sequence (CDS)
Coding sequence (CDS)
Genbank protein accession
QBQ80950.1
[NCBI]
Genbank nucleotide accession
MK373794
[NCBI]
CDS location
range 28213 -> 31551
strand +
strand +
CDS
GTGTCTAACTCTGAGCAATTTGCATTGCAGGCGTCTGAATCCGCTGCTAATGCTAAGAAGAGTGAGCTTAACGCAGAGACAAGTAAAAACTCTGCTGCAACGGATGCAGCGTCAGCTCTGCTGTCAAAACAGGAATCTGCTGCTAGTGCGTCTTCCGCATTGCAATCCAAGAACGCTGCGGCTGGTTCTGCTACATCAGCTCAACAAGCTCTTACTGCTGTGCAAGGACTCAAGGCTGAAGTCCAGCAATTAAAAACCGACACGCAGCATATTAAAGAAGAGGGTGTCGCTGAAGTAACAGCTCTCAAGAATGCTGCAACTACTTCGGCCAACAATGCTAAAGCATCAGAAACTGCATCTGCTAATAATGCGACTTTGGCTGATACTAAAGCAAAGGAAGCGTCTGTAAGTGCAACCACCGCTAATCAAGCGAAAGTAGCTGCTGAGGCAGCAAAGGCCGCATCCGAAGCAAGTGCAGTTGAGTCCGCTACTTCTGCCGGTAAATCGCAGACCGCAGCTAATGCAGCTAAGGCAAGTGAGACGGTTGCAACACAGAAGGCAGCTTCGGCGTCTACGTCTGAAACTAATGCGGCAGCTTCCGCTCAGACTGCGACTAACAAAGCTGGTGAGTCCGCCGCTAGTGCTACCGCAGCTAAGGCAAGCGAAACTAATGCTAAAACTTCAGAAACTAATGCTGGTACATCAGCTGGAAGATCTGAGTCTGCTGCTTTACGCTCAGAAGCTGCTGCCCAACGTGCGGAAGACATTGCTGATGCAATTGGTCTTGAAGATGCAACTTTAACTGTTAAAGGTATCGTTCGTCTTAGCAATGATACTAACAGCACTGCTGAAAACCTGGCAGCTACGCCAAAGGCTGTTAAAACAGTCATGGACGTTGCTAATACTAAAGCTCCGTCAAATAGTCCTGTTCTAACTGGCGTACCAACGGCACCAACTCCAGCACCTGAAGTTAATAACAATCAGATTGCCACAACTCAGTTTGTACACCAAATTATAAGCGCACTGATTGGTGATGCACCAGACGCGTTAAACACTCTGAAAGAGTTAGCTGACGCTCTTGGTGATGATCCGAATTTTGCAACGACGATAACCACACTTATTAACAGTAAGTTGGCTAAAGATCAAAATGGTGCTGACATTCCTAATAAGCAACTGTTCATTGATAACGTTGGTCTACGTGAGACTGTCAACTTAGCTCTCGGTGCATTGAAGAAGAACCAAAACGGTGCAGATATACCGGATAAAGGTGTCTTCCTTAATAACATCAATGCTGCCAGCAAGACGGATATGGCCGCCAAGAAAGGTATGAGATATACTGTGGTCAATGCCCCAGCGGGAGTTGAAGCAGGTAAGTTTTATCCTGTTGCGATTCGTCGTTCTTCTGGATTCAGCAGTGAATTAGCATCCAGGGTGACAATAAGTACAGGTTCAAGAACGGGAAACCATAGGTTGAACAACTGCGAGTTCAATGGTTTTGTTATGACTGGTGGTTGGACTGACAGGGGACGTTATGCTTATGGAATGTTCCACGCATATACTGCAAATGAACGTGCTATTCATTCCATTCTGATGAGTAATAAAGATGATGATCTGTGCTCTGTGTTCTATGTCGAAGGTGAAGCGTTCCCTATCTCTGTATATGTAGAAGAAGGTTTATCTGTTGTTGCACCATCTGCCGACTATGTTGTCGGACAAACGACTTACAAATGGGGTGCTACAGACCCTGGAACAGAATGCGTGGCCGCTGAAACTGTACTTAATTTCTCCAATGGTAGAGGATTCTATAGTTCGCACGCACTCCTCACAAATGCGGATATTAGTGGCAATAAAATTTACGCCAACGGTGAAATCGTAGCTCGCGGTGAGAATCAAGTTCGCATGATTGGTGGGGACTACGGTGCGTTATGGCGTAACGACGGTGCCAAGACCTATCTCTTATTTACCGCTAAAGGCGATCAATACGGTGGTTGGAACGATTTAAGACCATTTATGATTGATAACGCTACTGGTGAGTTTACCATCGGTACTAAATTGAACGCCGGTCAAGGTGTGAATGGTAACGCATCCAGTGCTACTAAGTTACAAACAGCGCGTAAAATTGGTGGCGCCTCATTTGATGGTACTGCTGACGTCAATCTTCCAGGCGTTAACATTCAGGGTAATCAGAACACAACTGGTAATGCGGCAACTGCCACCAAACTACAGACAGCACGTAGGATTGCCAATGTGCCTTTTGATGGTAGTGGCGATATATCGATTCCAGCTAGAAACGTGAATGCGTTTGCGTTAGGTTTGACTCAAATAGTACAAGATCCAGAAGATGGAAATCTATCTAATGGTGTACCTTGGAATGCTATAACCGGTGTTTACAGACAACAAGGTATAGGAGCCTCTAATATTGTTGCTCATTTTTGTGATGGTGGTGGTTCAACACCTTCTTTGCAGATAAGAGCTATGTATAGGAATGGTGGTCTTTGGTATCGTACATCACGTGATGGTTATGGTTTTGAAGAAAACTGGGATCAAATTTACACTAAAAAGAACCCGCCTCCGGCTGGCGAAACCTTACCTGTAGGTGTTCCATTACCGTGGCCCACGGATACACCTCCGTCCGGTTATATTGTAATGCAGGGTCAACCCTTCGATAAAGCTGCATACCCGAAACTTGCTGCTGCATACCCGTCGGGTGTACTTCCAGACATGCGTGGTCAAACTATCAAAGGTAAACCAGCAAGCGGTCGTGACGTACTTTCTACAGAAGCTGATGGTGTGAAATCACATACTCACACTGCGTCTACTGCATCTGTCGATCTTGGAAGCAAGACAACTTCTAGCTTTGATTATGGTACTAAAGGGACAAGTAGTTTCGACTACGGTACTAAGACGAGTAATAATACTGGTGCGCATACACATGGTATTAGCGGAACAGCAAATAGTGCAGGTGGGCACCAACACCATAGTTCTGGACCATATGCGAATTCTAGTGATAGTGATCTCTTTCCTAACGGTTATACCCAGGTTTCAGTCACAAACAAACCAGTTGTACCACGGCAAGGTGGCGCAGGTATGACCCGTGTATCAGGGAAGACATCATCAGATGGAGCACATACCCACTCGTTATCCGGTAGTGCCGCAAGCGCAGGTGCTCACGCACACACTGTTGGTATTGGTGCTCACACACACTCAGTCGGTATTGGTGCTCATAGTCATACAGTCGCTTTAGGTTCACACAGCCACGGTGTTACAGTTAACGCAACTGGTAACACGGAAAACACCGTTAAGAACGTTGCATTCAACTATATTGTGAGGGCTGCTTAA
Genome Context
Genome Context
Tertiary structure
PDB ID
69d9e087e6f4f439e6ab9578e664dadaca7d3afa808607a212cb4f4923fc7749
Model Confidence
Very high
pLDDT > 90
pLDDT > 90
High
90 > pLDDT > 70
90 > pLDDT > 70
Low
70 > pLDDT > 50
70 > pLDDT > 50
Very low
pLDDT < 50
pLDDT < 50
Literature
| Title | Authors | Date | PMID | Source |
|---|---|---|---|---|
| Still something new to discover - new insights into E. coli phage diversity and taxonomy | Korf,I.H.E., Adriaennsens,E., Dreiseikelmann,B., Kropinski,A., Nimtz,M., Meier-Kolthoff,J.P., Rohde,M., van Raaij,M. and Wittmann,J. | 2019-05-17 | — | GenBank |