Genbank accession
QGH79679.1 [GenBank]
Protein name
hypothetical protein
RBP type
TSP
Evidence DepoScope
Probability 1,00
TF
Evidence RBPdetect2
Probability 0,93
Protein sequence
MRLRGFPSAGPPASYVSKPQGSIVGSSSSKITKIVAVPGNPGKDGDPGEVTLAQLSAAVDPKLNKLGSGQTTQVYARNSAGVDTGVGYSGSAVANTIPIRDANGRLVAADPSSSGHVSTKNYVDTTTPKIADGAVISGPNNLSGAIGTWAPEASGFVHLPHLFNDLAYNRIRGGSFTMLKNGVDAGVSQGTLDRIFEPNTTAASIALTDRATDVFVLEVNCCVPFRYGTQVGIAMPSGFRGKHVVIEGWYNDTWNTLTTRTAVETGLVIQAISVPTAATAGMTKIRYTIRDFHSTASFRVSSIFALAYNSPLLEAGFVSRGGGEIYGPITYGIDPVGANDLARKSYVDTKVAKQTGNTRVYVKDTSGNDSAVAYSSGPTAQTMVYRTADGVTSVGEPTAASHAATKNYVDNKVAGIVNSAPATLDTLDELAQALGDDPNFATTVATQIGTKVDKLTGAYKVYATDGGGAQTSASWSFDPNPITVAVRTDQGRLTAAAPAADNDVVNKKHLDDRLPWTGTQAQYDAIPTKDPNRLYVVVP
Physico‐chemical
properties
protein length:539 AA
molecular weight: 56194,02620 Da
isoelectric point:7,05539
aromaticity:0,07236
hydropathy:-0,17291

Domains

Domains [InterPro]
DC_0646
STR
18–342
DC_1251
STR
320–539
QGH79679.1
1 539
Architecture
STR
STR 18-539
Legend: ATT STR RBD CBM LEC ENZ CHP LNK TAS TTP UNK Unmapped

Tail Spike Domain Segmentation

Tail Spike Domain Segmentation

This protein has been segmented into three structural domains: N-terminal, central domain, and C-terminal.

Domain Layout
N-terminal
Central
C-terminal
QGH79679.1
1 539
Domain Start End Length (AA) Confidence
N-terminal 1 156 156 0,8562
Central domain 157 360 205 0,4269
C-terminal 361 539 178 0,1894
Legend: N-terminal Central domain C-terminal
3D Structure with Domain Coloring

The structure is colored according to the domain segmentation: N-terminal (blue), Central (green), C-terminal (pink).

Domain Coloring
N-terminal
1-156
Central
157-360
C-terminal
361-539

Taxonomy

  Name Taxonomy ID Lineage
Phage Gordonia phage Anon
[NCBI]
2653744 Viruses > Duplodnaviria > Heunggongvirae > Uroviricota > Caudoviricetes
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)
Genbank protein accession
QGH79679.1 [NCBI]
Genbank nucleotide accession
MN369756 [NCBI]
CDS location
range 3320 -> 4939
strand +
CDS
ATGAGGCTTCGCGGATTCCCCTCAGCCGGCCCGCCGGCTTCGTACGTCTCCAAGCCCCAAGGCTCGATCGTCGGATCGTCAAGCTCGAAGATCACCAAGATCGTCGCGGTCCCCGGTAACCCCGGCAAGGACGGCGATCCCGGCGAGGTCACCCTAGCCCAGCTCAGTGCAGCAGTAGACCCCAAGCTGAACAAGCTGGGCAGCGGCCAGACCACCCAGGTCTACGCTCGAAACTCAGCAGGGGTGGACACCGGAGTCGGGTATTCCGGCAGTGCGGTGGCCAATACCATCCCCATCCGAGATGCCAACGGGCGTCTGGTCGCTGCCGACCCCAGTTCATCTGGCCACGTGTCAACGAAGAACTACGTCGACACCACGACCCCGAAGATCGCTGACGGAGCGGTCATCTCAGGCCCGAACAACCTGTCCGGGGCTATCGGCACCTGGGCACCGGAGGCTTCCGGATTCGTCCACCTACCTCACCTGTTCAACGATCTGGCGTACAACCGGATCCGAGGCGGGTCGTTCACGATGCTCAAGAACGGCGTAGACGCCGGAGTGTCGCAAGGCACCCTCGACCGGATATTCGAACCTAACACCACCGCAGCCTCGATCGCGCTCACAGACCGAGCCACCGACGTCTTCGTCCTCGAGGTCAACTGCTGCGTCCCGTTCAGATACGGCACCCAGGTCGGCATCGCGATGCCCTCCGGATTCCGAGGCAAGCACGTAGTCATCGAAGGTTGGTACAACGACACCTGGAACACGCTCACGACCCGCACAGCGGTCGAGACCGGGCTAGTGATTCAGGCGATCAGCGTTCCCACCGCAGCCACTGCGGGGATGACCAAGATCCGTTACACGATCAGGGATTTCCACTCAACGGCATCGTTCCGAGTGTCGAGCATCTTCGCTCTGGCGTACAACTCACCTCTCCTCGAGGCGGGATTCGTCAGCCGGGGCGGCGGCGAGATCTACGGCCCGATCACCTACGGGATCGACCCAGTAGGAGCCAACGACCTGGCCCGGAAGTCGTACGTTGACACCAAGGTTGCCAAGCAGACCGGGAACACCCGCGTATACGTCAAGGACACCAGTGGCAACGACAGCGCCGTGGCCTACTCGTCAGGACCAACAGCTCAGACGATGGTGTACCGGACCGCCGACGGGGTCACCTCAGTGGGAGAGCCGACAGCAGCCTCCCACGCTGCCACGAAGAACTACGTCGACAACAAGGTCGCAGGGATCGTCAACTCGGCTCCAGCTACTCTCGACACTCTCGACGAGCTGGCCCAGGCGCTCGGCGATGATCCGAACTTCGCTACCACGGTAGCTACTCAGATCGGAACCAAGGTCGACAAGCTCACAGGAGCCTACAAGGTCTACGCCACCGACGGAGGCGGCGCACAGACCTCGGCTTCGTGGTCGTTCGACCCCAACCCGATCACGGTAGCGGTTCGCACCGACCAGGGGCGTCTCACGGCTGCAGCTCCCGCTGCTGACAACGACGTCGTCAATAAGAAGCACCTGGACGACCGACTCCCCTGGACCGGCACACAGGCTCAGTACGACGCCATCCCCACGAAGGACCCGAACCGTCTCTACGTGGTGGTGCCATGA

Genome Context

Genome Context

Tertiary structure

PDB ID
fd8a9173b9d7752af75fa458e6cb9d46b46011385de0716b6a41466b43d372eb
ESMFold
Source ESMFold
Method ESMFold
Resolution 0,6634
Oligomeric State monomer
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50