UniProt accession
G8E063 [UniProt]
Protein name
Receptor-recognizing protein gp38
RBP type
TF
Evidence RBPdetect
Probability 0,84
Protein sequence
MAVVGVPGWIGSSAVNETGQRWMSQAAGQLRLGVPCWMSQFAGRSREIIHTLGADHNFNGQWFRDRCFEAGSAPIVFNITGNLVSYSRDVPLFFMYGDTPNEYVQLNIGGGVHMWGRGGQGGWTHSGGDGNGQQGGHCIQNDIGGRLRINNGGVICGGGGGGGGIAYRPHSGAKWQDIGGGGGRPFGPGGGGGYSGGAASYDGPGGGYNYGNAHSGQGGDAGANGQNAWHDGGKVLKVGAGGAAGYAVIGSAPTWQNVGVIYGPRV
Physico‐chemical
properties
protein length:266 AA
molecular weight: 26982,34940 Da
isoelectric point:8,37614
aromaticity:0,10150
hydropathy:-0,33008

Domains

Domains [InterPro]
IPR048291
ATT
1–42
DC_2135
STR
1–141
G8E063
1 266
Architecture
ATT
STR
RBD
ATT 1-42 | STR 43-141 | RBD 142-266
Legend: ATT STR RBD CBM LEC ENZ CHP LNK TAS TTP UNK Unmapped

Taxonomy

  Name Taxonomy ID Lineage
Phage Enterobacteria phage RB33
[NCBI]
134822 Uroviricota > Caudoviricetes > Pantevenvirales > Tevenvirinae > Tequatrovirus
Host Escherichia coli B
[NCBI]
37762 Bacteria > Proteobacteria > Gammaproteobacteria > Enterobacteriales > Enterobacteriaceae > Escherichia

Coding sequence (CDS)

Coding sequence (CDS)
Genbank protein accession
AET36517.1 [NCBI]
Genbank nucleotide accession
JF491362 [NCBI]
CDS location
range 1 -> 801
strand +
CDS
ATGGCAGTAGTTGGAGTTCCTGGCTGGATTGGAAGTTCAGCCGTAAATGAAACGGGTCAGCGCTGGATGAGTCAAGCAGCTGGTCAATTAAGATTGGGTGTTCCTTGCTGGATGAGTCAATTTGCAGGTCGCTCAAGAGAAATTATTCATACACTTGGAGCGGACCATAACTTTAATGGTCAATGGTTCCGAGATAGATGTTTTGAGGCAGGTAGTGCACCTATAGTGTTTAATATTACCGGAAATTTGGTATCATATTCTAGAGACGTTCCTTTATTCTTCATGTACGGAGATACACCTAATGAATATGTTCAGTTGAATATTGGTGGCGGCGTCCATATGTGGGGTAGAGGTGGACAAGGTGGATGGACTCATTCCGGCGGAGATGGTAATGGTCAACAAGGTGGTCATTGTATTCAAAATGACATCGGTGGACGATTAAGAATTAATAACGGTGGAGTAATTTGTGGTGGAGGTGGAGGTGGAGGTGGTATCGCTTATCGTCCTCACTCTGGTGCAAAATGGCAAGATATTGGTGGTGGTGGAGGAAGACCATTTGGCCCAGGAGGAGGCGGAGGTTATTCCGGCGGCGCAGCTTCTTATGATGGCCCAGGAGGTGGTTATAATTATGGTAATGCTCACTCTGGCCAGGGTGGTGATGCTGGCGCAAATGGACAAAATGCCTGGCATGATGGCGGTAAAGTTCTAAAAGTTGGAGCTGGTGGCGCTGCCGGATATGCTGTTATAGGATCAGCTCCAACATGGCAAAATGTTGGAGTAATATATGGTCCAAGGGTATAA

Genome Context

Genome Context

Gene Ontology

Description Category Evidence (source)
GO:0098024 virus tail, fiber Cellular Component IEA:UniProtKB-KW (UniProt)
GO:0098671 adhesion receptor-mediated virion attachment to host cell Biological Process IEA:UniProtKB-ARBA (UniProt)
GO:0046718 symbiont entry into host cell Biological Process IEA:UniProtKB-KW (UniProt)

Tertiary structure

PDB ID
b0061667f8c53f78276f7ccae527c39c283f7c0c605bfed91eccfacc9a8e58a7
ESMFold
Source ESMFold
Method ESMFold
Resolution 0,8416
Oligomeric State monomer
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50