Genbank accession
QIG74916.1 [GenBank]
Protein name
tail spike protein with colonic acid degradation activity
RBP type
TF
Evidence Phold
Probability 1,00
TSP
Evidence DepoScope
Probability 1,00
TSP
Evidence RBPdetect
Probability 0,88
TSP
Evidence RBPdetect2
Probability 0,92
Protein sequence
MTYVSVKQLGGVISVATLETGDVVSTPAEIDNTRAIVFNQLEDNNAVCTLAAPQTSATPKLIHLTNNGDFPVTVQGVTLQPGPTYGIWWNGSAYSQIAVEGVGIVSNTQDSPSVHIGGDGSIIDPLTADVKLSAVLNNGLEIRSDGLYFQNVQIAPDRFVVVDTAAGSDTYAGTLTAPFKTIGKAIQTVLNQGLILVVGGTYTEALQLPAGKRFVVEGIGGANDSNVVSINGIIRNDAGAAGIRFKNLTLNSPAGQGPAVEFVDAAGGMMFDNVSITSDLTNDQASVVHFSGGNSSWYVFNEGSIKGGLKVEGDNNAPVISLLHGNEKTALYVDDVDATVQVSYQSYLGDITHLNGNLFLKSIATIGNIASSANSPNLLGIHDSSLFDLDTGGWKTLTKTGTAPYTLDLKRAATNNSFSGTPLFGSNASDIHGNYSPVAYTATNDSVAGHLEGIDDALASVGGGLSAVTSTDSATVDFSGAGTGAQPLTAAVKVSSNAGNVLQAESDGLFVAAADVGLTAVSHSNTDTTDITGDGTPSDPLQVEVSVSATAHNLLIKETDGLFVIEPSPLFETVINIPYMWGAGEMIYSYKASREYTLAQGLTGWQRDVVWLDDPDSPDPENPDPLVYPYTITLKKKNSLNVVTTIGTIDWNTGVTFPTAIQFTIGDVLYLEVDNEARFKSFALTILAQRPYTN
Physico‐chemical
properties
protein length:694 AA
molecular weight: 72594,81590 Da
isoelectric point:4,33769
aromaticity:0,07493
hydropathy:0,02118

Domains

Domains [InterPro]
DC_0915
ATT
2–340
IPR011050
STR
152–389
IPR012334
STR
161–396
IPR011459
Unmapped
166–206
QIG74916.1
1 694
Architecture
ATT
ATT 2-694
Legend: ATT STR RBD CBM LEC ENZ CHP LNK TAS TTP UNK Unmapped

Tail Spike Domain Segmentation

Tail Spike Domain Segmentation

This protein has been segmented into three structural domains: N-terminal, central domain, and C-terminal.

Domain Layout
N-terminal
Central
C-terminal
QIG74916.1
1 694
Domain Start End Length (AA) Confidence
N-terminal 1 211 211 0,9547
Central domain 212 418 208 0,8759
C-terminal 419 694 275 0,5686
Legend: N-terminal Central domain C-terminal
3D Structure with Domain Coloring

The structure is colored according to the domain segmentation: N-terminal (blue), Central (green), C-terminal (pink).

Domain Coloring
N-terminal
1-211
Central
212-418
C-terminal
419-694

Taxonomy

  Name Taxonomy ID Lineage
Phage Rhizobium phage RHph_I42
[NCBI]
2509736 Viruses > Duplodnaviria > Heunggongvirae > Uroviricota > Caudoviricetes
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)
Genbank protein accession
QIG74916.1 [NCBI]
Genbank nucleotide accession
MN988540.1 [NCBI]
CDS location
range 184381 -> 186465
strand +
CDS
ATGACTTACGTCTCTGTAAAACAGCTTGGCGGTGTGATCAGTGTCGCAACTCTCGAAACCGGAGACGTGGTTTCCACGCCCGCCGAGATCGACAACACCCGCGCAATTGTCTTTAACCAACTTGAAGACAACAATGCAGTCTGCACATTGGCCGCTCCGCAGACGAGTGCTACGCCGAAGCTCATCCATCTTACCAACAATGGCGACTTCCCGGTAACTGTGCAGGGCGTCACGCTCCAACCGGGACCGACGTACGGCATTTGGTGGAATGGCTCTGCCTATTCGCAGATCGCAGTCGAAGGCGTTGGTATTGTTTCCAACACTCAAGACAGCCCATCCGTTCACATCGGCGGCGATGGTTCGATCATCGATCCTCTTACCGCCGATGTGAAGCTTTCGGCAGTGCTGAATAACGGTCTGGAAATCCGTTCCGATGGTTTGTATTTCCAGAACGTCCAGATCGCGCCTGATCGTTTCGTTGTGGTTGACACCGCTGCCGGTTCTGACACGTATGCCGGTACTCTTACCGCGCCGTTCAAGACCATCGGCAAGGCCATTCAAACGGTTCTCAACCAAGGCTTGATCTTGGTAGTAGGCGGCACGTACACCGAAGCGCTTCAACTTCCGGCTGGTAAGCGCTTCGTCGTGGAAGGCATCGGCGGCGCGAACGATAGCAACGTCGTTTCGATCAACGGCATCATCCGCAACGATGCGGGCGCTGCTGGTATCCGCTTCAAGAACCTGACTCTCAATTCGCCTGCCGGTCAAGGTCCAGCGGTCGAGTTTGTCGATGCTGCCGGTGGCATGATGTTCGACAACGTATCGATCACTTCCGATCTTACCAACGATCAGGCAAGTGTTGTGCACTTTTCCGGCGGCAACTCGTCGTGGTATGTGTTCAACGAAGGCTCGATCAAGGGTGGGCTCAAGGTAGAAGGCGACAACAACGCGCCGGTAATCAGCCTGTTGCACGGCAACGAAAAGACGGCGCTGTACGTCGATGACGTTGACGCCACGGTGCAGGTTTCCTACCAGTCGTATCTCGGTGATATCACGCACTTGAACGGCAATCTCTTCCTCAAGAGCATTGCTACGATTGGTAATATCGCATCGAGCGCCAACTCGCCGAACCTGCTCGGCATTCACGACTCGTCGCTGTTCGATCTCGATACGGGCGGTTGGAAGACGCTGACGAAAACCGGCACGGCTCCCTACACGCTCGATCTGAAACGCGCAGCTACCAACAACTCGTTTAGCGGTACGCCGCTGTTCGGCTCGAATGCGTCCGATATCCACGGCAATTATTCGCCGGTTGCTTACACAGCTACCAACGATTCAGTCGCCGGTCATTTGGAAGGCATTGACGATGCTCTGGCGAGCGTTGGTGGCGGTCTATCGGCTGTTACCTCGACAGATAGCGCCACGGTTGACTTCTCCGGTGCCGGTACGGGCGCACAGCCTCTTACCGCCGCCGTCAAGGTGTCGTCCAATGCAGGCAACGTTCTGCAAGCGGAAAGTGATGGTCTGTTCGTTGCGGCTGCTGACGTAGGTCTTACCGCCGTCTCGCATTCAAACACCGACACGACTGACATTACCGGCGACGGTACGCCGTCTGATCCGCTGCAAGTTGAAGTGTCGGTTTCCGCGACGGCGCATAACCTTCTTATCAAAGAAACGGATGGTCTGTTCGTTATCGAACCGTCGCCGTTGTTTGAGACGGTCATCAACATCCCCTACATGTGGGGTGCTGGCGAAATGATCTACTCCTACAAGGCTTCGCGAGAATATACCCTTGCGCAAGGTTTGACCGGTTGGCAAAGAGACGTTGTTTGGTTGGATGATCCGGATTCTCCGGACCCTGAAAACCCCGATCCGCTGGTCTACCCGTACACCATCACCCTCAAGAAAAAGAACTCGCTCAACGTTGTTACTACCATCGGAACCATCGACTGGAACACTGGCGTTACCTTCCCTACCGCTATCCAGTTCACGATTGGCGACGTGCTGTATCTGGAAGTCGACAATGAAGCGCGGTTCAAGTCCTTCGCTCTTACCATTCTGGCGCAGCGACCGTACACCAACTGA

Genome Context

Genome Context

Tertiary structure

PDB ID
8fb18247263b72fe20f732cb178e630e273c7ef20fb653a49cbf94e4a03a6384
ESMFold
Source ESMFold
Method ESMFold
Resolution 0,6770
Oligomeric State monomer
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50

Literature

Title Authors Date PMID Source
Patterns of diversity and host range of bacteriophage communities associated with bean-nodulatin bacteria Vann Cauwenberghe,J., Santamaria,R.I., Bustos,P., Juarez,S. and Gonzalez,V. 1998-01 GenBank