Genbank accession
WNA14189.1 [GenBank]
Protein name
baseplate upper protein
RBP type
TF
Evidence RBPdetect
Probability 0,90
TF
Evidence RBPdetect2
Probability 0,96
Protein sequence
MIYNDAKQSFEITASTKRKITTGIQFSTQDIDTARLIFTLTKDREPLPLSAVSGKLVMFMADGSRFIKNVEIVDPVGGVAQYVLTSDEIKHYGTVNAELNLYYANNQALSVHKFSFNIDRALVDTDIAPIAEYYIDDFEALIAKVNELYDEAIATMEELRQKFSDLENIETKTGAQEKADAAEANAKAYTDVHANNKTIHITADERTTWNAKETTTGSQSKADKALGDAKAYTDTHANRTDNPHEVTKAQIGLANVEDIKQASFTDFQAHNYNQVRHISADERTAWNAKETTDGAQAKADKALTDAKAYTDTKVGQLTRTWTTIPLINGAVTDTTVPLRYRTKNGGDELQINGGFKSAFGTIIATGLPKVKYPTEFLVATVGTYGYLRMDYRVNGDLYLAGGTVNSETGISKISVNITIPLT
Physico‐chemical
properties
protein length:422 AA
molecular weight: 46403,24570 Da
isoelectric point:5,32351
aromaticity:0,08294
hydropathy:-0,40829

Domains

Domains [InterPro]
DC_1215
STR
1–257
G3DSA:2.60.40.3350
ATT
8–124
IPR018913
ATT
10–147
WNA14189.1
1 422
Architecture
ATT
STR
ATT 1-147 | STR 148-422
Legend: ATT STR RBD CBM LEC ENZ CHP LNK TAS TTP UNK Unmapped

Taxonomy

  Name Taxonomy ID Lineage
Phage Bacillus phage phi18-2
[NCBI]
3062017 Viruses > Duplodnaviria > Heunggongvirae > Uroviricota > Caudoviricetes
Host Bacillus subtilis
[NCBI]
1423 cellular organisms > Bacteria > Bacillati > Bacillota > Bacilli > Bacillales

Coding sequence (CDS)

Coding sequence (CDS)
Genbank protein accession
WNA14189.1 [NCBI]
Genbank nucleotide accession
OR208547 [NCBI]
CDS location
range 19595 -> 20863
strand +
CDS
ATGATTTATAACGATGCAAAACAGTCGTTTGAAATTACCGCCTCAACCAAACGAAAAATAACTACCGGAATCCAATTCAGCACGCAAGACATTGACACGGCGCGACTCATATTTACGTTAACGAAAGACAGGGAGCCTTTGCCCCTGTCGGCAGTTAGCGGAAAGCTCGTTATGTTCATGGCGGACGGAAGCCGGTTTATCAAAAATGTAGAAATTGTAGATCCGGTAGGTGGCGTAGCGCAATACGTATTAACTTCGGACGAAATTAAGCATTACGGAACGGTCAACGCAGAGCTTAATTTGTACTACGCGAATAATCAGGCGCTTTCCGTCCACAAGTTTTCATTCAACATAGATCGAGCGCTAGTCGATACTGATATCGCTCCTATAGCGGAATATTATATCGATGATTTCGAGGCTTTAATCGCAAAGGTCAACGAACTATATGACGAAGCTATTGCGACGATGGAAGAACTACGGCAGAAGTTTTCGGACCTTGAAAATATCGAAACAAAAACCGGAGCCCAAGAGAAAGCGGACGCAGCCGAAGCTAACGCTAAAGCCTATACGGACGTGCACGCGAATAATAAAACGATCCACATTACGGCTGACGAACGAACGACCTGGAACGCAAAGGAAACGACAACGGGATCGCAAAGTAAAGCGGATAAAGCGTTAGGCGATGCAAAAGCGTACACTGATACGCACGCCAACCGAACGGATAATCCACATGAAGTCACTAAGGCGCAAATCGGTTTGGCTAACGTAGAAGACATTAAACAAGCGTCCTTTACGGACTTTCAAGCGCATAACTACAATCAGGTCCGGCATATCTCGGCAGATGAACGTACGGCATGGAACGCGAAAGAAACGACTGACGGGGCGCAAGCTAAGGCGGATAAGGCGCTCACAGACGCAAAGGCTTACACGGATACAAAGGTCGGTCAGCTTACGAGAACATGGACGACTATTCCGTTGATAAACGGTGCGGTTACAGACACCACAGTTCCATTGCGTTATCGTACTAAAAACGGTGGCGACGAACTCCAAATTAATGGCGGCTTCAAGTCGGCCTTCGGAACGATCATCGCGACGGGATTGCCGAAAGTTAAGTATCCTACCGAATTTCTTGTGGCAACGGTAGGAACCTACGGATATTTGCGAATGGATTATCGAGTTAACGGCGATCTGTATTTGGCTGGAGGAACTGTAAACAGCGAGACAGGGATTAGTAAAATCTCTGTGAATATAACGATTCCGTTAACGTAA

Genome Context

Genome Context

Tertiary structure

PDB ID
eb20e7b00df03b8172477ab516817aae3c6b9e2a3b47030e3390ac722b672133
ESMFold
Source ESMFold
Method ESMFold
Resolution 0,8468
Oligomeric State monomer
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50