Genbank accession
YP_009206320.1 [GenBank]
Protein name
minor head protein
RBP type
TF
Evidence RBPdetect
Probability 0,61
TF
Evidence RBPdetect2
Probability 0,93
Protein sequence
MQNTELHIIDFKTQSIVATFQDQDYWDDMREWELKNNVDILEFKVFDGTRQAITLQQQNLVLRQDRQGNVIPYTIEDEVEKIAKDRSITVRAVGSWTGLRKAGFIRPQKLEGLTAHQYVSLATAGTKWQPGNIAYASFRTMTLDEFTDPLTLLKKTATLFNLELNYRVEVDGNRITGWFVDLVEKVGRVTRKEIELGKDLINVTRIEHSKNICTALLGFARGEDNEVITIEKINNGSPYIVDNEAFQRWSENGQHKYGFYQPETEGEIDAKRLMTLMKTEMEKRKNSSVSYEVDAVDIAEVFGLRHELIMKGDTIGIKDTAFTPALYLEARAIGGKESNTNPDRNKYTFGEYHEIVDYDAEMRRMYNRVQGLLNNKADKPLLDELEKKIKEQEKKMQKSVEENKVVKDIAERLKEKINNNMVDIIESENKPTKGLIDGKTLWRDISNGKPGILKLWTNGEWEPVVPDVESVKKETLDQVNKDIQLTKEELNKKVEKAQKETTGQFNEVNENLQEFSRTIKNVQNSQGEINKTVSEMKQTNKGFTKSIEELAKKDGEISSKLNTVEQTVESTKKTISDVQQTTNDLKKTTTDIEEKAGKISEKLTSVETKVNNINDDVTNLLVDSGTFEGAQRVSAVFPPRWYLKGGDARLSTDTFQGNSVYEVQANWSGIAYNFKDLIDRGVVKAGDKVTYSIYSRVKGLADGQTRDQTFFFYTGATGITLPTVTNQWKRVNATFTVTTAMMALTGTTIESFMRVEPSVGGSGSIWYQQSTPQLTVGDKVYTWRPAPEDLITNGEFNKKTTEIEKSVGGIKESIKTVEKTQVDFNERVNTVEKNAEGTTASVKKLQETQTEQGKTLTQATTTIQQHSEALKLTMKKKDVEDYVGGLGTVNVLRDTGFRFDRKYWYWNTDAGAMIKVDKNLQFKGLNTLSVTVSGQTQNRWWGLTGQYIPANAGEDWVASGYFNHDGKPPAFDNGTGAFVEIEWFDAAKKRISTNRAKANIINHTWVRVEITGKAPTNTKFVRYRVYAERNGRFWLGAPMLQMGKKASEFWENPKDQTDVDKMMDDIADRIATEQFNKRISEIQREIRVSAEGIEAAAKKRDIYVDTNFAKNSYVRELEGRLKVTEDNVSISVKEDNIIAKINVTKENILLNAQRIDLRGYVTAQHIRGQVLEGVTLKTAPSSEKRWVEMNKQDIRLYDAGEARTFLGFYNQKNGDLQPTFILGSNASSEIPGSFVVTTVIPKTANGYDYNRSVATIGMAGWYNATENIFRRNSEIVFYKENGGMFLNAYGPMNLKTRMDMFFETAWDNGGRNIQFEATRMFYVMAHGQINMKTNTSYFLESVEGKWMFQKTNSGGNTTLISDNGNDVDIRFAYTMLRSSHVPGYQGKIQVKNVNGNEFRDLEVRNIEHNGRIIQRSTQKLKEGVRDVDFSPLEKIMELKLKSYYLKTEMARLYEMRMHRKEGDELPTLKDLDVSYGFIAEQTDKVFLSPPGDGIDMYSTTAIHIAATQEIYEELQETQKENEKLKEHTKELIDRLSRLEGLVEELLLPKES
Physico‐chemical
properties
protein length:1551 AA
molecular weight: 176539,98120 Da
isoelectric point:6,01984
aromaticity:0,08704
hydropathy:-0,63972

Domains

Domains [InterPro]
DC_1274
ATT
1–642
Coil
Unmapped
473–525
SSF57997
STR
474–616
PTHR46349
Unmapped
521–878
YP_009206320.1
1 1551
Architecture
ATT
STR
RBD
ATT 1-642 | STR 855-1217 | RBD 1239-1550 |
Legend: ATT STR RBD CBM LEC ENZ CHP LNK TAS TTP UNK Unmapped

Taxonomy

  Name Taxonomy ID Lineage
Phage Bacillus phage phi4B1
[NCBI]
1643324 Viruses > Duplodnaviria > Heunggongvirae > Uroviricota > Caudoviricetes
Host Bacillus thuringiensis
[NCBI]
1428 cellular organisms > Bacteria > Bacillati > Bacillota > Bacilli > Bacillales

Coding sequence (CDS)

Coding sequence (CDS)
Genbank protein accession
YP_009206320.1 [NCBI]
Genbank nucleotide accession
NC_028886 [NCBI]
CDS location
range 15212 -> 19867
strand +
CDS
ATGCAAAATACCGAACTTCACATAATAGATTTTAAAACACAATCAATCGTGGCAACATTCCAGGATCAAGATTATTGGGACGATATGCGCGAGTGGGAATTAAAAAATAATGTTGACATTTTAGAATTCAAAGTCTTTGACGGAACGCGCCAGGCTATAACATTACAGCAACAAAACCTAGTATTGCGTCAAGATCGACAAGGAAATGTAATTCCGTACACTATCGAAGATGAAGTTGAAAAAATAGCAAAAGATCGCTCTATTACAGTTAGGGCAGTCGGATCATGGACAGGATTAAGAAAAGCTGGCTTTATTCGTCCGCAAAAGTTAGAAGGATTAACGGCCCATCAATATGTTAGCCTTGCAACTGCTGGTACTAAGTGGCAACCAGGAAACATTGCATACGCGTCTTTCAGAACAATGACGCTAGACGAATTCACTGATCCTCTAACTTTATTAAAGAAAACTGCAACTCTATTTAACTTAGAATTGAATTACCGTGTAGAAGTAGACGGAAATAGAATAACAGGTTGGTTTGTTGATCTGGTTGAAAAGGTTGGACGCGTTACACGAAAAGAGATTGAGTTAGGGAAAGACTTGATTAACGTAACACGTATTGAACACTCAAAAAATATTTGTACGGCCCTGCTCGGTTTCGCGCGTGGTGAAGATAATGAAGTTATCACAATAGAAAAGATCAACAACGGATCGCCTTACATTGTTGATAATGAAGCATTTCAGCGCTGGAGTGAGAACGGCCAACATAAATATGGTTTCTATCAACCAGAAACAGAAGGAGAAATTGACGCAAAACGTTTAATGACTCTTATGAAAACTGAAATGGAGAAAAGAAAGAATTCTTCGGTTAGTTATGAAGTAGACGCGGTTGACATTGCTGAAGTGTTTGGACTGCGACACGAATTGATTATGAAGGGTGATACGATCGGAATTAAAGATACGGCGTTTACACCTGCTTTGTATTTAGAAGCGCGAGCGATCGGCGGGAAAGAGTCTAACACTAATCCAGATAGAAATAAATATACGTTTGGCGAATATCACGAGATCGTTGATTATGACGCGGAAATGAGAAGAATGTATAACCGCGTTCAAGGGTTACTTAATAATAAAGCCGATAAGCCCCTTTTAGACGAATTAGAGAAGAAAATCAAAGAGCAAGAAAAGAAGATGCAAAAATCTGTAGAGGAAAACAAGGTTGTCAAAGACATTGCTGAACGATTGAAAGAGAAAATAAATAACAATATGGTGGACATCATTGAATCGGAAAATAAGCCAACAAAAGGTTTAATTGATGGCAAAACCCTTTGGCGAGATATTTCAAACGGTAAACCCGGTATCTTAAAGTTATGGACGAATGGAGAATGGGAACCAGTAGTGCCGGATGTAGAGTCAGTAAAGAAAGAAACGCTAGATCAGGTTAATAAGGATATTCAGTTAACCAAAGAAGAATTAAATAAGAAAGTGGAAAAGGCACAAAAAGAAACCACTGGCCAATTTAATGAAGTAAACGAAAATCTTCAAGAGTTTTCTCGAACGATTAAAAACGTACAAAACTCTCAAGGTGAAATAAATAAAACTGTCTCTGAAATGAAACAAACGAACAAAGGCTTTACTAAATCTATTGAAGAATTAGCGAAAAAAGACGGTGAAATTAGCAGTAAATTAAATACAGTCGAACAAACTGTAGAAAGCACAAAGAAGACGATTTCCGATGTACAACAAACAACTAATGACTTAAAGAAAACAACTACTGATATAGAAGAGAAAGCTGGAAAAATCAGCGAAAAGTTAACAAGTGTAGAGACAAAAGTAAACAACATCAATGATGATGTAACTAACTTGCTTGTTGATTCTGGCACTTTTGAAGGAGCGCAACGTGTAAGTGCAGTGTTTCCTCCGAGGTGGTATCTAAAAGGCGGAGATGCCAGACTATCTACTGATACGTTTCAAGGAAACTCTGTATATGAAGTTCAAGCCAACTGGTCAGGAATAGCTTATAATTTTAAGGATTTAATTGACAGAGGTGTTGTCAAAGCTGGGGATAAAGTTACTTACTCTATATATTCAAGAGTAAAAGGATTAGCAGATGGACAAACAAGAGATCAAACATTCTTTTTTTATACAGGAGCAACTGGCATAACACTCCCTACGGTTACTAATCAGTGGAAACGAGTTAACGCTACTTTTACAGTAACAACTGCAATGATGGCTCTAACAGGAACAACTATAGAGAGTTTTATGAGAGTTGAGCCTTCTGTTGGCGGGTCTGGGTCTATTTGGTACCAACAAAGCACACCGCAACTAACAGTTGGTGATAAAGTTTATACGTGGAGACCTGCACCAGAAGACTTAATAACAAATGGAGAATTCAACAAGAAAACAACCGAGATTGAAAAAAGTGTGGGTGGTATCAAAGAAAGTATTAAAACAGTAGAAAAAACACAAGTCGATTTTAATGAACGTGTTAACACTGTAGAAAAGAATGCTGAAGGAACAACTGCAAGTGTTAAGAAATTACAAGAAACACAAACTGAGCAAGGAAAAACATTAACTCAGGCTACTACAACGATACAGCAACATTCTGAAGCATTGAAATTAACAATGAAAAAGAAAGACGTTGAGGATTATGTAGGCGGTTTAGGCACTGTCAACGTGTTGAGAGATACTGGATTCCGTTTTGATAGAAAGTATTGGTATTGGAATACTGATGCTGGAGCAATGATCAAGGTCGATAAGAATCTACAATTCAAAGGACTAAATACATTAAGTGTGACAGTGAGTGGACAAACTCAAAATAGATGGTGGGGATTAACTGGACAATATATTCCTGCCAATGCTGGTGAGGATTGGGTTGCATCAGGATATTTTAATCACGATGGGAAACCACCTGCCTTTGATAATGGAACAGGTGCTTTTGTTGAAATCGAGTGGTTTGATGCAGCGAAAAAGAGAATTTCCACCAACCGAGCAAAGGCCAATATTATTAATCATACATGGGTGCGTGTTGAGATTACTGGAAAAGCGCCAACCAATACTAAATTCGTACGTTACAGAGTTTATGCTGAGCGAAATGGTCGTTTTTGGTTGGGGGCACCGATGTTGCAAATGGGTAAGAAAGCCTCTGAGTTTTGGGAAAACCCCAAAGATCAAACTGACGTTGATAAGATGATGGATGATATTGCTGATAGAATTGCCACGGAGCAATTTAATAAACGTATTTCTGAAATTCAACGTGAAATCAGAGTGAGTGCAGAGGGAATTGAAGCAGCAGCCAAAAAACGAGACATTTATGTAGATACTAATTTTGCTAAAAACTCTTATGTGCGTGAATTAGAAGGGCGTTTAAAAGTTACAGAAGATAATGTTTCTATTTCTGTTAAAGAAGACAACATTATCGCTAAGATTAATGTAACGAAAGAGAATATTCTACTTAATGCACAGCGTATTGACCTTAGAGGATATGTAACAGCTCAACATATTAGGGGACAAGTTCTTGAAGGTGTAACATTGAAAACTGCACCTTCTAGTGAAAAACGTTGGGTAGAGATGAATAAACAAGATATTCGTCTTTATGATGCGGGAGAGGCACGAACATTCTTAGGTTTCTACAATCAGAAGAATGGAGATCTTCAGCCGACATTCATTTTAGGAAGTAACGCAAGTTCTGAAATCCCTGGATCGTTCGTTGTTACAACTGTAATTCCTAAAACCGCAAACGGTTATGACTATAACAGGTCAGTAGCAACAATTGGTATGGCGGGATGGTACAACGCTACTGAAAATATCTTCAGAAGAAATTCAGAGATTGTTTTCTATAAAGAAAACGGCGGTATGTTCTTAAATGCATATGGTCCAATGAATTTAAAAACAAGAATGGATATGTTTTTTGAAACAGCATGGGATAATGGCGGGCGCAATATCCAGTTTGAAGCAACTAGAATGTTTTATGTCATGGCTCATGGTCAAATCAATATGAAGACGAACACAAGCTACTTCTTAGAGAGTGTTGAAGGGAAATGGATGTTCCAAAAAACGAATAGTGGAGGAAACACAACTCTGATTAGTGACAATGGAAATGACGTTGACATTCGTTTTGCCTATACCATGTTAAGGTCTTCTCATGTTCCTGGTTATCAAGGCAAGATTCAGGTGAAGAACGTTAACGGAAACGAATTTCGAGATCTCGAAGTTAGAAATATTGAACATAATGGACGTATCATCCAGAGGTCGACACAGAAGCTTAAAGAGGGCGTGAGGGATGTCGATTTTTCACCACTAGAAAAAATCATGGAACTAAAATTAAAAAGCTATTATTTGAAAACAGAGATGGCTAGATTGTATGAAATGAGGATGCACCGTAAAGAAGGAGATGAATTACCGACCCTTAAAGATCTCGATGTGAGTTATGGTTTCATAGCTGAACAAACCGATAAGGTGTTTTTGTCGCCACCAGGTGATGGAATTGATATGTATTCCACAACAGCGATACACATTGCTGCCACGCAAGAAATCTACGAAGAATTACAAGAAACCCAAAAAGAGAATGAGAAATTAAAAGAACATACAAAGGAATTAATAGATAGACTTTCTCGATTAGAAGGATTGGTTGAGGAATTATTACTTCCGAAGGAAAGTTGA

Genome Context

Genome Context

Tertiary structure

PDB ID
8135d981bc54a8af47cef28143fee03a0e6145d43f68187278ab9554a9191f25
ColabFold
Source ColabFold
Method ColabFold
Resolution 0,7535
Oligomeric State monomer
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50

Literature

Title Authors Date PMID Source
Whole Genome Sequencing of Bacillus ACT Group Temperature Bacteriophages Fouts,D.E., Rasko,D.A., Cer,R.R., Jiang,L., Fedorova,N.B., Shvartsbeyn,A., Read,T.D., Gill,S.R., Klumpp,J. and Calendar,R. 2017-11-14 GenBank