Genbank accession
QBX20481.1 [GenBank]
Protein name
endopeptidase
RBP type
TF
Evidence RBPdetect2
Probability 0,77
Protein sequence
MRTQDIALKPFVNGASVGAESALEIWKENLVGDDTFDVKSDILTLGSFNWEIDKIGNARGALGGVAGSILDVYGGEYEFDNRTIILRKQMGRKAPTVLEYGRNIVSVEEERLLDGNYTSIYPYVRYTPQPKPQEEAPGKPHVGEHKQPEEQLVTLPEFILDGQYLSLYAQRRIQMVDLSSHFNDDKNKKEPTIEEIRKLAQKYLKDNNVGAPKVSIEVDYIDLSQTLDYQDFRVMEEVELCDIVPLYYPKFGITTESEKVVEIVYDVYTDSNHTIKLGTIGQSISKSLTGGVSERINALENNQKVITNNQKQFELNLPKYLNDINGKRVWYEKPDDNIEHKIGDYWFEKNGKYQRTWIWDGHQWVKVLDTEDLNPNQRAFDEAMAEIEKAKKAQEEINQRTDKELEEFRATLKNLALPEEAIKKITEAIKVDDIPSIKQSFDDLKNRVSETSEESRLTAEILGNNGKTRYNKNLLVGDPNRVKKIDEDYIEVEANDGGFKRGETYTISFIQTCELLKKVAVTLTQANNKGVKLVLTPTKAKMEPETFTLTKDTEVINVYPLSYKGVLTGDWYKSKQIDLTVSEAQELALEMAYKEVVDGNNADLVLDWAENPDIIFDGNGGI
Physico‐chemical
properties
protein length:622 AA
molecular weight: 70787,76790 Da
isoelectric point:4,85686
aromaticity:0,08521
hydropathy:-0,60498

Domains

Domains [InterPro]
DC_1571
ATT
10–280
IPR007119
Unmapped
34–278
IPR010572
ENZ
68–289
DC_1613
STR
219–594
QBX20481.1
1 622
Architecture
ATT
STR
ATT 10-280 | STR 281-594 |
Legend: ATT STR RBD CBM LEC ENZ CHP LNK TAS TTP UNK Unmapped

Taxonomy

  Name Taxonomy ID Lineage
Phage Streptococcus phage Javan523
[NCBI]
2548236 Viruses > Duplodnaviria > Heunggongvirae > Uroviricota > Caudoviricetes
Host Streptococcus pyogenes MGAS8232
[NCBI]
186103 Bacillota > Bacilli > Lactobacillales > Streptococcaceae > Streptococcus > Streptococcus pyogenes

Coding sequence (CDS)

Coding sequence (CDS)
Genbank protein accession
QBX20481.1 [NCBI]
Genbank nucleotide accession
MK448793 [NCBI]
CDS location
range 31177 -> 33045
strand +
CDS
ATGCGCACACAGGATATTGCTTTAAAACCGTTTGTAAACGGTGCGAGCGTAGGAGCCGAATCAGCTTTAGAAATCTGGAAAGAAAACCTTGTTGGTGATGATACTTTTGACGTTAAAAGCGACATCTTAACGCTTGGTAGCTTTAACTGGGAAATTGATAAAATCGGCAATGCCCGTGGTGCTCTAGGAGGTGTCGCTGGCTCTATCCTAGATGTTTACGGTGGTGAGTACGAGTTTGACAACCGTACAATCATCTTACGCAAGCAAATGGGGCGTAAAGCTCCCACGGTATTGGAGTATGGCCGTAATATCGTCAGCGTAGAGGAGGAGCGATTGCTAGATGGCAATTACACCTCTATCTATCCTTACGTAAGATATACGCCACAACCAAAACCGCAAGAGGAAGCCCCTGGTAAGCCGCATGTAGGCGAGCATAAACAACCCGAAGAACAGCTAGTGACATTGCCTGAATTTATCCTAGATGGTCAGTATCTCAGCTTATATGCTCAGCGCAGAATCCAAATGGTTGATTTATCAAGTCATTTTAACGATGACAAAAATAAAAAAGAGCCAACGATCGAAGAAATCCGAAAGCTGGCTCAGAAATACCTTAAGGATAATAACGTTGGTGCACCAAAAGTCAGCATTGAGGTTGATTATATTGACTTGTCACAAACGCTTGACTATCAAGATTTTAGAGTCATGGAAGAGGTTGAGCTTTGCGACATTGTACCACTTTATTATCCAAAGTTTGGCATCACAACTGAGTCTGAAAAAGTCGTTGAGATTGTCTATGACGTCTATACAGATAGCAATCACACAATCAAATTAGGTACGATTGGTCAATCAATCTCTAAAAGTTTGACTGGTGGTGTTTCTGAACGTATTAATGCGTTGGAAAATAATCAAAAGGTAATTACTAACAACCAAAAACAATTTGAACTCAATCTGCCTAAATACCTCAATGACATCAATGGTAAACGCGTTTGGTACGAAAAACCAGATGACAATATTGAGCACAAGATAGGTGATTACTGGTTTGAAAAAAATGGTAAGTATCAGCGCACTTGGATTTGGGATGGCCATCAATGGGTCAAGGTACTAGATACAGAGGATTTAAACCCTAACCAACGGGCCTTTGACGAGGCAATGGCTGAAATCGAAAAAGCCAAAAAAGCGCAAGAAGAAATCAACCAGCGCACCGATAAAGAGCTTGAAGAATTTAGAGCTACCCTCAAAAACCTAGCGTTACCAGAGGAAGCGATTAAAAAAATCACAGAGGCTATCAAAGTTGATGACATCCCGTCTATTAAACAAAGCTTTGATGACCTCAAAAATAGAGTGAGTGAGACAAGCGAAGAATCTCGTTTAACTGCCGAAATTTTAGGAAATAACGGTAAGACCCGCTATAACAAAAATTTATTGGTTGGCGACCCTAATCGTGTTAAAAAAATTGATGAGGATTACATCGAGGTAGAAGCCAACGACGGTGGTTTTAAGCGTGGCGAGACCTACACGATTAGCTTTATCCAGACTTGTGAGCTACTCAAAAAAGTGGCTGTCACGCTGACACAGGCTAACAACAAGGGAGTTAAACTGGTACTGACACCTACTAAGGCAAAAATGGAGCCTGAGACTTTTACTCTAACTAAGGACACAGAGGTCATCAACGTCTATCCTTTGAGCTATAAAGGCGTTTTAACAGGCGACTGGTATAAATCTAAGCAAATAGATTTAACCGTGTCAGAGGCGCAGGAATTAGCTCTTGAGATGGCTTATAAAGAGGTTGTGGACGGCAATAATGCTGATTTAGTTTTGGATTGGGCGGAAAACCCAGATATTATTTTTGACGGAAACGGAGGTATTTAA

Genome Context

Genome Context

Tertiary structure

PDB ID
067324b8e8d37babae3054203af87b0495ae4dd89517f9a647f4f5eddae1dd33
ESMFold
Source ESMFold
Method ESMFold
Resolution 0,6501
Oligomeric State monomer
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50