Genbank accession
YP_008083077.1 [GenBank]
Protein name
hypothetical protein
RBP type
TF
Evidence RBPdetect
Probability 0,88
Protein sequence
MATQIQVNRSGTFEDIDYSDFSIELGTSKAQIAPTASVTSQARESIQAGQELRIIIDGTTRFEGVTQSAGTKSRNGQRRVEVEHSGVKLMEEPVTLSLASPSVSDVLNAAVDAADSGGSWTVDTSDLPALTLSNDYNVESRKVKRIFRDMTDRAEAVWWIEPTGETIHVANGGDGGLWQSFDTQTDRIRVDEFDEGDVKTVRNDVEVIGTGDVAVSATETNSTSISTYGRRAGESPYKVNYVTTEAEAGALAQALLQPDPLAQGKITVSAASGAVESPQVNKTVDLTDPSKDIDETDLTINKQVIRQSRAELHIGQGATDAIARLNRNSKSEGDVTEPGSVYDTDRIDDLAITTEKLVDTSVIEGKIADLSISETKVQDESISTPKLQAEAVVAAKIEAGTITAVEIAAGTITANEIDTLDLDTQQFTVGADTSFLSFTTEQGATGEALVMEPDGTDGFAFFGNPFGSADVSVYSDAGNEMDAIRPFNGDNTGNVGNSSEAYADVYAHNFVTASPDPIESVDTESVTDVDWYDNPPEEIRRRARDIGDTDSEVPEGRDHTPVELGTMANWLLETCKAQQERISDLEERLSEIEEKV
Physico‐chemical
properties
protein length:596 AA
molecular weight: 64032,02700 Da
isoelectric point:4,22680
aromaticity:0,05201
hydropathy:-0,43960

Domains

Domains [InterPro]
DC_0591
STR
4–570
Coil
Unmapped
568–595
YP_008083077.1
1 596
Architecture
STR
STR 4-570 |
Legend: ATT STR RBD CBM LEC ENZ CHP LNK TAS TTP UNK Unmapped

Taxonomy

  Name Taxonomy ID Lineage
Phage Haloarcula sinaiiensis tailed virus 1
[NCBI]
1262530 Viruses > Duplodnaviria > Heunggongvirae > Uroviricota > Caudoviricetes
Host Haloarcula sinaiiensis
[NCBI]
35742 Archaea > Euryarchaeota > Halobacteria > Halobacteriales > Halobacteriaceae > Haloarcula

Coding sequence (CDS)

Coding sequence (CDS)
Genbank protein accession
YP_008083077.1 [NCBI]
Genbank nucleotide accession
NC_021471 [NCBI]
CDS location
range 16908 -> 18698
strand +
CDS
ATGGCAACGCAGATTCAGGTCAACCGCTCGGGGACCTTCGAGGACATCGACTACAGTGACTTTTCCATCGAACTCGGTACGTCGAAGGCACAGATAGCCCCGACGGCGAGTGTTACGTCACAGGCGCGGGAGTCCATACAGGCTGGGCAGGAACTCCGCATCATCATCGACGGGACGACGCGCTTCGAGGGCGTGACACAGTCGGCCGGCACCAAGAGCCGCAACGGACAGCGTCGGGTCGAAGTCGAACACTCCGGGGTGAAGCTCATGGAGGAACCCGTGACGCTGTCACTGGCTTCGCCGTCGGTGTCGGATGTGCTCAATGCCGCGGTGGATGCGGCCGATAGTGGGGGTTCGTGGACTGTTGACACCAGCGACCTCCCGGCGCTGACCCTCTCGAATGACTACAACGTTGAATCGCGGAAGGTAAAGCGCATCTTCCGGGACATGACCGACCGCGCCGAAGCGGTGTGGTGGATTGAACCGACCGGCGAGACCATCCACGTCGCCAACGGCGGCGACGGTGGCCTGTGGCAGTCCTTCGATACACAGACGGACAGGATTCGCGTCGATGAGTTTGATGAGGGCGACGTGAAGACGGTTCGCAACGACGTAGAGGTTATCGGGACGGGCGACGTGGCGGTGTCGGCCACCGAGACGAACAGCACGAGCATCAGCACCTACGGCCGGCGGGCCGGAGAGAGTCCGTATAAGGTCAACTACGTGACGACGGAAGCCGAAGCCGGCGCGTTGGCACAAGCGTTGCTCCAACCCGACCCGCTGGCGCAGGGGAAGATAACGGTCTCGGCGGCGTCGGGCGCGGTGGAATCCCCGCAGGTCAACAAGACCGTCGACCTTACCGACCCGTCGAAGGACATTGATGAAACGGACCTCACCATCAACAAGCAAGTGATACGGCAGTCCCGCGCGGAACTGCATATCGGGCAGGGAGCCACGGACGCTATCGCGCGGCTAAATCGCAACTCCAAGAGCGAGGGGGACGTGACAGAACCGGGGTCGGTTTACGATACCGACAGGATAGACGACTTGGCAATCACGACTGAGAAGTTGGTCGACACGAGTGTCATCGAAGGGAAAATAGCGGACCTGTCCATCTCCGAGACGAAGGTGCAGGACGAAAGTATATCCACGCCGAAGCTACAGGCCGAGGCCGTCGTCGCGGCGAAGATAGAAGCGGGAACTATCACCGCCGTCGAAATTGCCGCCGGGACCATCACAGCCAACGAGATAGACACGCTCGACTTGGATACGCAACAGTTCACCGTCGGGGCCGACACGTCCTTCTTGTCATTCACCACCGAGCAGGGGGCGACTGGTGAAGCCCTCGTCATGGAACCCGACGGGACAGACGGTTTCGCCTTTTTCGGCAACCCGTTCGGGAGTGCCGACGTGTCTGTGTACTCCGACGCCGGCAACGAGATGGACGCGATACGCCCGTTTAACGGCGACAACACGGGCAACGTCGGCAACTCCAGCGAAGCGTATGCGGATGTATACGCACACAACTTCGTGACCGCATCACCGGACCCCATCGAATCCGTCGACACCGAGTCCGTAACGGATGTTGACTGGTACGACAACCCGCCAGAGGAGATTAGGCGGCGGGCGCGGGACATCGGCGACACCGACTCCGAAGTTCCCGAGGGGCGGGACCACACGCCGGTCGAACTCGGGACTATGGCGAACTGGCTGCTGGAGACGTGCAAGGCACAGCAAGAGCGCATCAGCGACTTGGAAGAACGCTTATCAGAAATAGAGGAGAAGGTGTGA

Genome Context

Genome Context

Tertiary structure

PDB ID
218cfc3fd2e7bd6218cec3714da1fb5e87733efb010385dd40b0242181da343a
ESMFold
Source ESMFold
Method ESMFold
Resolution 0,7590
Oligomeric State monomer
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50