Protein

Genbank accession
AGC34303.1 [GenBank]
Protein name
hypothetical protein
RBP type
TF
Evidence RBPdetect
Probability 0,82
Protein sequence
MAYQSKVKTYGAPADERAERPDSWRWKQNEPPVAEYLNDLNYNVIEDIKHLVALTNAIDPDNDGMVANADKLDGKHAEELGGFKYVQTETPDTPAVGNSWFKDTNGLILVGDGEKYQPQPEVGYSETNDFTEAGYSVLHETVPRTKIDELGSIRLINELVVENFDGGKVGPGLSTWAWTDSSGLTAQNTTVISGTHSGEYSVAGTLDAITLKREAPIIQDFETTFQIGSDTGNISDYSELIVKAQDGTLIGGVRFNDGNGSLVVLDDARSPIENIQSAWSVGQNYSFEWDWDFGNSQYDLYMDGALVGTYSLPAGVSDFGEFTVRQDNSSSGATRSVFIDDLHTGAREYGEAVITLPEPDQRIQSWDIIRIMRTSANESVVVDVEDSTGTTLLADVHSEDDLSAFVSASENFQFRAKFSRTNTANNPSLDGVYRRWTMRPGDTGLSKKVRERQEAERNRARYIHTRLATRGQ
Physico‐chemical
properties
protein length:472 AA
molecular weight: 52136,44740 Da
isoelectric point:4,53094
aromaticity:0,09110
hydropathy:-0,55784

Domains

Domains [InterPro]
AGC34303.1
1 472
Legend: Pfam SMART CDD TIGRFAM HAMAP SUPFAM PRINTS Gene3D PANTHER Other

Taxonomy

  Name Taxonomy ID Lineage
Phage Halorubrum sodomense tailed virus 2
[NCBI]
1262527 Viruses > Duplodnaviria > Heunggongvirae > Uroviricota > Caudoviricetes
Host Halorubrum sodomense
[NCBI]
35743 Archaea > Euryarchaeota > Halobacteria > Haloferacales > Haloferacaceae > Halorubrum

Coding sequence (CDS)

Coding sequence (CDS)
Genbank protein accession
AGC34303.1 [NCBI]
Genbank nucleotide accession
KC117376 [NCBI]
CDS location
range 24295 -> 25713
strand +
CDS
ATGGCATACCAAAGCAAAGTCAAAACCTACGGCGCTCCGGCGGACGAGCGGGCCGAGCGGCCCGATTCGTGGCGCTGGAAACAAAACGAGCCGCCTGTCGCTGAGTACCTGAACGACCTGAACTACAACGTCATCGAGGACATCAAGCACCTCGTGGCGCTGACGAACGCCATCGACCCGGACAACGACGGGATGGTGGCGAACGCCGACAAGTTGGACGGCAAGCACGCCGAGGAACTTGGCGGATTCAAGTACGTTCAAACCGAGACCCCGGACACACCCGCAGTGGGTAATTCGTGGTTCAAAGACACGAACGGCCTCATCCTCGTTGGTGACGGCGAAAAATACCAGCCTCAACCCGAAGTCGGCTACAGTGAAACTAACGACTTTACTGAAGCGGGCTACAGCGTTCTTCACGAAACTGTTCCAAGAACGAAAATAGATGAACTCGGTAGTATCCGACTCATCAACGAACTCGTTGTTGAGAATTTTGATGGGGGAAAAGTTGGTCCGGGCCTTAGCACTTGGGCGTGGACCGACTCATCGGGTCTGACCGCGCAGAACACCACGGTCATCTCGGGTACTCACAGCGGCGAGTATTCGGTCGCTGGCACGCTCGACGCAATTACGCTAAAGCGTGAAGCGCCCATCATCCAAGACTTCGAGACGACGTTCCAAATCGGCTCTGACACCGGGAACATCAGCGACTACTCGGAACTCATCGTCAAAGCACAGGACGGAACGCTCATCGGGGGCGTTCGGTTCAACGACGGTAACGGCTCGCTGGTCGTACTCGATGACGCCCGCTCGCCTATCGAAAACATCCAGTCCGCGTGGTCTGTCGGACAGAACTATTCGTTCGAGTGGGATTGGGACTTCGGGAACTCCCAGTACGACCTGTATATGGACGGCGCGTTGGTCGGGACGTACTCCCTTCCAGCGGGCGTGAGTGACTTCGGTGAGTTCACTGTCCGGCAAGACAACTCCTCGTCCGGTGCCACCCGGTCGGTCTTCATAGACGACCTCCACACCGGAGCGCGTGAATACGGCGAAGCAGTCATTACTCTACCCGAGCCGGACCAGCGTATTCAGTCGTGGGACATCATCCGCATTATGCGGACTTCGGCTAACGAGAGCGTCGTCGTAGACGTGGAAGACTCTACAGGCACGACGCTACTTGCAGACGTTCACAGCGAAGACGACCTTTCGGCTTTCGTCTCTGCTTCAGAAAACTTCCAATTCAGGGCAAAATTCAGTCGGACTAACACGGCAAATAACCCTTCTCTCGATGGCGTGTACCGGCGCTGGACGATGCGCCCCGGCGACACGGGTCTCAGCAAGAAGGTACGAGAACGACAAGAAGCAGAACGAAACAGAGCGCGATACATCCACACTCGGCTGGCAACGCGCGGTCAATAA

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

PDB ID
9a4546dcb27bccb8fa1a1f5b7b60e458d383e415c4b4439d74f74c9fd66fe135
ESMFold
Source ESMFold
Method ESMFold
Resolution 0,7479
Evidence 0,7479

Literature

No literature entries available.