Genbank accession
AQW89118.1 [GenBank]
Protein name
hypothetical protein
RBP type
TF
Evidence RBPdetect
Probability 0,89
Protein sequence
MTVSATATVFTILDGNGNDTFTFPFKAFVEKTTPSMQSIVVWERAADGTDSLQIQGSDYVVLNIQAETGQIKFSTKTIPTGNKVVIASDVPYGQKVNFPDNAAIVPSQLQLAVDRIAQQLKQLAFLDGLKVGLEFPPEIDSLKLFVDPNITADAPVFFSQDQNDPNKYLLKAGDFTIAELKTLRDEIDAAKTAAESAKVAAEQSATNAASSAQTAANAAAQTATNAAQVAQSLIDISKIVADGKADITKLVTDGTTSLNQIVTTGTTSLNKLVTDGTTSLNQIVTDATLVIDNKVVEAENAATDAQTAAGVAKTEADRAEAAAVSLPDYTVGSAGDVFITDGDGNPPVRQKYISGTDFSKLALGEFFTADGSGGAEKSSFVPIRNEDIAQTDSVVKFFNDSSGKYLLATSFGAVSSKDVLSVNSANEGGLDQRQINELFKTRLNALAPVGEAYPWGMSNTIADFIVNQDGYLISTTDSFIVLPNATKTPNVSNEIVGNRNQYTTDGNYSIPSHINGKDFFLNDFSALLCLFIDSNISSGDATIVANTQFNISKIGTDIVYTVGADSWTIPYTANNWVCYLLASDGIGTVSNISGEYIGGVLKITDNGDQTITKNATSEDFELTLEPGIRMSRLVMVDKRVVQSDLEKFLKFIFNIPDSASGMLPPAVSTNGYGIVTDASQYTQKPVIYSKSFPNDWTIDWAVDTSSPGLLDKKAVVFKADVPNKTLTVEAGDSGSGTENFPSIFITNPPSGATYVSNIDGTAMDDGKKFNLTVTKNGTVFTFDIEDSMPTKTNTVVRNTLATISSATPVKIADADDATLVRNVFVISDEDDVGGIYVGQLADLTADRTKGSPVYTGTSIALQQNRDYYAISISDAADFKGKVNVVDEVVV
Physico‐chemical
properties
protein length:890 AA
molecular weight: 94590,25150 Da
isoelectric point:4,36526
aromaticity:0,07865
hydropathy:-0,08719

Domains

Domains [InterPro]
DC_1672
STR
57–294
Coil
Unmapped
180–200
AQW89118.1
1 890
Architecture
STR
STR 57-294 |
Legend: ATT STR RBD CBM LEC ENZ CHP LNK TAS TTP UNK Unmapped

Taxonomy

  Name Taxonomy ID Lineage
Phage Xenohaliotis phage pCXc-HR2015
[NCBI]
1933104 Viruses > Duplodnaviria > Heunggongvirae > Uroviricota > Caudoviricetes
Host Candidatus Xenohaliotis californiensis
[NCBI]
84677 Pseudomonadota > Alphaproteobacteria > Rickettsiales > Anaplasmataceae > Candidatus Xenohaliotis >

Coding sequence (CDS)

Coding sequence (CDS)
Genbank protein accession
AQW89118.1 [NCBI]
Genbank nucleotide accession
KY296501 [NCBI]
CDS location
range 12251 -> 14923
strand -
CDS
ATGACCGTTTCAGCAACAGCAACAGTATTTACCATTTTAGATGGTAATGGTAATGATACATTTACCTTTCCATTTAAGGCTTTTGTAGAAAAAACAACACCATCAATGCAGTCAATTGTAGTCTGGGAGCGTGCTGCTGATGGTACTGACTCTCTGCAAATTCAAGGAAGTGATTACGTTGTACTTAATATCCAGGCAGAAACTGGGCAGATTAAATTTTCAACTAAAACTATTCCTACTGGAAATAAAGTAGTAATAGCATCTGATGTTCCTTATGGGCAAAAGGTAAATTTTCCTGACAATGCTGCAATAGTTCCATCACAATTACAACTTGCAGTTGATAGAATTGCTCAACAATTGAAGCAGTTAGCATTTCTTGATGGTCTAAAGGTTGGATTGGAATTTCCGCCTGAGATTGATAGTTTAAAACTCTTTGTTGACCCAAATATTACTGCAGATGCTCCTGTCTTTTTCTCTCAAGATCAAAATGATCCAAATAAGTATCTTTTAAAGGCCGGTGACTTCACTATAGCTGAGCTAAAAACACTGCGTGATGAGATTGATGCAGCTAAAACAGCAGCCGAATCAGCCAAAGTTGCTGCTGAGCAGTCTGCTACTAACGCTGCTTCATCAGCCCAAACAGCCGCTAATGCTGCCGCCCAAACAGCCACTAATGCAGCACAGGTGGCGCAGTCATTGATTGATATTTCTAAGATTGTAGCTGATGGAAAAGCTGATATTACAAAATTAGTTACCGATGGAACTACATCGTTAAACCAAATTGTTACTACTGGAACTACATCATTGAATAAATTAGTTACTGATGGAACTACATCATTGAATCAAATTGTTACTGATGCTACGTTGGTTATTGATAATAAAGTAGTAGAGGCAGAAAATGCAGCAACAGATGCACAAACGGCAGCTGGGGTGGCTAAAACTGAGGCCGACAGAGCTGAAGCGGCAGCCGTTTCACTTCCGGATTATACAGTTGGCAGCGCTGGTGATGTGTTTATTACTGACGGTGATGGTAATCCTCCTGTTCGTCAAAAGTATATCTCTGGCACAGATTTCTCCAAGCTTGCTCTAGGAGAGTTTTTTACAGCAGATGGAAGTGGGGGCGCTGAGAAAAGTTCTTTTGTACCTATTAGGAATGAGGATATCGCTCAAACAGATTCAGTTGTAAAGTTTTTTAATGATAGTAGCGGCAAATATCTGCTTGCAACTAGCTTTGGCGCTGTATCCAGTAAGGATGTGCTCTCAGTTAACTCTGCGAATGAAGGTGGTTTGGATCAGAGGCAGATCAATGAGCTGTTTAAAACAAGATTAAATGCTCTAGCCCCTGTTGGAGAAGCCTATCCATGGGGAATGTCTAACACAATAGCAGATTTCATAGTTAACCAAGATGGCTACTTGATTTCAACTACAGATAGTTTTATCGTCTTACCTAATGCTACCAAGACACCAAATGTTTCTAATGAGATTGTTGGGAACAGAAATCAGTACACTACTGATGGAAACTACTCAATCCCATCTCATATCAATGGTAAGGATTTTTTTCTCAATGATTTTTCCGCTCTCCTGTGTCTTTTTATTGACAGTAACATTAGCTCTGGTGACGCAACTATAGTTGCCAATACCCAGTTTAATATTAGTAAAATTGGTACCGACATTGTCTATACCGTTGGTGCAGACTCTTGGACAATTCCTTATACTGCTAACAACTGGGTTTGTTATCTACTCGCCTCTGATGGTATTGGCACTGTGTCTAATATTTCCGGTGAATACATTGGCGGTGTGCTAAAGATTACTGACAATGGTGATCAGACAATCACAAAGAATGCAACCTCAGAGGATTTCGAACTGACGTTAGAGCCAGGAATTAGGATGTCTAGGCTAGTGATGGTAGATAAAAGGGTAGTGCAGTCAGACCTAGAGAAATTCTTGAAGTTTATCTTTAACATCCCTGACTCTGCTTCAGGAATGTTACCACCTGCTGTGTCCACCAATGGCTATGGGATAGTAACTGATGCTAGCCAGTACACCCAAAAACCAGTTATCTATAGTAAGTCATTTCCAAATGACTGGACTATTGATTGGGCGGTAGATACCTCATCTCCTGGTCTTCTGGATAAAAAAGCTGTTGTGTTTAAGGCTGACGTTCCTAATAAGACATTAACTGTTGAAGCAGGTGACTCTGGCTCTGGCACTGAGAACTTTCCTAGCATCTTCATCACCAACCCACCATCTGGGGCTACCTATGTCTCTAATATTGATGGCACAGCAATGGATGATGGTAAGAAATTCAACCTAACCGTCACAAAAAATGGAACAGTATTCACTTTTGATATTGAGGATAGTATGCCTACTAAAACTAACACCGTTGTCCGCAACACACTTGCAACCATATCATCTGCCACACCTGTAAAAATAGCTGATGCTGATGATGCCACACTAGTCAGAAACGTCTTTGTCATTAGCGATGAGGATGATGTTGGTGGTATTTACGTTGGCCAGTTGGCTGATTTAACTGCTGACAGAACTAAAGGTTCACCTGTTTATACAGGAACCAGTATAGCGCTACAGCAAAACAGAGATTACTATGCCATCTCAATATCTGATGCTGCTGACTTTAAAGGTAAGGTTAATGTGGTTGATGAGGTAGTGGTTTAA

Genome Context

Genome Context

Tertiary structure

PDB ID
5c89ecd865186cdd2a3d02f23295fe95b5762f3fd952d26d4c4033d7a1509fe8
ESMFold
Source ESMFold
Method ESMFold
Resolution 0,5597
Oligomeric State monomer
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50

Literature

Title Authors Date PMID Source
Complete genome sequence of the phage hyperparasite of Candidatus Xenohaliotis californiensis pathogen of Haliotis spp Cruz-Flores,R., Caceres-Martinez,J., Del Rio Portilla,M.A., Licea-Navarro,A.F., Gonzales-Sanchez,R., Guerrero,A. and Castro-Longoria,E. 2018-01-11 GenBank