Protein

Genbank accession
ATE85479.1 [GenBank]
Protein name
host specificity protein J
RBP type
TSP
Evidence RBPdetect
Probability 0,87
TF
Evidence RBPdetect2
Probability 0,95
TF
Evidence Phold
Probability 1,00
Protein sequence
MGKGSSKGHTPREAKDNLKSTQLLSVIDAISEGPVEGPVDGLKSVLLNSTPVLDSEGNTNISGVTVVFRAGEQEQSPPEGFESSGSETVLGTEVKYDTPITRTITSANIDRLRFTFGVQALVETTSKGDRNPSEVRLLVQIQRNGGWVTEKDITIKGKTTSQYLASVVVGNLPPRPFNIRMRRMTPDSTTDQLQNKTLWSSYTEIIDVKQCYPNTALVGVQVDSEQFGSQQVSRNYHLRGRILQVPSNYNPQTRQYSGIWDGTFKPAYSNNMAWCLWDMLTHPRYGMGKRLGAADVDKWALYVIGQCCDQSVPDGFGGTEPRITCNAWLTTQRKVWDVLSDFCSAMRCMPVWNGQTLTFVQDRPSDKVWTYNRSNVVMPDDGAPFRYSFSALKDRHNAVEVNWIDPDNGWETATELVEDTQAIARYGRNVTKMDAFGCTSRGQAHRAGLWLIKTELLETQTVDFSVGAEGLRHVPGDVIEICDDDYAGISTGGRVLAVNSQTRTLTLDREITLPSSGTTLISLVDGSGNPVSVEVQSVTDGVKVKVSRVPDGVAGYSVWGLKLPTLRQRLFRCVSIRENDDGTYAITAVQHVPEKEAIVDNGAHFDGDQSGTVNGVTPPAVQHLTAEVTADSGEYQVLARWDTPKVVKGVSFMLRLTVAADDGSERLVSTARTTETTYRFTQLALGNYRLTVRAVNAWGQQGDPASVSFRIAAPAAPSRIELTPGYFQITATPHLAVYDPTVQFEFWFSEKRIADIRQVETTARYLGTALYWIAASINIKPGHNYYFYVRSVNTVGKSAFVEAVGQPSDDASGYLDFFKGEIGKTHLAQELWTQIDNGQLAPDLAEIRTSITDVSNEITQTVNKKLEDQSAAIQQIQKVQVDTNNNLNSMWAVKLQQMQDGRLYIAGIGAGIENTPDGMQSQVLLAADRIAMINPANGNTKPMFVGQGDQIFMNEVFLKRLTAPTITSGGNPPVFSLTPDGRLTAKNADISGNVNANAGTLNNVTVNENCTIKGMLEATQVRGDFVKAVSKSFPKQAGTWGNTETPNGTVTVTISDDHNFDRQIIIPPIIFNGIAYSDPGSGNNPGGTRYTGYGFEVRKNGVLIASRETKGAIPGSYSAVIDMPSGRGSVTLEFKVFHKGNQWAGNITDCTVIVTKKAASGISIR
Physico‐chemical
properties
protein length:1165 AA
molecular weight: 127208,36420 Da
isoelectric point:5,85165
aromaticity:0,07897
hydropathy:-0,31991

Domains

Domains [InterPro]
Legend: Pfam SMART CDD TIGRFAM HAMAP SUPFAM PRINTS Gene3D PANTHER Other

Taxonomy

  Name Taxonomy ID Lineage
Phage Escherichia phage Ayreon
[NCBI]
2040288 Viruses > Duplodnaviria > Heunggongvirae > Uroviricota > Caudoviricetes
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)
Genbank protein accession
ATE85479.1 [NCBI]
Genbank nucleotide accession
MF807953 [NCBI]
CDS location
range 14946 -> 18443
strand +
CDS
ATGGGTAAAGGCAGCAGTAAGGGGCATACCCCGCGCGAAGCGAAGGACAACCTGAAATCCACGCAGCTGCTGAGTGTGATTGATGCCATCAGCGAAGGGCCGGTTGAAGGTCCGGTGGATGGATTAAAAAGCGTGCTGCTGAACAGTACGCCGGTGCTGGACAGTGAGGGGAATACCAATATATCCGGCGTCACGGTGGTGTTCCGGGCAGGTGAGCAGGAGCAGTCACCGCCGGAGGGATTTGAATCCTCCGGCTCCGAGACGGTGCTGGGTACGGAAGTGAAATATGACACGCCGATCACCCGCACCATTACGTCGGCAAACATTGACCGTCTGCGCTTTACCTTCGGTGTACAGGCACTGGTGGAAACCACCTCAAAGGGTGACAGGAATCCGTCGGAAGTCCGTCTGCTGGTTCAGATACAGCGTAATGGTGGCTGGGTGACGGAAAAAGACATCACCATTAAGGGCAAAACCACCTCGCAGTATCTGGCCTCGGTGGTGGTGGGTAACCTGCCGCCGCGCCCGTTCAATATACGGATGCGCAGGATGACGCCGGACAGCACCACAGACCAGCTGCAGAACAAAACGCTCTGGTCGTCATACACCGAAATCATCGATGTGAAACAGTGCTACCCGAACACGGCACTGGTCGGCGTGCAGGTGGACTCGGAGCAGTTCGGCAGCCAGCAGGTGAGCCGTAATTATCATCTTCGCGGACGCATTCTGCAGGTGCCGTCGAACTATAACCCGCAGACGCGGCAATACAGCGGTATCTGGGACGGAACGTTTAAGCCAGCATACAGCAACAACATGGCCTGGTGTCTGTGGGATATGCTGACCCACCCGCGTTACGGCATGGGTAAACGTCTTGGTGCGGCAGATGTGGATAAATGGGCGCTGTATGTCATCGGCCAGTGTTGCGACCAGTCGGTGCCGGACGGTTTTGGCGGCACGGAGCCGCGCATCACCTGTAATGCCTGGCTGACCACACAGCGTAAGGTGTGGGATGTTCTCAGTGATTTCTGCTCGGCGATGCGCTGTATGCCGGTATGGAACGGGCAGACGCTGACGTTCGTGCAGGACCGGCCATCAGATAAGGTGTGGACCTATAACCGCAGTAATGTGGTGATGCCGGATGATGGCGCGCCGTTCCGCTACAGCTTCAGCGCCCTGAAGGACCGCCATAATGCCGTTGAGGTGAACTGGATTGACCCGGACAACGGCTGGGAGACGGCGACAGAGCTTGTGGAGGATACGCAGGCCATTGCCCGTTACGGTCGTAACGTCACGAAGATGGATGCCTTTGGCTGTACCAGCCGGGGGCAGGCACACCGCGCCGGGCTGTGGCTGATTAAAACGGAGCTGCTGGAAACGCAGACCGTGGACTTCAGCGTGGGTGCTGAAGGGCTTCGCCATGTACCGGGCGATGTCATTGAAATCTGCGATGATGACTATGCCGGTATCAGCACCGGCGGGCGCGTGCTGGCGGTGAACAGCCAGACCCGGACGCTGACGCTCGACCGTGAAATCACGCTGCCATCCTCCGGTACCACGCTGATAAGCCTGGTTGACGGAAGTGGCAATCCGGTCAGCGTGGAGGTCCAGTCCGTCACCGACGGCGTGAAGGTAAAAGTGAGCCGTGTTCCTGACGGCGTTGCCGGATACAGCGTATGGGGGCTGAAGCTGCCGACGCTGCGCCAGCGCCTGTTCCGCTGCGTGAGTATCCGTGAGAACGATGACGGCACGTATGCCATCACCGCCGTGCAGCATGTACCGGAAAAAGAGGCCATCGTGGATAACGGGGCGCACTTTGACGGCGACCAGAGCGGCACGGTGAATGGTGTCACGCCGCCAGCAGTGCAGCATCTGACCGCCGAAGTCACCGCAGACAGCGGGGAATACCAGGTGCTGGCCCGCTGGGACACGCCGAAGGTGGTGAAGGGGGTGAGCTTTATGCTTCGCCTGACCGTGGCAGCGGATGACGGCAGTGAGCGGCTGGTCAGCACGGCCCGGACGACGGAAACCACTTACCGCTTCACACAACTGGCTCTGGGGAACTACAGGCTGACAGTCCGGGCAGTAAATGCATGGGGACAGCAGGGCGATCCGGCGTCGGTATCGTTCCGGATTGCCGCACCGGCAGCGCCGTCGCGGATTGAGCTGACGCCGGGCTATTTTCAGATAACCGCCACGCCGCATCTTGCGGTTTATGATCCGACGGTACAGTTTGAGTTCTGGTTCTCGGAAAAGCGGATTGCGGATATCAGGCAGGTTGAAACCACAGCACGCTATCTTGGTACGGCGTTGTACTGGATAGCCGCCAGTATCAATATCAAACCGGGCCATAATTATTATTTTTACGTTCGCAGTGTGAACACCGTTGGCAAATCGGCATTCGTGGAGGCTGTTGGTCAGCCGAGTGATGACGCATCCGGCTATCTGGATTTTTTCAAAGGCGAGATAGGGAAAACCCATCTGGCTCAGGAGCTGTGGACGCAGATTGATAACGGTCAGCTTGCGCCTGACCTGGCTGAAATCAGGACATCCATTACGGATGTCAGCAATGAAATCACACAGACCGTCAATAAGAAACTGGAAGACCAGAGTGCGGCAATTCAGCAGATACAGAAGGTTCAGGTTGATACAAATAATAACCTGAACAGCATGTGGGCTGTGAAGCTGCAGCAGATGCAGGACGGACGCCTTTATATCGCGGGTATTGGTGCCGGTATTGAGAACACCCCTGACGGCATGCAGAGTCAGGTGCTGCTGGCGGCAGACAGGATTGCGATGATTAATCCTGCGAATGGCAACACAAAGCCGATGTTTGTTGGTCAGGGCGATCAGATATTCATGAACGAAGTGTTCCTGAAACGCCTGACGGCTCCCACCATTACCAGCGGCGGTAATCCTCCGGTATTTTCCCTGACACCGGACGGGCGGCTGACGGCGAAAAATGCCGATATCAGCGGTAACGTGAATGCGAACGCCGGGACGCTCAACAATGTCACGGTAAATGAAAACTGTACGATTAAGGGCATGCTGGAGGCGACTCAGGTCAGAGGTGACTTCGTTAAAGCTGTATCCAAATCATTCCCGAAACAGGCTGGTACGTGGGGTAACACGGAAACACCAAACGGGACGGTTACAGTCACCATCAGCGATGATCATAACTTTGACCGTCAAATCATTATTCCACCCATTATCTTTAACGGAATAGCGTATAGCGATCCGGGAAGTGGTAATAACCCGGGAGGTACAAGATACACGGGTTATGGTTTTGAAGTTCGCAAAAACGGTGTATTAATCGCATCCAGAGAAACTAAAGGGGCCATTCCCGGTAGCTACAGTGCGGTTATTGATATGCCGAGTGGCAGGGGAAGCGTCACTCTGGAGTTTAAGGTTTTCCATAAAGGCAATCAGTGGGCAGGTAATATCACCGACTGTACGGTGATTGTGACCAAAAAAGCTGCTTCCGGCATCAGTATTCGTTGA

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

PDB ID
309cf1689636a9c1b0360259b395732eb4f081b8b1a0ae0b957b8e6f7372bc8d
ESMFold
Source ESMFold
Method ESMFold
Resolution 0,8095
Evidence 0,8095

Literature

Title Authors Date PMID Source
Complete Genome Sequence of the Escherichia coli Phage Ayreon Vlot,M., Nobrega,F.L., Wong,C.F.A., Liu,Y. and Brouns,S.J.J. 2018 29326205 GenBank