Protein

Genbank accession
AGO49462.1 [GenBank]
Protein name
structural protein
RBP type
TSP
Evidence DepoScope
Probability 1,00
TSP
Evidence RBPdetect2
Probability 0,92
TF
Evidence Phold
Probability 1,00
Protein sequence
MTKIKLDIDQIEDLPISKVIGLEDALVNTSSEATVFSELGGAPDYTQTIAQAYPSVTLTEVQVVNSSATLSDSASWFYLQSGVNKQNNVGFYAFKDNQHVKIPNGQYYMSRPLELGESAGATFDFTDTYLKPSTGYTGFLVSIINDSSNQAYIWIKNLKISGEWRAKGIDCMRFQQLSFEDLLIEKVVLGVEMRGVWYSNFLGTSTVRDCLNAIKFTSTQNQSFDEVNGTGFYGLSIRAASADNLIDVSKFNVASNYQTKAFEIETRTLGGKFNDMTIEGFVHAFYLTDLNNGGVDNKFRSNITGNYFEANDNIFYFSPTLSVSTVDMTFKGNLINNSVNAESYFTYKQGNVEFSGNMLADYNRYKITILDGPDNVFSNLVTDLLPSNVVNLDASFGTRVTYENQKISTTEWDRFTYGTTNFGTFNPIDRKVTANQMQDVEGFVAVAPESVPTLSSQNLHYKKAYQNSLKDISFIDKNKGVILKDRTTDKAYRLSIDNGEFFFEEEIIADKVYDVVGVKQPNFFIRHTDPVSISGLYYMQGMYTAPMVWDGTMWTSGGIRRIGTSAEKSTTGTYYDTTLGKFVRGAGETYVINGENKIEPISKSDLESSNALQWVDITGNKTIVKEELGSIQELNLEATYHITIPTQVNEEFEGNGVITFENQLATGKIIIEPETGIELPRLQLTGENAWGSIFRKAEDVWRYKGFGLYVNDSPEIYTVLNALSKSNNTDATTGLVTSSATVTSESEVVNGETIFVAQATSVDPPTQFSRFFGNSNVNAAGSGVAYEYSFWARAIAATIGSQSVTIGGGVATSARHILTADWAYYSGEAVTAGTNIISLDANTNRNATTGGLAGVAGDTVQVANYSLKRKEE
Physico‐chemical
properties
protein length:872 AA
molecular weight: 96041,66650 Da
isoelectric point:4,73863
aromaticity:0,11353
hydropathy:-0,25344

Domains

Domains [InterPro]

No domain annotations available.

Legend: Pfam SMART CDD TIGRFAM HAMAP SUPFAM PRINTS Gene3D PANTHER Other

Taxonomy

  Name Taxonomy ID Lineage
Phage Cellulophaga phage phi4:1
[NCBI]
1328029 Viruses > Duplodnaviria > Heunggongvirae > Uroviricota > Caudoviricetes
Host Cellulophaga baltica
[NCBI]
76594 Bacteria > Bacteroidetes > Flavobacteriia > Flavobacteriales > Flavobacteriaceae > Cellulophaga
Host Cellulophaga baltica 4
[NCBI]
1348582 Bacteria > Bacteroidetes > Flavobacteriia > Flavobacteriales > Flavobacteriaceae > Cellulophaga

Coding sequence (CDS)

Coding sequence (CDS)
Genbank protein accession
AGO49462.1 [NCBI]
Genbank nucleotide accession
KC821632 [NCBI]
CDS location
range 57609 -> 60227
strand -
CDS
ATGACAAAAATAAAATTAGATATAGACCAAATAGAAGACTTACCAATTTCAAAGGTAATAGGTCTTGAAGATGCATTAGTTAACACATCAAGTGAAGCTACAGTATTTAGTGAATTAGGCGGTGCTCCAGATTACACTCAAACTATAGCACAAGCTTATCCTTCCGTTACTTTAACAGAGGTGCAAGTAGTAAATAGTTCAGCTACTTTATCAGATAGTGCTTCATGGTTTTATTTACAAAGTGGAGTAAATAAACAAAATAACGTAGGATTCTATGCATTTAAAGATAATCAACATGTAAAAATACCTAATGGTCAATACTATATGTCTAGACCTCTAGAGCTAGGGGAAAGTGCAGGAGCTACTTTTGATTTTACTGATACATATTTAAAGCCTTCTACAGGATACACTGGATTCTTAGTGAGCATTATAAACGATTCTTCTAATCAAGCTTACATTTGGATTAAAAACCTGAAGATATCTGGCGAGTGGAGAGCTAAGGGAATAGATTGTATGAGATTTCAGCAATTAAGTTTCGAAGATCTACTTATAGAAAAAGTAGTTTTAGGTGTAGAAATGAGAGGAGTTTGGTATTCCAACTTCTTAGGTACATCTACAGTAAGAGATTGCTTAAATGCCATTAAATTTACTTCTACTCAAAATCAATCATTTGATGAAGTGAATGGCACTGGATTTTATGGATTAAGTATAAGAGCTGCTAGTGCAGATAACTTAATAGATGTGTCTAAATTTAATGTAGCTTCTAATTACCAAACTAAAGCTTTTGAAATTGAAACTAGGACTTTAGGTGGTAAATTTAATGATATGACTATCGAAGGTTTTGTTCATGCATTTTACTTAACTGATTTAAATAATGGAGGTGTAGATAACAAATTTAGAAGTAATATCACAGGTAACTATTTTGAAGCAAATGATAATATCTTTTACTTTAGCCCTACATTATCTGTATCTACAGTAGACATGACTTTTAAAGGGAATCTAATAAACAATAGTGTGAATGCTGAATCTTACTTTACTTATAAACAAGGTAATGTTGAATTTTCAGGTAATATGTTGGCTGACTATAACAGGTATAAAATTACTATTTTAGATGGCCCAGATAATGTTTTTTCAAACTTAGTAACAGATTTACTTCCAAGTAATGTTGTTAACTTAGACGCCTCTTTTGGCACTAGGGTTACTTATGAAAACCAAAAAATATCTACAACTGAATGGGATAGATTTACATATGGTACAACTAACTTTGGAACATTTAATCCAATAGACAGAAAGGTAACAGCTAATCAAATGCAAGATGTCGAAGGATTTGTTGCGGTAGCTCCAGAAAGTGTACCTACTCTAAGCAGTCAAAATTTACACTATAAAAAAGCTTATCAAAATTCTTTAAAAGATATTAGTTTTATAGATAAAAACAAAGGTGTAATATTAAAAGATAGAACTACAGATAAAGCTTATAGGTTAAGTATAGACAATGGAGAGTTTTTCTTTGAAGAGGAGATTATAGCTGATAAAGTTTATGATGTAGTTGGAGTTAAGCAACCTAACTTTTTTATAAGACACACTGATCCAGTATCTATTTCAGGGTTATACTACATGCAAGGTATGTACACAGCACCTATGGTATGGGATGGTACTATGTGGACTTCTGGTGGAATTAGGAGAATAGGGACTTCAGCTGAAAAATCAACAACTGGAACTTATTATGACACTACTTTAGGTAAATTCGTGAGAGGGGCAGGTGAAACTTATGTTATAAATGGGGAGAATAAAATAGAACCTATTTCAAAATCAGATCTAGAGTCTTCAAATGCTCTACAGTGGGTAGATATAACAGGGAATAAAACTATAGTTAAAGAAGAGTTAGGATCCATTCAAGAACTTAATTTAGAAGCTACTTATCACATTACAATACCAACTCAAGTAAATGAAGAGTTTGAAGGTAATGGTGTAATCACATTTGAAAATCAATTAGCAACTGGTAAAATTATTATAGAACCTGAAACAGGAATAGAGTTGCCTAGATTACAATTAACAGGTGAAAATGCTTGGGGATCTATATTTAGAAAAGCTGAAGATGTTTGGAGATATAAAGGATTCGGATTGTATGTTAATGATAGTCCTGAGATTTATACTGTATTAAACGCTTTATCTAAATCTAATAATACAGATGCTACAACAGGTTTAGTGACAAGTAGCGCTACAGTTACTTCTGAGTCTGAAGTGGTTAACGGGGAAACAATATTTGTAGCACAAGCAACAAGTGTAGACCCTCCTACTCAATTTTCAAGATTCTTTGGTAATTCTAATGTTAATGCAGCAGGTTCTGGAGTCGCTTATGAGTATTCATTCTGGGCAAGAGCTATAGCTGCTACTATAGGTTCTCAATCAGTAACAATTGGTGGTGGTGTTGCTACTAGCGCTAGACATATTTTAACAGCTGATTGGGCTTATTATAGCGGAGAGGCAGTTACAGCAGGTACTAACATTATTAGTTTGGATGCTAATACTAATAGAAATGCAACAACTGGAGGATTAGCTGGTGTAGCAGGTGATACTGTTCAGGTAGCTAATTATTCACTTAAAAGAAAAGAAGAATAG

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

PDB ID
3eb0b79766fd699a7e063eb65a7ae11130e1d7ffc8b1a0afa8636930cd9168dc
ESMFold
Source ESMFold
Method ESMFold
Resolution 0,5449
Evidence 0,5449

Literature

No literature entries available.