Protein

Genbank accession
AGH07730.1 [GenBank]
Protein name
hypothetical protein
RBP type
TSP
Evidence DepoScope
Probability 1,00
TSP
Evidence RBPdetect
Probability 0,85
TF
Evidence RBPdetect2
Probability 0,68
TF
Evidence GenBank
Probability 1,00
TF
Evidence Phold
Probability 1,00
Protein sequence
MAEFRLGRVKFNWTGDWTTSKAYLIDDIAKFGGNTYVAIENHTSTASVSDFYANDLSKWNIHIEGLEQKGQWATGVYYRINDLVKFGNVVYRVTTAHTSEGTFIDKTKVTEYVKGFQNEGEWDGNVEYQSGDVVNYNGSSYVALTTSLAGFQPPQYLGVSTDPNAKWTILSDGLAGAASTYIEGTYYRGDLIQYGGNIYRHKLGITTNVSPLQVGLGSIQPQSFTGTEVWDLLVKGFDFKGGFSTTFNYHPGHVARYGSDSYISIGSSHKNVIPTAGIGTFWEVLASGDSSAALNTKGDLLSYNSGNVRIGIGSTGYALAVQSNGMPGYEIVGNQTRIYYVDSEDGLDVNNGLAPNLAFKTIKKACEAARPKTNITNMVYTASTGVATVTAPGHGLLNTGTFVQLQDIQFECLSGGNVFNVLGMTYNKAVGLATITAIGLGAAPEVGIGATVRIRNLSVQYQGSAQFAHTFQTATEDAIQSGGDYVHTFNSCAPNGVTVVGGSSITPTSATYDPSNGNFTMTITGHSLSTSDSVTIADNAFTFTCTMNNNASQKTYPRPGKDPAQGQQLAITGTTTDTITVNVGPSPIVNHQPTAVSYNQATGDMVCTIGPHTLTVGTSVKLATDGMVFRCSMDGYTTDHSYPRQVAGDGNPDPAYNTALNITAVTTNTITINVGTASDDQTITGKFPAVHTQGTYEFLVQAVPDSNSITLNVGVSTTDYLYVSGGTAFVGLTTTKYPDKVAKSYYEVLEVTDVDNFKCNVGISTIDHTYVEGGLVTDLTPAILKLSASQFYEQLPVTVPPFTSIVGNALRGSQVLPKAGTSDDSSTPNNRSHMFKMSDATTIQAISMKGMEGFYYDPNAPLELDNSNLRTGIGTTAAGVFISLNPDSPIDNKSPYVKDCTCFSDPATESGRFGGGGVGVFIDGGVHDTGAKSMVFDAFTHVASDGAGYILDKGAIAEIVSCFTYYAKWGYYSGGGSRIRGVGGNNSYGDYGVISSGFSTDEVPRTAKVFGDMMTVQGTTKGGTVSIGATMFGQTSKATAWMLNDQISADKIYFKYQSGYGNAGIGTTGFVDGEIVWFGNGAENSSGVGSITVGAAASSTTGQKGTILEVDQTSGTLLIGDAIGFQTTAYGADDRFYIINTITNVAAAATYYEWQAGNGASQQVVFNNRATLTISPEKSRGTWDTRNSGSTSDIEIRTLFSQARLTGHDFLAVGTGNKTQTGYPNVNLANVIQGQETNVFGPGKVFFVSTDQGGNFRVGDFFSVDQLTGRATLDASAFNLSGLTELRLGAIGGQVGEAINEFSSDETLAGNSNTACPTEFAVKGFLTRGSMGIKAMTPPVGTTAQRPGGVDDEFNTGALRFNTTLGALEYYNGTTWIQPGVKTYSTVSSSFSATSGVTYFVNTGGGQVTATLPASPDLGAEITFMDVAKTFDSNNLVVSRNGRPIQGDNANLTVSTEGAAFTLVYSGSTYGWRIFSI
Physico‐chemical
properties
protein length:1477 AA
molecular weight: 156014,87700 Da
isoelectric point:4,94871
aromaticity:0,10156
hydropathy:-0,15342

Domains

Domains [InterPro]
Legend: Pfam SMART CDD TIGRFAM HAMAP SUPFAM PRINTS Gene3D PANTHER Other

Taxonomy

  Name Taxonomy ID Lineage
Phage Cyanophage S-SSM6a
[NCBI]
682650 Viruses > Duplodnaviria > Heunggongvirae > Uroviricota > Caudoviricetes
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)
Genbank protein accession
AGH07730.1 [NCBI]
Genbank nucleotide accession
HQ317391 [NCBI]
CDS location
range 214827 -> 219260
strand +
CDS
ATGGCTGAGTTTAGACTTGGAAGAGTAAAATTCAACTGGACAGGTGACTGGACAACATCCAAAGCCTATTTGATTGACGATATCGCCAAGTTTGGTGGTAACACTTATGTGGCAATCGAGAACCACACGTCAACAGCAAGTGTCAGTGACTTTTACGCTAACGACCTTTCCAAATGGAACATCCATATTGAAGGTCTAGAACAAAAAGGACAATGGGCTACTGGAGTCTACTATCGTATTAACGACTTAGTAAAATTCGGTAACGTTGTTTATAGAGTTACAACTGCTCACACATCAGAAGGAACTTTCATCGATAAGACGAAAGTTACTGAGTATGTAAAAGGATTTCAAAACGAAGGTGAGTGGGATGGCAATGTAGAGTATCAATCAGGTGACGTTGTAAATTACAACGGTTCATCTTATGTTGCGTTAACCACATCTCTTGCTGGATTCCAACCTCCTCAGTATTTGGGTGTTTCCACAGATCCAAATGCTAAGTGGACTATCTTATCTGATGGTCTTGCTGGTGCGGCTTCAACATACATTGAAGGTACTTACTACAGAGGTGACCTTATTCAGTATGGTGGTAACATCTATCGTCATAAATTAGGTATCACAACCAACGTTTCTCCTTTACAGGTTGGTTTAGGATCTATTCAACCACAATCTTTCACTGGTACAGAAGTTTGGGATCTTCTGGTTAAGGGATTTGATTTCAAAGGCGGTTTCTCCACGACCTTTAACTATCATCCAGGCCACGTTGCAAGATACGGTTCAGACTCTTACATCTCTATTGGTTCTTCTCACAAGAATGTTATTCCTACCGCTGGTATCGGAACTTTCTGGGAAGTTCTTGCATCTGGAGATTCCTCTGCTGCTCTTAACACCAAGGGTGACTTACTCAGTTACAACTCTGGTAATGTAAGAATCGGTATTGGTTCTACAGGTTATGCACTTGCAGTCCAGTCAAATGGAATGCCAGGCTACGAGATTGTAGGTAACCAAACAAGAATTTACTACGTTGACTCCGAAGACGGATTAGACGTAAACAACGGTCTTGCACCTAACTTGGCGTTTAAGACTATTAAAAAGGCTTGCGAAGCTGCTCGTCCTAAAACAAACATCACCAATATGGTGTATACCGCTTCCACTGGTGTGGCAACGGTTACTGCGCCTGGTCACGGTCTGTTAAACACTGGTACGTTCGTTCAGTTACAAGATATTCAGTTTGAGTGTCTATCTGGAGGTAACGTATTCAACGTTCTGGGTATGACCTATAACAAGGCAGTTGGTCTTGCAACGATTACTGCTATTGGTCTTGGTGCTGCTCCCGAAGTTGGAATTGGTGCAACTGTTAGAATCAGAAACTTGAGTGTTCAGTATCAAGGTTCTGCACAGTTTGCTCATACATTCCAAACCGCAACAGAAGACGCTATCCAGTCTGGTGGTGACTATGTTCACACATTCAACAGTTGTGCTCCAAACGGAGTGACAGTTGTTGGTGGATCCAGTATCACTCCAACAAGTGCTACATATGATCCTTCAAATGGTAACTTCACTATGACTATCACAGGTCATAGTTTATCAACTTCTGACAGTGTTACCATCGCTGATAACGCATTTACGTTCACCTGTACGATGAACAACAATGCAAGTCAGAAGACATATCCTAGACCTGGCAAAGACCCTGCACAAGGACAACAGTTAGCAATTACAGGTACAACAACTGATACAATTACAGTTAACGTTGGACCTTCTCCAATCGTCAACCATCAACCAACTGCGGTTTCTTATAACCAAGCAACTGGTGACATGGTTTGTACAATCGGACCACACACATTAACAGTTGGTACATCTGTTAAGTTGGCTACTGATGGTATGGTATTCCGTTGTTCAATGGACGGATACACAACCGATCACTCTTATCCTCGTCAAGTTGCTGGAGATGGTAACCCTGACCCTGCATACAACACAGCTCTTAACATTACTGCTGTTACAACCAACACAATTACAATCAACGTTGGTACTGCGTCTGATGATCAGACAATCACTGGTAAGTTCCCTGCTGTACATACTCAAGGTACTTACGAATTCCTTGTACAGGCTGTACCCGACTCCAACTCTATTACTTTAAACGTTGGTGTTTCTACTACCGATTACCTCTATGTTTCTGGTGGTACTGCGTTCGTTGGTTTAACAACAACCAAGTATCCCGACAAGGTTGCTAAGTCTTACTACGAGGTTCTTGAAGTTACTGATGTTGACAACTTTAAGTGTAACGTTGGTATCTCCACAATCGATCACACATACGTTGAAGGTGGTTTAGTTACAGACTTGACCCCTGCTATCTTGAAACTGTCTGCGTCTCAGTTCTACGAACAGTTACCAGTTACAGTTCCTCCTTTCACTTCGATTGTTGGTAACGCACTTAGAGGTTCACAGGTTCTTCCTAAGGCTGGAACATCTGATGACTCTTCAACTCCTAACAACAGGAGTCACATGTTCAAGATGTCTGATGCAACAACTATTCAGGCAATCTCCATGAAAGGAATGGAAGGATTCTACTATGATCCTAACGCTCCTTTAGAGTTAGATAACTCTAACTTAAGAACTGGTATCGGTACAACCGCTGCTGGTGTGTTCATCTCCTTGAACCCAGATTCACCTATCGATAACAAGTCACCTTATGTTAAGGACTGTACTTGTTTCTCCGACCCTGCAACTGAGAGTGGCAGATTCGGTGGTGGTGGTGTTGGTGTATTCATCGATGGTGGTGTACACGACACAGGTGCAAAATCAATGGTGTTCGATGCGTTTACGCACGTTGCATCTGATGGTGCTGGTTACATTCTTGATAAGGGTGCAATCGCTGAAATCGTTTCCTGTTTCACATACTACGCTAAGTGGGGTTACTACTCAGGTGGTGGATCAAGAATCAGGGGTGTTGGCGGAAACAACTCTTACGGAGACTACGGTGTTATCTCATCTGGTTTCTCAACTGATGAAGTTCCCAGAACTGCAAAGGTCTTTGGTGACATGATGACAGTTCAAGGAACAACCAAAGGCGGTACAGTTTCTATCGGTGCTACCATGTTCGGTCAGACATCTAAGGCAACTGCATGGATGTTGAATGATCAGATCTCCGCTGATAAGATTTACTTCAAGTATCAAAGTGGATACGGTAATGCTGGTATTGGTACTACTGGATTCGTAGATGGAGAAATCGTCTGGTTCGGTAATGGTGCTGAGAATAGTTCTGGTGTGGGTTCAATCACAGTCGGAGCTGCCGCATCTTCCACAACTGGACAGAAAGGAACAATTCTAGAGGTTGACCAAACATCTGGAACACTTCTAATCGGTGATGCTATCGGATTCCAGACAACCGCTTACGGTGCAGACGATAGATTCTACATCATCAACACAATCACGAATGTTGCCGCCGCGGCCACATACTATGAGTGGCAGGCTGGAAACGGTGCAAGTCAACAAGTTGTCTTCAACAACCGTGCAACTCTGACAATCTCTCCTGAGAAATCAAGAGGAACATGGGATACTAGAAACTCTGGTTCCACATCTGACATTGAAATCAGAACACTGTTCTCACAGGCAAGACTAACAGGACACGACTTCCTCGCAGTTGGTACTGGTAACAAGACACAGACTGGATATCCAAACGTCAACTTGGCGAACGTTATCCAAGGTCAGGAAACTAACGTGTTCGGACCTGGTAAGGTGTTCTTCGTATCTACTGACCAAGGTGGTAACTTCCGAGTTGGAGACTTCTTCTCCGTTGACCAGTTGACTGGTCGTGCAACATTGGATGCTTCCGCGTTCAACTTGTCTGGTTTGACAGAATTGAGACTGGGTGCTATCGGTGGTCAGGTCGGTGAGGCAATCAACGAGTTCTCCTCTGATGAGACACTTGCTGGTAACTCAAACACTGCATGTCCTACAGAATTTGCAGTTAAGGGATTCTTAACTCGCGGTTCAATGGGTATCAAGGCAATGACACCTCCTGTTGGTACAACTGCTCAAAGACCTGGCGGCGTTGACGATGAGTTCAACACTGGTGCGTTAAGATTCAACACTACCTTGGGTGCTCTTGAGTACTACAATGGTACAACATGGATTCAGCCTGGTGTTAAGACATATAGTACTGTTTCTTCAAGTTTCAGTGCTACTTCTGGAGTCACATACTTTGTGAACACAGGTGGTGGTCAGGTAACTGCTACACTCCCTGCATCTCCTGACTTAGGTGCTGAGATTACATTCATGGACGTTGCTAAGACCTTCGACTCAAACAACTTGGTTGTTTCCAGAAACGGTAGACCAATCCAAGGTGACAATGCTAACCTAACTGTTTCCACAGAAGGTGCTGCATTTACCCTCGTTTACTCTGGTTCCACATACGGTTGGAGAATCTTCTCCATCTAA

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

PDB ID
f713b6f72cdcdfcb52a5830c3f1d835829ff1249e590830a00dd8c998f860a5a
ESMFold
Source ESMFold
Method ESMFold
Resolution 0,5341
Evidence 0,5341

Literature

Title Authors Date PMID Source
The Genome Sequence of Cyanophage S-SSM6a Henn,M.R., Sullivan,M.S., Osburne,M.S., Levin,J., Malboeuf,C., Casali,M., Russ,C., Lennon,N., Chapman,S.B., Erlich,R., Young,S.K., Yandava,C., Zeng,Q., Alvarado,L., Anderson,S., Berlin,A., Chen,Z., Freedman,E., Gellesch,M., Goldberg,J., Green,L., Griggs,A., Gujja,S., Heilman,E.R., Heiman,D., Hollinger,A., Howarth,C., Larson,L., Mehta,T., Pearson,M., Roberts,A., Ryan,E., Saif,S., Shea,T., Shenoy,N., Sisk,P., Stolte,C., Sykes,S., White,J., Yu,Q., Coleman,M.L., Huang,K.H., Weigele,P.R., DeFrancesco,A.S., Kern,S.E., Thompson,L.R., Fu,R., Hombeck,B., Chisholm,S.W., Haas,B., Nusbaum,C. and Birren,B. 2011-09-23 GenBank