Protein

Genbank accession
AAX46828.1 [GenBank]
Protein name
fiber
RBP type
TSP
Evidence DepoScope
Probability 1,00
TSP
Evidence RBPdetect
Probability 0,87
TSP
Evidence RBPdetect2
Probability 0,82
TF
Evidence Phold
Probability 1,00
Protein sequence
MSLTRLKNIITSRTGRIIYVNPDDFDASDAIDNRGNSALRPFKSLQRAFLEVARFSYRVGLSNDEFDAFSIMLYPAEYVVDNRPGDVLYTNVAPIDANSNLDLTSPNNVLYKYNSVEGGIIVPRGCSVVGTDLRRTKIIPKYVPYPTTYPAQGINTEAQVPPRTAIFKVTGGTYFWQFSFFDGAEEGVYFKPDSVTTLAPKFSHHRLTCFEFADGLNPLSTLIANGTVPNADYSAVSNILARTDLEIYYQKVSKAFATIPDTSGDPSTDQIQARVEENRIVGPISDEYRVLQITRNGQTATAVTVDEFDDPRDHGFSVGVNINVSGVTGSTGSQSEADAGLYNGSFTVTSASGNVFTYQMQGEPTGNAVGSNIAVKTEIDTVDSASPYAFNLSLRSVWGMNGMHANGAKATGFKSMVVAQFTGLSLQKDDRAFVRYNASTGNYDVATAGDGAHLDGFAEYRKGWGHEHIKCSNDSFIQAVSVFAVGYQGHFTALSGGDMSITNSNSNFGNTALRSAGFKAKAFSKDKAGALTHIIPPKALNVISTTATGASGTNTITLANDGSVNGVIQGMTITGTNIGVGATVGNINVSTRVLTLTASNTGAVNGNVIFGEETSINWVNIDIQRTKTINASLAGQGGTPGTRLYLYGYTVEASPPTTRVQGFAVGARQDGTGASAIADKINCLLVAQGATDASVQSASISPYGPSVSGLAAGVVGSPLQYDANTYTISGVAGSVGGWYLSVSSVNNEIYTTLSTNTTYNTVNFTPTTFLKRIPDPRDLQDRTYRVRYVIDKDKTNPLPRDPLSGYVMQPLNSDTTSFNLQRCFYIYDIEVVQPFVRGTDDGIYYITLLCGSISPTTSNFDDRKFSQNVNEVYPTFDRDNPIADPAAATSVADNVTIGLVNATDGASPPVKDPKLSITKESTVFLLTDTGWTQPGTTPNYDSVNKRLSNVELTARAGDEEVRKINIRENNDGTVAPINVEFRRHSILRSGNHTFEYLGFGPGNYSTAFPQTQVETLTQEQIRFSQSIKEEGGVSFYSGLNSNGDLFIGNQVINPVTGQITNEDVAQLNVVGEESTTIETFSELVLTDKITVIGGASNQLESIFAGPVTFQGQSTFTNNISAKKLTYFNQDGTVIKQTLLAPENASGLPDFSNITGYDTPADGDIVYNINWTPGKSLGWIYYGGAFKEFGLTDTGDINISTTGNGHIGLGEAPDTTYRVRINGSVRIDGDVVGTGRGVVGSDKYITKSYTGDGNTLTFAVTTYGGGIKHSDDSLLVFLNGVAQIAGTNYTVDTNGANVVFSSGDAPLSTDTVHILELPI
Physico‐chemical
properties
protein length:1318 AA
molecular weight: 141001,52930 Da
isoelectric point:4,90324
aromaticity:0,09256
hydropathy:-0,21275

Domains

Domains [InterPro]

No domain annotations available.

Legend: Pfam SMART CDD TIGRFAM HAMAP SUPFAM PRINTS Gene3D PANTHER Other

Taxonomy

  Name Taxonomy ID Lineage
Phage Prochlorococcus phage P-SSM4
[NCBI]
268747 Viruses > Duplodnaviria > Heunggongvirae > Uroviricota > Caudoviricetes
Host Prochlorococcus
[NCBI]
1218 Bacteria > Cyanobacteria > Prochlorales > Prochlorococcaceae >
Host Prochlorococcus marinus str. NATL2A
[NCBI]
59920 Bacteria > Cyanobacteria > Prochlorales > Prochlorococcaceae > Prochlorococcus >

Coding sequence (CDS)

Coding sequence (CDS)
Genbank protein accession
AAX46828.1 [NCBI]
Genbank nucleotide accession
AY940168 [NCBI]
CDS location
range 23611 -> 27567
strand +
CDS
ATGTCACTAACGAGACTCAAGAATATTATTACGTCCAGAACTGGACGTATCATATATGTCAACCCAGACGATTTCGATGCATCTGATGCGATTGACAACAGGGGTAACTCTGCATTGCGTCCGTTTAAAAGTTTACAGAGAGCATTTTTAGAAGTAGCAAGATTTTCATATAGAGTTGGTTTAAGTAATGACGAGTTTGATGCTTTTAGTATCATGCTCTACCCTGCTGAGTATGTCGTAGACAATAGACCAGGCGATGTTTTATATACAAACGTTGCACCTATTGATGCAAACTCAAACCTAGACTTAACTTCTCCTAACAATGTTCTCTACAAATATAACTCAGTCGAAGGTGGTATTATTGTACCTAGAGGTTGTTCTGTTGTTGGTACAGACCTCAGAAGAACTAAAATAATTCCAAAATACGTTCCATATCCCACTACATATCCAGCTCAAGGTATTAATACGGAAGCTCAGGTTCCACCAAGAACAGCAATCTTCAAAGTCACTGGTGGTACATACTTCTGGCAGTTCTCGTTCTTCGATGGAGCAGAGGAGGGAGTATATTTCAAACCTGATTCAGTCACAACTTTAGCACCTAAGTTTTCTCACCATAGACTTACATGTTTTGAGTTTGCTGATGGTCTTAATCCTTTATCAACTCTTATTGCAAACGGTACAGTTCCTAACGCTGACTACTCTGCAGTATCAAATATTCTAGCAAGAACTGACTTAGAGATATATTATCAGAAAGTATCTAAAGCATTTGCTACAATACCTGATACATCTGGTGATCCTTCAACTGACCAAATACAGGCAAGGGTTGAGGAAAACAGAATCGTTGGTCCTATATCTGATGAATACAGAGTATTACAGATCACACGTAATGGTCAGACTGCAACAGCAGTCACAGTTGATGAATTTGATGATCCTAGAGATCATGGCTTCTCTGTTGGTGTAAACATCAACGTTAGTGGTGTCACTGGTTCGACTGGATCACAGTCTGAAGCAGACGCAGGTTTATATAATGGTTCATTTACTGTCACATCTGCATCTGGAAACGTATTCACCTATCAAATGCAGGGAGAACCAACTGGTAATGCTGTAGGTTCAAACATTGCAGTTAAAACTGAGATTGATACTGTTGACTCTGCATCACCATACGCATTTAACCTATCACTAAGAAGTGTATGGGGTATGAATGGTATGCATGCAAACGGTGCAAAGGCAACTGGTTTCAAATCAATGGTTGTGGCACAGTTTACTGGATTATCACTACAGAAAGATGACAGAGCATTTGTAAGATATAATGCATCAACTGGAAACTATGATGTAGCAACAGCAGGAGATGGTGCACACTTAGATGGTTTCGCTGAGTATAGAAAGGGATGGGGTCATGAACACATCAAGTGTAGTAATGATTCATTCATACAGGCAGTTTCTGTGTTCGCTGTGGGATATCAAGGTCACTTTACTGCATTAAGTGGTGGTGATATGTCAATCACCAACTCTAACTCTAACTTTGGTAATACTGCATTGAGATCAGCAGGATTCAAAGCAAAAGCATTCTCTAAAGATAAGGCAGGTGCATTAACACATATAATACCACCTAAAGCTTTAAATGTTATCTCTACAACTGCTACTGGTGCTAGTGGAACAAATACAATTACGTTAGCAAATGATGGTAGTGTAAATGGTGTCATACAAGGTATGACTATCACAGGAACTAATATTGGAGTAGGAGCGACAGTCGGAAATATAAACGTAAGCACAAGAGTATTGACACTTACAGCTTCAAATACTGGTGCTGTAAATGGAAACGTAATCTTTGGTGAAGAGACATCAATTAACTGGGTTAACATTGACATCCAGAGAACTAAAACTATCAACGCATCACTTGCAGGACAGGGTGGTACACCAGGTACAAGGTTATATCTATATGGTTATACTGTAGAAGCATCTCCACCAACAACAAGAGTACAGGGTTTTGCAGTAGGAGCAAGACAAGATGGTACAGGTGCAAGTGCAATAGCAGATAAAATAAACTGTTTACTTGTAGCACAGGGTGCAACTGATGCAAGTGTACAATCTGCAAGCATATCACCTTATGGTCCTAGTGTTTCTGGTTTGGCAGCAGGTGTTGTTGGATCTCCATTACAATATGATGCTAATACATATACAATTAGCGGTGTAGCAGGATCAGTCGGTGGATGGTATCTATCAGTCAGTTCAGTAAATAATGAAATCTATACAACTTTATCTACTAACACAACATATAACACAGTTAACTTCACACCAACTACATTCCTCAAGAGAATACCTGACCCAAGAGACTTACAAGATAGAACATATCGTGTAAGATATGTAATTGATAAAGATAAGACCAATCCATTACCAAGAGATCCCCTCTCTGGTTATGTTATGCAACCATTGAATAGTGATACTACATCATTTAATTTACAAAGATGTTTCTACATCTATGACATTGAGGTTGTACAACCATTTGTAAGAGGTACTGACGATGGTATCTACTACATAACTCTCTTATGTGGATCTATATCACCTACAACTTCTAACTTTGATGATAGGAAGTTCTCTCAAAATGTCAATGAAGTTTATCCTACATTTGACAGAGATAATCCAATCGCTGATCCTGCTGCTGCAACATCAGTTGCAGATAATGTCACCATTGGTCTTGTTAATGCAACAGATGGTGCATCGCCACCTGTTAAAGATCCAAAGTTATCTATTACAAAGGAATCAACAGTATTCTTACTAACAGATACAGGATGGACACAACCAGGTACTACACCTAACTATGACTCAGTTAATAAAAGATTATCTAACGTAGAGTTAACTGCACGAGCAGGAGACGAAGAAGTAAGAAAGATTAATATACGAGAAAATAATGATGGTACAGTAGCACCAATTAACGTAGAGTTTAGACGACACTCTATCCTAAGATCAGGAAACCATACGTTTGAATACCTTGGTTTTGGTCCAGGTAACTACAGTACAGCGTTCCCTCAGACACAGGTAGAAACTCTAACTCAAGAGCAAATTAGATTCTCACAGTCAATTAAAGAAGAGGGAGGAGTTTCATTCTACTCAGGATTGAACTCAAATGGTGATCTATTCATTGGTAATCAGGTTATCAACCCAGTCACAGGTCAGATCACTAACGAAGATGTTGCACAGTTAAACGTTGTTGGTGAAGAGAGCACAACCATTGAAACATTCTCTGAGTTGGTTCTAACGGACAAGATAACTGTTATTGGTGGAGCATCAAACCAGTTAGAATCTATCTTTGCAGGTCCTGTCACATTCCAAGGACAGAGTACATTTACTAATAATATATCTGCTAAGAAACTTACATACTTTAACCAAGATGGTACTGTTATTAAACAGACTTTACTAGCACCAGAAAATGCAAGTGGACTACCAGATTTCTCTAATATCACAGGATATGATACACCTGCTGATGGAGATATAGTTTACAATATCAACTGGACACCAGGTAAATCACTAGGTTGGATATACTATGGTGGTGCATTTAAAGAGTTTGGACTGACAGACACAGGAGATATTAACATATCTACCACAGGTAATGGACATATAGGTTTAGGAGAAGCACCTGATACAACTTACAGAGTCAGAATTAATGGTTCAGTTAGAATTGACGGAGACGTTGTTGGTACTGGTAGAGGTGTTGTTGGATCAGATAAATATATTACTAAATCTTATACAGGGGATGGTAATACATTAACCTTCGCAGTCACAACATATGGTGGAGGCATCAAACACTCTGATGATTCACTCTTAGTATTCCTCAATGGTGTAGCACAGATTGCAGGAACTAACTACACAGTGGATACAAATGGTGCTAACGTTGTATTCTCATCTGGGGATGCACCTCTATCAACGGACACAGTTCACATCTTAGAATTACCTATCTAA

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

PDB ID
c203b8aa3146d06eee42f0698aa6cf6b31695bab4e5f8c92a144d8e8f664790f
ESMFold
Source ESMFold
Method ESMFold
Resolution 0,3695
Evidence 0,3695

Literature

No literature entries available.