UniProt accession
G8EYD1 [UniProt]
Protein name
Collagen triple helix repeat protein
RBP type
TF
Evidence Phold
Probability 1,00
TF
Evidence RBPdetect
Probability 0,90
Protein sequence
MEIYHVKQRENPFQFHASEFLGDFPTLEHLYELDAKTGALAYVINTGYTYIQARPGDWKQLKAGKGTAGPRGPQGVPGPMGLPGRDGKDGLPGPIGPAGPPGIQGRPGKDGVDGRDGAPGRNGVDGKDGKDGKDGVDGRDGVDGKPGKQGPRGFTGTNGEDGCGWTRGEYDYETGVVTFYSDDGLGFSTGDLRGAPSPWAHMSLEQLANALKPYL
Physico‐chemical
properties
protein length:215 AA
molecular weight: 22587,63970 Da
isoelectric point:5,38501
aromaticity:0,08837
hydropathy:-0,79721

Domains

Domains [InterPro]
DC_0341
STR
20–209
PTHR24637
Unmapped
65–162
IPR008160
STR
65–121
G8EYD1
1 215
Architecture
STR
STR 20-209 |
Legend: ATT STR RBD CBM LEC ENZ CHP LNK TAS TTP UNK Unmapped

Taxonomy

  Name Taxonomy ID Lineage
Phage Synechococcus phage S-CBP42
[NCBI]
461711 Uroviricota > Caudoviricetes > Autographivirales > Aegirvirus SCBP42 >
Host Synechococcus sp.
[NCBI]
1131 cellular organisms > Bacteria > Bacillati > Cyanobacteriota/Melainabacteria group > Cyanobacteriota > Cyanophyceae
Host Synechococcus sp. WH 7803
[NCBI]
32051 Bacteria > Cyanobacteria > Oscillatoriophycideae > Chroococcales > Synechococcus >

Coding sequence (CDS)

Coding sequence (CDS)
Genbank protein accession
AET72511.1 [NCBI]
Genbank nucleotide accession
JF974300 [NCBI]
CDS location
range 9014 -> 9661
strand +
CDS
ATGGAAATCTATCACGTTAAACAGCGTGAGAACCCCTTCCAGTTCCACGCCTCTGAGTTCCTCGGTGACTTCCCCACACTTGAACATCTCTATGAACTTGACGCCAAGACCGGCGCACTTGCCTATGTGATTAACACGGGCTACACATACATCCAAGCCCGTCCAGGTGATTGGAAACAGCTGAAGGCTGGTAAAGGCACAGCTGGGCCTAGAGGTCCTCAAGGAGTTCCTGGCCCTATGGGATTACCTGGTAGGGATGGTAAGGATGGCTTACCCGGCCCCATTGGACCTGCTGGACCCCCTGGTATTCAAGGTAGACCTGGTAAGGATGGTGTAGATGGCCGTGACGGTGCTCCTGGAAGAAACGGAGTTGATGGTAAGGACGGAAAGGACGGAAAGGATGGCGTAGATGGTCGTGATGGTGTTGACGGGAAACCAGGCAAGCAAGGACCCCGAGGCTTCACAGGGACTAACGGTGAAGATGGTTGTGGGTGGACCCGTGGTGAATACGACTATGAGACCGGCGTAGTTACTTTCTACTCTGACGATGGTCTCGGCTTCTCCACAGGTGACCTACGAGGCGCTCCTTCACCGTGGGCACATATGAGCCTCGAACAATTAGCAAATGCTCTCAAGCCTTACTTATGA

Genbank protein accession
AGK86673.1 [NCBI]
Genbank nucleotide accession
KC310805 [NCBI]
CDS location
range 22226 -> 22873
strand +
CDS
ATGGAAATCTATCACGTTAAACAGCGTGAGAACCCCTTCCAGTTCCACGCCTCTGAGTTCCTCGGTGACTTCCCCACACTTGAACATCTCTATGAACTTGACGCCAAGACCGGCGCACTTGCCTATGTGATTAACACGGGCTACACATACATCCAAGCCCGTCCAGGTGATTGGAAACAGCTGAAGGCTGGTAAAGGCACAGCTGGGCCTAGAGGTCCTCAAGGAGTTCCTGGCCCTATGGGATTACCTGGTAGGGATGGTAAGGATGGCTTACCCGGCCCCATTGGACCTGCTGGACCCCCTGGTATTCAAGGTAGACCTGGTAAGGATGGTGTAGATGGCCGTGACGGTGCTCCTGGAAGAAACGGAGTTGATGGTAAGGACGGAAAGGACGGAAAGGATGGCGTAGATGGTCGTGATGGTGTTGACGGGAAACCAGGCAAGCAAGGACCCCGAGGCTTCACAGGGACTAACGGTGAAGATGGTTGTGGGTGGACCCGTGGTGAATACGACTATGAGACCGGCGTAGTTACTTTCTACTCTGACGATGGTCTCGGCTTCTCCACAGGTGACCTACGAGGCGCTCCTTCACCGTGGGCACATATGAGCCTCGAACAATTAGCAAATGCTCTCAAGCCTTACTTATGA

Genome Context

Genome Context

Tertiary structure

PDB ID
0af6940dbc3855f737c16dd515ce54d89289802d5e5393f864359c0afa0de578
ESMFold
Source ESMFold
Method ESMFold
Resolution 0,7412
Oligomeric State monomer
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50