Genbank accession
WMM95343.1 [GenBank]
Protein name
collagen-like protein
RBP type
TF
Evidence RBPdetect
Probability 0,90
TF
Evidence RBPdetect2
Probability 0,89
Protein sequence
MAQINIGRVRMGWKGTWNSTTTYVAQDAVYYDGETFVAKQDVPVGTATTNATYWQKVAQKGTNGIDGQDGAQGPTGPQGPQGIQGIQGETGAAGATGPQGPQGDIGETGPVGPQGPTGNTGSTGATGPQGPAGPTGPKGDTGNTGATGATGAVGPQGPQGSTGATGPSGPAPSHGWSGYSLRFQNPNGTWGAYTNLRGATGATGPQGPTGAQGPQGNTGATGATGPQGIQGPVGDQGPAGPAGPTGPQGATGPKGNTGNTGATGATGPQGATGPTPAHQWSGTSLRFYNGSSWGSYVNLKGSTGNTGATGPAGPTGATGATGPQGPQGPQGPAGTPSSTYGAVGSYVLGYRHMGRTQQGNAYSGSQIRIIGIYGSNIGYSWQMTTNQQVYAVHRFGDNTLAGTWRMMFDASQDTSSGMARGVLWVRIS
Physico‐chemical
properties
protein length:428 AA
molecular weight: 42041,92540 Da
isoelectric point:9,34413
aromaticity:0,07009
hydropathy:-0,62850

Domains

Domains [InterPro]
DC_1819
STR
2–122
IPR050149
Unmapped
60–339
DC_1341
STR
106–195
DC_1431
STR
239–422
WMM95343.1
1 428
Architecture
STR
STR 2-422 |
Legend: ATT STR RBD CBM LEC ENZ CHP LNK TAS TTP UNK Unmapped

Taxonomy

  Name Taxonomy ID Lineage
Phage Roseobacter phage CRP-125
[NCBI]
3072844 Uroviricota > Caudoviricetes > Autographivirales > Actaeavirus > Actaeavirus CRP125
Host Rhodobacteraceae bacterium
[NCBI]
1904441 cellular organisms > Bacteria > Pseudomonadati > Pseudomonadota > Alphaproteobacteria > Rhodobacterales

Coding sequence (CDS)

Coding sequence (CDS)
Genbank protein accession
WMM95343.1 [NCBI]
Genbank nucleotide accession
OR420743 [NCBI]
CDS location
range 32014 -> 33300
strand +
CDS
ATGGCTCAGATTAACATTGGTCGTGTGCGTATGGGTTGGAAAGGTACTTGGAATAGTACAACAACCTATGTCGCTCAGGACGCAGTATATTATGACGGTGAGACATTCGTCGCTAAACAAGATGTCCCCGTGGGTACTGCAACCACAAATGCCACGTACTGGCAGAAGGTTGCTCAAAAAGGTACTAACGGTATCGATGGTCAAGATGGTGCCCAAGGCCCAACTGGCCCACAAGGCCCGCAGGGTATTCAAGGTATCCAAGGTGAAACAGGTGCTGCGGGTGCTACAGGTCCTCAAGGCCCTCAAGGTGACATTGGTGAGACAGGTCCCGTAGGTCCACAAGGTCCTACAGGTAACACTGGTTCTACTGGTGCTACAGGTCCTCAAGGTCCCGCAGGTCCAACAGGCCCTAAAGGTGATACAGGAAATACAGGTGCGACAGGTGCCACTGGTGCTGTAGGTCCACAAGGTCCTCAAGGCTCTACTGGTGCGACTGGCCCATCAGGCCCTGCCCCATCTCATGGGTGGTCAGGCTACAGCCTACGATTCCAGAACCCTAATGGAACATGGGGTGCCTACACAAACCTTCGTGGTGCTACAGGTGCTACAGGTCCACAGGGTCCTACAGGTGCCCAAGGCCCTCAAGGTAACACTGGTGCAACTGGTGCGACAGGACCGCAAGGTATTCAGGGTCCAGTAGGTGACCAAGGTCCAGCGGGACCAGCGGGTCCTACAGGTCCTCAAGGTGCTACAGGTCCGAAGGGTAACACTGGTAATACAGGTGCTACAGGGGCTACTGGTCCTCAAGGGGCTACTGGTCCAACCCCAGCGCACCAATGGTCAGGCACAAGCCTACGGTTCTACAATGGTTCCTCTTGGGGTTCTTATGTAAACTTAAAGGGTTCTACTGGTAACACTGGTGCGACTGGTCCAGCAGGTCCTACAGGGGCTACAGGGGCTACAGGTCCACAAGGGCCACAAGGGCCACAAGGTCCAGCTGGTACACCTTCAAGTACATACGGAGCCGTTGGTTCATATGTGTTGGGCTATCGTCATATGGGTCGTACACAACAGGGTAATGCGTACTCTGGCAGTCAAATTAGAATAATTGGTATATATGGAAGTAACATTGGTTATAGTTGGCAAATGACGACAAACCAACAAGTTTACGCAGTGCATAGGTTTGGTGATAACACATTAGCTGGCACATGGCGCATGATGTTTGACGCTTCACAGGACACCAGTTCAGGTATGGCTAGAGGTGTACTTTGGGTAAGGATTTCTTAA

Genome Context

Genome Context

Tertiary structure

PDB ID
a9abaa0ff7f0846cefabda4d25c5de1e2f6afbebd7ca409ee4f25f34f2e05bfc
ESMFold
Source ESMFold
Method ESMFold
Resolution 0,6569
Oligomeric State monomer
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50