Genbank accession
UJQ44930.1 [GenBank]
Protein name
collagen-like protein
RBP type
TF
Evidence RBPdetect
Probability 0,90
TF
Evidence RBPdetect2
Probability 0,93
Protein sequence
MSAMSNLENLATAIGRDVKAIKEDSELKDREVKKRLGSLESRPRVNPETLVTKAELEEKGYLTSHQDLSTYAQKWELYNDIPIKARISALENRPSFDNLTSIQRESLKGENGHSLNASVRIEGSYKNGSTSQLNLFADVFYDGEAVTSGYTLDYYYRGFGNNNWGVLRNQTPDVNGKFGQWNASQRSGGWFEVRIEVNYRGLKASGFTHLDNVNDGERGPQGVQGERGQQGLQGIQGPRGLTGARGADGARGRDGAPGQNIINQNGGQALKYWVGTKAQYEAIRTKDPNTIYDVYEP
Physico‐chemical
properties
protein length:297 AA
molecular weight: 32888,82110 Da
isoelectric point:6,86901
aromaticity:0,08754
hydropathy:-0,83266

Domains

Domains [InterPro]
DC_2298
STR
162–297
IPR008160
STR
215–259
G3DSA:1.20.5.320
STR
231–276
UJQ44930.1
1 297
Architecture
STR
STR 162-297
Legend: ATT STR RBD CBM LEC ENZ CHP LNK TAS TTP UNK Unmapped

Taxonomy

  Name Taxonomy ID Lineage
Phage Streptococcus phage MissF
[NCBI]
2910752 Viruses > Duplodnaviria > Heunggongvirae > Uroviricota > Caudoviricetes
Host Streptococcus mitis
[NCBI]
28037 cellular organisms > Bacteria > Bacillati > Bacillota > Bacilli > Lactobacillales

Coding sequence (CDS)

Coding sequence (CDS)
Genbank protein accession
UJQ44930.1 [NCBI]
Genbank nucleotide accession
OL799250 [NCBI]
CDS location
range 1310 -> 2203
strand -
CDS
GTGAGTGCTATGAGTAACCTTGAAAATCTAGCAACAGCTATTGGTAGGGATGTCAAGGCAATCAAAGAAGATTCAGAGTTAAAGGATAGGGAGGTTAAGAAAAGACTGGGCTCTCTTGAGAGTAGACCGAGAGTCAATCCAGAAACCCTCGTTACAAAAGCGGAGCTGGAAGAGAAAGGCTACTTGACCTCCCACCAAGACTTGTCTACTTATGCCCAAAAATGGGAGTTGTACAATGATATCCCCATCAAGGCTAGGATTTCCGCCTTGGAAAATCGCCCGTCATTTGACAATCTGACCTCCATCCAGAGAGAAAGCTTAAAAGGGGAAAATGGGCATAGTTTGAATGCCAGCGTTCGTATTGAGGGCTCTTATAAGAATGGCTCTACTAGCCAATTGAATCTGTTTGCGGATGTATTCTATGATGGTGAAGCAGTCACGAGTGGCTATACTCTTGATTATTACTACCGTGGTTTTGGGAACAATAACTGGGGGGTATTGAGAAATCAGACGCCTGATGTAAATGGTAAATTTGGTCAGTGGAACGCTTCTCAGCGCTCTGGTGGTTGGTTCGAAGTACGCATTGAAGTGAATTACAGAGGTCTAAAGGCCTCTGGGTTCACTCATTTGGATAATGTCAACGATGGCGAACGTGGTCCGCAAGGTGTCCAAGGTGAACGAGGGCAACAAGGGCTGCAGGGCATTCAAGGACCTAGAGGTTTGACTGGCGCTAGGGGCGCTGATGGAGCTAGAGGACGTGACGGAGCGCCTGGCCAAAACATCATCAATCAAAACGGTGGGCAAGCCTTGAAATATTGGGTTGGTACAAAGGCTCAATATGAAGCTATCAGAACGAAAGACCCCAATACTATCTATGATGTGTATGAACCATAA

Genome Context

Genome Context

Tertiary structure

PDB ID
d5fb1fdae9332d5ebc9c1f945386aacfe15139b7ba2215f82be12c2c8e98a618
ESMFold
Source ESMFold
Method ESMFold
Resolution 0,7525
Oligomeric State monomer
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50