Genbank accession
WJZ48564.1 [GenBank]
Protein name
collagen-like protein
RBP type
TF
Evidence RBPdetect2
Probability 0,96
Protein sequence
MPWSNDVVTRTVTGTYLKTNGSGAKGRVTFTPTTVVLDPDDAVVIADAVVATLNTNGVFSVELPTTDNPLLSPAGWAYRVEVRLYGVAPQEFYVYIPEGDGSTIDITADIAVLTSGIADGTVPPAARGPVGPAGPTGPTGPAGSASSTGATGPTGATGPTGPAGATGADSTVTGPTGATGETGPTGATGAVGATGPQGAVGPTGAQGVQGNDGATGPTGPTGATGAASTVTGPTGPTGAQGATGPTGAQGATGATGPTGPQGVAGTSVTILGSYPTFLALYNDHPTGDPGDAYLVAGDLFVWNNFDGWENVGNIQGPTGATGATGPTGAASNVPGPTGATGATGPTGAVGATGATGPTGAQGVAGATGPTGAQGAQGAQGIQGPVGPTGATGAQGETGPTGAQGNVGATGATGPQGVQGDVGPTGPAGATGATGATGPTGAASNVTGPTGPQGDVGATGPTGPQGTQGVAGETGPTGPQGDVGATGPQGDVGATGPTGATGPQGETGPQGDVGATGPQGDVGATGPTGPQGATGPQGEVGATGATGATGPQGPQGEQGVTGPTGPQGEIGPTGATGAASNVTGPTGATGPQGEIGPTGATGPQGEVGATGPTGAQGEVGPTGPTGAASNVTGPTGPTGATGDVGPTGAQGEIGPTGSTGPTGPQGETGPTGATGPTGAASNVTGPTGPTGPKGEDGVGVSILGSYNSLAELQSAHPTGNPGDGYLVSGDLYVWSATSSQWENVGQIQGPTGPTGPTGAASDVTGPTGPTGPTGAASNVTGPTGATGETGPTGPTGPTGATGPTGSTGPTGSFSLSDSTPPTSPDPGDAWFNSNTGKVYVYYDGYWVEVGAAPIGPTGPTGPAGADTTATGPTGPTGAQGATGPTGPTGPQGLASQVTGPTGATGPTGPSVTGPTGPASDVTGPTGPTGPTGATGPIGLQGDPSNVTGPTGPTGPTGPAGSFLQVQWDTYVPVWSASITNPLIGNGSITGRFVQVGKAIFGEVRLIAGSTTLRGTGTYRISLPFTGNGANYQPVGQVVMRDSSAPSLFFGTAMFNNENYTRIELFIHSQTAIFDEGSGATHDQPFFFSEGDQILISFMYERT
Physico‐chemical
properties
protein length:1101 AA
molecular weight: 102960,54210 Da
isoelectric point:4,05003
aromaticity:0,04632
hydropathy:-0,27230

Domains

Domains [InterPro]
DC_0620
STR
433–519
WJZ48564.1
1 1101
Architecture
ATT
STR
STR
ATT 1-284 | STR 285-648 | STR 679-1089 |
Legend: ATT STR RBD CBM LEC ENZ CHP LNK TAS TTP UNK Unmapped

Taxonomy

  Name Taxonomy ID Lineage
Phage Actinomycetia phage DSL-LC01
[NCBI]
3058956 Viruses > Duplodnaviria > Heunggongvirae > Uroviricota > Caudoviricetes
Host Actinomycetia bacterium
[NCBI]
1883427 cellular organisms > Bacteria > Bacillati > Actinomycetota > Actinomycetes > unclassified Actinomycetes

Coding sequence (CDS)

Coding sequence (CDS)
Genbank protein accession
WJZ48564.1 [NCBI]
Genbank nucleotide accession
OQ999401 [NCBI]
CDS location
range 123085 -> 126390
strand -
CDS
GTGCCTTGGTCTAACGACGTTGTAACTCGTACCGTAACTGGCACCTACCTGAAGACCAACGGAAGTGGTGCTAAAGGAAGAGTTACATTCACTCCTACGACAGTCGTTCTTGATCCAGACGACGCCGTCGTAATTGCTGACGCTGTTGTCGCGACTCTCAACACTAATGGCGTATTTTCTGTCGAACTTCCAACGACTGATAACCCTCTGCTCTCTCCCGCTGGTTGGGCGTATCGTGTTGAAGTACGGCTATACGGCGTAGCTCCTCAAGAGTTTTACGTCTACATTCCTGAGGGCGACGGCTCTACAATCGACATCACCGCTGACATCGCGGTTCTAACTTCTGGAATTGCCGACGGCACCGTGCCACCTGCGGCGCGCGGCCCAGTGGGACCTGCAGGTCCAACAGGCCCCACTGGACCAGCAGGCTCGGCATCATCTACAGGTGCAACTGGACCAACTGGTGCTACAGGACCAACTGGACCAGCAGGTGCGACAGGAGCAGATTCAACAGTTACAGGACCAACAGGTGCAACTGGCGAAACAGGTCCTACAGGCGCGACTGGCGCTGTAGGTGCTACAGGTCCTCAAGGTGCTGTTGGCCCGACAGGCGCGCAAGGTGTACAAGGTAATGACGGCGCGACTGGCCCAACTGGCCCAACGGGTGCTACAGGAGCTGCCTCGACTGTCACCGGTCCAACAGGTCCAACAGGCGCACAAGGCGCGACCGGCCCAACAGGCGCGCAGGGTGCAACAGGTGCAACTGGACCAACTGGACCTCAAGGTGTCGCAGGCACATCAGTAACTATTCTCGGTTCGTATCCGACATTCCTCGCGCTGTACAACGATCACCCAACTGGCGATCCAGGCGATGCGTATCTTGTTGCTGGTGACTTGTTTGTTTGGAACAATTTTGACGGCTGGGAAAATGTAGGAAACATTCAAGGTCCGACTGGCGCAACTGGCGCAACAGGTCCGACTGGAGCGGCTTCTAACGTTCCTGGTCCAACAGGCGCGACTGGCGCGACTGGACCGACAGGTGCAGTAGGCGCTACAGGTGCAACTGGTCCAACTGGTGCACAAGGCGTTGCTGGCGCAACTGGTCCAACTGGTGCACAAGGCGCGCAAGGAGCGCAGGGTATTCAAGGACCAGTTGGTCCTACAGGCGCGACTGGCGCACAGGGCGAAACAGGACCAACAGGTGCACAAGGTAACGTCGGCGCAACAGGCGCGACTGGTCCTCAAGGTGTTCAAGGTGATGTTGGTCCGACCGGACCCGCCGGTGCAACAGGTGCAACAGGTGCAACTGGACCTACGGGTGCAGCGTCTAATGTCACAGGACCCACAGGTCCGCAAGGTGACGTCGGTGCAACTGGACCCACAGGTCCGCAAGGTACTCAAGGTGTCGCTGGCGAGACTGGACCCACAGGTCCGCAAGGTGATGTCGGTGCGACTGGTCCGCAAGGTGATGTCGGTGCAACTGGACCCACAGGCGCAACTGGTCCGCAAGGTGAGACAGGTCCGCAAGGTGATGTCGGTGCGACTGGTCCGCAAGGTGATGTCGGTGCAACTGGACCCACAGGTCCGCAAGGAGCAACTGGTCCGCAAGGTGAAGTAGGAGCGACTGGCGCAACTGGCGCTACTGGCCCGCAGGGACCGCAAGGCGAACAAGGCGTTACAGGGCCAACAGGCCCGCAGGGCGAGATCGGCCCGACAGGTGCTACAGGCGCAGCAAGCAACGTAACTGGTCCAACTGGCGCAACTGGTCCGCAAGGCGAGATTGGTCCAACTGGAGCAACTGGTCCGCAAGGTGAAGTAGGAGCGACTGGCCCTACCGGTGCACAAGGCGAAGTTGGACCTACAGGACCTACAGGAGCTGCAAGCAATGTTACAGGTCCTACAGGTCCTACAGGCGCAACCGGAGACGTCGGTCCCACCGGTGCACAAGGTGAAATCGGTCCTACAGGATCAACCGGTCCAACTGGACCTCAGGGCGAGACTGGCCCAACCGGCGCAACTGGACCTACTGGTGCAGCGAGCAACGTTACTGGCCCGACTGGTCCAACCGGACCCAAAGGTGAAGACGGAGTCGGTGTATCAATTCTTGGATCGTACAACTCTCTTGCAGAGTTGCAATCTGCACACCCGACTGGAAATCCTGGCGATGGATACTTGGTTTCTGGAGACCTGTATGTGTGGTCAGCAACTTCGTCCCAGTGGGAAAACGTCGGTCAAATTCAAGGACCGACCGGTCCCACGGGCCCAACAGGCGCGGCAAGCGACGTAACTGGACCAACCGGACCGACTGGCCCAACAGGAGCGGCGAGCAACGTAACTGGTCCAACCGGTGCGACTGGAGAAACTGGACCTACTGGACCTACTGGACCCACAGGCGCTACAGGCCCGACCGGTTCTACTGGTCCTACAGGATCGTTTTCTCTTTCTGACTCAACACCGCCGACTTCTCCAGACCCAGGCGATGCGTGGTTCAATTCAAACACAGGCAAGGTGTACGTTTACTACGACGGGTACTGGGTTGAAGTTGGCGCTGCGCCTATCGGCCCAACTGGACCTACTGGTCCTGCAGGTGCCGATACAACAGCGACTGGACCGACTGGACCTACAGGTGCACAAGGTGCTACAGGGCCAACTGGTCCAACCGGTCCGCAAGGTCTTGCTTCACAAGTAACTGGTCCAACTGGCGCAACTGGTCCGACAGGCCCGTCAGTTACTGGTCCTACAGGTCCTGCTTCAGATGTTACAGGACCAACAGGTCCAACAGGGCCTACTGGCGCAACTGGTCCGATAGGTCTTCAAGGCGATCCAAGCAATGTGACTGGACCTACAGGCCCGACCGGTCCGACTGGTCCTGCAGGCTCGTTCTTGCAAGTTCAATGGGACACATACGTTCCTGTGTGGTCTGCATCAATAACTAATCCACTCATCGGTAACGGCAGCATCACAGGTCGCTTTGTGCAAGTTGGTAAGGCGATCTTTGGAGAAGTTCGTCTTATTGCTGGAAGCACAACTCTTCGTGGAACAGGCACTTACCGCATTTCACTTCCATTCACAGGTAACGGAGCAAACTACCAGCCGGTCGGCCAAGTTGTGATGCGAGACTCATCTGCGCCGTCGCTATTCTTCGGTACAGCGATGTTCAACAATGAAAACTACACTCGCATCGAACTGTTCATTCACTCGCAAACTGCAATCTTCGATGAAGGCTCTGGCGCTACTCATGATCAGCCATTCTTCTTCAGCGAAGGCGACCAGATCTTGATCTCATTTATGTACGAGAGGACGTGA

Genome Context

Genome Context

Tertiary structure

PDB ID
f04e77b1e478f7b60b6f8a7f6ccc3e2746f5260ebfb12909d69d56baddf5b8f7
ESMFold
Source ESMFold
Method ESMFold
Resolution 0,6360
Oligomeric State monomer
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50