Genbank accession
CAB4203768.1 [GenBank]
Protein name
C1q domain containing protein
RBP type
TSP
Evidence RBPdetect
Probability 0,90
Protein sequence
MANNNSLNNTSYGFTLPSLNLNFTSTAQRITGDMSNGTHANRLAFQSSTVNGNTSLFVLPNGSGVSSSVLCSNAGDPTNSAFAQIQIDNTSVKLISSVLGGGVQLPMEIRIGGTAAIDINTSRAVSIANTSAGVPLTITGTSSGGLTVNQSNSGAANISLIQSSATAEPPYVNFFKSRSGGNVNAADQLGNIRFFGFATANQVAAQITCTAESIGATRVSGSISVWTTNTAGSTSPRLNISADGAVTIPATVAGNVPLTITGTNAGGLVVNQATGGVASIYCVQSSADVQPVYINSYKNRAGGNINAGDILYQSQVFGFATAYQVAAQSRCYAESIGAARVSGAFDWFTTSTAGVSGQRLYISSDGALVINTPAAALTALTVNGFNTAGTYAQQINAVTPGAQFSAFRALNNTATAGSSCLIECSVTGQAATGGDPFLRFVNYGTNGVSLGLDTSANQFSMSKTFLGDGNTFLTFDNATNQINLPLQCSFMAEVSVAIPNVTGNGAAYNVIFNSERFDVSNSYNNATGIFTAPKTGKYLFSGSLRISGLTALMTYAQVVLVNSGGNLLFGINNIGLIRSVTTAADNCCIPFSAVVSMTAGDATFIQVIIFNGAGNTAGIAISTSFWSGQLLS
Physico‐chemical
properties
protein length:632 AA
molecular weight: 64546,11510 Da
isoelectric point:5,87018
aromaticity:0,08070
hydropathy:0,20538

Domains

Domains [InterPro]
IPR008983
RBD
470–632
IPR008983
RBD
481–632
IPR001073
RBD
483–632
IPR001073
RBD
505–611
IPR008983
RBD
507–631
CAB4203768.1
1 632
Architecture
RBD
RBD 470-632
Legend: ATT STR RBD CBM LEC ENZ CHP LNK TAS TTP UNK Unmapped

Tail Spike Domain Segmentation

Tail Spike Domain Segmentation

This protein has been segmented into three structural domains: N-terminal, central domain, and C-terminal.

Domain Layout
N-terminal
Central
C-terminal
CAB4203768.1
1 632
Domain Start End Length (AA) Confidence
N-terminal 1 258 258 0,0577
Central domain 259 457 200 0,0481
C-terminal 458 632 174 0,9949
Legend: N-terminal Central domain C-terminal
3D Structure with Domain Coloring

The structure is colored according to the domain segmentation: N-terminal (blue), Central (green), C-terminal (pink).

Domain Coloring
N-terminal
1-258
Central
259-457
C-terminal
458-632

Taxonomy

  Name Taxonomy ID Lineage
Phage uncultured Caudovirales phage
[NCBI]
2100421 Uroviricota > Caudoviricetes > Peduoviridae > Maltschvirus maltsch >
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)
Genbank protein accession
CAB4203768.1 [NCBI]
Genbank nucleotide accession
LR797335 [NCBI]
CDS location
range 1420 -> 3318
strand +
CDS
ATGGCAAATAATAATAGTTTAAATAATACAAGTTATGGGTTTACTCTTCCAAGTTTAAATCTTAATTTCACTTCAACCGCTCAGCGTATTACTGGTGATATGTCTAATGGGACTCACGCCAACCGTCTCGCTTTTCAGTCCAGTACAGTTAATGGAAACACTTCTTTATTTGTTCTACCTAATGGATCAGGAGTTTCTTCTTCTGTCCTTTGTTCTAACGCTGGCGACCCTACTAATTCAGCCTTTGCTCAAATTCAAATAGATAATACTTCAGTTAAACTAATCTCTTCTGTTCTTGGTGGTGGTGTGCAGTTGCCAATGGAAATAAGAATAGGTGGTACTGCTGCTATTGATATAAATACATCTAGAGCTGTCTCCATAGCAAATACAAGTGCTGGCGTGCCTCTAACCATTACAGGAACAAGTTCAGGTGGATTAACAGTCAATCAATCTAATAGTGGTGCTGCAAATATTAGTTTAATTCAAAGTAGTGCAACTGCTGAACCACCTTATGTTAATTTTTTTAAAAGTAGGTCAGGCGGAAATGTAAACGCCGCAGATCAATTAGGTAATATACGCTTTTTTGGGTTTGCAACTGCTAACCAAGTTGCTGCTCAAATTACATGTACTGCGGAGAGTATTGGTGCGACGAGGGTATCAGGGTCAATTTCTGTATGGACAACTAATACTGCTGGCTCAACAAGTCCCCGTTTAAATATCTCTGCTGATGGTGCAGTAACGATACCAGCAACAGTCGCTGGAAACGTGCCCTTAACAATTACTGGGACTAATGCTGGCGGATTAGTAGTCAATCAAGCTACTGGTGGCGTCGCAAGTATTTATTGTGTTCAAAGTAGTGCCGATGTTCAACCAGTTTATATTAATAGTTATAAAAATAGAGCAGGCGGAAACATAAACGCTGGTGATATTTTATATCAAAGTCAGGTTTTTGGTTTTGCTACTGCTTATCAAGTTGCTGCTCAATCCAGATGTTACGCCGAGAGTATTGGAGCTGCACGAGTTTCAGGTGCTTTTGACTGGTTTACGACTAGCACTGCTGGAGTATCTGGTCAGCGTTTATATATTAGTTCCGATGGTGCTTTAGTAATAAATACTCCTGCTGCAGCTCTCACAGCCCTGACAGTAAATGGATTTAACACTGCTGGCACTTATGCTCAACAAATTAATGCGGTTACACCAGGTGCTCAATTTTCTGCGTTTAGAGCTTTAAATAATACTGCTACAGCTGGTAGTTCATGCTTGATTGAGTGTAGCGTTACTGGTCAAGCTGCTACAGGCGGAGACCCTTTTTTAAGATTTGTAAACTATGGTACTAATGGAGTAAGTCTCGGTCTAGACACTAGTGCTAACCAGTTTTCTATGTCTAAAACATTTTTAGGAGATGGAAATACCTTCTTGACTTTTGATAATGCTACAAACCAAATAAATCTGCCCTTGCAATGTAGTTTTATGGCTGAAGTATCTGTTGCTATTCCAAACGTAACTGGTAACGGTGCAGCTTATAATGTTATTTTTAATTCAGAAAGATTTGATGTAAGTAATTCATATAATAATGCAACTGGTATATTTACTGCACCGAAAACAGGAAAATATTTGTTTTCTGGGTCATTAAGAATTTCAGGGTTAACTGCATTAATGACATATGCACAAGTTGTACTTGTAAACTCAGGAGGAAATTTATTATTTGGTATAAATAATATTGGTTTAATCCGTTCTGTAACAACGGCTGCCGATAACTGTTGCATACCTTTTTCAGCAGTTGTTTCTATGACTGCTGGTGATGCAACATTTATTCAAGTTATCATTTTTAATGGAGCTGGCAACACCGCAGGTATAGCAATATCCACAAGTTTTTGGAGTGGTCAATTATTAAGTTAG

Genome Context

Genome Context

Tertiary structure

PDB ID
e452b87bfec8334d87baeb8d66ee143f43ad4d4cc1d0a7719d6611777f7ab9f5
ESMFold
Source ESMFold
Method ESMFold
Resolution 0,6621
Oligomeric State monomer
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50