Protein

Genbank accession
CAB4188256.1 [GenBank]
Protein name
Collagen triple helix repeat
RBP type
TSP
Evidence DepoScope
Probability 1,00
TF
Evidence RBPdetect
Probability 0,89
TF
Evidence RBPdetect2
Probability 0,89
TF
Evidence Phold
Probability 1,00
Protein sequence
MNPDKLAKIRKAFDTLKSNELLTSADLSTIGKQGERGERGTGFRWRGKALSTSIYQPMDVVHYKHSAYVCTKETSNLPPLDGWEALVIGSVGPQGLEGKAGPQGVKGESIVGATGPQGKQGPTGVQGRQGVAGVRWVGNYAAGREYEIGDLVMFGGTVFLAFERTSSEPTTLNGWAAFSARGERGPSGFRGEAGTISNTSGPIVTTDTTASTSSATGALIVAGGAGIAKDSFINGMIIGTRGTASCTAVGASALLSSTAASCTAIGQQAGYLNTGGNATAIGVNALLSNSGTSCTAMGVSAAYQNSGTFCTAVGVQAAYQNNQASVTAFGQNALFANTGANSTAVGANALQSNQGGSCTAIGQQAGYLNTGGSCVAVGVTAAQLNTGANVIAVGVNALRNNVAAGNTAIGNNALYPATALQPSGINNTAGGLNSLDANTSGTLNSAWGAESLGAVTTGASNVGIGSTAGSNLTTGSNNTIVGAAATVSAVGDDNSIVIGKSAVGIGSNTTVIGVTATTATKIFGVQATGEVAPTVASATTIAPTTQIVFVSGTTAIVTITAPTGIATTGGQITIIPTGIFTTTTAGNIALASTAVVSRALIMTYDATTTKWYPSY
Physico‐chemical
properties
protein length:615 AA
molecular weight: 60729,86520 Da
isoelectric point:8,38884
aromaticity:0,05366
hydropathy:0,12537

Domains

Domains [InterPro]
CAB4188256.1
1 615
Legend: Pfam SMART CDD TIGRFAM HAMAP SUPFAM PRINTS Gene3D PANTHER Other

Taxonomy

  Name Taxonomy ID Lineage
Phage uncultured Caudovirales phage
[NCBI]
2100421 Viruses > Duplodnaviria > Heunggongvirae > Uroviricota > Caudoviricetes
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)
Genbank protein accession
CAB4188256.1 [NCBI]
Genbank nucleotide accession
LR797126 [NCBI]
CDS location
range 7885 -> 9732
strand +
CDS
ATGAATCCTGACAAACTCGCAAAGATCCGCAAGGCGTTCGACACGCTGAAATCTAACGAACTCCTGACGAGTGCGGATCTGTCAACGATCGGCAAGCAGGGCGAGCGTGGCGAGCGTGGCACAGGATTTCGTTGGCGTGGCAAGGCGTTGTCGACCTCGATATATCAACCGATGGATGTCGTGCACTACAAGCACAGTGCGTACGTGTGCACGAAAGAAACAAGCAACCTGCCACCACTCGATGGATGGGAAGCACTGGTCATCGGATCAGTCGGACCACAAGGCCTCGAAGGCAAGGCAGGACCGCAAGGCGTGAAGGGCGAGTCGATCGTCGGCGCAACTGGTCCGCAAGGCAAGCAAGGACCAACGGGCGTTCAGGGTCGGCAAGGTGTGGCAGGCGTAAGGTGGGTCGGCAATTATGCGGCTGGACGTGAGTATGAGATCGGCGATCTCGTGATGTTTGGCGGCACGGTGTTCCTTGCGTTCGAGCGCACGAGTTCCGAGCCTACGACACTCAACGGGTGGGCTGCATTCAGCGCACGAGGCGAGCGTGGACCGAGCGGATTCCGTGGCGAGGCGGGGACGATTTCAAACACGAGCGGACCGATCGTCACGACGGACACGACTGCATCGACATCATCTGCAACGGGTGCGCTCATTGTTGCGGGCGGCGCAGGCATTGCGAAGGACTCGTTTATTAACGGGATGATCATCGGGACACGAGGCACGGCGAGTTGCACGGCGGTCGGAGCAAGTGCGCTTTTGTCGAGTACGGCTGCGAGTTGCACGGCAATTGGACAACAGGCGGGATATCTAAACACGGGCGGAAACGCTACGGCGATTGGCGTGAATGCGCTGCTATCAAACTCAGGAACAAGTTGCACGGCGATGGGAGTGAGTGCTGCATATCAAAACTCAGGGACATTTTGCACTGCGGTCGGAGTGCAGGCTGCGTATCAAAACAATCAGGCAAGCGTCACGGCGTTTGGGCAAAATGCGTTGTTTGCAAACACAGGCGCAAATTCCACAGCGGTCGGAGCAAATGCGCTGCAGTCAAATCAAGGCGGAAGTTGCACGGCAATTGGACAACAGGCGGGATATCTAAACACGGGCGGAAGTTGCGTTGCGGTTGGCGTGACTGCGGCACAATTAAACACAGGCGCAAATGTTATAGCGGTCGGCGTAAATGCGTTGCGTAACAATGTTGCTGCAGGCAACACAGCGATCGGAAACAATGCACTCTATCCCGCAACTGCTTTGCAACCAAGCGGCATCAACAACACGGCAGGCGGTCTCAACTCGCTCGACGCAAACACAAGCGGCACTTTGAACTCTGCGTGGGGAGCAGAATCGCTCGGCGCAGTCACGACGGGCGCAAGCAATGTCGGCATCGGATCGACTGCAGGCAGCAACCTCACAACAGGCAGCAACAACACCATCGTCGGCGCAGCCGCAACCGTGTCCGCAGTCGGTGACGACAACTCCATCGTCATCGGCAAGTCAGCAGTAGGTATCGGATCAAACACCACAGTCATCGGCGTGACGGCAACAACCGCTACGAAAATTTTCGGAGTGCAGGCAACAGGTGAAGTTGCGCCAACCGTTGCAAGTGCAACCACGATCGCCCCGACAACGCAGATCGTGTTCGTGTCAGGCACAACCGCAATCGTCACGATCACCGCACCAACGGGAATCGCAACCACAGGCGGACAGATCACAATCATCCCCACAGGCATCTTCACAACAACCACCGCAGGCAACATCGCACTCGCATCCACTGCAGTTGTCTCTCGTGCCCTCATCATGACCTACGACGCAACAACCACAAAGTGGTATCCATCCTATTGA

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

PDB ID
7dd8ddee76fabf47a2f5b3486d754702ca6fa33b8d4a6a725ac52e7f465fbfe0
ESMFold
Source ESMFold
Method ESMFold
Resolution 0,7946
Evidence 0,7946

Literature

No literature entries available.