Genbank accession
CAK8273894.1 [GenBank]
Protein name
tail protein
RBP type
TSP
Evidence DepoScope
Probability 1,00
TSP
Evidence RBPdetect
Probability 0,91
TSP
Evidence RBPdetect2
Probability 0,95
Protein sequence
MLLSALLNQHKRTKNMALYREGKAAMAADGTVTGTGTKWQSSLSLIRPGATIMFLSSPIQMAVVNKVVSDTEIKAITTNGAIVASSDYAILLSDSLTVDGLAQDVAETLRYYQSQETVIADAVEFFKNFDFDSLQDLANQINADSESAQSSAAAAAASENAAKTSENNAKSSEVAAENARDQVQQIINDAGDASTLVVLAQADGGDKIGFKLNSTYAPMRTVRKRLLDTINVIDFGAKGDGVTDDYPAFQYAAMYAESIGGAIIEIPTPAVEYKIGFPVYLFNNTHFKGSGINCRINFTDPLYARKSRSGFVIGSCREQNRDKAIQCLMNGTWATTGSVVDSTFVELSRGVYLRDNLSKVQSSNCCVSDVYLVATYPNGTTLKGGYGVSFANAIDCEAYNLWGEGWTEIINIGSDVPPATPSCHNCHAYNIICVEPNHYETYYSAGFIANSTACSIHDYKQLKPIADGSPHGSGASMNYTEDCLIYNFDIPSLGRTASSEGVLVNNSKGAVVHDINGGNAKSLVSEYYTTEVRIFYDAAKPNVFYNIHANNCDHAVALRSKYSVWKNVTQSNCTDHVYFGTSNAQYCDVRFVPDSIGYGSGLTPWHMLQSNYVAGWRVRTKYIRPINYLLNDKSSLQSWDTNRNMKAKAGTNLQVLYDIPVTMRAIAEVRCYLTFEVGALTAGSNVKMSIRRMVNYSGNSSEAPYIEATNTKTATADTVQDTTLVVAAGNTDGFIKCADTTNGLENSLDLLVEINNPTVNMNLKEIKFTYLGD
Physico‐chemical
properties
protein length:773 AA
molecular weight: 83998,10180 Da
isoelectric point:5,32271
aromaticity:0,09185
hydropathy:-0,19185

Domains

Domains [InterPro]
DC_0313
STR
1–770
Coil
Unmapped
162–182
IPR012334
STR
179–459
CAK8273894.1
1 773
Architecture
STR
STR 1-770 |
Legend: ATT STR RBD CBM LEC ENZ CHP LNK TAS TTP UNK Unmapped

Tail Spike Domain Segmentation

Tail Spike Domain Segmentation

This protein has been segmented into three structural domains: N-terminal, central domain, and C-terminal.

Domain Layout
N-terminal
Central
C-terminal
CAK8273894.1
1 773
Domain Start End Length (AA) Confidence
N-terminal 1 241 241 0,9909
Central domain 242 622 382 0,9926
C-terminal 623 773 150 0,9850
Legend: N-terminal Central domain C-terminal
3D Structure with Domain Coloring

The structure is colored according to the domain segmentation: N-terminal (blue), Central (green), C-terminal (pink).

Domain Coloring
N-terminal
1-241
Central
242-622
C-terminal
623-773

Taxonomy

  Name Taxonomy ID Lineage
Phage Webervirus KLPPOU149
[NCBI]
2845084 Viruses > Duplodnaviria > Heunggongvirae > Uroviricota > Caudoviricetes
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)
Genbank protein accession
CAK8273894.1 [NCBI]
Genbank nucleotide accession
CAVLHI020000002 [NCBI]
CDS location
range 20721 -> 23042
strand +
CDS
ATGCTATTATCTGCGTTACTCAACCAACACAAGAGGACTAAAAACATGGCACTATACAGAGAAGGTAAGGCGGCTATGGCCGCAGACGGAACCGTTACCGGGACTGGCACAAAATGGCAATCATCGCTTTCACTGATACGCCCTGGCGCGACGATTATGTTTTTGTCGTCACCAATTCAAATGGCCGTCGTAAACAAGGTGGTTAGCGATACTGAAATTAAAGCCATCACCACAAACGGCGCTATCGTAGCGTCTAGCGATTACGCGATCCTTTTAAGTGACTCGCTTACCGTTGACGGTCTGGCGCAAGATGTTGCTGAAACTCTGCGCTACTATCAGTCACAGGAAACCGTGATCGCGGATGCAGTCGAGTTCTTCAAGAACTTTGATTTCGATTCCCTGCAAGATCTTGCTAACCAAATTAATGCAGACTCTGAATCTGCACAATCAAGCGCTGCGGCTGCTGCTGCGTCTGAAAATGCGGCCAAAACTTCAGAGAATAACGCCAAGTCTTCAGAGGTGGCTGCGGAGAATGCAAGAGACCAAGTTCAGCAGATCATTAATGACGCTGGAGACGCATCAACACTGGTTGTGCTGGCGCAGGCAGATGGCGGCGACAAAATAGGCTTTAAGCTGAATTCTACCTATGCGCCAATGCGCACTGTTAGAAAGCGCCTGCTTGACACCATCAATGTAATTGATTTTGGGGCCAAGGGTGATGGCGTAACCGATGATTACCCGGCGTTTCAATATGCCGCCATGTACGCTGAATCTATCGGTGGCGCAATTATTGAAATCCCGACGCCAGCTGTTGAATACAAGATCGGTTTTCCAGTATACCTGTTCAATAACACCCATTTTAAAGGTTCCGGAATTAACTGCCGTATCAACTTTACTGACCCACTATATGCTAGGAAATCACGCAGCGGCTTTGTCATTGGTAGTTGCCGCGAGCAAAACAGAGATAAGGCAATTCAATGCCTTATGAACGGCACATGGGCCACCACAGGTTCAGTGGTTGATTCAACATTTGTTGAGCTTTCGCGTGGGGTTTACCTGCGTGACAACCTTAGCAAAGTACAATCATCTAACTGCTGCGTCAGTGATGTCTACCTGGTAGCCACTTACCCTAATGGAACTACGCTAAAAGGCGGGTATGGCGTATCCTTTGCGAACGCTATTGATTGCGAAGCGTATAACCTTTGGGGTGAAGGCTGGACGGAAATAATCAATATTGGTTCGGACGTTCCGCCAGCAACACCAAGCTGTCATAACTGCCACGCTTACAACATCATTTGCGTTGAACCGAACCATTATGAAACATATTACAGCGCCGGTTTCATTGCTAACTCCACGGCATGTTCTATTCATGACTACAAGCAGTTAAAACCAATTGCCGATGGTTCGCCGCACGGTTCTGGCGCGTCGATGAACTACACGGAAGACTGTTTAATCTATAATTTCGATATCCCTAGCCTCGGTCGCACAGCTAGTTCGGAAGGTGTTCTGGTCAACAACTCCAAAGGCGCGGTAGTCCATGACATCAACGGCGGTAACGCCAAGTCGCTCGTGTCGGAATATTACACAACGGAAGTCAGGATTTTCTACGATGCTGCGAAACCGAATGTCTTCTACAATATTCACGCAAACAACTGTGACCATGCTGTTGCCCTGCGTTCTAAATACAGCGTATGGAAAAATGTTACGCAGTCTAACTGCACGGATCACGTTTACTTTGGAACCAGCAACGCGCAATATTGTGATGTCAGATTTGTTCCTGATTCAATTGGTTACGGTTCTGGATTAACGCCGTGGCACATGTTGCAAAGCAACTATGTTGCTGGCTGGCGAGTGCGGACTAAATACATTAGGCCGATAAACTATCTTCTTAATGACAAGTCATCTTTGCAGTCATGGGACACCAACCGCAACATGAAGGCGAAGGCCGGAACGAACTTGCAGGTTCTTTATGATATTCCTGTAACTATGAGAGCAATTGCGGAAGTAAGATGTTACTTGACTTTTGAAGTTGGCGCACTGACAGCAGGTTCTAACGTGAAAATGTCTATTCGCCGTATGGTTAACTATTCTGGTAACTCATCAGAAGCGCCGTATATCGAAGCCACGAATACCAAAACGGCAACGGCTGACACAGTTCAAGATACCACTCTTGTGGTGGCAGCCGGTAACACCGATGGATTTATTAAGTGCGCTGATACTACGAACGGTCTTGAAAATTCTCTTGATTTACTGGTTGAGATCAATAACCCGACAGTAAATATGAACCTTAAAGAGATCAAATTTACCTATTTGGGGGACTAA

Genome Context

Genome Context

Tertiary structure

PDB ID
0828dfa83ee84b242ef46055c30717e555687a77cc06cb04e04b41e95173a573
ESMFold
Source ESMFold
Method ESMFold
Resolution 0,6999
Oligomeric State monomer
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50