Genbank accession
CAK6604371.1 [GenBank]
Protein name
tail spike protein
RBP type
TF
Evidence GenBank
Probability 1,00
TSP
Evidence DepoScope
Probability 1,00
TSP
Evidence RBPdetect
Probability 0,91
TSP
Evidence RBPdetect2
Probability 0,94
Protein sequence
MDFFTQPKGSTIGVLRDGRTVQEAFDSLMYTEVTLKSGRENAEANRVSLQAAADSTKLISVPAGEYFVSAHITLRTGNVIVGRGANASSSGYTRLVCETTGEGVFWYTGDTSTGQKRMPQIYNMGLKGDYPIRFNDERTAVIADQAASNVPYGMVPVVQFCTIDPLTNGVGIGISASKMFDGVFSFNEIANFDTGVLLNGCDLMYVAHNRIRNAYKYMVLELGVGTFGSQNEIYHNDILHAGSADCIFIKSTARHVRIYDNYLEQASGTSGQALIGFIDASVVDAPVFNGNAAAVRASTIIKDNRIDGQHFAKYFVYKYQPLGQTYGVIEDVSTVGPNTGLGANHLVLVDASGVTIDRVPFLYNSVQPCSFRFSGPRFGKWNGYNSASDYALKMTGSNMSMWGTSLGGNNLKDYLSARGNSLILSSGFTASAVLQFPSGTLIRPNSQYAIKVTAKCSSGSEALTFAGVANGTGQTSVTMQLSTEPVSATAQFTSGSTQSGFSLGRSNNGADIEIIAIEFIKLYSVEYSVASDTGTVTVYRSTGDIVISAAGNNQFPAYTVLRYIGGSLKEVYKSVTDAAVIGIAWSVSGQTLTVNITGSDGGKRFSVTQEEV
Physico‐chemical
properties
protein length:612 AA
molecular weight: 65343,49990 Da
isoelectric point:5,88030
aromaticity:0,09967
hydropathy:-0,03039

Domains

Domains [InterPro]
DC_1231
ATT
1–148
IPR012334
STR
41–339
IPR011050
STR
49–358
CAK6604371.1
1 612
Architecture
ATT
STR
RBD
ATT 1-148 | STR 149-358 | RBD 360-612
Legend: ATT STR RBD CBM LEC ENZ CHP LNK TAS TTP UNK Unmapped

Tail Spike Domain Segmentation

Tail Spike Domain Segmentation

This protein has been segmented into three structural domains: N-terminal, central domain, and C-terminal.

Domain Layout
N-terminal
Central
C-terminal
CAK6604371.1
1 612
Domain Start End Length (AA) Confidence
N-terminal 1 42 42 0,9555
Central domain 43 387 346 0,9914
C-terminal 388 612 224 0,9304
Legend: N-terminal Central domain C-terminal
3D Structure with Domain Coloring

The structure is colored according to the domain segmentation: N-terminal (blue), Central (green), C-terminal (pink).

Domain Coloring
N-terminal
1-42
Central
43-387
C-terminal
388-612

Taxonomy

  Name Taxonomy ID Lineage
Phage Klebsiella phage vB_Kpn_K43PH164C1
[NCBI]
3071633 Viruses > Duplodnaviria > Heunggongvirae > Uroviricota > Caudoviricetes
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)
Genbank protein accession
CAK6604371.1 [NCBI]
Genbank nucleotide accession
OY979391.1 [NCBI]
CDS location
range 36729 -> 38567
strand +
CDS
ATGGATTTCTTCACTCAGCCGAAAGGCTCAACCATTGGTGTGCTCAGAGATGGGCGCACCGTCCAAGAGGCGTTCGACTCGCTGATGTACACGGAGGTTACATTGAAGTCTGGTAGAGAGAACGCCGAGGCCAATAGGGTATCCCTTCAGGCTGCTGCTGACTCCACGAAGCTAATCTCAGTTCCAGCTGGTGAATACTTTGTGTCGGCACACATTACCCTTCGCACAGGAAACGTAATAGTTGGGCGAGGGGCAAATGCCTCCTCTTCAGGGTACACGAGGTTAGTATGTGAGACCACTGGGGAAGGTGTGTTCTGGTACACAGGGGACACGTCCACTGGTCAGAAGCGTATGCCTCAGATTTACAACATGGGCCTCAAGGGTGACTACCCGATTCGCTTTAACGATGAGCGTACTGCCGTCATTGCGGACCAAGCAGCATCTAACGTGCCGTATGGTATGGTTCCTGTCGTGCAGTTCTGTACAATCGACCCACTGACTAACGGGGTGGGGATTGGCATATCGGCGTCCAAGATGTTCGACGGCGTGTTCTCTTTCAACGAAATCGCCAACTTCGACACTGGTGTCTTATTGAATGGGTGTGACCTAATGTATGTTGCGCACAACCGTATCCGCAATGCATACAAGTACATGGTGCTTGAGCTGGGCGTTGGAACGTTTGGTTCCCAGAACGAGATTTACCACAATGACATCCTTCACGCCGGGTCTGCCGATTGTATCTTCATCAAGTCTACAGCTCGCCATGTGCGCATCTACGATAACTACCTAGAGCAAGCCTCCGGGACTAGCGGTCAGGCCCTCATTGGCTTCATTGATGCCTCGGTGGTTGACGCTCCTGTGTTTAATGGGAACGCGGCGGCTGTGCGTGCTTCCACCATCATAAAGGACAACCGTATAGACGGTCAGCATTTTGCGAAATACTTCGTGTATAAGTATCAGCCGTTAGGTCAGACATATGGTGTAATCGAAGACGTGTCTACCGTGGGTCCCAACACTGGACTTGGTGCTAACCACTTGGTGCTTGTGGATGCTTCCGGGGTCACTATTGACCGGGTTCCGTTCCTGTATAACTCTGTGCAGCCATGTTCATTCCGCTTCTCTGGCCCACGCTTCGGTAAGTGGAATGGTTACAACTCTGCGAGTGACTACGCTCTGAAGATGACAGGCTCGAATATGTCAATGTGGGGTACATCCTTAGGTGGGAACAACCTGAAAGACTACCTGTCGGCTCGCGGAAACTCCTTGATTCTGTCTAGCGGGTTCACTGCGTCGGCTGTCTTACAGTTCCCGTCTGGTACCCTAATCAGGCCAAACTCGCAGTACGCCATTAAGGTTACCGCCAAGTGCTCTAGCGGTAGTGAGGCTCTCACATTCGCTGGGGTGGCTAATGGTACTGGGCAGACAAGCGTTACCATGCAACTCAGTACGGAACCAGTAAGTGCTACTGCGCAATTCACCTCAGGCTCCACCCAGAGTGGCTTCTCCTTGGGCCGTTCCAACAATGGGGCGGACATTGAGATTATCGCTATAGAGTTCATTAAGTTGTACTCAGTGGAATACTCGGTGGCTTCCGACACTGGTACGGTGACGGTGTATCGTAGTACTGGTGACATTGTTATCTCGGCGGCTGGTAACAACCAGTTCCCAGCTTACACCGTGCTGCGCTACATCGGCGGAAGCCTCAAGGAAGTGTATAAGAGCGTCACTGATGCCGCCGTTATTGGCATTGCGTGGTCCGTATCTGGCCAGACCCTTACCGTGAACATTACGGGGAGTGATGGTGGCAAGCGGTTCTCCGTCACACAGGAGGAGGTATAA

Genome Context

Genome Context

Tertiary structure

PDB ID
c5f4706eab7e93e740f210eb437ca68fa6caac080c78d8cdb6f1079e1b006825
ESMFold
Source ESMFold
Method ESMFold
Resolution 0,6802
Oligomeric State monomer
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50