Genbank accession
YP_008241368.1 [GenBank]
Protein name
tail fiber protein
RBP type
TF
Evidence GenBank
Probability 1,00
TF
Evidence Phold
Probability 1,00
TSP
Evidence DepoScope
Probability 1,00
TSP
Evidence RBPdetect
Probability 0,85
TSP
Evidence RBPdetect2
Probability 0,94
Protein sequence
MHMGLIERSQVVRDETTQGANTAERVGGILVDLSTALEEGGAVGPAGPQGEQGIQGEQGIQGEQGVKGDTGDTGPQGIQGIQGEPGETGPQGIQGETGPQGIQGIQGEPGEDGAQGIQGIKGDTGDTGPQGLQGIQGAAGLQGEQGIQGEPGPSTPQTIDDGSIVEAKLSTELSDKINSPVEIINYFSPTSASDLSNASNANKTARIDNDIDLGGATVVMASGVIIEPNGGIISNGTLSTPKEITGSKIKVFDTDVSFSGLYLNGFFYPEWFGASADGVTIDDVSIKKACDNSETVKFTDSKKYRISTTIDITRSGKLTIIGGDSRPIITSNGGTLSSDFFKFNSTVTEANMYKLELDGGDIVANGIYTLTSGEIKDNYLHDFYNLTSSSVGIRSDIAKEVNINIEGNIIHTVKSENDGGIGGSGGASRSISVNWQTGGTGVATIKNNKLTYAFGDDGDLIQVANQSNDYTTDIQTNITGNRLIGGVRRCAKLTAHNILFEGNYVESIYSAHEETTGVVAAGMVSVGVYNDVATPSAYVTGVKVINNTFNNIGGWDGRVYISRTSGYVNKGNTYLNRASMVIYLKNKNLSIDRNDFDNGGISTQTPTFEGKNSISFNTANFELTNTTRTSFFAHDNSSNINDIIFDSNVVQTIESIVSGVFYPYYIGNGGTASNIHFRNNTVNKSGTNVRPEFIRSQINFPASCSFSNNWNIGHASTTGVLKFSGGSVTYPQVFINVNNYNGAGVLLEPSTN
Physico‐chemical
properties
protein length:752 AA
molecular weight: 79198,97380 Da
isoelectric point:4,64081
aromaticity:0,07314
hydropathy:-0,33418

Domains

Domains [InterPro]
DC_0656
STR
6–109
PTHR24637
Unmapped
32–157
DC_1284
STR
91–370
IPR008160
STR
95–153
YP_008241368.1
1 752
Architecture
STR
STR 6-370 |
Legend: ATT STR RBD CBM LEC ENZ CHP LNK TAS TTP UNK Unmapped

Tail Spike Domain Segmentation

Tail Spike Domain Segmentation

This protein has been segmented into three structural domains: N-terminal, central domain, and C-terminal.

Domain Layout
N-terminal
Central
C-terminal
YP_008241368.1
1 752
Domain Start End Length (AA) Confidence
N-terminal 1 192 192 0,9914
Central domain 193 721 530 0,9564
C-terminal 722 752 30 0,8304
Legend: N-terminal Central domain C-terminal
3D Structure with Domain Coloring

The structure is colored according to the domain segmentation: N-terminal (blue), Central (green), C-terminal (pink).

Domain Coloring
N-terminal
1-192
Central
193-721
C-terminal
722-752

Taxonomy

  Name Taxonomy ID Lineage
Phage Cellulophaga phage phi17:1
[NCBI]
1327980 Uroviricota > Caudoviricetes > Helsingorvirus >
Host Cellulophaga baltica
[NCBI]
76594 cellular organisms > Bacteria > Pseudomonadati > FCB group > Bacteroidota/Chlorobiota group > Bacteroidota

Coding sequence (CDS)

Coding sequence (CDS)
Genbank protein accession
YP_008241368.1 [NCBI]
Genbank nucleotide accession
NC_021795.1 [NCBI]
CDS location
range 26169 -> 28427
strand +
CDS
TTGCATATGGGACTTATAGAGAGATCGCAAGTAGTACGAGACGAAACAACACAAGGAGCAAACACCGCTGAGAGAGTTGGTGGGATTCTTGTGGATTTATCAACAGCTTTAGAGGAGGGAGGAGCCGTTGGTCCTGCAGGGCCTCAAGGAGAGCAGGGCATCCAAGGAGAGCAAGGTATCCAAGGAGAGCAGGGAGTCAAAGGGGATACTGGAGATACTGGTCCTCAAGGGATCCAAGGTATACAGGGTGAACCCGGAGAGACCGGACCTCAAGGTATACAAGGTGAGACTGGACCTCAAGGTATACAAGGTATTCAGGGTGAGCCTGGAGAGGATGGTGCTCAAGGTATACAAGGAATCAAAGGAGATACAGGAGATACCGGTCCTCAGGGGCTGCAAGGTATTCAAGGTGCAGCCGGATTGCAAGGTGAGCAAGGTATCCAAGGTGAGCCTGGACCTTCAACCCCTCAGACTATAGATGATGGATCTATCGTTGAGGCTAAACTTAGTACAGAGCTGTCTGACAAAATAAACTCTCCTGTAGAAATTATTAATTACTTCTCACCTACAAGCGCATCTGATTTATCAAATGCATCAAACGCAAACAAAACTGCTAGAATAGATAATGACATTGATTTAGGGGGGGCAACTGTAGTTATGGCTAGTGGTGTAATTATAGAGCCTAATGGAGGTATTATTAGTAATGGAACTTTATCAACACCAAAGGAAATAACAGGTAGCAAAATAAAGGTTTTTGATACGGATGTATCGTTTTCTGGATTGTACTTAAATGGATTCTTTTATCCTGAATGGTTTGGTGCTTCTGCTGATGGTGTAACTATTGATGATGTTTCTATAAAAAAAGCGTGTGATAATTCAGAGACGGTAAAATTTACAGATAGTAAAAAATATAGAATTTCAACAACAATAGATATAACCAGAAGCGGAAAATTAACTATTATAGGCGGAGACAGTAGACCAATAATAACATCTAACGGAGGCACATTGTCTAGTGATTTTTTTAAATTCAATAGTACTGTTACAGAAGCTAACATGTACAAGTTAGAATTAGACGGTGGTGACATCGTTGCAAATGGTATTTATACCTTAACATCTGGTGAAATAAAAGATAATTACTTACATGATTTCTATAATTTAACCAGTTCATCTGTAGGAATAAGATCAGATATAGCTAAAGAGGTAAATATTAATATAGAGGGAAATATAATTCATACCGTTAAGTCTGAAAATGATGGAGGTATTGGAGGTAGCGGAGGCGCTAGTCGTTCTATATCCGTAAACTGGCAGACAGGAGGCACGGGAGTTGCTACAATTAAAAATAATAAATTAACTTACGCTTTTGGTGATGATGGGGATTTAATTCAAGTAGCAAATCAATCGAACGACTACACTACAGATATACAAACCAATATAACTGGTAATAGATTAATTGGAGGCGTTAGAAGATGTGCTAAATTAACAGCTCATAACATATTATTTGAGGGCAATTACGTTGAAAGTATATATTCAGCACACGAAGAAACAACAGGAGTTGTGGCTGCTGGTATGGTTTCGGTTGGAGTCTACAACGATGTAGCTACACCTAGCGCATATGTTACAGGCGTAAAAGTTATCAATAATACATTTAATAACATTGGCGGTTGGGATGGTCGCGTATATATTTCCCGTACTTCTGGGTATGTAAATAAAGGAAACACTTATTTAAACCGTGCTTCTATGGTTATATATCTTAAAAACAAAAACTTATCAATAGATAGAAATGATTTTGATAATGGAGGTATAAGCACACAAACACCAACTTTTGAAGGTAAAAATAGCATATCATTTAACACCGCAAATTTTGAGTTAACAAACACCACAAGGACATCATTTTTTGCTCATGATAATTCAAGTAACATTAATGATATTATTTTTGATTCTAATGTGGTTCAAACAATAGAAAGTATTGTTAGCGGTGTGTTTTATCCTTATTACATCGGAAATGGCGGCACAGCTTCTAATATTCACTTTAGGAACAACACGGTAAATAAGTCAGGTACAAATGTTAGACCTGAGTTTATAAGGTCACAAATTAACTTTCCTGCAAGTTGTTCATTTAGTAACAATTGGAATATAGGACACGCAAGTACTACAGGGGTTTTAAAGTTTTCTGGTGGTTCGGTTACATATCCGCAAGTATTTATTAATGTGAACAACTATAACGGAGCAGGAGTTTTATTAGAGCCTTCAACAAACTAA

Genome Context

Genome Context

Tertiary structure

PDB ID
f8bebf491410406e69c57c7b32dde16c21449baad77b5714d6521c0b1278b728
ESMFold
Source ESMFold
Method ESMFold
Resolution 0,7820
Oligomeric State monomer
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50