Genbank accession
QOE32401.1 [GenBank]
Protein name
tail fiber protein
RBP type
TF
Evidence GenBank
Probability 1,00
TSP
Evidence DepoScope
Probability 1,00
TSP
Evidence RBPdetect
Probability 0,88
TSP
Evidence RBPdetect2
Probability 0,93
Protein sequence
MAFKFKGSLSAVDASLKGGVIFDNEKLSDSKIMPKGKFTINEIDESTTDLASNTLVTNFGKQENIAGMFSFNTGDWKDTASAPVKLMMRVVNDVNDSDWVTIFDSSKTTVTWSSIAAAGQTKFTVPNLDFKKAIIYVNGVLQYPGLSYQTFGGTVTFNSKLSTGDTIYVVVGEDTTNTDNEEFTATATEDQTVIVLPFTKKTNFVFINGILQYPDSYTVSGSTITLGDKLYLNDDILVLGNETDLVSFTALASQNQTDFTLPGEFDDGLVYINGVLQYDNAYSVSGTTLSFISSLDENDNVFVTLNDPKFIDSVNASYTSTITDESNNKLNIPYFNFSELQVFINGVLQNPDSGAYILNGTEITLSSPLQLGDDIHVIVYNSPVQNSNLVTKADLTSYATAEELQELKNSLKNEGINLKWQPHLPSIEVAFGLPRRSLKIWTVGSTSTVNQYWLYPVDGTVWTGTGILGTVPGVPFYKLDSNKDVITWTYTATSDNVTRIFVPYNFGSITIFINGILQSVNLNHYTIDGQYINLNGALQTGDNFIAILGKLVFNTNPYVTNQDLISYAKTSKLDFYVEKSNLSSEDGLKYIGMAKSVSSLRTIEPEYDGQHIEVKSFYENLNVGGGIFVYESDNTNTDDGLINIVTYNGKVWKRKLENNIIYISMAGILGNGDDETAKLVNLISVCSSLSGYVIDGENKTITKSSSIEIDLVKVGLQNISLYDTTVNSYDPYYILILDGSSVSSRKSKGNICVFNNVEVYGAVNRNTSNVHGILIQPSNSLSSVSMNNIDIRYVNCGLVFAANSYLTIHNNWVIHNCNIALSTTQYTGTSSTTLNNAGENIRFNDCIISDSNMISRLTGFECGLTFNSCSFDYTGGSVENNYVQWSDFRQGSKIDFYGCHFESGNVNDNWTGNYFQVNTSVAINIIGGMMRHSSTYNSCPYWFYDESTNGQFVIEGTDIWGAGVRQWSNRGMSKFYPMINGLNSQVRGYLSDRSDLILENNFSNSTSTTTLDNWQVIDGTKSGALTSDVLSCTVTTFTDSNSITYPALKVKKLKTSAGATLRLYVKAPYSNFGPLGTVSIGSNSVIIPSGVCTTSVGFVKSKNMFDSYGQPLHTKITPVVSTTITSISSTISTFDARGQLTISDDYKGYDYLFLSVNLGGLSANDEIYITYYNLEVPKR
Physico‐chemical
properties
protein length:1179 AA
molecular weight: 129518,02050 Da
isoelectric point:4,67298
aromaticity:0,11026
hydropathy:-0,13944

Domains

Domains [InterPro]
DC_0116
ATT
174–255
QOE32401.1
1 1179
Architecture
ATT
STR
ATT
RBD
ATT 41-255 | STR 256-471 | ATT 562-696 | RBD 847-1174 |
Legend: ATT STR RBD CBM LEC ENZ CHP LNK TAS TTP UNK Unmapped

Tail Spike Domain Segmentation

Tail Spike Domain Segmentation

This protein has been segmented into three structural domains: N-terminal, central domain, and C-terminal.

Domain Layout
N-terminal
Central
C-terminal
QOE32401.1
1 1179
Domain Start End Length (AA) Confidence
N-terminal 1 676 676 0,9901
Central domain 677 993 318 0,9910
C-terminal 994 1179 185 0,9088
Legend: N-terminal Central domain C-terminal
3D Structure with Domain Coloring

The structure is colored according to the domain segmentation: N-terminal (blue), Central (green), C-terminal (pink).

Domain Coloring
N-terminal
1-676
Central
677-993
C-terminal
994-1179

Taxonomy

  Name Taxonomy ID Lineage
Phage Klebsiella phage Muenster
[NCBI]
2767572 Viruses > Duplodnaviria > Heunggongvirae > Uroviricota > Caudoviricetes
Host Klebsiella pneumoniae
[NCBI]
573 cellular organisms > Bacteria > Pseudomonadati > Pseudomonadota > Gammaproteobacteria > Enterobacterales

Coding sequence (CDS)

Coding sequence (CDS)
Genbank protein accession
QOE32401.1 [NCBI]
Genbank nucleotide accession
MT708547.1 [NCBI]
CDS location
range 107156 -> 110695
strand -
CDS
ATGGCATTTAAATTTAAAGGATCACTATCAGCAGTTGATGCATCCCTTAAAGGTGGAGTGATATTTGATAATGAAAAGTTATCAGATTCTAAAATTATGCCTAAAGGAAAATTCACAATAAATGAAATTGATGAAAGCACTACAGATTTAGCAAGTAATACTTTAGTAACAAATTTTGGTAAACAAGAAAATATCGCTGGTATGTTTTCTTTTAATACTGGAGATTGGAAAGACACCGCTTCGGCACCAGTGAAATTGATGATGAGAGTCGTTAACGATGTTAACGATTCGGATTGGGTTACTATCTTTGATTCATCCAAAACTACAGTAACATGGAGTTCTATCGCCGCCGCCGGGCAAACTAAATTCACAGTTCCTAATCTAGATTTCAAAAAAGCAATTATCTATGTTAACGGTGTACTACAATATCCAGGTTTATCTTACCAAACATTTGGTGGTACTGTAACATTTAACTCCAAATTAAGTACAGGTGATACCATTTATGTTGTTGTTGGTGAAGACACAACCAACACTGATAATGAAGAATTTACTGCAACTGCAACCGAAGATCAAACTGTTATTGTGTTACCATTTACCAAAAAAACTAATTTTGTTTTTATTAATGGTATTTTGCAATATCCAGATTCGTATACTGTTTCTGGAAGTACTATCACTCTTGGTGATAAATTGTATTTAAATGATGATATTCTTGTTCTTGGTAATGAGACTGACCTGGTTTCATTTACTGCTCTTGCGTCTCAAAATCAAACTGATTTTACTTTACCTGGTGAATTTGATGATGGTCTTGTTTATATCAATGGTGTTTTGCAATACGATAATGCATATAGTGTATCAGGCACAACTTTATCATTTATTTCGTCTTTGGATGAAAATGATAATGTTTTTGTTACATTGAATGATCCAAAATTCATTGATAGTGTTAATGCCAGTTATACCAGTACCATAACCGATGAATCGAATAATAAACTAAATATACCTTATTTTAATTTTAGTGAATTGCAAGTTTTTATCAATGGTGTTTTGCAAAATCCAGACAGCGGTGCTTATATATTAAACGGTACTGAAATTACGTTAAGTTCTCCTTTACAATTAGGAGACGATATTCATGTTATTGTTTATAACTCTCCAGTACAGAACAGTAATTTAGTAACCAAAGCAGATTTAACTTCCTATGCAACCGCAGAAGAACTACAGGAATTAAAAAATTCATTAAAAAATGAAGGTATAAATTTAAAATGGCAACCACATTTACCGAGTATTGAAGTAGCATTTGGTTTACCACGAAGATCATTGAAAATATGGACAGTTGGTAGTACATCGACCGTGAACCAGTACTGGTTATATCCAGTTGACGGTACTGTCTGGACTGGTACAGGAATTCTTGGTACAGTACCAGGTGTACCGTTTTATAAATTAGATTCAAACAAAGACGTTATTACATGGACTTATACTGCAACCTCTGATAATGTAACTAGAATCTTCGTTCCGTATAATTTTGGTAGTATTACTATTTTTATTAATGGTATTCTTCAAAGCGTAAATTTAAATCACTACACAATTGATGGACAATACATCAATTTAAATGGTGCATTACAAACTGGTGATAATTTTATTGCAATTCTTGGTAAATTGGTTTTCAATACAAATCCATATGTTACAAACCAAGATTTGATAAGTTATGCTAAAACTTCAAAATTGGATTTTTATGTAGAAAAATCCAATTTGTCATCTGAAGATGGTTTAAAATATATAGGAATGGCTAAAAGTGTTTCTTCACTTCGCACGATAGAACCTGAGTATGATGGACAACACATTGAAGTAAAATCATTTTATGAAAATTTAAATGTAGGTGGTGGGATATTCGTTTATGAAAGCGATAATACTAACACCGATGATGGTTTAATAAACATTGTAACATATAACGGTAAAGTATGGAAGAGAAAACTTGAAAACAATATTATATATATATCAATGGCTGGGATTTTAGGAAATGGCGACGACGAAACAGCTAAATTGGTAAATTTAATTTCTGTTTGTTCCTCATTATCTGGTTACGTAATAGACGGTGAAAATAAAACAATTACAAAATCATCATCCATAGAAATAGACTTGGTTAAAGTTGGATTACAAAACATATCATTATATGATACTACTGTTAACTCATATGACCCTTATTATATATTAATTTTAGATGGTTCTTCTGTTAGTAGCAGGAAATCGAAAGGTAACATATGTGTATTCAATAATGTTGAAGTTTACGGTGCAGTAAATCGAAATACTAGCAATGTTCATGGTATTTTAATTCAGCCATCAAATAGCTTGTCTAGTGTTTCTATGAATAATATAGACATTCGATATGTAAACTGTGGTTTAGTCTTTGCAGCTAATTCTTATTTAACTATTCATAATAATTGGGTGATACATAATTGTAATATAGCACTAAGTACTACTCAATATACTGGAACATCTTCCACTACACTTAATAATGCAGGTGAAAATATTAGATTTAATGATTGCATCATAAGTGATTCGAATATGATTTCAAGATTAACTGGATTTGAATGTGGGTTAACTTTTAATTCGTGTTCGTTTGATTATACAGGTGGTAGCGTAGAGAATAATTATGTTCAATGGAGTGATTTCAGACAAGGATCAAAAATTGATTTTTATGGATGCCACTTTGAATCTGGAAATGTCAATGATAATTGGACAGGAAATTATTTTCAAGTTAATACATCAGTAGCAATTAACATAATTGGTGGAATGATGAGACATAGTAGTACATATAATTCATGTCCATATTGGTTCTATGATGAAAGTACAAATGGTCAGTTTGTTATAGAAGGTACAGATATATGGGGTGCTGGAGTTAGACAATGGTCCAATAGAGGAATGTCTAAATTTTACCCAATGATCAATGGACTCAATTCACAAGTTCGTGGATATTTATCAGATAGGAGTGATTTAATTTTAGAAAATAACTTTTCCAACTCAACATCCACCACAACTTTAGACAATTGGCAAGTAATTGATGGTACAAAATCTGGAGCACTAACTAGTGATGTACTTTCATGCACAGTTACTACATTCACAGATTCAAACTCAATCACATATCCAGCATTAAAGGTGAAAAAATTGAAAACATCAGCCGGGGCAACTTTAAGGCTATACGTTAAAGCACCATATAGTAATTTCGGCCCGCTTGGAACTGTATCTATTGGGAGCAATTCAGTAATTATACCAAGTGGCGTATGTACTACCTCGGTAGGTTTTGTCAAAAGTAAAAATATGTTCGACTCATACGGTCAACCATTACATACAAAAATAACTCCTGTTGTTTCTACTACAATTACTTCAATATCATCAACAATATCCACATTTGATGCAAGAGGGCAATTGACTATTAGTGATGATTATAAAGGGTATGATTATTTGTTTTTATCTGTAAATTTAGGTGGGTTATCTGCTAACGACGAAATTTATATTACTTATTATAACTTAGAAGTACCTAAACGTTAA

Genome Context

Genome Context

Tertiary structure

PDB ID
aa612f9707e50d1c5713646931eca207490629e6c41537d605c4be9d70b05952
ESMFold
Source ESMFold
Method ESMFold
Resolution 0,5951
Oligomeric State monomer
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50

Literature

Title Authors Date PMID Source
Complete genome sequence of Klebsiella pneumoniae phage Muenster Martin,C.C., Lessor,L., Le,T., Gill,J., Young,R. and Liu,M. 2021-01-21 GenBank