Genbank accession
WPK27955.1 [GenBank]
Protein name
non-contractile tail tubular protein
RBP type
TSP
Evidence DepoScope
Probability 1,00
TSP
Evidence RBPdetect2
Probability 0,62
Protein sequence
MEVQGSLGRQIQGISQQPPAVRLDGQCTAMINMIPDVVNGTQSRMGTTHIAKILDAGTDDMATHHYRRGDGDEEYFFTLKKGQVPEIFDKYGRKCNVTSQDAPMAYLAEVINPREDVQFMTIADVTFMLNRRKVVKASNRKSPKVGNKAIVFCAYGQYGTSYSIVINGANAASFKTPDGGSADHVEQIRTERITSELYSKLQQWSGVSDYEIQRDGTSIFIERRDGASFTITTTDGAKGKDLVAIKNKVSSTDLLPSRAPAGYKVQVWPTGSKPESRYWLQAEPKEGNLVSWKETIAADVLLGFDKGTMPYIIERTGIVDGIAQFKIRQGDWEDRKVGDDLTNPMPSFIDEEVPQTIGGMFMVQNRLCFTAGEAVIASRTSYFFDFFRYTVISALATDPFDIFSDASEVYQLKHAVTLDGATVLFSDKSQFILPGDKPLEKSNALLKPVTTFEVNNKVKPVVTGESVMFATNDGSYSGVREFYTDSYSDTKKAQAITSHVNKLIEGNITNMAASTNVNRLLVTTDKYRNIIYCYDWLWQGTDRVQSAWHVWKWPIGTKVRGMFYSGELLYLLLERGNGVYLEKMDMGDALTYGLNDRIRMDRQAELIFQHFEAEDEWVSEPLPWVPTNPELLDCILIEGWNSYIGGSFLFKYNPGDNTLSTTFDMHDDSHVKAKVIVGQIYPQEFEPTPVVIRDRQDRVSYIDVPVVGLVHLNLDMYPDFSVEVKNVKSGKVRRVLASNRIGGALNNTVGYVEPREGVFRFPLRAKSTDAVYRIIVESPHTFQLRDIEWEGSYNPTKRRV
Physico‐chemical
properties
protein length:800 AA
molecular weight: 90013,84780 Da
isoelectric point:5,72922
aromaticity:0,10375
hydropathy:-0,34825

Domains

Domains [InterPro]
DC_0058
STR
1–800
IPR058003
TTP
3–799
WPK27955.1
1 800
Architecture
STR
STR 1-800
Legend: ATT STR RBD CBM LEC ENZ CHP LNK TAS TTP UNK Unmapped

Tail Spike Domain Segmentation

Tail Spike Domain Segmentation

This protein has been segmented into three structural domains: N-terminal, central domain, and C-terminal.

Domain Layout
N-terminal
Central
C-terminal
WPK27955.1
1 800
Domain Start End Length (AA) Confidence
N-terminal 1 141 141 0,9642
Central domain 142 355 215 0,0983
C-terminal 356 800 444 0,2926
Legend: N-terminal Central domain C-terminal
3D Structure with Domain Coloring

The structure is colored according to the domain segmentation: N-terminal (blue), Central (green), C-terminal (pink).

Domain Coloring
N-terminal
1-141
Central
142-355
C-terminal
356-800

Taxonomy

  Name Taxonomy ID Lineage
Phage Escherichia phage vB-Eco-KMB47
[NCBI]
3054435 Viruses > Duplodnaviria > Heunggongvirae > Uroviricota > Caudoviricetes
Host Escherichia coli CFT073
[NCBI]
199310 Bacteria > Proteobacteria > Gammaproteobacteria > Enterobacteriales > Enterobacteriaceae > Escherichia

Coding sequence (CDS)

Coding sequence (CDS)
Genbank protein accession
WPK27955.1 [NCBI]
Genbank nucleotide accession
OR424385 [NCBI]
CDS location
range 24887 -> 27289
strand +
CDS
ATGGAAGTACAAGGTTCATTAGGTAGACAAATCCAAGGTATTAGCCAGCAGCCGCCAGCGGTACGCTTGGATGGTCAGTGCACAGCTATGATTAATATGATACCTGATGTAGTGAATGGTACTCAATCACGCATGGGTACAACTCATATTGCAAAGATACTTGATGCGGGGACTGATGACATGGCTACTCATCATTATCGCAGAGGTGATGGTGATGAAGAGTATTTCTTCACGTTGAAGAAAGGACAAGTTCCTGAGATATTTGATAAGTATGGGCGCAAATGTAATGTGACTTCACAAGATGCACCTATGGCTTACTTGGCTGAGGTGATAAACCCAAGAGAAGATGTGCAATTTATGACGATAGCTGATGTTACTTTCATGCTTAACCGCAGGAAGGTAGTTAAAGCTAGTAACAGAAAGTCACCCAAAGTTGGAAACAAAGCCATTGTGTTTTGTGCATATGGTCAATACGGTACATCTTATTCCATTGTAATTAATGGGGCCAACGCTGCTAGTTTTAAAACACCGGATGGTGGAAGTGCAGACCATGTTGAACAAATTCGAACTGAACGTATCACTTCTGAATTGTACTCTAAGTTACAGCAATGGAGCGGTGTGAGTGACTATGAAATACAAAGAGACGGTACTAGTATATTTATCGAGAGACGGGATGGTGCTAGCTTTACAATAACAACCACCGATGGTGCAAAAGGTAAGGATTTAGTGGCTATCAAGAATAAAGTTAGCTCTACTGACCTACTCCCTTCTCGTGCGCCTGCGGGTTATAAAGTGCAAGTGTGGCCTACTGGCAGCAAACCTGAGTCTCGTTACTGGCTGCAAGCTGAGCCTAAAGAGGGAAACCTTGTATCTTGGAAAGAAACAATAGCTGCCGATGTATTACTTGGGTTTGATAAAGGCACAATGCCTTATATCATTGAACGTACAGGTATCGTAGACGGCATAGCTCAATTCAAGATAAGACAAGGCGATTGGGAAGATCGTAAAGTAGGAGATGACCTGACTAACCCTATGCCTTCTTTTATTGATGAAGAAGTACCTCAGACAATAGGTGGGATGTTCATGGTGCAGAACCGCCTATGCTTTACAGCAGGTGAAGCTGTTATTGCTTCTCGTACATCATACTTCTTCGATTTCTTTCGTTATACGGTTATCTCTGCATTGGCAACTGACCCATTTGATATTTTCTCAGATGCTAGTGAAGTCTACCAGCTAAAACATGCAGTGACCTTAGATGGCGCTACCGTGTTGTTCTCTGATAAGTCACAATTCATACTGCCAGGAGATAAGCCTTTAGAGAAGTCAAATGCATTGCTTAAGCCTGTTACAACATTTGAAGTGAACAATAAAGTGAAGCCAGTAGTAACTGGTGAATCGGTAATGTTTGCCACTAATGATGGTTCTTACTCTGGTGTACGAGAGTTCTATACAGACTCTTATAGTGACACTAAGAAGGCACAAGCAATCACAAGTCATGTGAATAAACTCATCGAAGGTAACATTACCAACATGGCAGCAAGCACCAATGTCAACAGGTTACTTGTCACTACCGATAAGTATCGTAACATAATCTACTGCTACGATTGGTTATGGCAAGGAACAGACCGTGTACAATCAGCATGGCATGTATGGAAGTGGCCTATAGGTACAAAGGTGCGAGGTATGTTTTATTCTGGCGAATTACTTTACCTGCTCCTTGAGAGAGGTAATGGTGTCTATCTGGAGAAGATGGATATGGGTGATGCATTAACCTATGGTTTAAATGATCGCATCCGAATGGATAGACAGGCAGAGTTAATCTTCCAGCATTTCGAAGCAGAAGATGAATGGGTATCTGAACCGCTACCTTGGGTTCCTACTAACCCAGAACTTTTAGATTGCATCTTAATCGAGGGTTGGAATTCATATATTGGTGGTTCTTTCCTATTCAAATACAACCCCGGTGATAACACCTTGTCTACAACCTTTGATATGCATGATGATAGCCACGTAAAAGCGAAGGTTATTGTTGGTCAGATTTACCCTCAAGAGTTTGAACCAACACCTGTAGTTATCAGAGATAGGCAAGACCGTGTATCCTATATTGATGTACCTGTTGTGGGATTAGTTCACCTTAATCTTGATATGTATCCTGATTTCTCCGTGGAGGTTAAGAATGTGAAGAGTGGTAAAGTACGCAGGGTGCTAGCGTCAAACCGGATAGGTGGTGCTCTCAACAATACAGTAGGTTATGTTGAACCAAGAGAAGGTGTCTTCAGATTCCCACTGAGAGCTAAGAGTACGGATGCTGTTTATCGTATTATTGTAGAATCACCTCACACATTCCAGCTTCGTGATATTGAGTGGGAAGGGAGCTACAATCCAACCAAAAGGAGGGTCTAA

Genome Context

Genome Context

Tertiary structure

PDB ID
bcb315f85e7cda53f9eaa785cbf1416648dffc1c24fe51c5ece7bc1c3f19cc24
ColabFold
Source ColabFold
Method ColabFold
Resolution 0,3100
Oligomeric State monomer
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50