Genbank accession
YP_007348357.1 [GenBank]
Protein name
tail protein
RBP type
TSP
Evidence DepoScope
Probability 1,00
Protein sequence
MAKVGGSYDSVVLGVSQQTPQDRRSGQMWEQVNMVSDPVQGLTRRHGSVFEAKQDLKAGTVNTNWLREAAVKFKVRPFSIGGIDYDLIYSNEYVRGNITEVLPVYCYDKTNKKFLPVRGSGDVWGALVANGASAVVNIGSYLFLSAKGYVPQYTTTTKYTPDNERKSIAIWVRNGDYSRDYNFRFTTTAGTTFLAAVRTPASTYPGKLDTSGIPVPVINMAGIGDSGSAEDTMKLNKAIADFNSKMAQYNKQIADSTNGYNSAVTQWLGTASAAIQPEQIAIQLTDQIRSKAGLTAQQVQRDGSYIFITDAANVKTGECVAVSDAYLKAVVNDVAKPDDLIPKHFFGKTVKVRSQKATGKDAYYLVAEAKDGQSGLYGDVIWRETAGVQTTPTKVFCVGTIANGTLFIASDPASLESAAGITGVPRFVGSQVGDQISIPVPNFLKKGISYMGVFQDRLLIGTGSTVFASRPGDYFNWFRQSVLSVSDNDPVEMYALGSEDDTIYWDTTFDRNHVMFGRKYQYIISGRSLLTPNNPNIQIMSAVEDAVQAEPQASGNLVFYGKDIISKGSLHQMQVGATTDSAESYECSQQLDRYIKGKPCQILCNQSPYVVLLRTTEKYNGFYVYTYLDSMQGGQRMFDSWSTWEWDEKLGYCAGISKYQGEILCYTLRTHNNWMGMVCDRFTFDTELSDYPYLDSWRPMKDWQANNQDLVPSQFPKRLSVAYTVAHTYYFMGSPYENLDNNMPGWESDINSLIIGVNYPAYFTPTSPYLRDKNDKAILNGRLTISRLNVAVSDTGALDGQLDLGDRQIDLPGFSGRILTRLGNFVGRQPIVETSVIMPIYKEIREYKLKLMARDWLPLTVTGLEWVGQWFSRVRRV
Physico‐chemical
properties
protein length:877 AA
molecular weight: 97573,02070 Da
isoelectric point:6,64922
aromaticity:0,11288
hydropathy:-0,29818

Domains

Domains [InterPro]
IPR058003
TTP
1–55
DC_0058
STR
1–877
YP_007348357.1
1 877
Architecture
STR
STR 1-877
Legend: ATT STR RBD CBM LEC ENZ CHP LNK TAS TTP UNK Unmapped

Tail Spike Domain Segmentation

Tail Spike Domain Segmentation

This protein has been segmented into three structural domains: N-terminal, central domain, and C-terminal.

Domain Layout
N-terminal
Central
C-terminal
YP_007348357.1
1 877
Domain Start End Length (AA) Confidence
N-terminal 1 159 159 0,9107
Central domain 160 358 200 0,4743
C-terminal 359 877 518 0,2125
Legend: N-terminal Central domain C-terminal
3D Structure with Domain Coloring

The structure is colored according to the domain segmentation: N-terminal (blue), Central (green), C-terminal (pink).

Domain Coloring
N-terminal
1-159
Central
160-358
C-terminal
359-877

Taxonomy

  Name Taxonomy ID Lineage
Phage Cronobacter phage vB_CskP_GAP227
[NCBI]
1264737 Viruses > Duplodnaviria > Heunggongvirae > Uroviricota > Caudoviricetes
Host Cronobacter sakazakii
[NCBI]
28141 cellular organisms > Bacteria > Pseudomonadati > Pseudomonadota > Gammaproteobacteria > Enterobacterales

Coding sequence (CDS)

Coding sequence (CDS)
Genbank protein accession
YP_007348357.1 [NCBI]
Genbank nucleotide accession
NC_020078 [NCBI]
CDS location
range 24536 -> 27169
strand +
CDS
ATGGCTAAAGTTGGCGGAAGCTACGACTCAGTTGTGCTCGGCGTGAGCCAGCAGACCCCGCAAGACCGCCGCTCTGGTCAGATGTGGGAACAAGTTAACATGGTGTCTGACCCGGTACAGGGCTTGACTCGCCGACACGGTTCCGTGTTTGAAGCGAAGCAAGACCTCAAGGCAGGCACCGTCAACACTAACTGGCTGCGGGAAGCGGCGGTTAAATTCAAGGTGCGTCCGTTCTCTATCGGCGGCATCGACTACGACTTGATTTACTCAAACGAGTACGTCCGGGGCAACATCACTGAGGTTCTCCCGGTGTACTGCTATGACAAGACGAACAAGAAGTTCCTGCCTGTCCGAGGCTCTGGCGACGTGTGGGGTGCTCTGGTGGCGAACGGCGCTTCGGCGGTGGTCAACATCGGTTCATACCTGTTCCTGTCTGCGAAGGGCTATGTCCCGCAGTACACGACCACCACGAAGTATACGCCTGACAACGAGCGCAAGTCGATTGCCATCTGGGTTCGTAACGGCGACTACAGCCGCGACTACAACTTCCGATTCACCACTACAGCAGGCACCACCTTCCTGGCAGCGGTTCGCACACCAGCGAGCACCTACCCTGGCAAGCTGGACACCTCCGGCATCCCGGTTCCGGTCATTAACATGGCAGGCATCGGCGACAGCGGCAGCGCGGAAGACACCATGAAGTTGAACAAAGCTATCGCAGACTTCAACTCCAAGATGGCGCAGTACAACAAACAGATTGCCGACTCAACCAACGGGTACAACTCGGCGGTGACGCAGTGGCTCGGCACGGCATCGGCGGCTATCCAGCCTGAGCAGATTGCCATCCAGCTCACTGACCAGATTCGCTCAAAGGCGGGCCTGACGGCTCAGCAGGTTCAGCGCGATGGCTCCTACATCTTCATCACGGATGCAGCCAACGTTAAAACTGGTGAGTGTGTGGCGGTATCAGACGCCTACCTGAAAGCTGTGGTTAATGACGTTGCCAAGCCTGACGACCTAATCCCTAAGCACTTCTTCGGGAAGACGGTCAAGGTGCGCTCGCAGAAGGCCACGGGTAAGGACGCGTACTACCTGGTAGCGGAGGCGAAGGACGGGCAGAGCGGGCTGTACGGCGACGTTATCTGGCGCGAGACGGCGGGTGTGCAGACGACGCCAACGAAGGTGTTCTGCGTAGGCACCATCGCCAACGGCACGTTATTCATCGCATCTGACCCGGCATCGCTGGAGTCGGCGGCGGGCATCACTGGCGTACCGCGATTCGTGGGTAGCCAGGTCGGTGACCAGATTTCTATCCCGGTGCCGAACTTCCTCAAGAAGGGAATCAGCTACATGGGCGTGTTCCAGGACAGGCTCCTGATTGGCACAGGCTCCACGGTGTTCGCCAGCCGACCGGGGGATTACTTCAACTGGTTCCGACAGTCGGTGCTGAGCGTATCCGACAACGACCCGGTGGAAATGTACGCCCTCGGCTCCGAAGACGACACGATTTATTGGGACACCACGTTTGACCGTAACCACGTTATGTTCGGACGCAAGTACCAGTACATCATCAGTGGGCGCTCTCTGCTTACGCCTAACAACCCCAACATCCAGATTATGTCTGCGGTGGAGGATGCTGTTCAGGCTGAGCCGCAGGCCTCTGGTAACCTCGTCTTCTACGGCAAGGACATTATCAGCAAAGGCTCTCTGCACCAGATGCAGGTGGGCGCTACGACGGACTCTGCGGAATCGTATGAGTGCTCGCAGCAGCTCGACCGCTACATCAAGGGCAAGCCGTGCCAAATCCTGTGCAACCAGTCACCTTACGTCGTGCTTCTCCGCACCACTGAGAAGTATAACGGGTTCTACGTCTACACATACCTGGATTCAATGCAGGGTGGTCAGCGTATGTTCGATAGCTGGTCAACGTGGGAATGGGACGAGAAGCTGGGCTACTGTGCTGGCATTTCCAAGTACCAGGGCGAGATTCTCTGCTACACGCTGCGTACCCACAATAACTGGATGGGTATGGTCTGCGACCGCTTTACCTTCGACACGGAACTGAGCGATTACCCGTATCTGGATTCGTGGCGACCCATGAAAGACTGGCAAGCCAACAATCAAGACCTGGTGCCTTCGCAGTTCCCGAAACGGTTGAGCGTGGCGTACACTGTGGCGCACACCTACTACTTCATGGGTTCGCCATACGAGAACCTGGACAACAACATGCCGGGCTGGGAATCGGATATCAACAGCCTCATCATCGGGGTTAACTATCCGGCCTACTTCACGCCAACGTCGCCTTACCTTCGGGATAAGAACGACAAGGCAATCCTGAATGGACGCCTGACGATTAGCCGCCTCAACGTCGCGGTGAGTGACACCGGGGCGCTGGATGGCCAGCTCGACCTGGGCGACCGTCAGATTGACCTGCCGGGGTTCAGTGGTCGAATCTTGACCCGGTTGGGTAACTTCGTGGGCCGTCAGCCGATTGTGGAAACGTCGGTAATCATGCCGATTTACAAGGAGATTCGGGAATACAAACTTAAACTAATGGCGCGAGATTGGCTACCCCTGACTGTAACAGGTCTGGAGTGGGTAGGCCAATGGTTCAGCCGTGTACGGAGGGTTTAA

Genome Context

Genome Context

Tertiary structure

PDB ID
b3ff579524c925b8de994e21f23d72a73c1c9f4ccdca5b964f05d48669891775
ESMFold
Source ESMFold
Method ESMFold
Resolution 0,8445
Oligomeric State monomer
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50