Genbank accession
YP_010844868.1 [GenBank]
Protein name
tail fiber protein
RBP type
TF
Evidence Phold
Probability 1,00
TSP
Evidence RBPdetect
Probability 0,55
Protein sequence
MAIRPKLNRVWTSNNSVARRDPGDEKYLKGWVAEIPTYQVLNYLQYKIDTTMLAQAERGIFEWGSDVSYGVGSLAWDETNKTIYVCTVANPNKTLRPSANSAQWSPSSIQVSRANYDSIVAAINAHIADVTTNPHKVTAAQIGAYNKSELDAIVANYRAMVQAHVTDYNNPHKLTAVQVGAVPVAGGAYTGEVTFTTVFLNDSKTAQIVNDGGLYLRNGSYYLGINGATGNAEVGTASSKSPVVTDLTFPTLKVATEPEYAVPEPIVNMPLIGSINLRAGIGKVNSSANEPVYAPQFGNALLVRHGAAFGIRGTEEVLAGGRQATLAIDTIFSGTPIGGNWLWDFGLGWFTILVTPNRDVRCEIKGTGTVAVSNSVTLPMNEWVRIVATYNAATGRVCLYINGVKAFDLTPSNLPTTGFREGTIYNSVSASYPNISTYLRNFRVWSDELTDKQVSTL
Physico‐chemical
properties
protein length:457 AA
molecular weight: 49367,02700 Da
isoelectric point:6,96740
aromaticity:0,09409
hydropathy:-0,09606

Domains

Domains [InterPro]
DC_0782
STR
6–450
G3DSA:2.60.120.200
STR
324–457
IPR013320
STR
339–457
PF13385
LEC
346–455
YP_010844868.1
1 457
Architecture
STR
STR 6-457
Legend: ATT STR RBD CBM LEC ENZ CHP LNK TAS TTP UNK Unmapped

Tail Spike Domain Segmentation

Tail Spike Domain Segmentation

This protein has been segmented into three structural domains: N-terminal, central domain, and C-terminal.

Domain Layout
N-terminal
Central
C-terminal
YP_010844868.1
1 457
Domain Start End Length (AA) Confidence
N-terminal 1 215 215 0,8803
Central domain 216 420 206 0,7162
C-terminal 421 457 36 0,5893
Legend: N-terminal Central domain C-terminal
3D Structure with Domain Coloring

The structure is colored according to the domain segmentation: N-terminal (blue), Central (green), C-terminal (pink).

Domain Coloring
N-terminal
1-215
Central
216-420
C-terminal
421-457

Taxonomy

  Name Taxonomy ID Lineage
Phage Escherichia phage vB_EcoP_Bp7
[NCBI]
2593331 No lineage information
Host Escherichia coli
[NCBI]
562 cellular organisms > Bacteria > Pseudomonadati > Pseudomonadota > Gammaproteobacteria > Enterobacterales

Coding sequence (CDS)

Coding sequence (CDS)
Genbank protein accession
YP_010844868.1 [NCBI]
Genbank nucleotide accession
NC_079181.1 [NCBI]
CDS location
range 47503 -> 48876
strand +
CDS
ATGGCGATTCGTCCTAAACTGAACCGAGTCTGGACCTCAAACAACTCTGTCGCAAGACGTGACCCAGGGGATGAAAAATATCTCAAGGGTTGGGTTGCGGAGATTCCGACCTATCAGGTTCTTAACTACCTACAATACAAGATTGACACCACAATGCTTGCACAAGCAGAACGTGGAATCTTTGAGTGGGGCAGTGATGTAAGTTATGGCGTTGGTAGTTTGGCTTGGGACGAAACAAACAAAACGATTTATGTTTGTACTGTCGCCAACCCCAACAAAACCTTACGCCCAAGTGCAAACTCTGCACAGTGGAGTCCAAGCTCCATTCAAGTTTCTCGTGCGAATTACGACTCAATTGTTGCAGCAATCAATGCACACATTGCAGACGTAACCACAAACCCGCACAAGGTAACTGCTGCACAAATTGGTGCATACAACAAGTCTGAACTTGATGCGATTGTGGCCAACTATCGTGCAATGGTTCAAGCGCACGTAACGGATTACAACAATCCACACAAACTTACTGCGGTCCAGGTTGGTGCTGTCCCTGTGGCTGGTGGTGCATACACTGGTGAAGTTACCTTCACAACCGTGTTTCTGAATGATTCTAAGACTGCCCAGATTGTTAACGACGGTGGATTGTATCTTCGCAATGGAAGCTACTATCTTGGAATCAACGGGGCCACAGGAAATGCAGAAGTCGGAACAGCTTCAAGCAAGTCCCCTGTCGTAACTGACTTAACATTCCCGACACTCAAAGTTGCAACTGAACCTGAGTATGCAGTGCCGGAACCAATTGTTAACATGCCCCTTATTGGCAGCATCAACCTGCGAGCCGGGATTGGTAAGGTCAACAGCAGTGCCAACGAGCCAGTATATGCACCGCAGTTTGGTAATGCACTATTGGTAAGACATGGTGCTGCTTTTGGGATACGCGGCACTGAAGAGGTGCTGGCAGGTGGTAGACAAGCGACACTTGCCATTGACACCATATTCTCCGGTACTCCGATTGGAGGTAATTGGTTGTGGGACTTTGGTCTTGGTTGGTTTACAATTCTTGTAACACCCAACAGGGATGTCCGGTGTGAGATTAAAGGGACAGGAACTGTTGCAGTATCTAATAGTGTAACCTTGCCGATGAACGAGTGGGTTCGCATTGTTGCTACATACAATGCGGCAACAGGGAGGGTTTGTCTGTACATCAATGGCGTAAAAGCATTCGACCTTACCCCAAGTAACCTCCCAACAACAGGATTTAGGGAGGGGACTATCTATAATTCGGTATCAGCATCATATCCTAATATCTCCACATACTTGCGAAACTTCCGAGTATGGTCTGACGAGTTAACAGACAAACAAGTTTCAACCTTATAG

Genome Context

Genome Context

Tertiary structure

PDB ID
001a6f79b9764c0eb63f1578bebb0da5a5236048330b96568ac3220a53d53762
ESMFold
Source ESMFold
Method ESMFold
Resolution 0,7278
Oligomeric State monomer
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50