Genbank accession
URN70745.1 [GenBank]
Protein name
tail fiber
RBP type
TF
Evidence GenBank
Probability 1,00
TSP
Evidence RBPdetect
Probability 0,89
Protein sequence
MYYSLMRESKVIVEYDGRAFHFDALSNYDIQTSYEEFKTLRRTVHRRTNYADSIINAQTPSSISLAVNFSNTLTEANFFEWLGFDRKGNTFLLPLYSNNIEPIMFNIYIVNKDNNCVYFENCYVSTVDFSLDKNIPILNVGIESGKFSEVSTYREAASIIQGEVMSYSPVIASTNGSILPGLISASLSFQQQCSWREDKSVFDINKIYNNKRAYVNEMNASATISLYYLKRFAGDMVYNIEPETDVPLNIRNNSISIDFPLARITKRLDFSDVYKVEWDIIPTASSDPVRIDFFGEIKND
Physico‐chemical
properties
protein length:300 AA
molecular weight: 34444,34400 Da
isoelectric point:4,91580
aromaticity:0,13333
hydropathy:-0,23333

Domains

Domains [InterPro]
IPR056389
ATT
1–298
URN70745.1
1 300
Architecture
ATT
ATT 1-298 |
Legend: ATT STR RBD CBM LEC ENZ CHP LNK TAS TTP UNK Unmapped

Tail Spike Domain Segmentation

Tail Spike Domain Segmentation

This protein has been segmented into three structural domains: N-terminal, central domain, and C-terminal.

Domain Layout
N-terminal
Central
C-terminal
URN70745.1
1 300
Domain Start End Length (AA) Confidence
N-terminal 1 287 287 0,6168
Central domain 288 289 3 0,7948
C-terminal 290 300 10 0,7446
Legend: N-terminal Central domain C-terminal
3D Structure with Domain Coloring

The structure is colored according to the domain segmentation: N-terminal (blue), Central (green), C-terminal (pink).

Domain Coloring
N-terminal
1-287
Central
288-289
C-terminal
290-300

Taxonomy

  Name Taxonomy ID Lineage
Phage Escherichia phage EC104
[NCBI]
2936939 Viruses > Duplodnaviria > Heunggongvirae > Uroviricota > Caudoviricetes
Host Escherichia coli
[NCBI]
562 cellular organisms > Bacteria > Pseudomonadati > Pseudomonadota > Gammaproteobacteria > Enterobacterales

Coding sequence (CDS)

Coding sequence (CDS)
Genbank protein accession
URN70745.1 [NCBI]
Genbank nucleotide accession
ON185581.1 [NCBI]
CDS location
range 94549 -> 95451
strand -
CDS
ATGTACTACTCTCTAATGAGAGAGTCAAAAGTTATAGTTGAGTATGATGGTAGGGCATTTCATTTTGATGCCCTATCAAACTATGATATACAGACTTCCTACGAGGAATTCAAGACTCTTCGTAGGACTGTTCATCGTAGAACTAACTATGCAGACTCTATTATAAATGCTCAAACCCCCTCTTCTATCTCTCTAGCAGTAAATTTCAGTAATACTCTTACGGAGGCTAACTTCTTTGAATGGTTAGGTTTTGATAGAAAAGGTAATACTTTCTTACTCCCACTATATAGTAATAATATTGAACCTATTATGTTTAATATCTATATAGTAAATAAAGATAATAACTGTGTATATTTTGAAAACTGCTATGTATCTACAGTAGATTTTTCTTTAGATAAGAACATACCAATTCTTAATGTTGGTATTGAGTCTGGGAAATTCTCAGAAGTATCTACCTATAGAGAAGCAGCTTCTATTATACAGGGTGAAGTAATGTCTTACAGCCCAGTAATAGCTTCTACTAATGGCAGCATCTTACCCGGTCTTATTTCTGCCTCTTTATCTTTCCAACAGCAGTGCTCCTGGAGAGAGGATAAGAGTGTTTTTGATATAAATAAAATTTATAATAATAAAAGAGCTTATGTAAATGAAATGAATGCTTCGGCAACCATTTCTCTATATTACTTAAAACGTTTTGCTGGAGATATGGTTTACAATATCGAACCAGAGACCGATGTACCTTTAAATATAAGAAATAATAGTATTTCTATAGATTTTCCTTTAGCACGTATTACAAAACGCCTAGATTTCTCAGATGTGTATAAAGTTGAGTGGGATATTATACCTACTGCTTCTTCAGACCCAGTGAGAATAGATTTCTTTGGAGAAATTAAAAATGATTAA

Genome Context

Genome Context

Tertiary structure

PDB ID
83793476287a9702cc13ecd674bef382744a19253493f08107daebf88c87dc65
ESMFold
Source ESMFold
Method ESMFold
Resolution 0,8167
Oligomeric State monomer
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50

Literature

Title Authors Date PMID Source
Complete genome sequences of 17 Escherichia coli bacteriophages isolated from wastewater, pond water, cow manure and bird feces Vitt,A.R., Ahern,S.J., Gambino,M., Holst Sorensen,M.C. and Brondsted,L. 2022-10-20 GenBank