Genbank accession
URO83710.1 [GenBank]
Protein name
straight tail fiber
RBP type
TF
Evidence GenBank
Probability 1,00
TF
Evidence RBPdetect
Probability 0,89
Protein sequence
MSNNTYQHVSNESKYVKFDPVGSNFPDTVTTVQSALSKISNIGVNGIPDATMEVKGIAMIASEQEVLDGTNNSKIVTPATLATRLLYPNATETKYGLTRYSTNEETLEGSDNNSSITPQKLKYYTDDVFQNRYSSESSNGVIKISSTPAALAGVDDTTAMTPLKTQKLAIKLISQIAPSEDTATESVRGVVQLSTVAQTRQGTLREGYAISPYTFMNSVATQEYKGVIRLGTQAEINNNLGDVAVTGETLNGRGATGSMRGVVKLTTQAGIAPEGDSSGALAWNADVINTRGGQTINGSLNLDHLTANGIWSRGGMWKNGDQPVATERYASERVPVGTIMMFAGDSAPPGWIMCHGGTVSGDQYPDYRNTVGTRFGGDWNNPGIPDMRGLFVRGAGTGGHILNQRGQDGYGKDRLGVGCDGMHVGGVQAQQMSYHKHAGGWGEYNRSEGPFGASVYQGYLGTRKYADWDNASYFTNDGFELGGPRDALGTLNREGLIGYETRPWNISLNYIIKIHY
Physico‐chemical
properties
protein length:516 AA
molecular weight: 55420,81260 Da
isoelectric point:5,70222
aromaticity:0,08140
hydropathy:-0,45969

Domains

Domains [InterPro]
DC_0176
STR
1–436
G3DSA:2.10.280.10
Unmapped
246–286
IPR015173
STR
246–324
SSF69349
STR
246–346
IPR011083
ATT
337–392
URO83710.1
1 516
Architecture
STR
ATT
STR
STR 1-323 | ATT 324-392 | STR 393-514 |
Legend: ATT STR RBD CBM LEC ENZ CHP LNK TAS TTP UNK Unmapped

Taxonomy

  Name Taxonomy ID Lineage
Phage Escherichia phage EC128
[NCBI]
2936909 Viruses > Duplodnaviria > Heunggongvirae > Uroviricota > Caudoviricetes
Host Escherichia coli
[NCBI]
562 cellular organisms > Bacteria > Pseudomonadati > Pseudomonadota > Gammaproteobacteria > Enterobacterales

Coding sequence (CDS)

Coding sequence (CDS)
Genbank protein accession
URO83710.1 [NCBI]
Genbank nucleotide accession
ON210139.1 [NCBI]
CDS location
range 81269 -> 82819
strand -
CDS
ATGAGTAATAATACATATCAGCACGTATCAAATGAATCAAAATATGTTAAATTCGATCCAGTGGGATCGAATTTTCCTGACACTGTTACGACGGTTCAGTCTGCATTATCAAAAATAAGTAATATCGGCGTAAATGGTATTCCTGATGCAACTATGGAAGTTAAAGGAATAGCAATGATTGCATCAGAGCAAGAGGTTTTAGATGGAACAAATAATTCTAAAATCGTTACACCAGCTACATTAGCAACAAGATTATTGTATCCAAATGCTACTGAAACTAAATATGGTTTAACGCGTTATTCAACAAATGAAGAAACATTAGAAGGCTCAGATAATAATTCATCAATAACACCGCAAAAACTGAAATACTATACTGATGATGTGTTCCAAAATAGATATTCGTCTGAGTCATCAAACGGGGTTATTAAAATATCATCTACGCCTGCAGCTTTGGCTGGTGTCGATGATACTACGGCGATGACTCCGCTGAAAACCCAAAAACTTGCAATAAAATTAATTTCACAAATAGCTCCTTCAGAAGACACTGCAACAGAATCTGTGAGAGGAGTAGTTCAATTATCTACTGTTGCACAAACTCGTCAAGGAACTCTCCGCGAAGGATATGCAATTTCTCCGTATACCTTTATGAATTCTGTTGCAACACAAGAATATAAGGGTGTTATACGTCTAGGAACACAAGCAGAAATTAATAATAATTTGGGGGATGTTGCAGTAACAGGTGAAACACTAAATGGTCGAGGAGCTACCGGTTCTATGCGTGGGGTAGTAAAATTAACGACGCAAGCTGGTATTGCTCCTGAAGGTGATAGCTCTGGGGCATTAGCCTGGAACGCAGATGTAATTAATACTCGTGGTGGACAAACTATTAATGGTTCTTTAAATTTAGATCATCTCACAGCAAATGGAATTTGGTCGCGCGGCGGAATGTGGAAAAATGGCGATCAACCTGTTGCCACTGAAAGATATGCGTCCGAAAGGGTTCCAGTTGGAACTATTATGATGTTTGCTGGAGATTCAGCTCCTCCTGGTTGGATTATGTGTCATGGTGGAACCGTGTCAGGAGATCAATATCCTGATTATAGAAACACAGTTGGAACAAGATTTGGTGGTGATTGGAATAATCCTGGCATTCCTGATATGCGAGGTCTTTTTGTTAGAGGAGCTGGCACGGGCGGTCATATTTTAAATCAACGTGGACAAGATGGTTATGGGAAGGATAGACTTGGTGTAGGATGTGACGGAATGCATGTTGGTGGCGTTCAGGCACAACAAATGTCATACCATAAACATGCTGGTGGTTGGGGAGAATATAACAGAAGTGAAGGTCCATTTGGCGCGTCTGTTTATCAAGGATATCTTGGAACTAGAAAATATGCCGACTGGGATAACGCTTCATACTTCACCAATGATGGATTTGAATTAGGTGGACCGAGAGATGCCCTTGGTACACTTAATCGTGAAGGATTAATTGGTTATGAAACTAGACCATGGAATATATCATTAAACTATATTATTAAAATTCATTACTAA

Genome Context

Genome Context

Tertiary structure

PDB ID
8a5919a81dcf97aa53a810a10508ed8827f657ef6d3a42764b3ac912441efad4
ESMFold
Source ESMFold
Method ESMFold
Resolution 0,6650
Oligomeric State monomer
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50

Literature

Title Authors Date PMID Source
Complete genome sequences of 17 Escherichia coli bacteriophages isolated from wastewater, pond water, cow manure and bird feces Vitt,A.R., Ahern,S.J., Gambino,M., Holst Sorensen,M.C. and Brondsted,L. 2022-10-20 GenBank