UniProt accession
A0A2Z3DNR8 [UniProt]
Protein name
Putative tail fiber protein
RBP type
TF
Evidence GenBank
Probability 1,00
TF
Evidence Phold
Probability 1,00
TF
Evidence RBPdetect
Probability 0,89
TF
Evidence RBPdetect2
Probability 0,67
Protein sequence
MGAGGFRKNTGRNNTTLPYNVGFLNRVEDTETYNTAVYDELQKVSTATNQMFQAIDEIHDEIDVRIKALNAMNLQFDELENRITTEIETAIADIQTQMGNLSTDDIWDNSTNPPTKLNGTVAGIKTSIEGNDLKIQTVQGIVNEQGQEIALVQTELTTQGQKIVDVDGRVTTVQNSLSQYMKLTEYEATWGVNSSVNGRYAGVKLTNNGTNSAFQVTAQKFIVGDGSSGNTPFVFENGRARMEFADIKNVNITTAMIANARIQFAQIDNVWIQDGQIANLTANKITAGSMSGSNWRLTVGGDFVMGGTGGAQLWMNGNRIDFYDGSGALRIRIGSW
Physico‐chemical
properties
protein length:336 AA
molecular weight: 36707,38440 Da
isoelectric point:4,68702
aromaticity:0,07440
hydropathy:-0,36518

Domains

Domains [InterPro]
DC_0177
ATT
1–159
A0A2Z3DNR8
1 336
Architecture
ATT
RBD
ATT 1-159 | RBD 174-318 |
Legend: ATT STR RBD CBM LEC ENZ CHP LNK TAS TTP UNK Unmapped

Taxonomy

  Name Taxonomy ID Lineage
Phage Escherichia phage EP335
[NCBI]
2070199 Uroviricota > Caudoviricetes > Mktvariviridae > Nieuwekanaalvirus > Nieuwekanaalvirus EP335
Host Escherichia sp.
[NCBI]
1884818 cellular organisms > Bacteria > Pseudomonadati > Pseudomonadota > Gammaproteobacteria > Enterobacterales

Coding sequence (CDS)

Coding sequence (CDS)
Genbank protein accession
AVZ45101.1 [NCBI]
Genbank nucleotide accession
MG748548 [NCBI]
CDS location
range 19729 -> 20739
strand +
CDS
ATGGGTGCTGGAGGTTTTCGTAAGAATACTGGACGTAACAACACCACACTTCCCTACAACGTTGGGTTTCTCAACAGGGTAGAGGATACAGAAACGTATAACACTGCTGTCTATGATGAATTACAGAAAGTCAGTACAGCGACTAACCAAATGTTCCAAGCAATCGACGAGATTCATGATGAGATTGATGTTCGTATCAAAGCTCTTAACGCAATGAATCTCCAGTTCGATGAGCTTGAGAATCGAATTACTACAGAGATTGAAACAGCTATTGCAGACATTCAAACTCAAATGGGCAATCTATCTACGGATGATATTTGGGATAACTCAACGAATCCTCCTACTAAATTGAACGGTACGGTAGCTGGTATTAAAACTAGCATTGAAGGTAATGATTTAAAGATTCAAACTGTTCAAGGCATTGTGAATGAGCAAGGACAAGAGATTGCTTTAGTTCAAACAGAACTTACAACTCAAGGTCAGAAGATTGTGGATGTTGATGGAAGGGTTACAACTGTACAGAACTCTCTTTCTCAATATATGAAGCTTACCGAATATGAAGCTACTTGGGGTGTGAACTCAAGTGTTAACGGTCGGTATGCAGGAGTTAAGTTAACTAACAACGGAACGAACAGTGCTTTCCAAGTAACTGCTCAGAAGTTTATTGTCGGTGATGGTAGCTCTGGTAACACCCCTTTCGTCTTTGAGAATGGTAGGGCTAGAATGGAGTTCGCTGATATAAAGAATGTCAACATCACAACTGCCATGATTGCTAATGCTCGTATCCAGTTTGCTCAGATTGATAATGTATGGATTCAAGATGGTCAGATTGCTAACCTAACAGCTAACAAGATTACTGCTGGTAGTATGTCCGGCTCTAACTGGAGACTTACAGTAGGTGGGGATTTTGTAATGGGCGGAACAGGTGGGGCACAGCTTTGGATGAATGGAAACAGAATTGATTTCTATGATGGGAGTGGTGCTTTGAGAATTAGAATAGGGAGTTGGTGA

Genome Context

Genome Context

Tertiary structure

PDB ID
a90b53fab25500b2e6de0c48cf1fce20f4a3db2d8bb8d034c1f845f910db255c
ESMFold
Source ESMFold
Method ESMFold
Resolution 0,7189
Oligomeric State monomer
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50