UniProt accession
A0A2Z3DUT9 [UniProt]
Protein name
Putative tail fiber protein
RBP type
TF
Evidence Phold
Probability 1,00
TSP
Evidence DepoScope
Probability 1,00
TSP
Evidence RBPdetect
Probability 0,90
TSP
Evidence RBPdetect2
Probability 0,73
Protein sequence
MALYPIKSLGAVGVIADQAPTDLAPNAFTNAMNARFVEQRVFKTGGNAPLSYVEEDKDLTPLSFVSMPFDYYSAGNSFLVVGTDKKLYKLTDESLTDISRKVATVTKKASAIIKIYPVVSRIVPKESTITMNFNQTKELEVQVFPEDANNANLTWEVSNPSYASIAVNPTDSKKATLTTLSTEGTLSITVSIEDESVTAQISVNIVDGDTGIFLSQDTITIRRGGTTTLTAISGKTPITWISSNGGALSVTPNANTLTAVLNAMGEGTFTVTADNGSKSATCTVNVIPQIDSISLSQTDVQMDRGTQYVLTATVNPADAPNKAITWTSSNPNIATVSGTSTEATITGLLAGFTEITAVTEEGSRSAVCTVRVNLAGRMLNTRSLAMAASAPLVEEFKEEEEPVVQNEEVVYFMSDSMGIDTSGMAEGNNFFDYSNVFDMEGFARAAENSRAAPLTNVTLDIVEASLDVGEEIVITATAAPEGDYSYQWVVDKSGYVSTTSTTGRSLKLTAVRKGEIKVTCTASQMTQRDYDAFDDYPWYHAVISNCAVATTHYETPQVKEFESEYFTDLPGWGEQTIVDGDGNPSVRKFNWKCERVRAFNNRLFALNMRESNASGVTTHYPLRLRWSNFANENKAPTLWDDYAYDRLTTSDLSANIVGQTEALENGYAGYIDLADSNGSLIDVLPLKDYLFVYTEFETYIGSPTNNTYQPLMFKKLFNDSGILAPECVVEVEGGHFVVTQNDVILHNGASKKSIASNRVKNMLINEVCLVNPLATRVHLHQDKKEVWVMYVGPGEPKESFACTKAAVWNYEFDTWSFRTIPYAQCIGLVDPPVLERGPVWTDFQTITWDDPAIDKLVWRKDATNFRQRITIVGSFLRGFYQVDVGALDYFYDRANDKIIERPLEMRLERTGIDFDNVTNEWNQKHINRFRPQTTGSGTYIFEAGGSQFSNEYGHNHTTKSYTIGVDRHVAVRLNHPYLFYNVIDNDVNSNAAINGLTIEFNVGGRR
Physico‐chemical
properties
protein length:1006 AA
molecular weight: 110891,69280 Da
isoelectric point:4,80701
aromaticity:0,09642
hydropathy:-0,24503

Domains

Domains [InterPro]
DC_0191
STR
12–1006
G3DSA:2.60.40.1080
STR
118–206
IPR008964
RBD
290–372
IPR003343
STR
291–370
A0A2Z3DUT9
1 1006
Architecture
STR
STR 12-1006
Legend: ATT STR RBD CBM LEC ENZ CHP LNK TAS TTP UNK Unmapped

Tail Spike Domain Segmentation

Tail Spike Domain Segmentation

This protein has been segmented into three structural domains: N-terminal, central domain, and C-terminal.

Domain Layout
N-terminal
Central
C-terminal
A0A2Z3DUT9
1 1006
Domain Start End Length (AA) Confidence
N-terminal 1 525 525 0,9362
Central domain 526 724 200 0,3256
C-terminal 725 1006 281 0,5442
Legend: N-terminal Central domain C-terminal
3D Structure with Domain Coloring

The structure is colored according to the domain segmentation: N-terminal (blue), Central (green), C-terminal (pink).

Domain Coloring
N-terminal
1-525
Central
526-724
C-terminal
725-1006

Taxonomy

  Name Taxonomy ID Lineage
Phage Escherichia phage EP335
[NCBI]
2070199 Uroviricota > Caudoviricetes > Mktvariviridae > Nieuwekanaalvirus > Nieuwekanaalvirus EP335
Host Escherichia sp.
[NCBI]
1884818 cellular organisms > Bacteria > Pseudomonadati > Pseudomonadota > Gammaproteobacteria > Enterobacterales

Coding sequence (CDS)

Coding sequence (CDS)
Genbank protein accession
AVZ45100.1 [NCBI]
Genbank nucleotide accession
MG748548 [NCBI]
CDS location
range 16657 -> 19677
strand +
CDS
ATGGCGTTATACCCTATAAAATCACTTGGGGCTGTCGGTGTTATCGCTGATCAGGCACCAACTGATTTAGCACCTAACGCTTTCACCAACGCTATGAACGCTCGGTTTGTTGAGCAGAGAGTTTTTAAGACGGGGGGCAATGCCCCTCTTTCTTACGTGGAAGAAGACAAAGATCTGACTCCACTCTCTTTTGTCTCCATGCCTTTCGATTATTATAGCGCAGGAAATAGCTTCCTTGTAGTAGGTACAGATAAGAAGTTATATAAACTGACAGATGAAAGCTTAACTGATATCAGTCGTAAAGTTGCTACGGTAACTAAGAAAGCTTCTGCTATCATAAAGATTTATCCAGTGGTCTCAAGGATTGTTCCTAAAGAGAGTACTATCACAATGAACTTTAACCAGACAAAAGAGTTAGAAGTTCAGGTTTTTCCAGAGGATGCTAATAATGCTAATCTGACTTGGGAAGTAAGTAACCCTTCTTATGCCAGTATTGCAGTAAATCCTACAGATTCTAAAAAAGCCACCCTCACTACATTATCTACAGAAGGAACACTGTCCATTACTGTTTCCATTGAAGATGAATCTGTGACAGCTCAAATCTCCGTTAACATTGTTGATGGGGATACGGGTATCTTCTTGAGTCAAGACACAATCACAATTCGAAGAGGTGGTACAACAACTCTTACTGCTATCTCAGGTAAGACTCCTATCACTTGGATTAGTAGCAATGGTGGTGCTTTGTCTGTGACACCCAATGCCAATACACTAACTGCTGTTCTCAATGCCATGGGAGAAGGAACTTTTACTGTTACAGCTGATAATGGCTCTAAGTCTGCTACCTGTACAGTTAATGTGATACCTCAGATTGATTCTATTTCTCTGAGTCAGACAGATGTTCAGATGGATAGAGGGACTCAGTATGTTCTAACTGCAACAGTCAACCCTGCTGATGCTCCTAATAAAGCAATCACTTGGACTTCTTCCAATCCTAATATTGCTACAGTATCAGGGACAAGCACAGAGGCTACAATTACTGGACTTCTGGCTGGGTTTACAGAGATTACAGCAGTAACAGAGGAAGGTAGTCGTTCAGCTGTTTGTACTGTTCGTGTCAACCTAGCAGGTAGAATGCTAAATACCAGAAGCCTAGCGATGGCTGCTAGTGCACCTCTAGTGGAAGAGTTCAAAGAGGAAGAAGAACCAGTTGTGCAGAATGAAGAAGTTGTTTACTTCATGTCTGACTCTATGGGAATTGATACCTCTGGCATGGCTGAAGGCAATAACTTCTTTGACTACTCTAACGTATTTGATATGGAAGGTTTTGCTCGTGCTGCGGAGAACTCAAGAGCTGCTCCTCTGACAAATGTGACACTAGATATTGTTGAAGCTTCTCTAGATGTAGGTGAAGAAATTGTCATAACTGCTACAGCAGCTCCAGAAGGGGATTACTCCTATCAGTGGGTTGTTGACAAGAGTGGTTATGTTTCTACTACCTCAACAACTGGAAGATCTTTGAAACTTACAGCTGTTCGTAAAGGCGAGATTAAAGTTACATGTACGGCGAGTCAGATGACTCAAAGAGACTACGATGCTTTTGATGATTACCCTTGGTATCATGCAGTAATCTCTAACTGTGCAGTAGCGACAACTCACTATGAAACTCCTCAGGTTAAAGAATTCGAATCTGAATACTTTACAGACCTTCCGGGCTGGGGTGAACAAACAATTGTTGATGGTGATGGGAACCCTTCTGTTCGTAAGTTTAACTGGAAGTGCGAAAGGGTTAGAGCTTTTAACAACAGATTGTTTGCTCTGAATATGAGGGAATCTAATGCCTCTGGTGTTACCACTCACTATCCTTTACGTCTTCGCTGGTCTAACTTTGCGAACGAGAACAAGGCTCCTACTTTGTGGGATGATTATGCTTACGATCGACTGACAACTTCTGATCTTTCAGCGAACATTGTTGGGCAGACTGAAGCTCTTGAGAATGGTTATGCAGGGTATATTGATCTGGCTGACTCTAACGGTAGTTTGATTGATGTTCTCCCTTTGAAAGATTACTTATTTGTTTACACCGAGTTTGAAACCTACATCGGTTCTCCTACTAACAACACATACCAGCCTCTGATGTTTAAGAAGCTGTTTAACGATTCAGGTATTCTTGCTCCTGAGTGTGTGGTTGAAGTAGAGGGTGGTCACTTTGTTGTAACACAGAACGATGTGATTCTTCATAACGGTGCATCTAAGAAATCTATTGCATCTAACCGTGTCAAGAACATGCTTATTAATGAAGTGTGTTTGGTAAACCCTCTAGCTACTAGAGTTCACTTGCACCAAGATAAGAAAGAAGTTTGGGTCATGTATGTTGGGCCGGGAGAGCCGAAAGAAAGTTTTGCTTGTACGAAGGCTGCGGTCTGGAATTACGAGTTTGATACTTGGTCTTTCCGTACTATCCCGTATGCTCAATGTATTGGTCTTGTTGATCCTCCTGTTCTCGAAAGAGGTCCAGTGTGGACTGACTTCCAAACTATCACTTGGGACGACCCTGCTATTGATAAACTGGTGTGGAGAAAGGATGCAACTAACTTCCGTCAGAGAATTACTATCGTAGGCTCTTTCTTAAGGGGTTTCTATCAAGTAGATGTTGGTGCTTTGGATTATTTCTATGACAGAGCGAATGACAAAATAATAGAGCGCCCTCTGGAAATGAGGTTAGAGAGAACAGGGATTGATTTTGATAACGTCACTAACGAATGGAATCAAAAACACATCAACCGGTTCAGACCTCAGACTACAGGTTCTGGTACGTATATCTTTGAAGCTGGAGGTAGTCAATTCTCTAACGAGTATGGTCACAACCACACAACTAAGAGTTATACGATTGGAGTTGATAGGCACGTAGCTGTGAGACTGAACCATCCATACCTATTCTATAATGTTATAGATAATGATGTTAACAGTAACGCAGCCATAAATGGGCTGACAATAGAGTTTAATGTTGGCGGTCGAAGATAA

Genome Context

Genome Context

Tertiary structure

PDB ID
4853aef1eb2ab4040ab135726b6911b36134690d30c9c9f8453570c4453dff78
ESMFold
Source ESMFold
Method ESMFold
Resolution 0,7689
Oligomeric State monomer
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50