UniProt accession
A0A193GYN5 [UniProt]
Protein name
Tail fibers protein
RBP type
TF
Evidence GenBank
Probability 1,00
TF
Evidence Phold
Probability 1,00
TSP
Evidence DepoScope
Probability 1,00
TF
Evidence RBPdetect
Probability 0,88
TF
Evidence RBPdetect2
Probability 0,95
Protein sequence
MATNIKTVMTYPLDGSTTDFNIPFEYLARKFVRVTLIGVDRKELILNQDYRFATKTTISTTRALGPADGYNMVEIRRFTSATDRLVDFTDGSILRAYDLNISQVQTLHVAEEARDLTADTIGVNNDGNLDARGRRIINVADGVDLGDVITLGQVTRWNESALNSKNAAATSQAAAKTSETNAKTSETNAKTSETNAKTSETNAKTSETNAYQWAQRAEDSPVQGSEFSSYHYSRKSAKSAAAASVSEDNAKVSENNSKTSETNAKTSETNAAASAQTAKDEAAKLTNMNDFAAAIDSVNGNNVVMKGDFTIKGNGKVSILNHSGLYQDKTVNGVNYANRIYMDDAGAMHFENYRVSGTSSTLRAGFVITDTNRFDFTGPTQVNGDIIAYVNAPPNPAIGQYLNSVAVRSQLRGRGAENDPLGAYCGMYIQEHVGTDHAIILNLNGFSKDTNWQLFSNGKISTPLGDVMTSGSDVRIKDDITKPLDGAGERIDAIGIVEYTEIATGERKRGWLAQQLDTIDPLYTYLTTTAEGSLLNTNDRALLADVITELQALRQRVKELEG
Physico‐chemical
properties
protein length:562 AA
molecular weight: 60935,62540 Da
isoelectric point:5,15447
aromaticity:0,06762
hydropathy:-0,46441

Domains

Domains [InterPro]
IPR005604
ATT
1–131
DC_0474
STR
1–561
A0A193GYN5
1 562
Architecture
ATT
STR
ATT 1-131 | STR 132-562
Legend: ATT STR RBD CBM LEC ENZ CHP LNK TAS TTP UNK Unmapped

Tail Spike Domain Segmentation

Tail Spike Domain Segmentation

This protein has been segmented into three structural domains: N-terminal, central domain, and C-terminal.

Domain Layout
N-terminal
Central
C-terminal
A0A193GYN5
1 562
Domain Start End Length (AA) Confidence
N-terminal 1 177 177 0,9882
Central domain 178 376 200 0,5690
C-terminal 377 562 185 0,9165
Legend: N-terminal Central domain C-terminal
3D Structure with Domain Coloring

The structure is colored according to the domain segmentation: N-terminal (blue), Central (green), C-terminal (pink).

Domain Coloring
N-terminal
1-177
Central
178-376
C-terminal
377-562

Taxonomy

  Name Taxonomy ID Lineage
Phage Escherichia phage ECA2
[NCBI]
1852630 Uroviricota > Caudoviricetes > Autographivirales > Studiervirinae > Teetrevirus
Host Escherichia coli
[NCBI]
562 cellular organisms > Bacteria > Pseudomonadati > Pseudomonadota > Gammaproteobacteria > Enterobacterales

Coding sequence (CDS)

Coding sequence (CDS)
Genbank protein accession
ANN86232.1 [NCBI]
Genbank nucleotide accession
KX130726 [NCBI]
CDS location
range 10856 -> 12544
strand +
CDS
ATGGCTACAAATATTAAGACCGTGATGACTTACCCGCTGGATGGCTCCACTACGGACTTTAATATTCCGTTTGAGTATCTGGCTCGTAAGTTCGTCCGAGTGACCCTTATCGGTGTTGACCGAAAGGAACTCATCTTGAATCAAGACTATCGTTTTGCAACTAAGACCACAATCTCAACAACGAGAGCGTTGGGGCCAGCGGACGGTTATAACATGGTTGAAATCCGTCGATTTACCTCCGCTACCGATAGGCTGGTTGACTTCACCGATGGTTCAATCCTTCGGGCATACGACCTGAACATCTCTCAGGTTCAAACCCTTCACGTTGCAGAAGAGGCTCGTGACTTAACGGCTGATACCATTGGTGTGAACAACGATGGCAACCTTGATGCTCGTGGTCGTCGTATCATAAACGTAGCTGATGGTGTTGACCTCGGTGATGTTATCACTTTAGGTCAAGTCACTCGGTGGAATGAGTCTGCCCTGAACTCTAAGAACGCCGCTGCTACCTCTCAGGCCGCTGCCAAGACCTCAGAGACCAACGCTAAGACCTCAGAGACCAACGCTAAGACCTCAGAGACCAACGCTAAGACCTCAGAGACCAACGCTAAGACCTCAGAGACCAACGCGTACCAATGGGCGCAGCGTGCAGAGGACTCTCCGGTGCAGGGTAGCGAGTTCTCCTCATATCACTACTCACGCAAATCCGCTAAGAGTGCTGCCGCCGCGAGTGTTTCTGAAGATAATGCTAAGGTGTCCGAGAATAACTCGAAGACCTCTGAGACCAACGCTAAGACCTCTGAGACCAACGCTGCGGCCTCTGCTCAGACAGCTAAGGATGAGGCGGCTAAGCTGACCAACATGAATGACTTTGCTGCCGCTATTGATTCAGTCAACGGTAACAATGTTGTCATGAAGGGTGACTTCACCATTAAAGGAAACGGTAAAGTATCAATACTGAACCACAGTGGGCTATACCAAGATAAGACCGTGAATGGCGTGAACTATGCCAACAGGATTTATATGGATGATGCAGGGGCCATGCACTTCGAGAACTACCGTGTGTCTGGTACTTCATCCACCTTACGTGCTGGTTTTGTTATTACAGATACCAACAGGTTTGACTTTACAGGGCCAACGCAAGTTAATGGTGATATCATCGCATATGTGAATGCGCCCCCTAATCCAGCTATTGGTCAATACCTCAATTCTGTGGCAGTTCGCTCACAGCTTCGGGGCCGTGGTGCTGAGAATGATCCACTTGGTGCGTACTGTGGTATGTATATTCAGGAGCACGTTGGTACAGACCACGCGATAATCCTTAACCTCAATGGTTTTAGCAAGGATACTAACTGGCAGCTATTCTCCAATGGTAAAATATCTACACCACTTGGTGATGTGATGACTTCAGGTTCTGATGTTAGAATCAAGGATGATATCACTAAACCTCTTGATGGTGCTGGTGAGCGTATTGACGCAATTGGCATAGTGGAATACACAGAGATTGCCACCGGGGAACGTAAACGTGGGTGGCTGGCCCAACAGCTTGACACGATTGACCCGCTGTACACATACCTCACGACCACCGCAGAAGGTAGTCTTTTGAATACCAACGACAGGGCGTTGCTTGCTGACGTAATAACTGAGCTTCAGGCACTACGTCAACGCGTTAAAGAACTCGAAGGATAA

Genome Context

Genome Context

Gene Ontology

Description Category Evidence (source)
GO:0098015 virus tail Cellular Component IEA:UniProtKB-KW (UniProt)
GO:0098671 adhesion receptor-mediated virion attachment to host cell Biological Process IEA:UniProtKB-KW (UniProt)
GO:0046718 symbiont entry into host cell Biological Process IEA:UniProtKB-KW (UniProt)

Tertiary structure

PDB ID
5970270a0ca225d92b34539f0682237c633b2f3ae673d68a9bb275987c21fc5f
ESMFold
Source ESMFold
Method ESMFold
Resolution 0,7648
Oligomeric State monomer
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50