UniProt accession
C8XUS9 [UniProt]
Protein name
Conserved hypothetical phage protein
RBP type
TSP
Evidence DepoScope
Probability 1,00
TSP
Evidence RBPdetect
Probability 0,91
TSP
Evidence RBPdetect2
Probability 0,95
Protein sequence
MNPQFSQPKGTVSKETNKDSIARKFGCKKSEVLYAKPGSTLTGYKVIYDKVPQRSYALPSNLPAGATITSMADGILVHNTGTVDLGVLAVLRKEFVTLVENFDSGFTIRVRNEVVSDGTSLYRWGGTLPKSVNAGGTIANTGGLSSSNWVKLDPDALRDELYSGFYRLTVIDPSMSQSLIQTKLSQGGTFQFLPGNYTVTGQALAFYPNSEIYAHSGALFLAGENNVGVLTIDPSRSGGNAVRNCKLHNIRISLNGKTGCVAFRAKYWRNHGHLDMMWVDMGTAINNIGIEIGTLCYGLKIDGCEVIGGGAGSSRLIVQNGANAIVITGFDGYSGAPEVDMPDYGIIVRHSTDGGTSWDYTTTFPTEAVCFVNGFSQNTTKYGFLDQAKGTKVYGMYYENNSISDVRISGSQDATFRDCTHSSPSQGTLAHGYSITNSTNSLIDEPIWGTRAGGFFDIGGAQDAGVTGHIVNIARWGASRGIDDLGVVTACKLSRGKLPLQLTTDSVDINSGYGCYRRNVSSGSNIAFTGTPYDGMELTFLLRGSNIASLSIAGVPVDVTGANTATTKMNLVKMVYSKVTSTWVINAGQWNASA
Physico‐chemical
properties
protein length:594 AA
molecular weight: 63375,52420 Da
isoelectric point:7,96115
aromaticity:0,09091
hydropathy:-0,12980

Domains

Domains [InterPro]
G3DSA:3.30.2020.50
ATT
1–100
IPR040775
RBD
94–151
C8XUS9
1 594
Architecture
ATT
STR
ATT 1-164 | STR 165-594
Legend: ATT STR RBD CBM LEC ENZ CHP LNK TAS TTP UNK Unmapped

Tail Spike Domain Segmentation

Tail Spike Domain Segmentation

This protein has been segmented into three structural domains: N-terminal, central domain, and C-terminal.

Domain Layout
N-terminal
Central
C-terminal
C8XUS9
1 594
Domain Start End Length (AA) Confidence
N-terminal 1 178 178 0,9929
Central domain 179 494 317 0,9864
C-terminal 495 594 99 0,9862
Legend: N-terminal Central domain C-terminal
3D Structure with Domain Coloring

The structure is colored according to the domain segmentation: N-terminal (blue), Central (green), C-terminal (pink).

Domain Coloring
N-terminal
1-178
Central
179-494
C-terminal
495-594

Taxonomy

  Name Taxonomy ID Lineage
Phage Shigella phage Ag3
[NCBI]
637730 Uroviricota > Caudoviricetes > Pantevenvirales > Aglimvirinae > Agtrevirus
Host Shigella boydii
[NCBI]
621 cellular organisms > Bacteria > Pseudomonadati > Pseudomonadota > Gammaproteobacteria > Enterobacterales

Coding sequence (CDS)

Coding sequence (CDS)

No CDS data available.

Genome Context

Genome Context

Tertiary structure

PDB ID
60084ebe19898dc25fa092b21e63448d442f661dfb480a0ce7f1b50f063b54fb
ESMFold
Source ESMFold
Method ESMFold
Resolution 0,7763
Oligomeric State monomer
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50