Genbank accession
QHZ59591.1 [GenBank]
Protein name
hypothetical protein
RBP type
TSP
Evidence UniProt/TrEMBL
Probability 1,00
TSP
Evidence DepoScope
Probability 1,00
TSP
Evidence RBPdetect
Probability 0,91
TSP
Evidence RBPdetect2
Probability 0,95
Protein sequence
MADLEVFSNLGMPSQPFFRNDRFAALANGKIYAGKADTDPTNPANQVQVYVENEDGTLTPIPQPVRTNVAGFPVWNGQVVKLITKIESSMVVADQNDVVIFTFSNLFKYDPVQMWNLLLSQNGAKYVGTPVGNVQEQLNGYVTPWNFVGKAPYATVTAALQAMFDYASANGLDVRADGWKGTTNGILTASNIRIFGGDWTLTGNDPRFVNCEVIGGTWRGRSMWCQNTTLRGVNGRNGHRMRHDGGDVRMFDCWYDGIPGGSNNSHINIQGVQSGGPQIWGTIEIDGCLFTNGYNGIIHQGGNAFMSRGIFRNLAFVNMQGDGIELNVVHQCFQDGCVIENILLDNIDGINPGSFASNWGLGIGVAGKGPYGWDLPDENYARNFTIRNVLVNACKQCIHVEMGRDFVIENVCVNPDVNKSVGTGITTAGIYTSGCKDFVIDGVTGEPVGNATTDVHDLRIIHMEWGPPRQSAPRNYTIRNVKTKTGRVYCPVAADPGDPTAGVNRAPHDNRVKLENIDCYKLTIFGVATQLDMFNVNFWELDAIGDDSAGGTTSSGQYIRTKSVLNMMNVNSLNPVTQAWSKCRYSNINMINCNVEARMYVNIDGSLGAMLGGVSKQFFPDITSHGGYGNFFPCGREFDRGDLVWTYDYGSNDPAGGPNGIVGLKPYLVVEAGAYFPTGNQAKILAASVGQKSITQWLTPNGTSTGSPWLYTTMLSPGTRIVIPGAGAGGADLVTTVVRPPYQTPPDDTGKPIVVDIADPIQTAVPAETQIRMYKQIVTRPRITSGP
Physico‐chemical
properties
protein length:787 AA
molecular weight: 85297,17810 Da
isoelectric point:5,57110
aromaticity:0,09403
hydropathy:-0,21017

Domains

Domains [InterPro]
IPR036730
ATT
4–100
IPR009093
ATT
9–112
IPR036730
ATT
10–109
QHZ59591.1
1 787
Architecture
ATT
ATT
STR
RBD
RBD
ATT 4-112 | ATT 140-175 | STR 176-414 | RBD 415-445 | RBD 463-784 |
Legend: ATT STR RBD CBM LEC ENZ CHP LNK TAS TTP UNK Unmapped

Tail Spike Domain Segmentation

Tail Spike Domain Segmentation

This protein has been segmented into three structural domains: N-terminal, central domain, and C-terminal.

Domain Layout
N-terminal
Central
C-terminal
QHZ59591.1
1 787
Domain Start End Length (AA) Confidence
N-terminal 1 150 150 0,9914
Central domain 151 645 496 0,9936
C-terminal 646 787 141 0,9337
Legend: N-terminal Central domain C-terminal
3D Structure with Domain Coloring

The structure is colored according to the domain segmentation: N-terminal (blue), Central (green), C-terminal (pink).

Domain Coloring
N-terminal
1-150
Central
151-645
C-terminal
646-787

Taxonomy

  Name Taxonomy ID Lineage
Phage Escherichia phage sortkaff
[NCBI]
2696445 Uroviricota > Caudoviricetes > Sortsnevirus >
Host Escherichia coli K-12
[NCBI]
83333 Bacteria > Proteobacteria > Gammaproteobacteria > Enterobacteriales > Enterobacteriaceae > Escherichia

Coding sequence (CDS)

Coding sequence (CDS)

No CDS data available.

Genome Context

Genome Context

Tertiary structure

PDB ID
3581ded38801d0a5ab3dd2b32ff4b0426c1b57f11b59ab9062b8e88c28388faf
ESMFold
Source ESMFold
Method ESMFold
Resolution 0,7074
Oligomeric State monomer
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50