Genbank accession
AIZ02037.1 [GenBank]
Protein name
long tail fiber distal subunit
RBP type
TF
Evidence UniProt/TrEMBL
Probability 1,00
TSP
Evidence DepoScope
Probability 1,00
TSP
Evidence RBPdetect
Probability 0,78
TF
Evidence RBPdetect2
Probability 0,94
Protein sequence
MATLKQIQFKRSKVAGVRPTPAQLAEGELAINLKDRLLFTKDDSGAIIDLGFAKGGNIDGNVIHKGNYNQTGNYTLNGTFTQTGNYTTTGSVTANGDVTAKSRLMTDNGEVLVRGAGTAHVRFQDLADARERGIIYSQSRPGNTKQILNVRVQDYTNSTSNIFAFNGDGLFYAPSISGGTSIKSPVIYTNTVDTGNKSALDYDISSLANNNSNTDKNNLRVVRTDATAAMLHEICENNGISWYSGSTPTDYILSFAYSGGFQAGHSIAVGMESGPMTYSALGKGSIALGDNDTGLKWHQDGYFHTVNNGTRTFIYGPEETASLRKMVIGYSVNGTDLTTPPTENYALGTVVTYHDNNAFGDGQTLLGYYQGGAYHHYFRGKGTTNINTAGGLLVTPGNIDVIGGLINIDGRSNASTLLFSSYTSGQSSVDNMNIRVWGDTFATVGGTRKNVMEISDATSWMHYIQRTTAGKVESYLNGAMNIVEGLSVGQDASLKRNLYVSNEIKVRGSSGLRIWNDKYGVIFRNSEDQLHIIPTNINAGESGGLGPLRPLSITLDTGRVKIPNLEADQVYFRGNGALEFPNSNGASYANQNTTKALLYQTLDAATQAFYPITKQKNIDSNVTVTQGMDRATSEYRIVAQGDLLGDGDATGLKYWRFTKEGNFITQNHLYAGTAFLDTEGNISGSIWNKYSGATNLDAAVNTRVGKSGDTMTGKLTIEAPGDALVLRTTTGNSSHIRSDVNGTGNWYVGKGGDDNGIALYSYATNSGMYITNSGAFNVELSGSGLAMQMNYYRMYVNGRQWVASQGHGYGNQWQTEAPYFVDFGEAVPKDSYMPIIKGRSTLITDGYSTKADFGIIRSAGASTWGSAVIRVGSAESGDASHPNAIYVFGADGTLTAPNAVNAASRMGVGTSCSLPNASIAIGDNDTGLSTRGDGSLGAYANAQSVFFWEAGGITTEQNKFLNVNAGMYVRDNIDVNDVYIRSDIRCKSEIKLIENAQEKSKLLGGYTYLLKNSVTDEVKPSAGLIAQEVQEVLPELVSEDKETGLLRLNYNGIIGLNTATINEHTDEIKELKSEIAELKALIKSLL
Physico‐chemical
properties
protein length:1086 AA
molecular weight: 116589,01420 Da
isoelectric point:5,64249
aromaticity:0,08748
hydropathy:-0,33094

Domains

Domains [InterPro]
DC_0538
STR
1–742
G3DSA:6.20.80.10
STR
721–777
IPR030392
CHP
982–1075
AIZ02037.1
1 1086
Architecture
STR
RBD
STR 1-777 | RBD 786-1086
Legend: ATT STR RBD CBM LEC ENZ CHP LNK TAS TTP UNK Unmapped

Tail Spike Domain Segmentation

Tail Spike Domain Segmentation

This protein has been segmented into three structural domains: N-terminal, central domain, and C-terminal.

Domain Layout
N-terminal
Central
C-terminal
AIZ02037.1
1 1086
Domain Start End Length (AA) Confidence
N-terminal 1 609 609 0,0742
Central domain 610 808 200 0,2750
C-terminal 809 1086 277 0,8211
Legend: N-terminal Central domain C-terminal
3D Structure with Domain Coloring

The structure is colored according to the domain segmentation: N-terminal (blue), Central (green), C-terminal (pink).

Domain Coloring
N-terminal
1-609
Central
610-808
C-terminal
809-1086

Taxonomy

  Name Taxonomy ID Lineage
Phage Escherichia phage vb_EcoM-VR5
[NCBI]
1567026 Uroviricota > Caudoviricetes > Pantevenvirales > Tevenvirinae > Dhakavirus
Host Escherichia coli
[NCBI]
562 cellular organisms > Bacteria > Pseudomonadati > Pseudomonadota > Gammaproteobacteria > Enterobacterales

Coding sequence (CDS)

Coding sequence (CDS)

No CDS data available.

Genome Context

Genome Context

Tertiary structure

PDB ID
2d9deb235b047ccc3b7d3ad4ab26e0a81930d0d9e7974cc991c1cbfff35efbc5
ESMFold
Source ESMFold
Method ESMFold
Resolution 0,5526
Oligomeric State monomer
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50