Genbank accession
ADO19603.1 [GenBank]
Protein name
long tail fiber distal subunit
RBP type
TF
Evidence UniProt/TrEMBL
Probability 1,00
TSP
Evidence DepoScope
Probability 1,00
TSP
Evidence RBPdetect
Probability 0,71
TF
Evidence RBPdetect2
Probability 0,91
Protein sequence
MATLKSIQFKRSKTPGAKPTAAQLDEGELAINLRDRTIFTKSDQGQIIDLGFAKGGTIDGDVLQNGTFNLNGNQFVYAGKYIEFLPKTTGNGAWANQHLNKAPIFTDLSSTTSVSEYHPLIKQRYKDGTFSVGTLVSEGSFRVHYIDSTAPDNSKHWVFNRNGNFIVDSGNIEVRTGNISASGNINSANGIVSAPQVITKNIILDSKNFGQYDSQSLVNYVYPGTGETNGVNYLRKVRAKSGGTMWHELCTAQTGQADELSWWTGNTPTSKQYGIRNDGRMAGRNSLALGTFTTDFPSSDYGNFGIMGDKYLVLGDTVTGLKYIKQGVYDLVGGGYSVASITPDSFRSTRKGLFGRSEDQGETWIMPGTNKALLSVQTQADNNAAGDGQTHIGYNSGGKMNHYFRGKGKTNINTQKGMEVNPGILKLVTGSDNLQFYANGTISSIQPIKLDNEIFLTTSNNTAGLKFGAPSGVNETRAIQWNGGTREGQNKNYVIVKAWGNSFNAAGDKSRETVFQVSDGQGYYFYAHRKAPTGDETIGRIEAQFAGALNAKSINAIENFKVNGLSTLVGGVTMSNGLNLTGGANISGPVKIGGVTNALRIWDSRYGAIFRRSETSLHIIPTNENEGENGAISNLRPFSIELGTGTVIMGDKSTGGPLFTVDNVSKFVQTDCRFRVNMDSDGIVVNASSQAASNFIQGRKADVTKWYLGIGDGGNVVRMHNYTYSHGIALNSDTVDITKPLKVGNAQLGTDGNITGGSGNFGNLNTTIENMKADIVTSYPVGAPIPWPSDSVPDGFALMEGQTFDTGAYPKLAIAYPTGTIPDMRGQTIKGTPSGRAVLSAEADGVKSHNHSASASTTALTGTTNGTDLGTKTVSTVDIGRKYTNNAGAHTHTFSGTTSTNGDHNHPASLGNNANVQSGRFAASNSGQSAIAYTNNAGNHSHTFSGTTSAGPEHSHYVDIGSHNHTVAIGSHSHTFSIAAHGHTITVNNTGNTENTVKNIAFNYIVRLA
Physico‐chemical
properties
protein length:1009 AA
molecular weight: 107350,38520 Da
isoelectric point:8,64361
aromaticity:0,08325
hydropathy:-0,41695

Domains

Domains [InterPro]
DC_0538
STR
1–1007
G3DSA:6.20.80.10
STR
682–739
IPR048388
ATT
683–756
IPR051934
Unmapped
715–1009
IPR011083
ATT
782–829
ADO19603.1
1 1009
Architecture
STR
ATT
STR
ATT
STR
STR 1-682 | ATT 683-756 | STR 757-776 | ATT 777-830 | STR 831-1008 |
Legend: ATT STR RBD CBM LEC ENZ CHP LNK TAS TTP UNK Unmapped

Tail Spike Domain Segmentation

Tail Spike Domain Segmentation

This protein has been segmented into three structural domains: N-terminal, central domain, and C-terminal.

Domain Layout
N-terminal
Central
C-terminal
ADO19603.1
1 1009
Domain Start End Length (AA) Confidence
N-terminal 1 718 718 0,0686
Central domain 719 917 200 0,0918
C-terminal 918 1009 91 0,9941
Legend: N-terminal Central domain C-terminal
3D Structure with Domain Coloring

The structure is colored according to the domain segmentation: N-terminal (blue), Central (green), C-terminal (pink).

Domain Coloring
N-terminal
1-718
Central
719-917
C-terminal
918-1009

Taxonomy

  Name Taxonomy ID Lineage
Phage Shigella phage SP18
[NCBI]
645664 Uroviricota > Caudoviricetes > Pantevenvirales > Tevenvirinae > Gaprivervirus
Host Shigella sonnei
[NCBI]
624 cellular organisms > Bacteria > Pseudomonadati > Pseudomonadota > Gammaproteobacteria > Enterobacterales

Coding sequence (CDS)

Coding sequence (CDS)

No CDS data available.

Genome Context

Genome Context

Tertiary structure

PDB ID
41b008d3e8632e9db9d9609f9a5aa78aa31ed01cb9d232765b683dd6df7cccf5
ESMFold
Source ESMFold
Method ESMFold
Resolution 0,5479
Oligomeric State monomer
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50