UniProt accession
G9IIM1 [UniProt]
Protein name
Putative tail fiber protein
RBP type
TF
Evidence GenBank
Probability 1,00
TF
Evidence Phold
Probability 1,00
TSP
Evidence DepoScope
Probability 1,00
TSP
Evidence RBPdetect
Probability 0,91
TSP
Evidence RBPdetect2
Probability 0,95
Protein sequence
MGYFQMTRNVEELFGGVITAPHQIPFTYKSNVGGETFLSLPFYPVTGVVTINGGMQVPLDNFEIEGNTLNLGRALSKGDVVYCLFDKILSPEDTAKGIRIYKFQAVGGETEFTPDFTSYGVQSLYIGGEYKTPEIEYSYNSTTGKVSLQTALTAGVWVVAEMSVKQPNISPAFDRSIQEIARSANVKDSEVIVSTDTISLLDGKKVVYDIATQTSYGLPTIPDGSVISSVSAGKLNYNPGDVQVDLLPLEDSFINVINTLGRNDGAKYIGECHSVADLRNTEPTMDGQRIILKQHTAGTLLGGGVFRALIDGTGKTDNNGTVIKTVGGAAWLRVNADRVNPFMFGALGGSNDDTIPVQSCVDSGKATQLTDAHYVSNIQLKYNTSSIYGSGLHYSRLHQLPSATGNCITIKDTCSLIVLDAFGVYGTGAQQGMPFTAGTTGIYVETPSGLSADYPFHTTADPRRDLCISKVHIAGFDEYGLNIDSGNFSVTTDSLLVNHINQVGVRCATTDWTWTNIQVNTCGKQCLVLDGCGNGSYLLGGKFIWANWQPYGTVGQFPGITINNSQNMVINGIEVQDCGGNGIEISDSYSISMNGLNTNRNGINANNTFYNIVFNKSDAVINGFVGLNYAANSGSGANSSAGNFQFLSNDCSVTINGVVETGYMGINFIGDNNIINPTNSDLSINGLVNYSKTGLQTMNETPTFDGVSTTPVYVSVPSSVGQVNGLRLSQANKDKLLYSRTAGPEGITMAAVVVPTISGAEVFNFMAIGSGFSDTSNSLHLQLVIDASGKQTIALLLGGDGTTQILSGDLPNDLKLQSGVPYHIAIGAKPGYFWWSILNIQTGKRIRRSFRGAYLAVPFNSIFGLTSSLTFFSDSNAGGDACSGVGAKVYVGMFSSENDYVSSRYYNLINPVDPTKLISYRILDSSI
Physico‐chemical
properties
protein length:927 AA
molecular weight: 98991,85770 Da
isoelectric point:4,95604
aromaticity:0,09277
hydropathy:-0,04391

Domains

Domains [InterPro]
DC_0041
STR
4–427
G3DSA:3.30.2020.50
ATT
161–250
G9IIM1
1 927
Architecture
STR
ATT
STR
STR 4-160 | ATT 161-250 | STR 251-927
Legend: ATT STR RBD CBM LEC ENZ CHP LNK TAS TTP UNK Unmapped

Tail Spike Domain Segmentation

Tail Spike Domain Segmentation

This protein has been segmented into three structural domains: N-terminal, central domain, and C-terminal.

Domain Layout
N-terminal
Central
C-terminal
G9IIM1
1 927
Domain Start End Length (AA) Confidence
N-terminal 1 352 352 0,9961
Central domain 353 776 425 0,9740
C-terminal 777 927 150 0,4520
Legend: N-terminal Central domain C-terminal
3D Structure with Domain Coloring

The structure is colored according to the domain segmentation: N-terminal (blue), Central (green), C-terminal (pink).

Domain Coloring
N-terminal
1-352
Central
353-776
C-terminal
777-927

Taxonomy

  Name Taxonomy ID Lineage
Phage Escherichia phage PhaxI
[NCBI]
926589 Uroviricota > Caudoviricetes > Pantevenvirales > Cvivirinae > Kuttervirus
Host Escherichia coli O157:H7
[NCBI]
83334 Bacteria > Proteobacteria > Gammaproteobacteria > Enterobacteriales > Enterobacteriaceae > Escherichia

Coding sequence (CDS)

Coding sequence (CDS)
Genbank protein accession
AEW24390.1 [NCBI]
Genbank nucleotide accession
JN673056 [NCBI]
CDS location
range 122214 -> 124997
strand -
CDS
ATGGGGTATTTTCAAATGACCAGAAATGTAGAAGAATTATTCGGCGGCGTAATCACAGCTCCCCACCAGATTCCTTTCACGTATAAATCAAATGTTGGTGGAGAAACTTTCCTTTCCTTGCCGTTCTATCCTGTCACTGGCGTAGTCACAATCAACGGTGGTATGCAAGTTCCGTTAGACAACTTTGAAATCGAAGGAAATACGTTGAATCTCGGGCGCGCATTGTCCAAAGGCGATGTTGTGTATTGCTTATTCGATAAAATTCTTTCGCCAGAAGATACAGCCAAAGGTATCCGCATATACAAATTTCAGGCCGTGGGAGGTGAAACCGAGTTCACTCCTGATTTCACATCCTATGGTGTCCAATCTCTTTATATCGGTGGCGAGTACAAAACACCCGAAATTGAATATTCCTATAACAGCACGACAGGGAAAGTGTCTTTGCAAACTGCACTGACTGCAGGCGTTTGGGTAGTCGCTGAAATGTCTGTTAAACAACCGAATATCAGTCCGGCGTTTGACCGAAGTATTCAAGAAATCGCCCGTTCTGCTAATGTAAAAGACTCTGAAGTCATCGTTAGTACGGACACCATATCTTTGTTGGATGGGAAGAAAGTTGTTTATGATATAGCGACGCAAACCAGTTATGGTTTACCAACCATTCCTGATGGTTCTGTCATTTCTTCTGTATCTGCTGGGAAATTGAATTACAACCCAGGTGATGTGCAGGTTGATTTGTTGCCTTTAGAGGATTCATTTATTAATGTGATAAACACTCTGGGGCGCAATGATGGTGCCAAGTATATTGGAGAATGCCATTCTGTTGCTGATCTCAGGAATACTGAACCCACTATGGATGGACAACGCATTATTCTTAAGCAACACACTGCGGGTACTCTTCTTGGTGGAGGGGTATTCCGTGCGTTAATTGATGGTACAGGAAAGACTGATAATAACGGTACTGTGATCAAAACTGTTGGCGGCGCGGCATGGTTACGTGTTAATGCTGATAGAGTTAACCCATTCATGTTTGGTGCTTTGGGTGGTTCTAATGATGATACTATTCCAGTACAATCTTGTGTGGATAGTGGTAAGGCCACACAATTAACTGATGCACATTACGTTAGCAATATCCAGTTAAAATATAATACGTCGTCTATTTATGGGTCTGGATTACATTACTCAAGGTTGCATCAGTTGCCTTCTGCTACTGGGAATTGTATTACCATAAAAGATACATGCTCCCTTATTGTATTAGACGCCTTTGGGGTATATGGCACAGGTGCACAACAAGGCATGCCATTTACTGCGGGCACAACAGGTATCTATGTAGAAACTCCTTCAGGTCTCTCAGCCGATTATCCGTTCCACACTACCGCAGACCCAAGACGCGACTTGTGTATTTCTAAGGTCCATATAGCAGGTTTTGATGAATATGGGTTAAATATTGATAGTGGTAACTTTAGTGTTACTACAGATTCTCTTTTAGTCAACCACATCAATCAGGTGGGTGTCCGTTGTGCTACTACTGATTGGACTTGGACAAATATCCAGGTTAATACCTGCGGTAAACAATGTCTGGTTCTTGATGGTTGTGGTAATGGTTCGTATTTATTGGGCGGTAAATTCATTTGGGCTAACTGGCAACCTTATGGTACAGTAGGACAGTTCCCAGGCATTACTATTAATAACAGCCAGAATATGGTTATTAATGGTATTGAGGTACAAGATTGTGGCGGGAATGGCATTGAGATTAGCGATTCATATTCAATTTCCATGAACGGATTGAACACCAATCGTAACGGCATCAATGCTAACAACACTTTCTACAACATCGTATTTAACAAAAGCGATGCAGTTATCAACGGATTCGTAGGACTCAATTATGCCGCGAATAGTGGTTCAGGTGCTAACTCTAGTGCAGGCAATTTTCAGTTCCTGTCTAATGATTGTAGTGTCACCATTAATGGTGTGGTTGAGACTGGTTATATGGGCATTAACTTTATTGGTGATAACAATATTATCAACCCCACCAATTCCGACCTGAGCATTAACGGATTGGTTAATTATTCCAAGACTGGTTTGCAAACCATGAACGAGACCCCTACATTTGATGGTGTTAGCACTACACCTGTTTATGTAAGTGTCCCATCTTCTGTAGGGCAAGTAAATGGTCTGAGACTATCACAAGCCAACAAAGATAAATTACTGTATTCAAGAACAGCAGGTCCAGAAGGTATTACCATGGCTGCTGTTGTAGTACCTACCATATCTGGAGCTGAAGTATTTAACTTCATGGCCATTGGTTCAGGGTTTAGTGATACATCCAACAGTCTTCATCTTCAATTAGTTATAGACGCTTCTGGAAAACAAACAATTGCTTTGCTATTGGGGGGCGATGGTACAACCCAAATTTTATCTGGGGATTTACCTAACGACCTTAAACTACAAAGTGGTGTACCATATCATATAGCTATTGGTGCTAAACCTGGATATTTCTGGTGGAGTATTCTTAATATTCAGACGGGTAAGAGAATCAGACGGTCATTCCGAGGCGCTTATTTAGCCGTACCATTTAATTCTATATTCGGATTAACTTCCTCATTAACATTCTTCTCGGATAGCAATGCTGGTGGGGATGCCTGTTCTGGTGTAGGTGCTAAAGTGTATGTTGGTATGTTCTCTTCTGAGAATGATTATGTATCTTCACGGTACTACAACCTGATTAATCCTGTAGACCCTACTAAGTTAATTAGTTACCGTATATTGGATTCTTCTATTTAA

Genome Context

Genome Context

Tertiary structure

PDB ID
b1ec412459baade2ff5f40b427ec1596a81fc1d176ecdddc152fd91452aed733
ESMFold
Source ESMFold
Method ESMFold
Resolution 0,7138
Oligomeric State monomer
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50