Genbank accession
YP_008530271.1 [GenBank]
Protein name
Ig-like domain-containing protein
RBP type
TSP
Evidence DepoScope
Probability 1,00
Protein sequence
MVQATQYPMYILFDDGQLNITGYNRFGECRTGDLDTINYPNEAAWNVDHVWRADRAFVIRTFDNKFFYIGCTAGLIGSEAAGGNDVCVREWTPLPEQIVTGLHLDTHPERLVEVMGGVNNTVWVIAAPEADGMLHLYGSGNNTYGSLHVDKSQHATPVKIGETSESPETGPWKNPRINCEVHDNSVIFGGPNGFWIAGYDFLRNNSKNTLVYPAEHVTLDQFKGIPVGETWKGFMCGPNGAIIATERMHKPDDQQLVNVFYGQDVWGDDTWRSLNITYTHEVIMARGYGTSGIFFNNGTKQYRGFSRNLCNDIGAQSANNSPRANFIHYRALATAKFVEQTEPVSWTESVYFQGIHREGFLGTFCVVNGKLWWSGIPRGSFSGSNNLFGGRLNSQGFTEIPENWYKNVPVDSWGIEDIFDVNGVSNVSNIYIGDTVKMKLKPQPEGATFIIDKIELVNAAGTVVTDANYQFSTNWNHGGANEVVVTQYNRNINRRGLYSIKITYHDKHGTGRTYTTRTLNWNTIVPAYPSNGKWHTVGRNKQFHVNDTVYFGLNGTQPAVEGDTSYMVRLHRMDAGSVTYDVTQEIYDQRRATYDNIVKTQALWEFNPNGKGGKMLQVNEQNGTSLTVHEHPDPWPDPGKPHPGARTLKIVSHDAGYFGIRWEATVRYVDGTTNNIGITLGGTSEDNSLKIAYTPRGISIDNMDVVRNGYGDVNVKVTLGEHLGGERIIMYAFDHDPRTNGTYANQAWSYLINAPEPNTKEFYYGMKRDVCKKTGTHDWIAICVKDERTAWDEPVNRWFIGIPTRSDKYVAEYIVCMGGTNLNMCWNEDVNKYSDYDYMRDYSCNLWFNQTTGYPPRQAKVNPAIFTDTQVFLTKQANEVQTFKNKYDPNKWFYNCYGAFFWGPGELPGGGSCLNEATYASDYIMGQVKKYKIIPGPETLGNAVDPYIIMTALRSNMDGTSMSMQIPVTNGYKRVVMIIKCDLIGKQVLAENGTSTHPFEIALHYQFADSPFAGDKRISDTDAQRCKKVLLGAGWWWYEFDLTDKFTDTSKVVTGLRLDLGENMHKAVCDGTYGDPTIYLKYISFEHPEDVVYGPKLRLFGSWIAKDRVGMGKKVRGFLVDAGTEDMLVNAVWPELPGTQWNNSAKSINWFNIHRAMWTTNCYLWRELNDQAFGFSDGRRMAIICWTTLQRCYDHDYEIGGRAWKNIRDRIITNFADDNGGGAHNFGTSRLIHLNGSSAYKKEGYSGSMIEWGLVKDARVLMGQQLAAAIGPSAVQSVKPAWFDIPLWSPGTPGTAAINPTTGDLEISWEDLKQVGGWDKTGYQVQWWRADGSLAADEFVKDNFYTMSSAKAQQLFGQATPSTITMSMCCKDNRTGALGPRVAKVFSGIKWNLPVQSISWKQIGDNKLLVTPACQFNATLNVDPAVAANSAKASDFSVSNTAMADVRKIDTLNARITCKNTYGTFQIINNFTDADSKVVRTASQTLSLGTLAYAALITEQSATLQGGGVGKSIATPVWKPNEWVVFDLAVDFSSDNNWTWVRNCLPQLMGGPSSVSDSHDSTDPSVFQVGKTHPETGATLPDRKYALVCISYGKADVTFSGTHTYNGTYNFSRKYSLKAGNIIDEVGVLYNPGNGIGIVGGKLQMQEPSITPSNVAGIRKTWESSNTNIATVDATTGLVTFKATGNVTIKFVVTDDAGRKTSSTSFTVKQMAPQWRMWIGTATNGAYPNPAGTSGMKTFSTSKPMEYGSGPKVGQMVYFGAYIPEIIGLPRSQLQLLFGAGVDDLATFGYSDNIEAARSSGWVGFRMESGKEGRILGTASIGVMFPGDQQYRLEAYATFSR
Physico‐chemical
properties
protein length:1841 AA
molecular weight: 204422,21420 Da
isoelectric point:6,30114
aromaticity:0,11570
hydropathy:-0,38235

Domains

Domains [InterPro]
IPR009091
STR
2–161
G3DSA:2.60.40.1080
STR
1638–1709
IPR003343
STR
1646–1696
YP_008530271.1
1 1841
Architecture
STR
RBD
STR
RBD
STR 1-1627 | RBD 1628-1637 | STR 1638-1709 | RBD 1710-1841
Legend: ATT STR RBD CBM LEC ENZ CHP LNK TAS TTP UNK Unmapped

Tail Spike Domain Segmentation

Tail Spike Domain Segmentation

This protein has been segmented into three structural domains: N-terminal, central domain, and C-terminal.

Domain Layout
N-terminal
Central
C-terminal
YP_008530271.1
1 1841
Domain Start End Length (AA) Confidence
N-terminal 1 10 10 0,0059
Central domain 11 556 547 0,9265
C-terminal 557 1841 1284 0,1689
Legend: N-terminal Central domain C-terminal
3D Structure with Domain Coloring

The structure is colored according to the domain segmentation: N-terminal (blue), Central (green), C-terminal (pink).

Domain Coloring
N-terminal
1-10
Central
11-556
C-terminal
557-1841

Taxonomy

  Name Taxonomy ID Lineage
Phage Escherichia phage JES2013
[NCBI]
1327956 Uroviricota > Caudoviricetes > Vequintavirinae > Vequintavirus JES2013 >
Host Escherichia coli O157:H7
[NCBI]
83334 Bacteria > Proteobacteria > Gammaproteobacteria > Enterobacteriales > Enterobacteriaceae > Escherichia
Host Escherichia coli str. K-12 substr. MG1655
[NCBI]
511145 Bacteria > Proteobacteria > Gammaproteobacteria > Enterobacteriales > Enterobacteriaceae > Escherichia

Coding sequence (CDS)

Coding sequence (CDS)
Genbank protein accession
YP_008530271.1 [NCBI]
Genbank nucleotide accession
NC_022323 [NCBI]
CDS location
range 13498 -> 19023
strand -
CDS
ATGGTACAAGCAACACAGTATCCTATGTATATACTTTTTGATGATGGTCAGCTTAATATTACTGGCTATAATAGGTTCGGAGAGTGTCGGACTGGAGATCTTGATACAATAAACTATCCTAATGAGGCAGCTTGGAACGTAGACCATGTTTGGAGAGCTGACCGTGCATTTGTAATCCGCACATTCGATAATAAATTTTTCTATATTGGTTGCACAGCTGGTCTTATCGGATCGGAGGCAGCAGGTGGAAACGATGTTTGTGTCAGAGAGTGGACACCTTTGCCGGAACAGATCGTAACAGGGTTGCATTTGGACACCCATCCTGAGAGACTTGTTGAAGTAATGGGTGGCGTAAACAATACAGTGTGGGTAATTGCTGCACCAGAGGCTGATGGTATGCTTCACCTGTACGGCTCTGGCAATAATACTTACGGCTCTCTTCATGTTGACAAAAGTCAACACGCTACTCCAGTTAAGATCGGTGAAACATCTGAAAGTCCAGAGACAGGTCCTTGGAAAAATCCAAGAATTAATTGTGAAGTTCATGACAACTCGGTTATTTTTGGTGGTCCTAACGGATTCTGGATTGCAGGTTACGATTTCTTAAGAAATAACAGCAAAAACACACTAGTTTATCCAGCCGAGCATGTGACTCTTGACCAGTTTAAAGGCATCCCGGTGGGCGAGACATGGAAAGGGTTCATGTGTGGACCTAATGGTGCAATTATAGCCACAGAAAGGATGCACAAACCTGATGACCAACAGCTTGTTAATGTGTTTTATGGTCAGGATGTTTGGGGTGACGACACGTGGAGAAGCCTTAACATCACCTATACCCATGAAGTAATTATGGCTCGCGGGTACGGGACTAGTGGAATTTTCTTCAATAATGGTACAAAACAGTACCGTGGTTTCTCCAGAAACCTGTGTAACGATATTGGTGCTCAATCTGCTAATAATAGTCCTAGGGCCAATTTTATTCATTACAGAGCGCTGGCAACAGCTAAGTTTGTTGAACAGACAGAGCCAGTAAGCTGGACAGAAAGCGTTTATTTTCAAGGGATCCACAGGGAAGGTTTCTTAGGCACATTCTGTGTGGTAAATGGAAAGCTTTGGTGGTCAGGAATACCACGTGGTAGTTTTTCAGGATCTAATAACCTCTTTGGCGGCAGACTTAACAGCCAAGGGTTCACAGAGATTCCGGAGAATTGGTATAAAAATGTCCCGGTAGATAGCTGGGGTATCGAAGACATCTTTGATGTCAATGGTGTGAGCAATGTTAGCAACATTTACATCGGAGATACTGTCAAGATGAAATTGAAACCTCAGCCGGAGGGTGCTACGTTCATTATTGACAAGATTGAGTTGGTAAATGCTGCAGGCACTGTTGTTACAGATGCAAACTATCAGTTCTCCACTAACTGGAACCATGGTGGTGCTAACGAAGTTGTGGTCACCCAGTATAACCGTAACATCAACAGACGTGGTTTGTATTCTATCAAGATCACCTACCATGACAAGCATGGCACTGGAAGAACCTATACGACAAGGACTTTGAACTGGAACACTATAGTTCCTGCGTATCCGTCCAACGGCAAGTGGCATACCGTTGGTAGAAACAAGCAGTTCCATGTTAACGATACTGTATACTTTGGGCTGAATGGGACCCAGCCTGCAGTAGAAGGGGATACATCGTATATGGTCAGACTCCACAGAATGGATGCTGGATCAGTCACTTATGATGTGACCCAAGAGATCTACGATCAAAGACGCGCCACGTATGACAATATTGTGAAAACACAAGCCTTGTGGGAGTTCAATCCTAATGGCAAGGGTGGTAAAATGTTGCAGGTTAATGAGCAGAACGGAACGTCGTTGACTGTTCACGAACACCCAGACCCTTGGCCTGATCCTGGAAAACCTCACCCGGGAGCTCGTACACTGAAAATTGTCAGCCATGATGCAGGATATTTCGGAATACGCTGGGAAGCTACAGTAAGATATGTCGACGGGACAACAAACAACATAGGGATTACACTTGGCGGCACTAGTGAAGATAACTCGCTGAAAATTGCCTACACGCCCAGAGGTATCTCTATTGATAATATGGATGTTGTTCGCAATGGTTACGGCGATGTGAATGTCAAAGTTACTCTTGGCGAACATCTTGGTGGGGAAAGGATCATCATGTATGCCTTTGATCATGATCCACGCACCAATGGAACTTATGCAAATCAGGCATGGAGTTACCTGATAAATGCACCTGAGCCGAATACTAAAGAGTTCTACTATGGTATGAAGCGAGATGTGTGCAAGAAAACAGGAACACATGACTGGATTGCTATCTGCGTTAAAGATGAGCGCACAGCTTGGGATGAGCCTGTTAACAGATGGTTCATAGGTATTCCTACCAGAAGTGATAAGTATGTCGCAGAATATATTGTATGCATGGGCGGTACCAACCTAAACATGTGTTGGAACGAAGATGTAAACAAATACTCTGATTATGACTATATGAGGGATTACTCTTGTAATTTGTGGTTTAACCAAACTACAGGTTACCCTCCAAGACAGGCGAAAGTAAACCCTGCAATCTTCACAGACACGCAGGTTTTCTTGACAAAACAGGCCAACGAGGTTCAGACCTTCAAGAATAAATACGATCCTAACAAGTGGTTCTACAACTGTTACGGCGCATTTTTCTGGGGACCTGGTGAGTTACCTGGTGGAGGATCTTGCTTGAATGAGGCAACTTACGCCAGTGACTACATCATGGGGCAGGTCAAGAAGTATAAGATAATCCCCGGTCCTGAGACTCTGGGCAATGCCGTTGACCCTTATATCATCATGACTGCACTGAGATCAAACATGGACGGGACATCCATGTCTATGCAAATTCCGGTAACAAACGGGTACAAACGTGTTGTGATGATCATTAAGTGCGACTTAATAGGTAAACAAGTTCTTGCAGAAAACGGGACATCCACACATCCTTTTGAGATTGCTCTGCACTATCAGTTTGCTGACTCACCTTTTGCAGGTGATAAGAGAATATCTGATACAGATGCACAGCGTTGTAAGAAAGTATTGTTGGGTGCAGGTTGGTGGTGGTATGAGTTTGATTTAACAGACAAATTCACCGACACGTCTAAAGTTGTAACAGGACTTCGTCTAGACCTTGGCGAGAACATGCACAAAGCTGTTTGTGATGGCACCTATGGTGACCCTACAATATACCTAAAATATATTTCTTTTGAGCATCCGGAAGATGTTGTCTACGGTCCAAAACTGAGATTGTTTGGCAGTTGGATTGCAAAAGATAGGGTAGGCATGGGTAAGAAGGTGAGAGGGTTCTTGGTTGATGCAGGAACAGAAGACATGCTGGTTAATGCAGTATGGCCTGAATTACCTGGGACTCAGTGGAACAACTCAGCCAAATCTATAAACTGGTTTAACATTCATAGGGCCATGTGGACCACCAACTGCTACTTGTGGAGAGAGCTTAACGATCAGGCCTTTGGCTTCAGCGATGGTAGAAGGATGGCGATCATTTGCTGGACAACACTGCAAAGATGTTATGACCATGACTACGAGATTGGGGGCAGGGCATGGAAAAATATCAGGGATAGAATCATAACCAATTTTGCAGACGACAACGGTGGTGGAGCACATAACTTTGGCACAAGCCGACTCATCCATTTGAACGGATCCTCAGCCTACAAGAAAGAAGGTTACTCTGGGTCTATGATCGAGTGGGGTCTGGTGAAAGATGCTCGCGTATTGATGGGCCAGCAACTTGCTGCCGCAATAGGCCCCAGCGCAGTGCAGTCGGTTAAACCAGCGTGGTTCGATATTCCATTGTGGTCACCCGGAACACCGGGAACTGCTGCAATCAACCCCACCACTGGGGATCTAGAAATCTCCTGGGAGGACTTGAAGCAGGTCGGTGGTTGGGACAAAACAGGGTATCAGGTTCAGTGGTGGAGGGCTGATGGATCTTTAGCCGCTGATGAGTTTGTTAAGGACAATTTCTACACTATGTCCTCTGCAAAAGCACAGCAATTATTTGGTCAGGCAACTCCGTCAACGATCACCATGTCTATGTGCTGTAAAGACAACAGGACTGGAGCTTTGGGGCCAAGGGTTGCTAAAGTTTTCTCAGGTATTAAATGGAATCTACCTGTCCAAAGTATTTCATGGAAGCAAATAGGTGATAACAAGCTGCTGGTTACCCCTGCCTGTCAGTTCAACGCAACTCTTAACGTTGATCCTGCTGTTGCGGCAAACTCAGCTAAAGCTTCTGACTTCTCTGTGTCTAACACTGCCATGGCAGATGTGAGGAAGATTGACACGCTGAACGCCAGAATTACCTGTAAAAACACTTATGGCACATTCCAGATCATCAACAACTTCACAGATGCTGATTCTAAGGTAGTGAGGACAGCAAGCCAGACTTTGAGTTTAGGAACTCTGGCCTATGCGGCCCTGATCACTGAACAGTCGGCAACACTCCAAGGAGGTGGCGTAGGTAAATCTATTGCAACACCAGTATGGAAGCCAAATGAGTGGGTTGTATTTGATCTGGCTGTTGACTTCTCTAGCGATAATAACTGGACGTGGGTAAGGAATTGCTTACCTCAATTGATGGGTGGTCCAAGCTCTGTCAGCGATAGCCACGATTCTACTGACCCCAGCGTGTTCCAGGTTGGTAAAACTCACCCAGAGACGGGAGCAACGTTGCCTGACAGGAAGTATGCTTTGGTTTGCATCTCTTACGGAAAAGCAGATGTCACCTTCTCAGGGACACACACTTATAACGGAACTTACAACTTCTCAAGGAAGTATAGTCTCAAAGCAGGGAACATTATAGACGAGGTTGGTGTGCTGTATAATCCGGGCAACGGCATAGGGATTGTTGGTGGTAAACTGCAGATGCAGGAACCTTCTATTACCCCTTCCAACGTAGCCGGGATTAGAAAGACTTGGGAAAGTAGTAATACCAACATTGCAACAGTGGATGCCACCACAGGACTGGTAACATTTAAAGCTACTGGGAATGTCACCATAAAGTTTGTAGTTACTGATGATGCAGGGCGCAAAACGTCTTCAACATCTTTTACTGTCAAACAGATGGCACCACAGTGGAGAATGTGGATAGGTACAGCAACAAACGGGGCATACCCTAATCCGGCAGGTACTTCTGGTATGAAGACTTTCTCTACAAGCAAACCGATGGAGTACGGCAGCGGTCCTAAGGTAGGGCAGATGGTGTACTTTGGTGCGTATATTCCTGAAATTATAGGGCTTCCGAGAAGTCAGCTTCAGTTGCTGTTTGGGGCTGGTGTTGACGATCTTGCCACTTTCGGGTATAGCGACAACATCGAGGCTGCAAGGAGTTCAGGATGGGTAGGGTTCAGAATGGAGTCTGGAAAGGAAGGCAGGATTCTAGGGACAGCCTCTATAGGTGTTATGTTCCCTGGTGACCAGCAGTATCGTCTAGAAGCCTACGCAACCTTTTCTCGTTAA

Genome Context

Genome Context

Tertiary structure

PDB ID
9d69b9654361bcfedb531621d54c46f6ad5a727c85419e62984cf7bbaea00aae
ColabFold
Source ColabFold
Method ColabFold
Resolution 0,4188
Oligomeric State monomer
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50