Protein
View in Explore- Genbank accession
- WCS67311.1 [GenBank]
- Protein name
- putative C-type lectin
- RBP type
-
TSPTSP
- Protein sequence
-
MKDIQQLIKKNSQEGRYEDIFPKTFLDAVIDKESGTTLTDILSSFNMYFLSYTGSRETTRLEVPMSLRKQGLWITYVLYDGNTITEWYGINAVDDTSWQNGENWRLGSNMLVGDISISSGGNWIINGVDTGIPARGEKGDSAMLRVNDSNKLQVSYTNGNVWIDLSDNPVYTQFRVNNNKLEQSIDLGETWSVVSDYIASWFRFTGTAGSSQADNVGKIQISRDNGVTWSDLSGEFTNSLHIKGYVATVGALPSTAVQGDIYGVGPTYDPSDTEQTNPIYQLYVKDSTGWVDNGRFTSIAAGVVQELGNSETAVISQKGVTDNFINMSQLGGVAKTPTLPSGGSAYIVDFTTDDGIIVGAKRHYKFSTLKNTTIRFQTRNASGPMVEVSVPINSFGVAEYAMVQGGTIVRIYGMASTETITLIAENTLLDIVGSMSDNNEAYKRLEPNTNLDAIKDAGIYLVKGSYQVGSYMMLVFDNGANSIGQLTYGFTSSYKCSIRIRTFNGTSWTAWDDVFDYVKATFLDETTNIDTITSNGLYLKKGGQYYSPYILMNFDGKQVAIGNGPTGLYLRKRTFINGVFSEWVDLWQNNMIGITTFNNGVSLSPDGTITSLAMKFCSDKLPITTEKIILNNSETIGYNGNVIAYQIAFYDSSSVVIGSRMNLPIGEFTNIPANAKFFKVAGGYNSSVDKAWSNEDSQNYLSFSYGTQKGKLDIMDDDINSHNQKIAVLENKIDKFCINNPLNQTAAENPCTLIDSKGRMYVSYISSPVGVGESFHSIDLSVFSMVDFTHIEHYNIATASEFGSQNILGNNICEVSDNIIRVFFRVGVYGEQQHYYKDFNKSTKQVGNAVKVKFKSSNSSSPVDFTLTNITNAYTGTSYTPPTGELTMTSNIVIYNDNLFTTVSSTLSNTMVVKSTDFGATWEKVGFIGNLSQYENELSVVGDIMYCFLRNGAPDSGVSGPARSTRNLWKSVDSGATWIDTTLDINMGNNRGASFTLNGKLYLAYSDNNQSSNWSITRPWRSSVGIYEYNEADNSLKQVKHFTDKYGFVEIDIVIHENSIFLLFSNGKIYQMYRDGGTFFYGYSQGKDALYITEVNNDFNFIGNYSR
- Physico‐chemical
properties -
protein length: 1107 AA molecular weight: 122459,08020 Da isoelectric point: 4,92711 aromaticity: 0,11743 hydropathy: -0,27326
Domains
Domains [InterPro]
DC_0777
STR
1–688
STR
1–688
cd19958
STR
448–511
STR
448–511
cd15482
ENZ
900–1013
ENZ
900–1013
1
1107
Architecture
STR 1-688 | STR 745-1097 |
Legend:
ATT
STR
RBD
CBM
LEC
ENZ
CHP
LNK
TAS
TTP
UNK
Unmapped
Tail Spike Domain Segmentation
Tail Spike Domain Segmentation
This protein has been segmented into three structural domains: N-terminal, central domain, and C-terminal.
Domain Layout
1
1107
| Domain | Start | End | Length (AA) | Confidence |
|---|---|---|---|---|
| N-terminal | 1 | 330 | 330 | 0,9617 |
| Central domain | 331 | 535 | 206 | 0,3029 |
| C-terminal | 536 | 1107 | 571 | 0,2835 |
Legend:
N-terminal
Central domain
C-terminal
3D Structure with Domain Coloring
The structure is colored according to the domain segmentation: N-terminal (blue), Central (green), C-terminal (pink).
Domain Coloring
N-terminal
1-330
1-330
Central
331-535
331-535
C-terminal
536-1107
536-1107
Taxonomy
| Name | Taxonomy ID | Lineage | |
|---|---|---|---|
| Phage |
Bacteroides phage PhiCrAssBcn23 [NCBI] |
3023106 | No lineage information |
| Host |
Bacteroides intestinalis [NCBI] |
329854 | cellular organisms > Bacteria > Pseudomonadati > FCB group > Bacteroidota/Chlorobiota group > Bacteroidota |
Coding sequence (CDS)
Coding sequence (CDS)
Genbank protein accession
WCS67311.1
[NCBI]
Genbank nucleotide accession
OQ221558
[NCBI]
CDS location
range 45005 -> 48328
strand -
strand -
CDS
ATGAAAGATATACAACAACTAATTAAAAAGAATAGTCAAGAGGGAAGATATGAAGACATCTTCCCTAAGACTTTCCTTGATGCAGTAATTGACAAGGAGAGTGGGACTACTCTAACAGACATTCTTTCCAGCTTTAATATGTACTTCCTCTCATACACAGGAAGTAGGGAAACTACAAGACTTGAAGTTCCTATGTCACTTAGGAAACAAGGATTGTGGATAACCTATGTACTATATGATGGAAATACTATTACAGAATGGTATGGAATCAATGCTGTGGATGATACTTCTTGGCAGAATGGTGAGAACTGGAGATTAGGTTCAAATATGTTGGTAGGTGATATTAGTATCTCTTCTGGTGGTAATTGGATTATTAATGGAGTTGATACTGGAATCCCTGCAAGAGGAGAGAAAGGTGATTCAGCTATGCTAAGAGTAAATGATTCTAACAAACTCCAAGTTAGTTATACTAATGGTAATGTTTGGATAGATTTGTCAGATAATCCTGTTTACACACAGTTTAGAGTGAATAACAATAAACTGGAACAATCTATAGACCTTGGTGAAACATGGTCTGTAGTATCTGACTATATTGCATCATGGTTTAGGTTTACTGGAACTGCTGGAAGTAGTCAAGCTGATAATGTAGGTAAGATACAGATTAGTAGAGATAATGGGGTTACATGGTCTGATTTAAGTGGAGAATTTACTAATAGTTTGCATATTAAAGGATATGTAGCTACTGTAGGTGCTCTCCCCTCTACTGCTGTTCAAGGAGATATTTATGGTGTTGGTCCTACTTATGACCCAAGTGATACTGAACAAACTAATCCTATCTATCAACTATATGTTAAAGATAGTACTGGATGGGTTGATAATGGTAGATTTACTTCAATAGCTGCTGGTGTAGTACAAGAGTTAGGTAATAGTGAGACTGCTGTAATAAGTCAAAAGGGAGTTACTGATAACTTTATTAATATGAGTCAATTAGGAGGAGTTGCTAAAACTCCTACACTGCCTTCTGGAGGCTCAGCCTATATTGTTGATTTTACAACAGATGATGGAATAATTGTTGGTGCAAAAAGACATTATAAGTTTAGTACCCTTAAAAATACTACAATAAGATTCCAGACAAGAAATGCTTCTGGTCCTATGGTAGAAGTTTCTGTACCTATTAATAGCTTTGGAGTAGCTGAGTATGCTATGGTTCAAGGTGGAACTATAGTAAGAATCTATGGAATGGCTTCCACTGAAACTATAACACTAATAGCTGAAAATACTCTATTAGATATTGTAGGAAGTATGAGTGATAACAATGAAGCCTATAAGAGATTAGAACCAAATACAAATCTTGATGCTATTAAAGATGCAGGTATATATTTGGTTAAAGGTAGTTATCAGGTAGGTTCCTACATGATGTTAGTTTTTGATAATGGAGCAAATAGTATAGGACAGCTAACTTATGGGTTCACCTCATCATATAAATGTAGCATCAGAATAAGAACCTTTAATGGTACTTCATGGACTGCATGGGATGATGTCTTTGATTATGTTAAAGCTACATTCCTTGATGAAACCACTAATATAGATACTATAACATCTAATGGATTGTATCTTAAAAAAGGTGGTCAGTATTATTCTCCTTACATTTTAATGAATTTTGATGGTAAACAAGTAGCTATTGGTAATGGTCCTACAGGTCTTTACCTTAGAAAAAGAACTTTTATTAATGGAGTATTTTCTGAATGGGTAGACCTATGGCAGAATAATATGATAGGAATAACCACATTTAATAATGGTGTGTCACTTTCCCCAGATGGAACTATAACTTCTCTTGCTATGAAGTTCTGTTCAGATAAGTTACCTATTACAACTGAAAAGATTATCCTAAATAACTCAGAAACAATAGGATATAATGGTAATGTTATTGCTTATCAAATAGCTTTCTATGATAGTTCAAGTGTAGTAATTGGAAGTAGAATGAATTTACCAATTGGTGAGTTTACTAATATACCTGCTAATGCTAAGTTCTTTAAAGTTGCTGGAGGTTATAACTCAAGTGTAGATAAAGCATGGTCTAATGAAGATAGTCAGAACTACTTATCTTTCTCTTATGGAACTCAGAAAGGGAAATTAGATATAATGGATGATGATATTAATAGTCATAATCAGAAAATTGCAGTTTTGGAAAATAAGATTGATAAGTTTTGCATAAATAATCCTTTAAATCAAACTGCTGCTGAAAATCCATGCACTCTTATTGATTCAAAAGGTAGAATGTATGTTTCTTATATATCATCTCCTGTAGGTGTTGGAGAGAGTTTTCATTCAATAGATTTGAGTGTTTTCTCTATGGTAGATTTTACTCATATTGAACATTATAATATAGCTACAGCCTCAGAGTTTGGTAGTCAGAATATTCTTGGAAATAACATTTGTGAGGTATCTGATAACATTATCAGGGTATTTTTTAGAGTAGGTGTTTATGGGGAACAACAACATTATTATAAAGATTTTAATAAATCAACTAAGCAGGTAGGTAATGCAGTTAAAGTGAAATTTAAGAGTTCTAATAGTTCAAGCCCAGTTGATTTTACATTAACTAATATTACAAATGCTTATACAGGCACTTCTTATACTCCCCCAACTGGAGAGCTAACTATGACAAGTAATATAGTTATATATAATGATAACTTATTTACTACTGTATCATCAACACTATCAAATACAATGGTTGTGAAAAGTACAGATTTTGGTGCTACTTGGGAGAAGGTAGGATTTATAGGTAACTTGTCTCAGTATGAGAATGAGTTGTCTGTTGTAGGAGATATAATGTACTGTTTCTTAAGGAATGGTGCTCCTGATTCTGGTGTATCTGGTCCTGCAAGAAGCACAAGAAATCTATGGAAATCTGTTGACTCAGGTGCAACTTGGATAGATACTACTCTTGATATAAATATGGGTAATAATAGAGGAGCATCTTTTACTCTTAATGGAAAACTTTATCTTGCTTATTCTGACAATAATCAGTCAAGTAACTGGTCTATAACAAGACCTTGGAGAAGTTCTGTAGGTATCTATGAGTACAATGAGGCAGATAACTCTTTGAAACAAGTTAAACATTTTACTGATAAGTACGGGTTTGTAGAAATTGACATTGTTATACACGAAAATAGCATATTCTTACTCTTTAGTAATGGTAAGATTTATCAGATGTATAGAGATGGAGGTACATTCTTCTATGGATATTCTCAGGGTAAAGATGCTTTGTATATAACAGAAGTGAATAATGATTTTAACTTTATTGGTAATTACTCAAGATAG
Genome Context
Genome Context
Tertiary structure
PDB ID
11c8b5f6cf983ce201d0697713b10ec7a8e14a1a85691183dd02c7d240fb1003
Model Confidence
Very high
pLDDT > 90
pLDDT > 90
High
90 > pLDDT > 70
90 > pLDDT > 70
Low
70 > pLDDT > 50
70 > pLDDT > 50
Very low
pLDDT < 50
pLDDT < 50