Genbank accession
YP_009910766.1 [GenBank]
Protein name
structural protein
RBP type
TSP
Evidence DepoScope
Probability 1,00
TSP
Evidence RBPdetect
Probability 0,91
Protein sequence
MADRVIQRNDTAARWQSINPVLAQGELGIVSDGAKGYKIGDGVTAWNSLEYPANPASVVQELGNSETAVISQNAITNNFANTNSFLKNSIFITEIERTRLIIDGNSSNIELTSGYIKTDGNLGDIVNLTVNQSLNTRYAIIPCKEGDYFVISGNGGDLSRLFCFIDSSNRIILNAESRLVLSTYIKVPYNTSKLIVNFEQASDYIKKFNINNDSVRKYWVLNNSLNIYTNTFTPSPLYIQYDNNYIQSINPLDTTPITPSKNGFLVFNKKDSKYIFRLNSVDIDVINDIIVFVVRNNIFYNGDSLSYCLSYHVQNLVKYKTRNTYMLYSPNNDTSYTNNSVTTLGAWYLDNGTIHNHFILENGIVTPKTFVFTINSYLVYNKANYTARIVDSYKDIGDDDVIWLYYQDGKFVDGLFYSYYMASKMNSTSIESENSVETTIKKRAYQFATIPWVALKPIASTSSSTGIEIGNHVGLPYTSCMEVDKFVGYEVSLRTFMTCANNPYSLLYTEDLSRNISGYGFTYHNTGRGTIGGYMGIVCNIFGMNAISYKIPYDTGNWKFLRKKGYFKELQYQEAKYLNIGDIIVEPGHCNVITNIEKNDSGYSKIIYWGESVMDFPKINRYSEEQANNRIAERGGIIYRRDNLYKDSYYERSPFVAVEDEILDSSYVYNDDICTFAGDYAVFRENQKIVINYNLKNTNSDWNQIELYKNDELIGTYSLVDSHNYDLSSLNLKYGNYKARLKYNSTFSDYTYFQILDTEVSYTLNDSTIKIVFNSHNGEPLFIRLCKVDGGPICIVELTEDDKRKGFIEFDYNYYGNGQGYSIVNGGTYLKVYFESEYGRVTNEPILTTI
Physico‐chemical
properties
protein length:850 AA
molecular weight: 96901,32600 Da
isoelectric point:5,21602
aromaticity:0,13765
hydropathy:-0,33471

Domains

Domains [InterPro]
DC_0999
STR
1–504
SSF69349
STR
4–94
YP_009910766.1
1 850
Architecture
STR
STR 1-504 |
Legend: ATT STR RBD CBM LEC ENZ CHP LNK TAS TTP UNK Unmapped

Tail Spike Domain Segmentation

Tail Spike Domain Segmentation

This protein has been segmented into three structural domains: N-terminal, central domain, and C-terminal.

Domain Layout
N-terminal
Central
C-terminal
YP_009910766.1
1 850
Domain Start End Length (AA) Confidence
N-terminal 1 85 85 0,9864
Central domain 86 297 213 0,5064
C-terminal 298 850 552 0,1178
Legend: N-terminal Central domain C-terminal
3D Structure with Domain Coloring

The structure is colored according to the domain segmentation: N-terminal (blue), Central (green), C-terminal (pink).

Domain Coloring
N-terminal
1-85
Central
86-297
C-terminal
298-850

Taxonomy

  Name Taxonomy ID Lineage
Phage Bacteroides phage crAss001
[NCBI]
2301731 Uroviricota > Caudoviricetes > Crassvirales > Asinivirinae > Kehishuvirus
Host Bacteroides intestinalis
[NCBI]
329854 cellular organisms > Bacteria > Pseudomonadati > FCB group > Bacteroidota/Chlorobiota group > Bacteroidota

Coding sequence (CDS)

Coding sequence (CDS)
Genbank protein accession
YP_009910766.1 [NCBI]
Genbank nucleotide accession
NC_049977 [NCBI]
CDS location
range 22456 -> 25008
strand +
CDS
ATGGCAGATAGAGTAATACAAAGAAATGACACTGCTGCAAGGTGGCAGTCAATTAACCCAGTTCTTGCACAAGGAGAACTGGGTATAGTCTCTGATGGGGCTAAAGGATATAAGATAGGTGATGGTGTTACTGCATGGAATAGTCTTGAATATCCTGCAAATCCAGCAAGTGTAGTACAAGAATTAGGTAATAGTGAAACTGCTGTTATTAGTCAGAATGCTATTACTAATAATTTTGCAAATACAAATTCCTTCTTAAAGAATTCTATATTTATTACAGAAATAGAAAGAACTAGATTAATTATTGATGGTAATTCTTCTAATATAGAATTAACATCTGGTTATATTAAAACTGATGGAAACCTTGGAGATATTGTTAATTTAACAGTTAATCAATCTTTAAATACAAGATATGCTATAATACCCTGTAAAGAAGGTGACTATTTTGTAATAAGTGGTAATGGAGGAGATTTATCAAGACTATTCTGTTTTATAGATTCTTCAAATAGAATTATATTAAATGCAGAGAGTAGATTAGTACTAAGTACTTATATTAAAGTTCCTTATAATACTTCTAAATTAATAGTTAACTTTGAACAAGCATCTGATTATATAAAAAAATTTAATATAAATAATGACTCAGTTAGAAAATACTGGGTACTAAATAATTCACTAAATATTTATACAAATACTTTTACTCCTTCTCCTTTATATATTCAGTATGATAATAACTATATACAATCTATAAATCCTTTAGATACAACACCTATTACACCATCAAAAAATGGATTTTTAGTATTTAATAAAAAGGACTCTAAATATATATTTAGACTAAATAGTGTTGATATTGATGTTATTAATGATATTATTGTATTTGTAGTAAGGAATAATATATTTTATAATGGAGACTCTTTATCCTATTGTCTTTCTTATCATGTACAGAATTTAGTTAAATATAAGACAAGGAATACTTATATGCTCTATTCTCCCAATAATGATACTTCATATACAAATAATTCTGTTACTACATTAGGAGCGTGGTATTTGGATAATGGTACTATACATAATCATTTCATATTAGAAAATGGAATTGTTACTCCTAAAACTTTTGTATTTACTATTAATTCTTATTTAGTATATAATAAAGCAAATTATACAGCAAGGATTGTAGATAGTTATAAGGATATTGGAGATGATGATGTAATCTGGTTATATTACCAAGATGGCAAGTTCGTAGATGGGCTTTTCTATAGTTATTATATGGCTTCTAAAATGAATAGTACTTCTATAGAGAGTGAAAATAGTGTTGAAACTACTATTAAGAAAAGGGCTTACCAATTTGCTACTATTCCTTGGGTAGCTTTAAAACCTATAGCAAGCACTTCCAGTTCTACTGGAATAGAAATAGGTAATCATGTAGGATTACCATATACTTCTTGTATGGAAGTTGATAAATTTGTAGGGTATGAAGTTTCTCTTAGAACATTTATGACTTGTGCTAATAACCCATATAGTTTACTATATACGGAAGATTTATCAAGAAATATATCTGGATATGGATTTACATATCATAATACAGGAAGAGGTACTATAGGAGGTTATATGGGAATTGTATGTAATATATTTGGTATGAATGCTATTTCTTATAAAATCCCATATGATACAGGAAACTGGAAATTTTTAAGAAAGAAAGGATATTTTAAAGAGTTACAGTATCAAGAAGCTAAATACCTAAACATTGGGGATATTATAGTTGAGCCAGGGCATTGTAATGTAATTACTAATATAGAAAAGAATGATTCAGGTTATTCTAAAATTATTTATTGGGGTGAATCAGTAATGGATTTTCCTAAAATAAATAGATATTCTGAGGAACAAGCTAATAATAGAATAGCAGAAAGAGGTGGAATAATCTATAGAAGAGATAACTTATATAAAGATAGTTATTATGAAAGGTCTCCTTTTGTAGCTGTAGAAGATGAAATATTGGATTCCTCTTATGTTTATAATGATGATATTTGTACATTTGCTGGAGACTATGCAGTATTTAGAGAAAATCAAAAGATAGTTATAAATTATAATTTAAAGAATACTAATAGTGATTGGAATCAAATTGAACTATATAAAAATGATGAATTAATTGGAACATATAGTTTAGTTGATTCTCATAATTATGATTTATCTTCATTAAATCTAAAATATGGTAATTATAAAGCTAGACTAAAATATAACTCTACTTTCTCTGACTATACATATTTCCAAATATTGGATACAGAAGTTTCATATACTTTAAATGATAGTACAATTAAGATAGTCTTTAATTCTCATAATGGAGAACCTTTATTTATTAGATTATGTAAAGTGGATGGAGGTCCTATATGTATAGTAGAACTTACAGAAGATGATAAGAGGAAAGGTTTTATAGAATTTGATTATAATTATTATGGCAATGGTCAAGGTTATTCTATAGTAAATGGGGGTACATATTTAAAGGTTTATTTTGAATCAGAATATGGAAGAGTTACTAATGAGCCTATACTTACTACAATATAG

Genome Context

Genome Context

Tertiary structure

PDB ID
a84612f9ee8e4e3cabc28b20790c9cffb90c22664f963dc95e28661b7c22e262
ESMFold
Source ESMFold
Method ESMFold
Resolution 0,5878
Oligomeric State monomer
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50

Literature

Title Authors Date PMID Source
PhiCrAss001, a member of the most abundant bacteriophage family in the human gut, infects Bacteroides Shkoporov,A.N., Khokhlova,E.V., Fitzgerald,C.B., Stockdale,S.R., Draper,L.A., Ross,R.P. and Hill,C. 2018-06-26 GenBank