Genbank accession
WDS60876.1 [GenBank]
Protein name
baseplate wedge tail fiber connector
RBP type
TSP
Evidence DepoScope
Probability 1,00
Protein sequence
MAREVPGSGAVIKPIFDEVFGVRAVELVSGGSGYDPADPPRLTIDGCGTPDQEALLYPIIDADSGKIVHVRVLERGKGYDPLRLQIIPSSETPSVLDSFDINRIWQRHPNSLTRGTFQTAGTPPVKNDRLRIESDNHPKPTWTSAEAVPGGGPLVDRSFDQVFIYRGGKDVPNPGTRTFQNNKSLGILANGGLLHTPEWGTVGNAPINYSIDTVKYDYVKDTNSDDAIVDGNVQYYQTKNVINEFDTTNGVFEWGKFEQFTWNIKVEFDNVMLTVNNIDETLAEVEVGRTVTEVGGTASGEIAKIVRNAQNVITRIYLRDVTGTFEDTDLLLGSTGFGMRVSADPRTFHSGIFYIDFGAEASEFGPFSPGVYYFAPENIRVKRNYLIVWNQTDASNQPTDLHAQGHPMQFSTTQDGLLNGGALYYNSTGASAAPSTDYENEFKPLFIMNADETNRIYYYCKVHRYMSGFEGDEGYMILDSTIEDEDIENNYYVENFYQSDANDPATIDRSRHVNGHSKVLGMSFDGYPIYGPYGYTTGRTVGRMTSSYRLRTTAELPGTREEVVTASTVTYNITVVNDKFYVDGQEEQLLTLKRGKSYVFNQDDSSNDSHYLFFSLSNDGWHSTGDPADIGSDTYLYNGEDSVVYVLDGTTVATRQLYVQGFNAATTREIRITIPVNAPRVAYVFSYLDSGHGLRLVNEGYILGDLTQDYIYDSSEGLLDEFNGKFGPTPEYPNGTYAYFMTEDASGNPQYPYAIGPKYYSVPLFEGDTVPDLVSSFPTEASGDIVLNTDGTISYIKMTKKGDSFFGSAKAVILGGEGTGAKATPVTQTVTGLSLLNQGRSYATPPNLIFEGGGGQGAEGAAEIDTLGKVTSISVVDPGEFYQEPPYILISGGGGIGAKAEAVISQGQITGINVIEPGEGYTTSPNVIFTKLVNLKRKTRARQAFNSSDIYLTGLTKTLGPQDSQIYVASTDAYAGSGQLIVNKETITYTAKSKGRFTGLTRGVNFKYDQRIILDTGQDVEGVSNYEFNVGDRVIRRVENANNKVAKVYDWDPSTRELLVTFEVDELAFIDAGIPSTEDAIVQFDAGVSNSSGTGVLPHTVIEEEGSTITTLTYPIGTIQDRTFEDDDEQDGAGDGIPDLINTGTTFANQISLDGGIYNSLYGIEETVGGENTTLFQVGDGIKDAAIPSKFATVIEAGGLSDGVEHVAQVRIKLGEGSGTFQVNEVVTGDVSGVRGTVVSWDLTEKILTLKDIVPYNLNNVALGVNGYLYEFSHNSTIVDFVITDNGTNYTAAPTITVENTGDIQATGTVNLTTAGDQVESITITNGGYGIPQTVDSSYALHPTITFTNDASDTTGSGASAQAVLGGERAVGNGGASFRIKSVEYLTLVRSDSA
Physico‐chemical
properties
protein length:1394 AA
molecular weight: 150987,77200 Da
isoelectric point:4,48598
aromaticity:0,09971
hydropathy:-0,30746

Domains

Domains [InterPro]
DC_1338
ATT
1–856
IPR025924
STR
513–560
IPR025924
STR
702–745
WDS60876.1
1 1394
Architecture
ATT
STR
ATT 1-856 | STR 857-1394
Legend: ATT STR RBD CBM LEC ENZ CHP LNK TAS TTP UNK Unmapped

Tail Spike Domain Segmentation

Tail Spike Domain Segmentation

This protein has been segmented into three structural domains: N-terminal, central domain, and C-terminal.

Domain Layout
N-terminal
Central
C-terminal
WDS60876.1
1 1394
Domain Start End Length (AA) Confidence
N-terminal 1 121 121 0,9641
Central domain 122 969 849 0,3176
C-terminal 970 1394 424 0,0094
Legend: N-terminal Central domain C-terminal
3D Structure with Domain Coloring

The structure is colored according to the domain segmentation: N-terminal (blue), Central (green), C-terminal (pink).

Domain Coloring
N-terminal
1-121
Central
122-969
C-terminal
970-1394

Taxonomy

  Name Taxonomy ID Lineage
Phage Synechococcus phage S-BM1
[NCBI]
3021412 Viruses > Duplodnaviria > Heunggongvirae > Uroviricota > Caudoviricetes
Host Synechococcus sp. WH 7803
[NCBI]
32051 Bacteria > Cyanobacteria > Oscillatoriophycideae > Chroococcales > Synechococcus >

Coding sequence (CDS)

Coding sequence (CDS)
Genbank protein accession
WDS60876.1 [NCBI]
Genbank nucleotide accession
OQ319120 [NCBI]
CDS location
range 7558 -> 11742
strand +
CDS
ATGGCAAGAGAAGTTCCTGGATCTGGCGCAGTCATCAAGCCAATCTTTGACGAAGTATTTGGTGTCCGTGCAGTTGAGTTAGTTAGTGGTGGTAGTGGGTATGATCCTGCAGACCCACCTCGACTAACTATTGATGGTTGTGGCACGCCAGACCAAGAAGCGTTGTTGTATCCTATTATCGACGCAGACTCTGGTAAGATCGTTCATGTTCGTGTTCTTGAAAGGGGCAAAGGATATGACCCATTGAGACTTCAGATTATTCCTAGCTCTGAAACTCCTAGCGTTTTAGATTCTTTTGATATCAATAGAATCTGGCAGAGACATCCAAACTCTCTAACTAGAGGAACTTTTCAGACTGCTGGCACTCCTCCTGTTAAAAACGATAGACTTCGTATTGAGTCTGATAATCATCCTAAACCCACCTGGACGAGCGCAGAAGCAGTGCCTGGTGGTGGTCCTCTTGTAGATCGTTCATTTGATCAAGTATTCATTTATAGAGGCGGTAAGGACGTTCCTAATCCTGGAACAAGAACATTCCAAAATAATAAATCGCTGGGCATTTTAGCAAACGGTGGTCTATTGCATACCCCTGAATGGGGGACCGTTGGTAATGCTCCAATCAACTATTCTATTGATACTGTAAAATATGATTATGTAAAAGATACCAATAGTGATGATGCTATCGTTGATGGCAATGTTCAGTATTACCAAACCAAAAATGTTATCAATGAGTTTGATACTACAAATGGTGTATTTGAGTGGGGTAAGTTTGAACAGTTCACTTGGAACATTAAGGTAGAATTTGATAATGTGATGTTGACCGTCAATAATATTGACGAGACATTGGCAGAAGTTGAAGTTGGTAGAACCGTAACTGAAGTTGGTGGAACCGCTAGTGGTGAGATTGCGAAGATTGTTAGGAATGCTCAAAATGTAATCACTAGAATATATCTTAGAGATGTTACAGGAACATTTGAAGATACTGACCTTCTTCTTGGTTCTACAGGATTTGGTATGCGAGTCAGTGCAGATCCAAGAACATTCCACAGTGGAATCTTCTACATTGACTTTGGTGCCGAAGCTAGCGAGTTCGGTCCTTTCTCTCCTGGTGTCTATTATTTTGCTCCAGAAAACATCAGAGTTAAGAGAAACTATCTGATTGTTTGGAATCAAACTGACGCATCTAATCAACCAACTGATTTACATGCTCAAGGGCATCCAATGCAGTTTAGTACGACACAAGATGGATTACTGAACGGTGGTGCATTGTATTACAACAGTACTGGTGCATCTGCCGCTCCTTCTACTGACTATGAGAACGAGTTCAAACCTCTCTTTATAATGAATGCGGATGAAACTAATCGCATCTATTATTATTGCAAAGTCCATCGTTACATGTCTGGATTTGAGGGTGACGAAGGTTATATGATTCTCGATTCAACTATTGAAGACGAGGATATTGAAAATAACTACTATGTTGAGAACTTCTATCAGTCAGATGCTAATGATCCAGCAACAATTGATCGTTCTAGGCATGTAAACGGTCACTCTAAAGTTCTGGGTATGTCCTTCGATGGATATCCTATTTACGGTCCATATGGATATACTACAGGCAGAACTGTAGGTAGAATGACAAGTTCTTATAGATTGAGAACTACAGCAGAACTCCCTGGTACTAGAGAAGAAGTTGTTACCGCAAGCACAGTTACTTACAATATCACTGTCGTTAATGACAAGTTTTATGTTGATGGTCAGGAAGAACAACTCCTGACTCTGAAAAGAGGAAAGAGTTATGTCTTCAATCAAGATGATTCTTCAAATGATAGTCACTATCTGTTCTTCTCTTTAAGTAATGATGGATGGCATAGCACAGGAGATCCTGCAGACATTGGTAGCGATACTTATTTGTACAATGGTGAAGATAGTGTAGTTTATGTTCTTGATGGAACAACTGTTGCAACTCGTCAGTTGTATGTACAAGGATTTAATGCCGCCACTACAAGAGAAATCAGAATCACGATTCCTGTAAATGCACCTCGTGTTGCGTATGTATTTTCATATCTCGATTCTGGTCATGGTCTTCGTCTTGTTAATGAAGGATATATTCTTGGTGATCTAACTCAAGATTACATTTATGACTCTTCTGAGGGTCTGTTGGATGAGTTCAACGGTAAGTTTGGACCTACTCCTGAATATCCTAATGGAACATATGCATACTTTATGACTGAAGATGCTTCTGGCAATCCTCAGTATCCATATGCGATTGGTCCCAAGTATTATAGTGTTCCTCTATTTGAGGGTGACACTGTACCTGATTTGGTATCATCGTTCCCAACGGAAGCATCTGGTGATATTGTTCTGAACACTGACGGAACTATTTCATATATTAAGATGACGAAGAAGGGTGATAGTTTCTTCGGTTCTGCAAAGGCAGTTATTCTTGGTGGTGAAGGAACAGGAGCAAAGGCAACCCCCGTAACTCAAACTGTTACTGGTTTGTCTCTACTTAATCAAGGCAGAAGTTACGCTACACCTCCCAACCTGATCTTTGAAGGTGGTGGTGGACAAGGCGCTGAGGGTGCTGCTGAGATTGATACTCTTGGTAAGGTCACTTCTATTAGTGTTGTAGATCCTGGTGAGTTCTATCAAGAACCCCCTTATATTCTCATCAGTGGTGGTGGCGGTATTGGTGCTAAGGCAGAAGCAGTAATCAGTCAAGGTCAAATCACTGGTATCAATGTCATTGAACCAGGTGAAGGTTATACTACCTCGCCAAATGTTATTTTCACCAAACTGGTTAATCTCAAGCGTAAAACGAGAGCACGTCAGGCATTTAACTCTTCGGATATCTATCTGACAGGTCTTACTAAAACTCTTGGACCTCAAGATTCGCAGATTTATGTTGCATCTACTGATGCATATGCTGGTTCTGGTCAACTTATTGTTAATAAGGAAACTATTACATATACTGCGAAGAGTAAAGGTCGATTTACTGGTTTAACTAGAGGTGTAAACTTCAAATATGATCAGCGAATCATTCTTGATACAGGACAGGATGTTGAAGGTGTTTCTAACTATGAGTTCAATGTTGGTGACCGAGTAATCCGTAGAGTTGAGAATGCTAATAATAAAGTTGCAAAAGTCTATGACTGGGATCCCTCAACCAGAGAACTTCTCGTTACATTTGAAGTTGATGAACTAGCATTTATTGATGCTGGTATTCCTTCTACTGAAGATGCTATTGTTCAGTTTGATGCTGGTGTTTCTAATAGCAGCGGCACAGGTGTTCTTCCTCATACAGTTATTGAAGAGGAAGGATCTACTATTACAACATTGACATATCCTATTGGAACTATTCAGGATAGAACATTTGAAGATGATGATGAACAAGATGGTGCAGGAGATGGTATTCCTGACCTGATTAACACAGGTACTACGTTTGCTAATCAAATCAGTCTTGATGGCGGAATCTATAACTCCTTGTATGGTATTGAAGAAACAGTAGGTGGAGAGAATACTACTCTATTCCAAGTTGGAGATGGTATCAAGGACGCAGCAATTCCATCCAAGTTTGCAACTGTTATTGAAGCAGGTGGATTAAGTGATGGTGTTGAACATGTTGCACAAGTTAGAATAAAACTTGGCGAAGGTTCTGGTACTTTCCAAGTGAATGAGGTTGTCACTGGAGATGTCTCTGGTGTCAGAGGAACTGTTGTTTCTTGGGATCTCACAGAAAAGATACTTACTCTAAAAGACATTGTTCCATACAATCTAAATAATGTTGCATTAGGTGTTAACGGATATCTGTATGAGTTCTCTCATAATAGCACTATCGTTGATTTCGTAATCACTGATAATGGAACAAACTACACTGCCGCTCCAACTATTACAGTTGAAAATACTGGAGATATTCAGGCAACAGGAACGGTAAATCTCACCACTGCAGGTGACCAGGTTGAATCTATTACGATCACAAATGGGGGATATGGTATTCCTCAAACTGTTGATTCGAGTTATGCACTTCATCCTACAATCACATTCACTAATGATGCATCCGATACAACTGGATCTGGTGCATCTGCTCAGGCAGTTCTCGGTGGAGAGCGTGCAGTTGGCAACGGTGGTGCTTCTTTCAGAATCAAGAGCGTTGAGTATCTGACATTGGTCCGCTCAGATTCCGCATAA

Genome Context

Genome Context

Tertiary structure

PDB ID
2166892bebf4ce54ebe016de45a0a683439bfc4b4f011a60bb83c7c071a85a0e
ESMFold
Source ESMFold
Method ESMFold
Resolution 0,4458
Oligomeric State monomer
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50