Genbank accession
WZC35867.1 [GenBank]
Protein name
tail fiber protein S'
RBP type
TF
Evidence GenBank
Probability 1,00
Protein sequence
MKGYSFSELVKLAEAALPATGTAVAANKLATARTIGGVPFDGTANINLPGVNTTGNQNTTGSSASTTGNAATATRFQTPRKINGKVFDGTADVTLNNADVGAAPTNHSHTPAQVGLPNLSNNANGLLAVSGVDSTNSQAILFTGVNLNLVASAVRQGIYSVANTRWEWAFSSGVLEVGTVPWARLSGIPAYAQRWPTYQEVGLGNLTNERQQRAHLGTVGFNTDWNLLNLAGTYTVGDSGSTPNSVPGNAYKYGTAVRFGADGNANHVQVYYPHNATAASTPSRADLLPRFRSGFNTWESAAWQTLPTLEYLTAQGYYHSGNKPTMEDVGLIPRDIPRNSLSLGVVDLNNITGPGFYSQQSDANANAGSNYPVKLAGSLRVTTGAGIVQVYTLYGNNVETYERAMYNGNWSPWYRLYSTKKPPTRAEIGLDTIHDGSTTQWTNTTMNGKKGGYAGIYFSAAEASLMIGEGAQKGGWGIYDNVTNKWNMYWVNDVMQQGQVPWARLTGIPATASRWPTRAEIGLNQVYDGTENNWGSITVTGAKTDYAGIHFKDMNRNFMVQAANQGIFDPTSGQWDWRFNNGVLAAGYVPWGRVTGRPAMADRWPTAAEAGALDRFLAAQEFGGGSVITTDQFLTILEQKGAFNKGYWAGRGTWSYASNSYVNTGIGTIHLAGAVIEVWGGWREECTVKVTLPTTQGGQGTYTHREFTYVNNGSTYNPGWRADFNTSTGVPWDSVTAKPATATRWPTPAEIGLGNVVNKGWNYGVAADTYAVRDANGDLTARQMHATEFYGSGYNLTNLQWARLVGVPVTATRWPTWNEVSGKPADVADWVTPYNSAAGSATNSWPLNAAVLNHLRAGGMVEFRVWFNRSNAQRSERFMRHAAVDGADGWTGTQWDAFTDCGWPGVSSCIIRNGASLDVQYRNTITSQYSNITRVDYRLL
Physico‐chemical
properties
protein length:940 AA
molecular weight: 101516,12600 Da
isoelectric point:8,38626
aromaticity:0,11170
hydropathy:-0,36851

Domains

Domains [InterPro]
DC_1592
STR
20–53
cd19958
STR
343–417
WZC35867.1
1 940
Architecture
STR
STR
STR 20-53 | STR 290-473 |
Legend: ATT STR RBD CBM LEC ENZ CHP LNK TAS TTP UNK Unmapped

Taxonomy

  Name Taxonomy ID Lineage
Phage Shewanella phage MSO-5
[NCBI]
3138417 Viruses >
Host Shewanella oneidensis MR-1
[NCBI]
211586 Pseudomonadota > Gammaproteobacteria > Alteromonadales > Shewanellaceae > Shewanella > Shewanella oneidensis

Coding sequence (CDS)

Coding sequence (CDS)
Genbank protein accession
WZC35867.1 [NCBI]
Genbank nucleotide accession
PP551579.1 [NCBI]
CDS location
range 41403 -> 44225
strand -
CDS
ATGAAAGGATATTCTTTCTCTGAGTTGGTGAAGTTAGCGGAAGCTGCTTTGCCAGCTACAGGTACGGCAGTGGCAGCTAACAAGCTTGCTACCGCGCGTACAATTGGTGGAGTTCCCTTCGACGGTACCGCCAACATCAACCTACCAGGTGTCAACACGACAGGTAATCAGAACACCACTGGTAGTTCTGCGTCGACAACTGGTAACGCTGCAACAGCGACTCGGTTCCAAACCCCCCGCAAAATTAACGGGAAAGTGTTCGACGGTACTGCTGACGTAACACTCAACAACGCAGATGTAGGTGCAGCCCCAACTAACCACAGCCACACCCCTGCGCAAGTTGGCCTCCCGAACTTGTCAAATAACGCGAACGGACTACTCGCAGTTTCAGGAGTAGATAGTACCAACAGTCAAGCCATTTTGTTTACTGGCGTCAACCTGAACTTGGTTGCGAGTGCAGTGCGACAAGGGATTTACAGCGTGGCAAATACTCGTTGGGAGTGGGCATTCTCCAGCGGAGTGCTGGAAGTTGGCACCGTTCCTTGGGCACGCCTCTCAGGTATCCCCGCCTACGCTCAACGGTGGCCAACCTACCAAGAAGTCGGCTTAGGCAACTTAACTAACGAGCGCCAGCAACGCGCGCACCTCGGGACAGTTGGGTTTAACACCGACTGGAACTTGCTAAATTTAGCGGGGACATACACAGTCGGGGACTCTGGGAGTACCCCGAACTCCGTTCCAGGGAATGCGTACAAGTACGGCACGGCTGTCAGATTTGGGGCTGACGGTAATGCAAACCACGTGCAGGTATACTACCCACACAACGCTACAGCTGCAAGCACCCCCTCACGAGCAGATCTCTTGCCTCGGTTCCGCTCGGGGTTCAATACATGGGAATCCGCCGCGTGGCAAACACTCCCTACCCTAGAATATCTGACCGCGCAAGGGTATTACCACTCTGGCAACAAGCCAACAATGGAGGACGTAGGACTAATACCACGGGACATCCCTCGAAACAGCTTGTCTCTTGGTGTTGTGGACCTTAACAACATTACAGGGCCAGGGTTCTACTCTCAGCAGTCAGATGCGAACGCGAACGCAGGGAGTAACTACCCAGTCAAGCTTGCAGGATCCCTACGTGTTACCACAGGTGCAGGGATTGTGCAAGTGTACACCCTATATGGGAACAACGTAGAGACCTATGAACGAGCAATGTATAACGGCAACTGGTCGCCTTGGTACCGCCTGTATAGCACGAAGAAGCCTCCTACCCGCGCGGAAATCGGGCTAGACACGATTCATGACGGAAGCACCACCCAGTGGACAAACACTACCATGAATGGTAAGAAGGGCGGCTACGCGGGGATTTATTTCTCCGCCGCAGAAGCTTCTCTGATGATCGGGGAAGGCGCGCAGAAGGGCGGTTGGGGCATTTATGACAACGTCACCAATAAATGGAATATGTATTGGGTTAATGATGTTATGCAGCAAGGTCAAGTTCCTTGGGCACGCCTAACTGGGATTCCTGCAACAGCATCCCGCTGGCCGACACGCGCGGAAATCGGCCTCAACCAAGTGTACGACGGGACAGAGAACAACTGGGGTTCGATCACAGTCACTGGGGCGAAAACCGACTACGCGGGGATTCACTTCAAGGACATGAACCGCAACTTCATGGTACAAGCTGCTAACCAAGGGATCTTTGACCCAACCTCGGGGCAGTGGGACTGGAGGTTTAACAACGGTGTACTAGCCGCTGGGTATGTCCCTTGGGGCAGAGTTACAGGCCGTCCTGCAATGGCGGATCGTTGGCCGACTGCAGCGGAGGCTGGCGCGCTTGATCGATTCTTAGCTGCGCAAGAGTTTGGTGGCGGGTCGGTAATCACCACTGACCAGTTCCTAACGATACTCGAGCAAAAAGGAGCTTTCAACAAAGGATACTGGGCTGGGCGCGGGACTTGGTCCTATGCCAGCAACAGCTATGTCAATACTGGTATCGGAACTATCCACTTAGCAGGAGCAGTCATCGAGGTTTGGGGGGGCTGGAGAGAGGAATGTACCGTTAAAGTTACGCTTCCAACTACTCAAGGCGGGCAAGGGACTTACACGCACCGAGAATTCACGTACGTTAACAACGGTTCTACGTACAACCCAGGTTGGCGTGCGGACTTTAACACCTCCACTGGTGTACCTTGGGATAGCGTTACTGCGAAACCTGCAACCGCAACTCGCTGGCCAACTCCCGCAGAAATTGGCCTCGGTAACGTAGTCAACAAAGGTTGGAACTACGGGGTCGCTGCCGACACATACGCTGTGCGAGACGCAAACGGTGACCTGACCGCTCGCCAGATGCACGCAACGGAGTTCTACGGAAGCGGTTACAACCTAACCAACCTCCAGTGGGCGCGGCTAGTAGGAGTCCCCGTGACAGCTACACGTTGGCCAACTTGGAACGAGGTGTCAGGTAAACCTGCGGATGTCGCGGACTGGGTAACCCCGTACAACTCGGCAGCAGGCTCCGCCACTAACTCGTGGCCACTCAACGCTGCCGTGCTGAACCACCTACGAGCAGGTGGCATGGTAGAGTTCCGTGTCTGGTTCAACCGCAGCAACGCACAACGCTCAGAGCGTTTCATGCGTCACGCAGCTGTAGACGGCGCGGATGGTTGGACTGGCACGCAATGGGATGCGTTTACTGATTGCGGTTGGCCTGGAGTATCTTCCTGCATCATCCGTAACGGCGCGTCTCTAGACGTCCAGTACCGGAACACGATCACGTCACAGTACTCCAACATTACGCGGGTCGACTACCGCCTCTTGTAA

Genome Context

Genome Context

Tertiary structure

PDB ID
2030120472680651e283256af793ace84adb88e72983b5c13188d1661251e796
ESMFold
Source ESMFold
Method ESMFold
Resolution 0,5886
Oligomeric State monomer
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50