Genbank accession
WLW39271.1 [GenBank]
Protein name
tail fiber protein
RBP type
TF
Evidence Phold
Probability 1,00
TSP
Evidence DepoScope
Probability 0,86
TF
Evidence RBPdetect
Probability 0,90
TF
Evidence RBPdetect2
Probability 0,92
Protein sequence
MAVTTKIIVQQILNIDDTKATASKFPRYTVTLGNSISSITASELVSSIEAAAKSAAAAKDSEIAAKTSELNAKNSEQEAAISAGASEASATQSATSATQSAASATKSEESAAAAKISETNAKTSEDKAKASETNAKTSETNAKTSETNAKASEDKAKASETNAAASAIAAKTSEDKAKASETNAAASASDSKGYRDGAEIFSAQAAASASAAKTSETNAKASETNAKTSETNAKTSETNAAHSAASASQSVTTIQGLKSDVEQLKIDTQGIKDTAVAETVALKSDVEQLKIDTQGIKDAAVSETTTLKDAAAASAAQASNSAIEAGQQASNAAGSAQSASTDAGRAEVAAGKAEGIISKSLLKENNLSDLLDINASRQTLLIDSLVQDGVHTFLYSHNRLYRFVIRDDGLIVLQRNATGDGISWETLPLSTAAGGTGADTPEGARYNLGVDRLFQNSVETVLYSPNKNKYLTIPADGDHWGVYDATTDQWLPLGIQFGGTGAKDANGIRNNIGLGEKHAPKFLSLNVENETENAVTANAGIYHSKLKNTQGEDIGSSQSYFETQVGVGKHTIGVFHNGLAQYYQFNENGTFSGAKSISLAPGAGIYADGNERNASQLFSIMNPPINTWTGVSRYNWYDDYAIAGLIRRGDTHVESFGIELYQAGIQSYMHKFYPDGRTHSAQYTGKTQQMGWDDPNYWGNALVWGEIIPNNDGGWAPGLSWGTQSTGGYPIRATWGLIPQGNNAWPFCSLRLRGDGNFFCNFQFQPASNDITTWSSNGNFIFQKAANSDRDLKHDIIYTDGKESYDRVMQWLPTMFKYNGSNIQRFGLIAQDLLKIDPEYVKLIPGGDIFADVIGVNDDGEEYVDRQIVVDKADDTLALDNNVIMADLACAFRYQADKVNKLEQELTELKKLVSELIKPDNS
Physico‐chemical
properties
protein length:922 AA
molecular weight: 98128,62410 Da
isoelectric point:4,93564
aromaticity:0,07267
hydropathy:-0,44751

Domains

Domains [InterPro]
DC_0608
ATT
2–170
PTHR43049
Unmapped
42–347
IPR030392
CHP
788–842
WLW39271.1
1 922
Architecture
ATT
STR
ATT 2-170 | STR 194-922
Legend: ATT STR RBD CBM LEC ENZ CHP LNK TAS TTP UNK Unmapped

Tail Spike Domain Segmentation

Tail Spike Domain Segmentation

This protein has been segmented into three structural domains: N-terminal, central domain, and C-terminal.

Domain Layout
N-terminal
Central
C-terminal
WLW39271.1
1 922
Domain Start End Length (AA) Confidence
N-terminal 1 455 455 0,9169
Central domain 456 654 200 0,2696
C-terminal 655 922 267 0,9627
Legend: N-terminal Central domain C-terminal
3D Structure with Domain Coloring

The structure is colored according to the domain segmentation: N-terminal (blue), Central (green), C-terminal (pink).

Domain Coloring
N-terminal
1-455
Central
456-654
C-terminal
655-922

Taxonomy

  Name Taxonomy ID Lineage
Phage Escherichia phage SUT_E520
[NCBI]
3065400 Viruses > Duplodnaviria > Heunggongvirae > Uroviricota > Caudoviricetes
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)
Genbank protein accession
WLW39271.1 [NCBI]
Genbank nucleotide accession
OQ990040.1 [NCBI]
CDS location
range 77696 -> 80464
strand -
CDS
ATGGCAGTAACTACTAAAATTATTGTACAACAAATATTAAATATTGATGATACTAAAGCTACTGCTAGTAAGTTTCCTAGATATACAGTAACTCTAGGTAATTCTATTAGCTCTATTACTGCAAGTGAGTTAGTATCTTCTATTGAGGCTGCAGCTAAGTCTGCTGCAGCTGCAAAAGATTCTGAGATAGCGGCTAAAACTTCGGAGCTTAATGCTAAGAACTCTGAACAGGAAGCTGCTATTTCTGCTGGAGCTTCTGAAGCCTCTGCCACTCAGTCCGCTACATCTGCTACTCAATCTGCTGCATCAGCTACTAAATCGGAGGAATCAGCAGCTGCAGCTAAAATATCTGAGACTAATGCTAAAACTAGCGAAGATAAGGCAAAGGCTAGTGAAACTAATGCAAAAACTAGTGAGACTAACGCAAAAACTAGTGAGACTAATGCAAAGGCTAGCGAAGATAAGGCAAAGGCTAGTGAAACTAATGCTGCTGCTTCTGCTATTGCAGCAAAGACTAGTGAAGATAAGGCAAAGGCTAGTGAAACTAATGCTGCTGCTTCTGCTTCGGATTCTAAAGGCTATAGAGATGGGGCAGAAATATTCTCTGCACAAGCCGCCGCATCAGCATCCGCTGCAAAAACCTCGGAAACTAATGCAAAAGCTAGTGAAACCAATGCAAAAACTAGTGAAACCAATGCAAAAACTAGTGAAACCAATGCAGCCCATTCCGCAGCCTCAGCTAGCCAATCAGTAACTACTATTCAAGGCTTAAAATCAGATGTTGAACAGCTAAAAATAGATACACAAGGTATTAAGGATACTGCAGTAGCAGAGACAGTAGCTTTAAAATCAGACGTTGAGCAGCTAAAAATAGACACACAGGGAATTAAGGACGCTGCTGTGTCCGAAACTACCACGCTAAAAGATGCTGCTGCTGCTTCCGCTGCACAAGCAAGCAATAGCGCTATCGAAGCAGGACAACAAGCTAGCAATGCTGCTGGTAGCGCACAAAGCGCATCTACAGACGCTGGGCGCGCGGAAGTGGCTGCTGGAAAAGCTGAAGGTATCATTAGTAAATCGTTATTAAAAGAAAATAATCTTTCAGATCTGTTGGATATCAATGCTTCTCGTCAGACACTCCTTATAGACTCACTTGTGCAAGATGGGGTTCATACTTTTTTATATTCACATAATAGATTATATAGATTTGTTATTAGAGATGATGGTCTTATTGTACTTCAGAGAAATGCAACAGGTGACGGAATTTCATGGGAAACTCTTCCATTATCAACTGCGGCTGGGGGTACAGGAGCAGATACTCCTGAAGGTGCTAGATATAATTTAGGAGTTGATAGGCTGTTTCAGAATTCTGTAGAAACAGTTTTATATTCTCCAAATAAAAACAAGTATCTAACCATTCCAGCTGACGGGGATCATTGGGGCGTTTATGATGCTACCACCGATCAATGGCTACCACTAGGTATTCAATTTGGTGGTACAGGTGCTAAGGATGCGAATGGTATCAGAAATAATATAGGGCTTGGTGAGAAACATGCGCCAAAGTTCCTTAGTCTTAATGTTGAAAATGAAACGGAAAATGCAGTTACTGCAAACGCAGGTATATACCATTCAAAACTAAAGAATACCCAAGGCGAAGATATTGGTTCTTCTCAGTCGTATTTTGAGACTCAAGTAGGCGTAGGAAAGCATACAATAGGCGTATTTCATAATGGGTTAGCACAGTATTATCAGTTTAATGAAAATGGTACTTTTTCGGGAGCGAAAAGTATTTCCTTAGCTCCTGGGGCTGGTATTTATGCAGACGGGAATGAACGCAATGCGTCACAACTATTCTCAATCATGAATCCACCAATAAACACATGGACTGGTGTAAGTCGTTACAACTGGTATGACGATTACGCTATCGCTGGTTTAATCAGAAGAGGCGACACACACGTTGAATCTTTTGGAATAGAATTGTATCAAGCAGGGATACAGTCCTATATGCACAAGTTTTACCCTGACGGCAGAACTCATTCTGCTCAATACACTGGCAAAACCCAACAAATGGGCTGGGATGATCCTAACTATTGGGGTAATGCTCTTGTTTGGGGCGAGATTATTCCTAACAATGATGGTGGATGGGCGCCTGGCCTTTCATGGGGTACACAATCAACTGGTGGTTATCCAATTCGTGCAACTTGGGGTCTTATCCCACAGGGAAATAATGCTTGGCCTTTTTGCTCGTTAAGACTTCGTGGTGATGGTAATTTTTTCTGTAACTTCCAGTTTCAACCAGCCTCAAACGATATAACCACATGGTCATCAAATGGAAACTTCATTTTTCAAAAGGCCGCTAACTCTGACAGAGATTTAAAGCACGATATAATCTATACAGATGGAAAAGAAAGCTATGATCGCGTTATGCAGTGGCTTCCGACAATGTTTAAATACAATGGAAGCAACATACAGCGATTTGGCTTAATAGCCCAAGACCTTCTAAAAATAGACCCGGAATATGTAAAATTAATTCCTGGTGGAGATATTTTTGCAGATGTTATAGGTGTTAACGATGACGGTGAAGAATATGTAGACCGCCAAATTGTTGTTGACAAAGCAGATGACACATTAGCACTTGATAACAACGTCATTATGGCAGATCTAGCTTGCGCATTTCGTTATCAAGCTGATAAAGTTAACAAACTTGAGCAAGAATTAACCGAACTGAAGAAACTTGTTAGCGAGCTTATTAAGCCAGATAACTCATAG

Genome Context

Genome Context

Tertiary structure

PDB ID
8209ec75cf164b4a1bdb376d573c67e259dfdade016e23c5b38485b0dec98899
ESMFold
Source ESMFold
Method ESMFold
Resolution 0,6033
Oligomeric State monomer
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50