Protein

Genbank accession
WNL62872.1 [GenBank]
Protein name
tail fiber
RBP type
TSP
Evidence RBPdetect
Probability 0,67
TF
Evidence Phold
Probability 1,00
Protein sequence
MKKILDSARNYLKNNSRIKTASLISLELPGSTGTSTAFIYLTDYFRDVLYNGILYQAGKVKSISTHKQNRDLSIGSLSFTITGTAQDEVLKLVQNGVSFLDRLVSIHQAIITEDGSILSVDPDTNGPLLYFRGRITGGGIKDNISTSGVGTSTITWNCSNQFYDFDRVNGRFTDDASHRGLEVVAGQLLPSNGAKRLEYQEDYGFFHANKSISILAKYQVQEERYKLKSKKKLFGLSRSYSLKKYYETVTKEVDIDFNLAAKYIPVVYGVQKIPGIPIFADTELHNPNIVYVVYAFAEGEIDGFLDFSFGDNPMICMDSNDSSARTCFGTKKVAGDTMQRIASGTPSSSPSVHGQEYKYNDGNGDIRIWTYHGKPDQTASEVLVDIAKERGFYLQNMNGNGPEYWDARYKLLDTAYAVVRFTINENRTEIPEVSAEIQGKKVKVYHSDGRVTANSTSLNGIWQTLDYLTSDRYGANITLDQFPLQQLIQEAAILDIIDESYQVSWQPYWRYVGWTDPLAENRQIVQMNTILDTSESVFKNVQGLLESYGGAINNLSGQYRVTVEKYSNTPLEINFLDTYGDLELSDTTGRNKFNSVQASIVDPALSWKTNSITFYNSKYKEQDKNLDKKLQLSFANITNYYTARSFADRELKKSRYSRTLSFSLPYQFIGIEPNDAIAFTYDRYGWDKKYFLVDEVENSREGKINVTLQEYGEDVFINSEQVDNSGNDIPDISNNVLPPRDFKYTPTPGGLVGSIGKNGELSWLPSLTNNVVYYSIVHSGHAEPYIVQQLETNPNERMIQEIIGESAGLAIFEIRAVDINGRRSSPVTLSIELNSAKNLSVVSNFRVTNTASGDVTEFVGPDVKLAWDRIPEEDIIESIFYTLEIYDSQNRMLRSVRIENQYTYDYLLTYNKADFALHNSDALGINRKLYFRIRAEGDDGEQSVEWASI
Physico‐chemical
properties
protein length:949 AA
molecular weight: 107061,22230 Da
isoelectric point:5,13463
aromaticity:0,11275
hydropathy:-0,42603

Domains

Domains [InterPro]
WNL62872.1
1 949
Legend: Pfam SMART CDD TIGRFAM HAMAP SUPFAM PRINTS Gene3D PANTHER Other

Taxonomy

  Name Taxonomy ID Lineage
Phage Escherichia phage Es2
[NCBI]
3074393 Viruses > Duplodnaviria > Heunggongvirae > Uroviricota > Caudoviricetes
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)
Genbank protein accession
WNL62872.1 [NCBI]
Genbank nucleotide accession
OR515477.1 [NCBI]
CDS location
range 84271 -> 87120
strand +
CDS
ATGAAAAAAATACTAGATAGTGCTAGAAACTACTTAAAAAATAATAGCAGAATAAAAACTGCTAGTCTAATTTCTTTAGAATTACCTGGCTCTACTGGTACTAGTACTGCTTTTATTTACTTAACTGACTATTTTAGAGATGTACTATATAATGGTATTTTGTACCAGGCCGGTAAAGTTAAGTCTATTAGCACACATAAACAAAATAGAGATTTATCTATTGGCAGCCTATCTTTTACTATTACAGGTACAGCGCAGGATGAAGTACTGAAATTAGTGCAGAATGGTGTATCTTTTTTAGACAGATTAGTATCAATTCACCAAGCAATTATTACAGAAGATGGTTCTATTTTATCTGTAGACCCAGACACAAATGGGCCTTTATTATACTTTAGAGGTAGAATTACTGGTGGAGGTATTAAGGATAATATTAGCACTTCGGGAGTTGGAACCTCCACAATTACTTGGAATTGTTCTAACCAATTCTATGATTTTGATAGGGTTAATGGTAGATTTACTGATGATGCTTCTCATAGAGGGCTTGAAGTTGTAGCAGGACAATTACTTCCATCTAACGGGGCTAAAAGACTTGAGTACCAAGAAGACTACGGTTTCTTTCATGCCAATAAAAGTATCTCTATTCTAGCAAAGTATCAGGTACAGGAAGAAAGATACAAACTAAAGTCTAAGAAAAAGCTATTTGGACTATCTAGAAGTTATAGTCTTAAAAAGTATTATGAGACTGTTACTAAAGAAGTAGATATAGATTTTAACCTTGCTGCTAAGTATATACCAGTAGTTTATGGTGTACAGAAAATACCAGGAATACCTATTTTTGCGGATACGGAATTACATAATCCTAACATAGTTTATGTAGTATATGCTTTTGCTGAAGGAGAGATAGACGGTTTTCTTGACTTTTCCTTTGGGGATAACCCTATGATTTGTATGGACTCTAATGATAGCTCCGCTAGAACCTGCTTTGGTACTAAAAAAGTGGCTGGGGACACCATGCAAAGAATAGCGTCAGGAACACCTTCTAGTAGTCCTTCCGTTCATGGGCAGGAGTATAAGTATAATGATGGTAATGGTGACATAAGAATTTGGACGTATCATGGAAAACCTGACCAAACGGCTTCTGAAGTACTAGTAGATATAGCTAAAGAACGTGGGTTCTACCTTCAGAATATGAATGGCAATGGACCGGAGTATTGGGATGCTAGGTATAAACTACTAGATACTGCATACGCAGTGGTGCGCTTCACTATTAATGAAAATAGGACTGAGATTCCTGAAGTTAGTGCTGAAATTCAAGGTAAAAAAGTAAAAGTCTATCATTCTGATGGTAGAGTAACTGCTAATAGTACTAGTTTAAATGGTATTTGGCAAACACTTGATTACTTAACCTCTGATAGATATGGCGCTAATATTACCCTTGATCAGTTCCCTCTTCAGCAACTAATACAGGAAGCAGCTATTTTAGATATTATAGATGAATCCTATCAGGTATCTTGGCAGCCGTATTGGAGATACGTTGGGTGGACTGATCCACTAGCAGAAAATAGACAAATAGTACAAATGAATACTATTCTGGATACATCTGAATCAGTATTTAAAAATGTGCAAGGTTTATTAGAGTCCTACGGTGGGGCTATTAACAATTTATCTGGCCAGTATAGAGTTACTGTAGAAAAATACTCTAATACTCCGCTAGAGATTAATTTTCTAGATACTTACGGTGATTTGGAGCTATCAGATACTACTGGTAGAAATAAATTCAATTCAGTTCAAGCATCTATCGTAGATCCCGCCCTTAGTTGGAAAACTAATTCCATTACATTCTATAATTCCAAGTATAAGGAACAGGACAAGAACCTAGATAAAAAATTACAACTGTCTTTTGCTAATATTACTAATTATTATACTGCAAGAAGTTTTGCTGATAGGGAGCTTAAGAAATCCAGATACTCAAGAACACTCTCTTTCTCATTACCATATCAATTCATTGGTATTGAACCTAATGATGCTATTGCATTTACATACGACCGTTATGGGTGGGATAAGAAGTATTTCCTAGTAGACGAAGTGGAAAACTCTAGGGAAGGAAAAATAAATGTTACCCTACAGGAATATGGAGAAGATGTATTTATCAATTCTGAGCAGGTTGATAATAGCGGTAATGATATTCCTGATATTAGTAATAATGTCCTTCCTCCTAGAGACTTTAAATATACTCCTACTCCTGGCGGTTTAGTAGGCTCTATAGGTAAAAATGGTGAGTTATCTTGGCTTCCGAGTCTAACCAATAATGTAGTCTATTACTCTATCGTGCATTCCGGTCATGCTGAACCTTATATTGTGCAGCAACTAGAAACAAATCCCAACGAACGGATGATCCAAGAGATAATTGGAGAGTCTGCAGGTTTAGCAATATTTGAAATAAGGGCTGTGGATATAAACGGTAGAAGAAGTTCTCCAGTAACATTATCTATAGAACTTAACTCTGCTAAAAACCTAAGTGTAGTATCTAATTTTAGGGTAACTAATACGGCTTCTGGGGATGTAACTGAGTTTGTAGGCCCAGATGTAAAACTAGCTTGGGATAGAATACCAGAAGAAGATATAATAGAGAGTATATTCTATACTCTGGAGATCTATGATTCCCAGAATAGAATGCTAAGAAGTGTACGTATTGAAAATCAGTATACTTATGACTATTTATTAACGTATAATAAGGCAGATTTTGCTCTTCATAACAGTGATGCTCTAGGAATCAATAGAAAACTATATTTTCGTATTAGAGCAGAAGGAGATGATGGAGAACAGTCTGTGGAGTGGGCATCCATTTAA

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

PDB ID
1d92fe0708033a1cec9dfe7d8d423564597269cdd9150a302ed14a4c0c69a7b2
ESMFold
Source ESMFold
Method ESMFold
Resolution 0,8126
Evidence 0,8126

Literature

No literature entries available.