Protein
View in Explore- Genbank accession
- WWQ70493.1 [GenBank]
- Protein name
- central tail fiber J
- RBP type
-
TFTF
- Protein sequence
-
MSSGGGKAKTPTLLNDNLFHKQFYRVLDILSEGPIFGVVNQTAPLNSVMLNDTPITDASGNTSVPGVSLAWRNGTADQSPINGFNAIESTVIVNAKVTHDTPIIRTISDPNVNRVRLNLGVDSLVRSDDKGNQYNTSVMLMVDVKPSSSSTWSLVKDITIGPGKQSGEYLEAHIINAPDEKPFDIRVRRVTPDSTGDLLHNDTRWSSYSEIIDDNLSYPHTAVAGAVIDHDQYTDTPTRTYHLRGLIVDVPDNYDPETRTYSGLWLGGFKKAYTNNPAWLFRYLVKNERFGLARHAGYIDVDDGALYTLSQYCDQLVSDGYGGFEPRMTLNAYITEQMSARDLLDNIAGMFRGIALWDGQRLTVMIDAPQDPIATITNANVVDGAFTRSSIARAESYNAVIVSWTDPENGWEQSKEYVADDELIARDGYNETTLEAFGCTSRGQAYRAGKWLIETAKREPSKFTFKMARDAIHFTPGDIIEILDNNRAGARLGGRIVANSGKVITVDKVDSEYIAAGDTISLLDSDGKFKKHQITGVNGNNITLATAPAWIRNGTVFAVSTEAAKPVLCRIISVAETENNSVYTIEAAQHDPHKQAVVDEGAIFEVNNDTLNHFRVPNIENLKVLNVGSETVQCRATWETQTTTHRLTFEIRVYNAEGRVVAAYETTNYRYDFYGLDAGSYTLGIRGRNDTGMKGAESIVDLVIGAPAAPIGVNWVPGVFQATVYPISKTTLTTDTAYEFYFSGENQITDPSKVTTLAQFTGRGYQWTFGGMNTGHTYYVYVRTRNAFGVSDFVESSGKPTENFDEISDYVTKDVINSEQFKGMISDIKDLGDRADLIESATNDLKTATDNLKTATDNLTNITDDLMVDTDNLTNITDELRTDTDGLITETGAIKADTDTLKKQTEDLYKKVGENADDIGQHEARIDSLEVSSEKVGSELAQAKASLQNASLALINNSLAQTNTRVTLTAQYKKGRTETKAEIDRIDNVIAEEKKATAESLETITAEMNTMDSNLKGQISSVERAVADEASARAEAINGVNASISNLDKKTDASINRLDQAIADETNARTQAVSDVNASISSLDKKTDASVKRLDQAIADETSARTEAISGVNASISTLDSKVTSNVTRIDKAIADETKARTDAISSLNSSLTSTINSKVSEVSTALSTHEASSAEKFSQISASFESVNSSITEWSQSMATADEALSTKIDQLTVTVNGNKTAIETTSNALTDFKGNVDASYSIKLATDNNGMKYATGMSLGLTGNGTNFQSQCIFLVDRFVLMTSANGTYTTPFYVTNGAMYVKEAFIKDGSIDTAKIAQQIQSTVFVQDSKGWMINKDGWAQFNEVTVRGNVIVKGNQNFDVRNSDGRVIINNNGITVNLPSGGKIIIGVW
- Physico‐chemical
properties -
protein length: 1393 AA molecular weight: 151722,58690 Da isoelectric point: 4,84123 aromaticity: 0,07251 hydropathy: -0,35628
Domains
Domains [InterPro]
IPR053171
Unmapped
1–1372
Unmapped
1–1372
DC_0129
STR
10–959
STR
10–959
IPR055385
ATT
88–214
ATT
88–214
1
1393
Architecture
STR 10-87 | ATT 88-214 | STR 215-334 | ATT 335-496 | STR 497-708 | ATT 709-811 | STR 812-1127 | RBD 1128-1392 |
Legend:
ATT
STR
RBD
CBM
LEC
ENZ
CHP
LNK
TAS
TTP
UNK
Unmapped
Taxonomy
| Name | Taxonomy ID | Lineage | |
|---|---|---|---|
| Phage |
Escherichia phage mEp515 [NCBI] |
3121470 | No lineage information |
| Host |
Escherichia coli K-12 [NCBI] |
83333 | Bacteria > Proteobacteria > Gammaproteobacteria > Enterobacteriales > Enterobacteriaceae > Escherichia |
Coding sequence (CDS)
Coding sequence (CDS)
Genbank protein accession
WWQ70493.1
[NCBI]
Genbank nucleotide accession
PP180003.1
[NCBI]
CDS location
range 46537 -> 50718
strand +
strand +
CDS
ATGTCTAGTGGCGGCGGTAAAGCAAAAACGCCAACTTTGCTTAATGATAATTTGTTTCATAAACAATTTTATCGGGTTTTAGATATTCTAAGTGAAGGACCAATTTTTGGAGTAGTTAACCAAACAGCGCCATTAAACAGCGTAATGCTTAATGACACGCCTATCACTGACGCAAGCGGAAACACATCAGTCCCTGGCGTTAGCTTAGCGTGGCGTAATGGTACGGCTGACCAGTCACCAATCAACGGCTTCAACGCTATTGAGTCAACCGTTATTGTCAACGCAAAAGTAACTCACGACACACCAATAATCAGGACCATTTCAGATCCCAACGTTAACCGCGTAAGGCTGAATCTTGGCGTTGACTCTTTGGTAAGGTCAGATGACAAGGGCAATCAATACAATACATCCGTTATGTTGATGGTTGATGTCAAACCTTCATCGTCATCAACGTGGTCACTGGTTAAAGATATTACTATTGGCCCAGGCAAACAAAGCGGTGAGTATCTAGAAGCCCACATTATCAACGCGCCGGATGAAAAACCGTTTGATATTCGCGTTCGTCGCGTAACGCCAGACAGCACGGGAGATCTTCTTCACAATGATACCAGGTGGAGCAGTTACAGCGAGATAATCGATGATAATTTGTCTTATCCCCATACCGCTGTTGCTGGCGCGGTAATTGACCACGATCAGTATACTGATACGCCTACCCGCACCTATCATCTGCGTGGGCTGATTGTTGATGTTCCTGATAACTACGACCCGGAAACGCGCACGTATTCAGGATTATGGCTTGGTGGCTTTAAAAAGGCGTATACCAATAACCCTGCATGGCTTTTCCGGTATCTGGTTAAAAATGAACGTTTCGGGCTTGCCCGTCATGCTGGTTACATTGATGTTGACGACGGCGCATTATATACGCTTTCTCAATACTGCGACCAGTTGGTTAGCGACGGATACGGCGGCTTTGAACCTCGCATGACGCTTAATGCTTACATCACGGAGCAAATGAGCGCACGCGACTTACTTGATAACATTGCAGGCATGTTCCGTGGTATCGCGTTATGGGACGGGCAACGCCTTACCGTGATGATTGATGCACCACAAGATCCGATCGCCACCATTACAAATGCAAACGTCGTTGATGGCGCGTTTACTCGTTCAAGTATTGCCCGCGCAGAATCTTACAACGCCGTGATTGTGTCATGGACTGACCCAGAAAACGGCTGGGAGCAATCAAAAGAATACGTGGCAGATGATGAATTGATCGCCCGCGACGGTTATAACGAAACAACGCTAGAGGCGTTCGGTTGCACGTCACGCGGGCAAGCGTACCGCGCTGGCAAATGGCTGATAGAAACGGCAAAACGCGAGCCATCAAAATTCACGTTTAAAATGGCCCGTGACGCAATTCACTTCACCCCCGGGGATATTATCGAGATACTCGACAATAACCGCGCTGGCGCTCGTTTAGGCGGCCGCATCGTGGCGAACAGTGGGAAGGTGATAACGGTAGACAAGGTTGATTCTGAATATATCGCGGCTGGTGACACTATCAGTTTACTTGATAGTGATGGCAAGTTTAAAAAACACCAGATCACCGGAGTTAACGGAAACAACATCACCCTTGCAACCGCCCCCGCATGGATTCGTAACGGGACCGTATTCGCTGTGTCAACTGAAGCGGCAAAACCTGTTCTGTGTCGAATTATCAGCGTAGCAGAAACAGAAAATAACAGCGTGTACACCATCGAGGCAGCACAACATGACCCACACAAACAGGCTGTAGTTGATGAAGGCGCAATCTTCGAGGTAAACAACGACACGCTTAATCACTTCCGCGTGCCGAACATTGAAAATCTGAAGGTGTTAAACGTTGGTTCTGAAACTGTTCAATGTCGCGCTACATGGGAAACACAGACGACAACGCATCGCCTGACCTTTGAAATACGCGTATATAACGCCGAAGGCCGCGTGGTGGCAGCTTATGAAACAACGAATTACCGCTATGATTTTTATGGCCTCGATGCTGGCAGCTACACACTTGGTATTCGCGGTCGTAATGACACTGGCATGAAGGGGGCGGAAAGTATTGTTGACCTGGTTATTGGCGCGCCAGCGGCCCCAATTGGAGTTAATTGGGTTCCCGGCGTTTTCCAGGCAACAGTATACCCGATCAGCAAAACAACACTGACAACTGATACAGCATACGAGTTCTATTTCTCAGGTGAAAATCAGATCACTGACCCATCAAAAGTAACAACTTTGGCGCAGTTTACCGGGCGCGGCTATCAGTGGACTTTTGGCGGAATGAACACGGGCCATACCTATTACGTTTATGTCCGCACACGTAACGCTTTTGGTGTGTCAGATTTCGTTGAGTCATCAGGAAAACCAACTGAAAACTTCGATGAAATTAGCGACTACGTCACCAAAGACGTGATTAACTCAGAACAGTTTAAAGGGATGATTAGCGACATTAAAGATCTTGGCGACCGCGCCGACCTTATCGAAAGCGCTACAAATGACCTTAAAACTGCTACCGACAACCTCAAAACTGCAACTGATAACCTGACAAACATCACTGACGATTTAATGGTTGACACTGACAACCTGACAAACATCACTGACGAATTGAGGACTGATACGGACGGCTTAATCACAGAAACAGGGGCCATAAAAGCTGATACGGACACACTGAAAAAACAAACGGAAGATCTTTATAAAAAGGTTGGGGAAAACGCCGATGATATTGGACAGCATGAGGCAAGAATAGATTCACTGGAGGTATCTAGCGAAAAAGTTGGCAGTGAACTTGCGCAAGCAAAAGCAAGCCTGCAAAACGCGTCATTGGCGCTTATCAATAACTCACTTGCACAGACTAACACTCGCGTAACTCTTACCGCTCAGTATAAGAAAGGCAGGACAGAGACGAAGGCTGAAATTGACCGTATTGACAACGTTATTGCTGAAGAGAAAAAGGCAACGGCTGAATCACTGGAAACCATCACGGCAGAAATGAACACGATGGATTCTAACCTTAAAGGTCAGATCTCAAGCGTGGAACGCGCAGTGGCTGACGAGGCCAGCGCACGCGCAGAAGCCATTAACGGTGTGAACGCATCAATAAGCAATCTAGACAAGAAAACAGACGCCAGCATCAATAGACTTGATCAAGCTATTGCAGACGAAACGAACGCCCGCACACAGGCAGTCAGTGACGTTAACGCGAGTATCTCATCGCTAGACAAGAAAACTGACGCCAGCGTTAAACGCTTGGATCAAGCAATTGCAGACGAAACCAGTGCAAGAACTGAAGCTATCAGCGGCGTGAACGCATCAATTTCCACTCTTGATAGCAAGGTAACAAGCAATGTAACGCGCATAGATAAGGCTATTGCAGACGAAACGAAGGCACGCACTGACGCGATAAGTAGCCTTAATTCATCGCTTACCAGTACGATTAATTCGAAAGTGTCTGAAGTATCAACGGCACTTTCTACGCATGAAGCATCAAGCGCTGAAAAATTTAGCCAGATCTCTGCGTCTTTCGAATCTGTAAACTCAAGTATTACAGAATGGTCACAATCTATGGCAACGGCTGACGAGGCATTGTCAACCAAAATTGATCAGTTAACAGTAACCGTTAACGGTAACAAAACGGCGATCGAGACGACATCGAATGCGCTAACCGATTTCAAGGGTAACGTTGATGCGTCATATTCAATCAAGCTCGCCACCGATAACAACGGCATGAAGTACGCGACAGGTATGTCGCTTGGCCTGACTGGTAACGGCACAAACTTTCAATCGCAGTGTATTTTCCTCGTTGACCGCTTCGTGTTAATGACCTCAGCAAACGGCACATATACAACACCTTTCTATGTCACCAATGGTGCGATGTATGTGAAAGAAGCGTTTATTAAAGACGGATCGATCGATACTGCAAAAATAGCGCAGCAAATCCAGTCAACTGTATTTGTTCAAGACTCAAAAGGTTGGATGATTAACAAAGATGGATGGGCGCAATTCAACGAAGTTACTGTAAGGGGTAACGTCATTGTTAAAGGCAATCAAAATTTCGATGTACGCAATAGTGACGGACGGGTAATTATTAACAATAATGGCATTACAGTAAATCTTCCTAGTGGTGGTAAAATTATTATCGGTGTTTGGTAA
Genome Context
Genome Context
Tertiary structure
PDB ID
3cffe3957645d32bfd9ae54ebcdc8fc899263002dc1abb42b84e2e7751ee76d1
Model Confidence
Very high
pLDDT > 90
pLDDT > 90
High
90 > pLDDT > 70
90 > pLDDT > 70
Low
70 > pLDDT > 50
70 > pLDDT > 50
Very low
pLDDT < 50
pLDDT < 50
Literature
| Title | Authors | Date | PMID | Source |
|---|---|---|---|---|
| A novel group of Nus-dependent non-lambdoid bacteriophages | Negrete-Mendez,H., Valencia-Toxqui,G., Sepulveda-Robles,O., Rios-Castro,E., Hurtado-Cortes,J., Flores,V., Cazares,A., Fernandez-Ramirez,F., Martinez-Penafiel,E. and Kameyama,L. | 2025-02-24 | — | GenBank |