Genbank accession
YP_009208178.1 [GenBank]
Protein name
tail fiber protein
RBP type
TF
Evidence GenBank
Probability 1,00
TF
Evidence Phold
Probability 1,00
TF
Evidence RBPdetect
Probability 0,88
TF
Evidence RBPdetect2
Probability 0,84
Protein sequence
MIVYNNQAPDAVNNVGQFGATEGSIGAYKQAAEYAADSKYWALLAESKFGTIDDLIAEVERLYQQGVLMKQDIEDLKQDFKDQDARLMTLIAQTNAAVSDANNAVALINQKLIEVQNQLDVLLGMSVDVTTLPPGTPATGSFNPNTGVISLGIPEGEPGKDGSVKDLDTAPTGVPELGDLGFYVDKDDNTVHKTTLDNIANLIPSVRSVSINGGPALDGEVALTLNKETVGLGNVLNVAQYSRQEINDKFDKTTKTYQSKAEADADAQYRQVGEKVLVWEATKYEFYTVAANKTLTPVKTEGRILTVNSRSPDSSGNIDITIPTGNPSLYLGEMVMFPYDPSKNISYPGVLPADGRLVSKESALDLGPSLVSGQLPVVSETEWQAGSKQYFSWGKLADGITDADSTNFINIRLPDWTGGEAIRAPDSDKDSQYNGSVQAQKPYVVTVNNQAPDEITGNVTLSRSILGAASSGANSDITSLTGLTTALSIAQGGTGGKTPSEARANLNLERFQQDNSQTLIYSPDYARRVYVDNTGGSWGCQNVTDGGFIALGIPQGGTGAKDAAGARSNLGLGSVSTLNNIPVANGGTGATTAAGARSNLGLGSVSTLDNVPIANGGTGAADAAGARFNIGALSSTPANTGVGGTGNRVQHASGNGLFTLDMFNCYWYMQPEDTNFWVAHSVSYAGSGGEASGYGRITYAIKIADGTTKYVHCLTNKNTITDVSGFIKAASPVVNIYANGRYETNDESEGVNVIRQGVGEYLITGCLGLNADAKWGGIDGGFEIPIDRNKQPRVWLDYEVKEDGSILVKTYHRTHSTSPVFARNELEGFSDGDPIDIPADAFVSVRVEMPSN
Physico‐chemical
properties
protein length:852 AA
molecular weight: 90050,61520 Da
isoelectric point:4,59523
aromaticity:0,07277
hydropathy:-0,28955

Domains

Domains [InterPro]
Coil
Unmapped
73–93
DC_1205
STR
574–611
YP_009208178.1
1 852
Architecture
STR
STR 183-852
Legend: ATT STR RBD CBM LEC ENZ CHP LNK TAS TTP UNK Unmapped

Taxonomy

  Name Taxonomy ID Lineage
Phage Escherichia phage 172-1
[NCBI]
1598146 Uroviricota > Caudoviricetes > Mktvariviridae > Kuravirus > Kuravirus kv1721
Host Escherichia coli
[NCBI]
562 cellular organisms > Bacteria > Pseudomonadati > Pseudomonadota > Gammaproteobacteria > Enterobacterales

Coding sequence (CDS)

Coding sequence (CDS)
Genbank protein accession
YP_009208178.1 [NCBI]
Genbank nucleotide accession
NC_028903.1 [NCBI]
CDS location
range 19375 -> 21933
strand +
CDS
ATGATCGTTTATAATAACCAAGCACCTGATGCAGTGAATAACGTTGGGCAGTTTGGTGCTACTGAAGGCTCTATCGGAGCTTACAAGCAAGCAGCAGAATACGCAGCTGACTCCAAATATTGGGCACTGCTAGCGGAATCTAAGTTTGGTACAATTGATGACTTGATCGCTGAAGTAGAACGTCTGTATCAACAAGGTGTTCTGATGAAGCAGGATATTGAAGATCTTAAACAAGATTTTAAAGATCAAGATGCTCGTCTTATGACCTTGATTGCCCAGACTAACGCAGCTGTTTCTGATGCGAATAACGCTGTTGCTTTAATCAATCAGAAACTTATTGAAGTCCAGAATCAGCTTGACGTTCTGTTAGGAATGTCCGTTGATGTAACCACACTTCCTCCGGGAACTCCGGCTACTGGTTCTTTTAATCCTAACACTGGTGTAATCTCTCTAGGTATCCCAGAAGGCGAACCCGGAAAGGATGGTTCTGTTAAGGATTTAGACACAGCTCCTACTGGTGTTCCAGAGCTAGGTGATTTAGGTTTCTATGTTGACAAAGATGACAATACCGTCCACAAAACTACTCTAGATAACATTGCTAACTTAATCCCATCTGTTCGTTCTGTCTCTATTAACGGCGGTCCAGCTCTTGATGGAGAGGTTGCTCTAACACTTAACAAAGAGACGGTAGGTTTAGGAAATGTTCTGAACGTCGCTCAGTACAGTCGTCAAGAGATTAATGACAAATTTGACAAGACTACCAAGACATACCAATCAAAAGCAGAAGCTGATGCTGATGCTCAGTATCGTCAAGTAGGTGAGAAAGTTTTAGTTTGGGAAGCTACTAAGTATGAATTCTATACTGTTGCTGCTAACAAAACACTGACTCCTGTTAAAACTGAAGGTAGAATTCTTACCGTTAACTCTCGCTCTCCAGACTCAAGCGGTAATATCGATATCACGATTCCAACAGGTAACCCGTCTTTGTATCTTGGTGAAATGGTAATGTTCCCTTACGACCCATCTAAGAATATCTCCTACCCAGGAGTTCTTCCTGCTGATGGTCGTCTGGTATCAAAAGAATCTGCTCTCGACTTAGGCCCATCTCTTGTCAGTGGTCAGCTTCCTGTAGTCTCTGAAACTGAATGGCAAGCAGGGTCTAAACAATATTTCTCATGGGGTAAGCTAGCAGATGGGATTACTGATGCGGATTCTACTAATTTCATCAATATTCGACTTCCTGATTGGACTGGAGGGGAGGCAATAAGAGCACCAGATTCTGATAAAGATTCTCAATACAATGGGTCTGTACAGGCTCAGAAACCTTATGTTGTCACGGTAAATAACCAAGCTCCTGATGAGATTACAGGGAACGTAACCCTCTCCAGATCTATCTTGGGAGCAGCTTCTTCTGGTGCAAACTCTGATATAACATCCCTGACAGGACTCACTACAGCTCTCTCTATCGCTCAAGGTGGTACAGGAGGGAAAACTCCATCTGAAGCTAGGGCAAACCTAAATCTTGAAAGATTTCAACAAGACAACTCGCAGACTTTGATATATTCTCCTGATTATGCCCGTCGTGTTTATGTTGACAACACTGGTGGTTCTTGGGGGTGCCAGAACGTAACAGATGGCGGTTTTATTGCTCTTGGTATTCCTCAAGGGGGTACAGGAGCTAAGGATGCTGCTGGTGCACGAAGTAATCTCGGTTTGGGTTCAGTGTCCACACTAAACAATATACCCGTAGCTAATGGAGGTACTGGAGCAACTACTGCTGCTGGTGCTCGTTCCAACTTAGGTCTTGGGAGTGTTTCTACTCTGGATAATGTACCTATTGCTAATGGTGGGACAGGGGCAGCTGATGCTGCTGGTGCAAGGTTTAATATTGGAGCACTAAGTAGCACCCCAGCAAATACAGGTGTTGGTGGTACTGGTAATCGCGTCCAACATGCATCCGGGAATGGCCTGTTTACTTTAGACATGTTTAACTGCTACTGGTATATGCAGCCAGAGGATACTAACTTCTGGGTTGCCCACAGTGTATCATATGCAGGATCTGGTGGCGAAGCCTCTGGATACGGTCGTATAACTTACGCAATAAAGATTGCAGATGGAACAACAAAATATGTCCATTGTCTTACAAACAAAAATACTATTACCGATGTAAGCGGTTTTATCAAAGCGGCATCTCCGGTAGTTAATATTTACGCCAACGGTCGATATGAGACGAACGATGAATCTGAGGGGGTCAATGTCATTCGGCAAGGTGTCGGTGAGTATTTGATTACCGGTTGCCTAGGCTTAAACGCAGATGCAAAATGGGGAGGTATCGACGGCGGCTTTGAAATCCCTATTGACCGCAACAAACAGCCTAGGGTTTGGCTTGATTATGAGGTTAAAGAAGATGGTTCTATTTTAGTCAAGACCTATCACAGAACCCATTCAACATCCCCAGTTTTTGCTAGAAACGAGTTGGAAGGTTTTTCTGACGGTGACCCAATCGATATCCCTGCCGATGCTTTCGTATCAGTTCGTGTTGAAATGCCTTCTAATTAA

Genome Context

Genome Context

Tertiary structure

PDB ID
29f26c283094057de6b1479e8894b6f7f555b93b608fc2932a83b36b93f7d3e5
ESMFold
Source ESMFold
Method ESMFold
Resolution 0,6247
Oligomeric State monomer
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50

Literature

Title Authors Date PMID Source
Completed Genome sequence of Escherichia coli Bacteriophage P172-1 Xu,J., Chen,M. and Zhang,W. 2015-10-26 GenBank