Protein

Genbank accession
XPK41954.1 [GenBank]
Protein name
tail fiber protein
RBP type
TF
Evidence RBPdetect
Probability 0,89
TF
Evidence RBPdetect2
Probability 0,59
TF
Evidence Phold
Probability 1,00
Protein sequence
MIVYNNQAPDAVNNVGQFGATEGSIGAYKQAAEYAADSKYWALLAESKFGTIDDLIAEVERLYQQGVLMKQDIEDLKQDFKDQDARLMSLIAQTNAAVSDANNAVALINQKLIEVQNQLDVLLGMSVDVTTLPPGTPATGSFNPNTGVISLGIPEGEPGKDGSVKDLDTAPTGVPELGDLGFYVDKDDNTVHKTTLENIANLIPSVRSVSVNGGPALDGEVALTINKETVGLGNVLNVAQYSRQEINDKFDKTTKTYQSKAEADADAQYRQVGEKVLVWEATKYEFYTVAANKTLTPVKTEGRILTVNSRSPDSSGNIDITIPTGNPSLYLGEMVMFPYDPSKNISYPGVLPADGRLVSKESASDLGPSLVSGQLPVVSETEWQSGAKQYFSWGKLADGITDADSTNFINIRLPDWTGGEAIRAPDSDKDSQYNGSVQAQKPYVVTVNNQAPDEITGNVNISRSILGAASSGANSDITSLSGLTTPLSISQGGTGAKDAAGARSNLGLGSTATLNTIPVANGGTGATTVDVARSNLSIDRIDQASGESKLLSPNKETYLFVDNNGWGCYSTSAGRVGDVALSVERGGTGAKNAASARSNLGLGSVSTLDNVPIASGGTGAGDAAGARFNLGLGNSATMNTGTNSDNVLKVGDFGIGRPDGALVFDTTSQDQLLAGLDTYGLCVFRNNQQIAAPWDIWNYSSNLFFRAGDTYSMISIPFESAGKIKVFGGASGSGWKTSRTVYDTVNTTVDVNGFIKAASPIVKVFHDGSFETNEQSDGVSVKKISTGVYLISGCLGLNSDAGWGGVDGGFEIPIDRNKQPRVWLDYEVKEDGSLLIKTYHRTHSTSPAFARNELEGFSDGDPVDIPKDAFISVRVEMPSK
Physico‐chemical
properties
protein length:880 AA
molecular weight: 92881,80490 Da
isoelectric point:4,59972
aromaticity:0,07159
hydropathy:-0,27148

Domains

Domains [InterPro]
XPK41954.1
1 880
Legend: Pfam SMART CDD TIGRFAM HAMAP SUPFAM PRINTS Gene3D PANTHER Other

Taxonomy

  Name Taxonomy ID Lineage
Phage Escherichia phage LorandFenyves_Bas91
[NCBI]
3398422 Viruses > Duplodnaviria > Heunggongvirae > Uroviricota > Caudoviricetes
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)
Genbank protein accession
XPK41954.1 [NCBI]
Genbank nucleotide accession
PQ850614.1 [NCBI]
CDS location
range 7657 -> 10299
strand +
CDS
ATGATCGTTTATAATAACCAAGCACCTGATGCAGTGAATAACGTTGGGCAGTTTGGTGCTACTGAAGGCTCTATCGGGGCTTATAAGCAAGCAGCAGAATATGCAGCTGACTCCAAATATTGGGCACTGTTAGCAGAATCTAAGTTTGGTACAATTGATGATTTGATCGCTGAAGTAGAACGTTTATATCAGCAAGGTGTTCTGATGAAGCAGGATATCGAAGATCTTAAGCAGGACTTTAAAGATCAAGATGCTCGTCTGATGAGTTTGATTGCTCAAACTAACGCAGCGGTATCAGATGCGAATAATGCTGTAGCTCTAATTAATCAGAAACTTATTGAAGTCCAGAACCAGCTTGACGTTCTGTTAGGAATGTCCGTTGATGTAACCACACTTCCTCCGGGAACTCCGGCTACTGGTTCTTTTAATCCTAACACTGGTGTAATTTCTTTAGGTATCCCAGAAGGCGAACCCGGAAAGGATGGTTCTGTTAAGGATTTAGACACAGCTCCTACTGGTGTTCCAGAGCTAGGTGACTTGGGTTTCTATGTTGATAAAGATGACAATACCGTCCATAAAACTACTCTAGAGAATATTGCTAACTTAATCCCATCTGTTCGTTCTGTTTCTGTTAATGGGGGACCAGCTCTTGATGGAGAGGTTGCTCTAACAATCAACAAAGAGACAGTAGGTTTAGGAAATGTTCTGAATGTTGCTCAGTACAGTCGTCAAGAGATTAATGACAAATTTGACAAGACTACCAAGACATACCAATCAAAAGCAGAAGCTGATGCTGATGCTCAGTATCGTCAAGTAGGTGAGAAAGTTTTAGTTTGGGAAGCTACTAAGTATGAATTCTATACTGTTGCTGCTAACAAAACATTGACTCCAGTTAAAACTGAAGGTAGAATTCTTACCGTTAACTCCCGTTCTCCAGATTCCAGTGGTAACATTGACATTACCATTCCAACAGGAAACCCTTCTCTGTATCTTGGTGAGATGGTGATGTTCCCTTACGACCCATCTAAGAATATCTCCTACCCAGGAGTTCTTCCTGCTGATGGTCGTCTGGTATCAAAAGAATCTGCTTCAGATTTAGGCCCATCCCTTGTCAGCGGACAGCTCCCTGTTGTTTCAGAAACTGAATGGCAATCGGGGGCTAAACAGTACTTCTCTTGGGGCAAGTTAGCAGACGGTATTACCGATGCGGATTCTACTAATTTTATCAACATTCGACTCCCTGATTGGACTGGAGGGGAGGCAATAAGAGCACCAGATTCTGATAAAGACTCTCAGTACAATGGGTCTGTACAGGCTCAGAAACCTTATGTTGTTACTGTAAATAACCAAGCTCCTGATGAGATTACTGGTAATGTGAACATCTCCAGATCAATTTTAGGTGCTGCTTCTTCTGGAGCAAACTCTGACATTACATCTTTATCTGGACTCACAACACCGCTATCCATCTCTCAAGGTGGTACTGGAGCTAAAGATGCTGCTGGTGCTCGTTCTAATTTAGGTCTAGGCTCAACAGCTACCCTCAACACAATACCTGTAGCCAATGGTGGCACAGGAGCAACTACTGTTGATGTTGCTCGTTCCAATTTATCCATAGACCGAATAGATCAAGCCTCCGGGGAGAGTAAATTATTATCCCCTAACAAAGAAACCTACCTCTTTGTGGATAACAATGGGTGGGGCTGTTATAGTACCTCAGCTGGAAGGGTTGGTGATGTGGCTCTTTCTGTTGAAAGAGGCGGTACTGGAGCTAAAAATGCTGCTAGTGCCCGATCTAACTTGGGGTTGGGGAGTGTTTCCACTCTAGATAACGTCCCTATCGCTAGTGGTGGGACAGGGGCTGGAGATGCTGCTGGTGCAAGGTTTAATCTTGGGTTAGGTAACTCAGCCACGATGAATACCGGAACTAACAGTGATAATGTTCTTAAGGTTGGTGATTTTGGAATTGGAAGACCTGACGGCGCTCTTGTTTTTGATACTACTTCACAAGACCAACTTCTTGCTGGACTTGACACTTATGGGCTGTGTGTGTTTAGGAATAACCAACAAATAGCAGCACCTTGGGACATATGGAACTACTCCTCAAACCTTTTCTTTAGGGCGGGTGACACATACAGTATGATAAGTATTCCGTTTGAGTCTGCTGGTAAGATAAAAGTTTTTGGTGGTGCATCAGGCAGTGGGTGGAAAACCTCAAGGACGGTATATGATACTGTAAATACAACTGTTGACGTCAATGGCTTTATCAAAGCAGCGTCACCAATAGTTAAGGTATTCCATGACGGAAGTTTTGAAACAAACGAACAATCTGATGGAGTTAGTGTTAAGAAAATCTCCACCGGGGTTTATTTAATTTCAGGGTGTCTTGGTCTTAATTCTGATGCAGGATGGGGTGGTGTAGATGGAGGTTTTGAAATCCCAATAGACCGAAACAAACAACCTAGAGTTTGGCTTGACTATGAGGTTAAAGAGGATGGTTCTCTTTTAATTAAAACTTATCATAGAACCCATTCAACATCCCCAGCTTTTGCTAGAAACGAGTTGGAAGGTTTTTCTGACGGAGACCCTGTTGACATTCCTAAGGATGCGTTCATTTCGGTTCGTGTTGAAATGCCTTCTAAGTAA

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

PDB ID
bd43f666ffd6bada0eeaaff2b72aa15ca8aa002f8fd003df814061ce23d3863e
ESMFold
Source ESMFold
Method ESMFold
Resolution 0,6397
Evidence 0,6397

Literature

Title Authors Date PMID Source
Complete genome sequences of Escherichia coli phages Huey, Dewey, and Louie Maffei,E., Willi,L. and Harms,A. GenBank