Protein

Genbank accession
AGZ17541.1 [GenBank]
Protein name
putative tail fiber protein
RBP type
TF
Evidence RBPdetect
Probability 0,90
TF
Evidence RBPdetect2
Probability 0,95
TF
Evidence GenBank
Probability 1,00
TF
Evidence Phold
Probability 1,00
Protein sequence
MADYNDKVVNAELILPEGGAGETFVDDGQFHTEDTTTVTLLGNGTSTTPLKATVIVDPASGNALVAGADGLSVGVSTISDNLLVLAGGKLYVAPPQIVIPISSKDGNTLVEVTDPGEEGLYVPPGEKGDKGDQGPQGEPGVGIYIQGILGDPSQLPPASTMQEGDTYVIGTHYWTVVNQQWVDLGDFAGPQGQDGIGLVIKGSFTDTAFLPTEGNTEGDTYIVQEQMWVWTGAADGWQPVGQVGPTGPQGATGATGPQGPKGDKGDRGEKGDQGVQGIQGLTGAQGPQGPKGDKGNDAAIVKLKGTKATTADLPAFGNAVADAWVVQTDNHVWVWTSDGAWEDIGPVQGPKGDDGDVGPQGPQGQKGDQGPTGPTGPQGPQGNDGAQGIQGPVGPKGDKGDTGLTGPVGPQGPVGPKGDRGETGYSARVLGTKGATSELPATGTSGDAWIIVPNLYVWSQSDAQWINVGPYVGPKGDKGDTGDTGPTGPQGPQGVDGPQGPKGDTGDTGPAGADGAQGPEGPMGVPLNPLGTVPTEADLPVGASHGDYYTTVDTGEGFAFTGSDWANLGVMRGPQGDQGIQGLQGVDGPQGPKGDAGPTGPTGPQGPKGDQGAGIKPLGTKASEADLPATGTEGDGWIIGTDLWVWSVTDNDWINVGGFVGPTGPAGPTGPQGPQGTQGIQGVKGDQGTLWLNFPRNPGPADGRVGDYFINKSTLEYFQKTSATVWASLGYMGGGNVYDTSSTTPQARTSSGWVDVPVLEAPADSGYYVRVNSAWKKLDRYDLLVTSSTGAMDVSVSQVFKVDGTANKTMSFTNLPANRAMTIVIVFTGSGASLTWPGNLAWSNGTAVTLGTTRTVVTILWDGTNLTGTTSLTVN
Physico‐chemical
properties
protein length:875 AA
molecular weight: 88769,27530 Da
isoelectric point:4,19826
aromaticity:0,06629
hydropathy:-0,41234

Domains

Domains [InterPro]
Legend: Pfam SMART CDD TIGRFAM HAMAP SUPFAM PRINTS Gene3D PANTHER Other

Taxonomy

  Name Taxonomy ID Lineage
Phage Escherichia phage 4MG
[NCBI]
1391428 Viruses > Duplodnaviria > Heunggongvirae > Uroviricota > Caudoviricetes
Host Escherichia coli K-12
[NCBI]
83333 Bacteria > Proteobacteria > Gammaproteobacteria > Enterobacteriales > Enterobacteriaceae > Escherichia

Coding sequence (CDS)

Coding sequence (CDS)
Genbank protein accession
AGZ17541.1 [NCBI]
Genbank nucleotide accession
KF550303 [NCBI]
CDS location
range 48314 -> 50941
strand -
CDS
ATGGCAGATTACAATGATAAGGTCGTCAACGCTGAACTGATCCTTCCTGAGGGCGGTGCAGGTGAGACCTTTGTTGATGATGGTCAATTCCATACGGAAGATACGACCACCGTCACACTTCTCGGTAATGGTACATCCACTACCCCACTGAAGGCAACTGTTATTGTAGATCCTGCTTCCGGAAATGCTCTTGTGGCTGGTGCTGACGGTCTTTCCGTTGGTGTCAGCACAATCAGTGACAACCTGCTTGTATTGGCGGGTGGAAAGCTGTATGTTGCTCCTCCTCAAATCGTTATTCCGATCTCAAGTAAGGATGGAAACACCCTTGTTGAGGTTACTGATCCAGGTGAAGAGGGGCTGTATGTCCCACCTGGTGAAAAGGGGGATAAGGGTGACCAGGGGCCGCAAGGTGAACCGGGTGTTGGTATTTACATCCAAGGTATTCTGGGAGACCCTTCCCAACTCCCACCTGCATCAACCATGCAGGAAGGCGACACATACGTTATTGGAACACACTACTGGACTGTTGTAAACCAGCAATGGGTTGACCTTGGAGACTTCGCAGGTCCACAAGGGCAGGACGGTATCGGTCTTGTAATCAAAGGCTCTTTCACAGACACAGCATTCCTGCCTACAGAAGGAAACACTGAGGGTGATACTTATATCGTCCAAGAACAGATGTGGGTATGGACTGGCGCTGCTGACGGTTGGCAACCAGTTGGTCAGGTAGGTCCAACAGGGCCTCAGGGTGCGACTGGTGCCACAGGGCCACAAGGACCTAAAGGTGACAAAGGTGATCGTGGTGAGAAAGGCGATCAGGGTGTTCAAGGTATCCAAGGGCTTACTGGCGCACAAGGACCTCAGGGTCCGAAGGGCGATAAAGGTAACGATGCCGCCATTGTTAAACTGAAAGGTACAAAAGCCACCACCGCAGATCTTCCTGCTTTCGGCAACGCCGTTGCTGATGCTTGGGTTGTTCAAACAGATAATCATGTTTGGGTGTGGACTTCTGACGGTGCATGGGAAGATATCGGTCCTGTTCAGGGGCCAAAAGGCGACGATGGTGACGTTGGTCCTCAGGGTCCACAGGGACAGAAAGGCGATCAAGGTCCAACCGGACCAACAGGACCTCAGGGTCCTCAGGGAAATGACGGCGCACAGGGTATCCAGGGACCCGTAGGTCCGAAAGGAGATAAGGGCGACACGGGCCTTACCGGACCAGTTGGACCTCAAGGACCAGTAGGTCCGAAGGGCGACCGTGGTGAAACAGGTTACAGTGCTCGCGTTCTCGGTACGAAAGGTGCCACATCTGAACTTCCTGCGACAGGAACATCTGGTGATGCTTGGATCATTGTACCAAACCTGTATGTCTGGAGTCAATCTGATGCACAGTGGATCAACGTTGGTCCATATGTAGGTCCGAAGGGCGATAAAGGTGACACTGGTGATACCGGGCCTACAGGCCCACAGGGTCCACAAGGTGTTGATGGACCTCAGGGTCCTAAAGGTGACACCGGAGATACAGGGCCTGCTGGTGCTGATGGTGCTCAAGGGCCTGAGGGTCCAATGGGTGTTCCGCTTAACCCTCTTGGAACAGTACCAACAGAAGCAGACCTTCCGGTTGGGGCATCTCATGGGGATTATTACACCACAGTAGATACCGGGGAGGGCTTTGCCTTTACAGGTTCTGACTGGGCCAACTTGGGTGTAATGCGAGGACCTCAGGGAGACCAGGGTATCCAAGGACTCCAAGGTGTTGATGGACCACAGGGTCCGAAGGGTGATGCAGGTCCTACCGGACCAACAGGCCCACAAGGACCTAAAGGAGACCAGGGCGCTGGTATTAAACCTCTAGGAACCAAAGCATCTGAAGCTGATCTACCAGCAACAGGTACTGAAGGTGACGGCTGGATCATCGGAACAGACCTGTGGGTTTGGAGTGTAACAGACAATGACTGGATCAACGTTGGTGGATTCGTAGGACCTACGGGGCCTGCTGGACCAACTGGACCTCAGGGTCCACAAGGTACTCAGGGTATCCAGGGTGTGAAAGGTGACCAAGGTACGCTTTGGTTGAACTTCCCACGTAACCCAGGGCCTGCTGATGGACGTGTTGGTGACTACTTCATCAACAAAAGCACCCTTGAGTATTTCCAGAAGACTTCTGCTACGGTATGGGCTTCTCTGGGATACATGGGTGGTGGTAACGTCTACGATACCTCCTCAACAACGCCACAGGCGCGTACAAGTTCTGGTTGGGTAGATGTGCCAGTTCTTGAAGCCCCTGCTGACAGCGGCTATTACGTTCGTGTAAACAGTGCATGGAAGAAACTGGATCGTTACGACCTTCTGGTAACTTCTTCAACTGGAGCTATGGATGTTAGCGTGTCTCAGGTCTTCAAGGTTGATGGTACAGCGAACAAAACCATGAGCTTTACTAATTTGCCTGCAAACAGGGCAATGACAATCGTTATCGTGTTCACAGGTTCTGGTGCGTCACTGACGTGGCCTGGAAACTTAGCATGGTCAAACGGTACGGCGGTTACTTTGGGAACTACCCGCACAGTTGTTACGATCCTTTGGGATGGTACAAACCTGACAGGTACAACCTCTCTTACTGTCAACTAA

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

PDB ID
9ee19c695ed3bd71e8753733a5cd2a492e107ef31e6f43b7566dd78db18fb254
ESMFold
Source ESMFold
Method ESMFold
Resolution 0,6881
Evidence 0,6881

Literature

No literature entries available.