Protein

Genbank accession
CAA6800711.1 [GenBank]
Protein name
tail fibre protein
RBP type
TF
Evidence RBPdetect
Probability 0,90
TF
Evidence RBPdetect2
Probability 0,96
TF
Evidence Phold
Probability 1,00
Protein sequence
MANVIKTVLTYQLDGSNRDFNIPFEYLARKFVVVTLIGVDRKVLTINTDYRFATRTTISLTKAWGPADGYTTIELRRVTSTTDRLVDFTDGSILRAYDLNVAQIQTMHVAEEARDLTTDTIGVNNDGHLDARGRRIVNLANAVDDRDAVPFGQLKTMNQNSWQARNEALQFRNEAETFRNQAEGFKNESSTNATNTKQWRDETKGFRDEAKRFKNTAGQYATSAGNSASAAHQSEVNAENSATASANSAHLAEQQADRAEREADKLENYNGLAGAIDKVDGTNVYWKGNIHANGRLYMTTNGFDCGQYQQFFGGVTNRYSVMEWGDENGWLMYVQRREWTTAIGGNIQLVVNGQIITQGGAMTGQLKLQNGHVLQLESASDKAHYILSKDGNRNNWYIGRGSDNNNDCTFHSYVHGTTLTLKQDYAVVNKHFHVGQAVVATDGNIQGTKWGGKWLDAYLRDSFVAKNEYSLWDIKFRPAALDPRGMTLVAGAFWADIYLLGVNHLTDGTSKYNVTIADGSASPKKSTKFGGDGSAAYSDGAWYNFAEVMTHHGKRLPNYNEFQALAFGTTEATSSGGTDVPTTGVNGTGATSAWNIFTSKWGVVQASGCLWTWGNEFGGVNGASEYTANTGGRGSVYAQPAAALFGGAWNGTSLSGSRAALWYSGPSFSFAFFGARGVCDHLILE
Physico‐chemical
properties
protein length:685 AA
molecular weight: 74950,76580 Da
isoelectric point:6,22963
aromaticity:0,10949
hydropathy:-0,46496

Domains

Domains [InterPro]
Legend: Pfam SMART CDD TIGRFAM HAMAP SUPFAM PRINTS Gene3D PANTHER Other

Taxonomy

  Name Taxonomy ID Lineage
Phage Escherichia phage T7
[NCBI]
10760 Viruses > Duplodnaviria > Heunggongvirae > Uroviricota > Caudoviricetes
Host Escherichia coli
[NCBI]
562 Bacteria > Proteobacteria > Gammaproteobacteria > Enterobacteriales > Enterobacteriaceae > Escherichia

Coding sequence (CDS)

Coding sequence (CDS)
Genbank protein accession
CAA6800711.1 [NCBI]
Genbank nucleotide accession
LR745708 [NCBI]
CDS location
range 34625 -> 36682
strand +
CDS
ATGGCTAACGTAATTAAAACCGTTTTGACTTACCAGTTAGATGGCTCCAATCGTGATTTTAATATCCCGTTTGAGTATCTAGCCCGTAAGTTCGTAGTGGTAACTCTTATTGGTGTAGACCGAAAGGTCCTTACGATTAATACAGACTATCGCTTTGCTACACGTACTACTATCTCTCTGACAAAGGCTTGGGGTCCAGCCGATGGCTACACGACCATCGAGTTACGTCGAGTAACCTCCACTACCGACCGATTGGTTGACTTTACGGATGGTTCAATCCTCCGCGCGTATGACCTTAACGTCGCTCAGATTCAAACGATGCACGTAGCGGAAGAGGCCCGTGACCTCACTACGGATACTATCGGTGTCAATAACGATGGTCACTTGGATGCTCGTGGTCGTCGAATTGTGAACCTAGCGAACGCCGTGGATGACCGCGATGCTGTTCCGTTTGGTCAACTAAAGACCATGAACCAGAACTCATGGCAAGCACGTAATGAAGCCTTACAGTTCCGTAATGAGGCTGAGACTTTCAGAAACCAAGCGGAGGGCTTTAAGAACGAGTCCAGTACCAACGCTACGAACACAAAGCAGTGGCGCGATGAGACCAAGGGTTTCCGAGACGAAGCCAAGCGGTTCAAGAATACGGCTGGTCAATACGCTACATCTGCTGGGAACTCTGCTTCCGCTGCGCATCAATCTGAGGTAAACGCTGAGAACTCTGCCACAGCATCCGCTAACTCTGCTCATTTGGCAGAACAGCAAGCAGACCGTGCGGAACGTGAGGCAGACAAGCTGGAAAATTACAATGGATTGGCTGGTGCAATTGATAAGGTAGATGGAACCAATGTGTACTGGAAAGGAAATATTCACGCTAACGGGCGCCTTTACATGACCACAAACGGTTTTGACTGTGGCCAGTATCAACAGTTCTTTGGTGGTGTCACTAATCGTTACTCTGTCATGGAGTGGGGAGATGAGAACGGATGGCTGATGTATGTTCAACGTAGAGAGTGGACAACAGCGATAGGCGGTAACATCCAGTTAGTAGTAAACGGACAGATCATCACCCAAGGTGGAGCCATGACCGGTCAGCTAAAATTGCAGAATGGGCATGTTCTTCAATTAGAGTCCGCATCCGACAAGGCGCACTATATTCTATCTAAAGATGGTAACAGGAATAACTGGTACATTGGTAGAGGGTCAGATAACAACAATGACTGTACCTTCCACTCCTATGTACATGGTACGACCTTAACACTCAAGCAGGACTATGCAGTAGTTAACAAACACTTCCACGTAGGTCAGGCCGTTGTGGCCACTGATGGTAATATTCAAGGTACTAAGTGGGGAGGTAAATGGCTGGATGCTTACCTACGTGACAGCTTCGTTGCGAAGAACGAATACAGCCTGTGGGACATCAAGTTTCGCCCGGCTGCGCTCGACCCGCGCGGCATGACGCTGGTTGCCGGCGCGTTTTGGGCAGACATCTATCTGCTAGGCGTCAACCACCTGACCGATGGCACCAGCAAATACAACGTGACAATTGCAGATGGTAGTGCATCACCTAAGAAATCTACCAAGTTCGGTGGAGACGGCAGCGCGGCCTACAGTGACGGAGCTTGGTACAACTTCGCTGAGGTCATGACTCATCACGGTAAGCGCCTGCCTAACTACAACGAATTCCAGGCGCTGGCTTTCGGCACGACCGAGGCTACGTCCAGCGGCGGCACCGACGTGCCCACCACCGGCGTGAACGGCACGGGCGCAACGAGTGCGTGGAACATCTTCACGTCCAAGTGGGGCGTTGTGCAGGCGTCCGGTTGCTTGTGGACGTGGGGTAACGAGTTCGGCGGCGTGAATGGCGCATCCGAATACACGGCCAACACTGGCGGCAGAGGATCGGTGTACGCCCAGCCCGCTGCTGCGCTATTCGGCGGCGCCTGGAACGGCACGTCGCTCTCGGGTTCTCGCGCTGCGCTCTGGTACAGCGGGCCGTCGTTCTCGTTCGCGTTCTTCGGGGCGCGCGGCGTCTGTGACCACCTGATTCTTGAGTAG

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

PDB ID
615e5e0ed8fa102bac933a2ee745403e6a5b9c88eb3fcdb7d56c94a8be12dde6
ESMFold
Source ESMFold
Method ESMFold
Resolution 0,6551
Evidence 0,6551

Literature

No literature entries available.