Protein

Genbank accession
AJT60503.1 [GenBank]
Protein name
tail fiber protein
RBP type
TF
Evidence RBPdetect
Probability 0,52
TF
Evidence GenBank
Probability 1,00
TF
Evidence Phold
Probability 1,00
Protein sequence
MPPAIIGAAIALGASAAAAASIISATTALVIGIAATAAGALLTKTPNMNFDAYKGQQERKQVLRAATASRSVVYGTTVASGLLAFAEEEAGDQDEGEWMHLVVVLASHKLEGIETIWLGDDGIDWFGENATWEFHNDRQTVDPFMLKHCPSWKEDMIGKGIAWLRLSLKFDAEKFPSGLPNVKVLKKGRRVYDPRSGQTVFSDNAALVILDYFRTYLKRKDENINWDQFKEAANICDEFVTNADNTTERRYRINGEFEVDEAPAKILDAMLEACGGELTYIGGKHGLLVGAYYGPATMTLDESCIAGDIKIIPETSYKERTNTITGTFVDPKQTYAEADFPPVVVKEWVEKDGGEITQDMDFRFVTSEYQAQRLANIILRRKRVGRTIEVPCNMKGYKFRPGMYVNVTITNIGMKNVEMRVTKWSFDPKGGINLVLRQDFLEMWDDAIGKPMERPDLVDLPSGAIAQPQNLQYQVLQISDVVQGVLTWSNIGQVAYNRVAVRQGGTTVWTAQVPGQNVRVTGLLRGAYTAHVQAVAYSGAISPEAYLQFNIQAPPPPTSVEVQQGYFAISLIPKSADIANVSTQYDFWTSGETRLSSVSTDVVEREATRKGMGTTWTSEGLKNDHTYYWYVRTINAFGSSAFVEVAALCFTTATDLMPQIDSEFKKTETYKELTTEITGVKDGVTQLSQTVVETEQRLSQSVSNVQQSVITLDGVVQDQGVTINEQGQLIDAQGKLIDTQGKTIETVSATVQETSKAVVDLEGNVNAQWGAKVQVDSKGQKYVAGMQLGMEGSGGAVQSYFMVSANNFAIYNPTNSVAELAFAVKNGQVFMKAAFIENGTIDSAKISNQIQSTNYASNSAGWMINKNGSAQFNNVTIRGTVYASNGSFTGTVNATSGKFKGTVEATSFVGDVANMCVIAESAIPNTQTTSSRTWTKTFKDSSGSTLSKDFVLLLSYSLTAYTSNQASRIIVTANIGGTSITRRIERAANGPAMDVTIPLAVKGKTASSVTISVKEEYTNVYGSYLRPSVILMTRGTGSWS
Physico‐chemical
properties
protein length:1040 AA
molecular weight: 113251,44270 Da
isoelectric point:5,37796
aromaticity:0,08750
hydropathy:-0,18808

Domains

Domains [InterPro]
AJT60503.1
1 1040
Legend: Pfam SMART CDD TIGRFAM HAMAP SUPFAM PRINTS Gene3D PANTHER Other

Taxonomy

  Name Taxonomy ID Lineage
Phage Escherichia phage CAjan
[NCBI]
1610828 Viruses > Duplodnaviria > Heunggongvirae > Uroviricota > Caudoviricetes
Host Escherichia coli K-12
[NCBI]
83333 Bacteria > Proteobacteria > Gammaproteobacteria > Enterobacteriales > Enterobacteriaceae > Escherichia

Coding sequence (CDS)

Coding sequence (CDS)
Genbank protein accession
AJT60503.1 [NCBI]
Genbank nucleotide accession
KP064094.1 [NCBI]
CDS location
range 15154 -> 18276
strand +
CDS
ATGCCACCAGCAATAATTGGAGCAGCGATAGCTTTAGGTGCGTCGGCTGCTGCTGCGGCTAGTATTATTTCGGCTACTACCGCTCTCGTTATCGGTATTGCTGCAACCGCGGCAGGTGCTCTTTTGACTAAGACACCAAATATGAACTTTGATGCTTACAAGGGGCAGCAAGAGCGTAAACAGGTTCTGCGTGCAGCTACAGCGTCACGTTCTGTTGTATATGGCACGACTGTAGCGTCCGGACTATTAGCATTTGCGGAGGAAGAAGCTGGAGATCAAGATGAAGGTGAATGGATGCATCTTGTTGTCGTTCTTGCTAGTCATAAATTGGAAGGTATTGAAACGATTTGGTTGGGTGATGACGGAATTGATTGGTTTGGTGAAAACGCTACTTGGGAGTTTCATAACGACCGTCAGACTGTTGACCCATTCATGTTAAAACATTGTCCTTCCTGGAAGGAGGATATGATTGGTAAGGGTATTGCGTGGCTACGTTTAAGCTTAAAGTTTGATGCTGAAAAATTCCCATCCGGTTTGCCAAACGTCAAGGTGTTGAAAAAGGGACGTAGAGTTTATGACCCACGTTCCGGTCAAACTGTCTTTAGCGATAACGCTGCTCTTGTTATTCTTGATTATTTCCGCACGTATTTAAAGCGCAAGGACGAGAATATTAACTGGGACCAATTCAAGGAAGCTGCTAACATCTGCGACGAGTTTGTAACTAATGCGGATAACACTACGGAAAGACGCTACCGCATTAACGGTGAGTTTGAAGTGGACGAAGCACCTGCTAAGATCCTTGATGCTATGCTTGAAGCTTGCGGCGGGGAACTAACCTATATTGGCGGTAAGCATGGTCTGTTAGTTGGTGCGTATTACGGTCCAGCCACCATGACTTTGGATGAAAGCTGCATTGCTGGGGATATCAAAATAATCCCTGAGACGTCTTATAAAGAGCGTACCAATACCATTACAGGTACATTCGTCGACCCGAAGCAAACATACGCTGAAGCAGATTTCCCTCCAGTTGTGGTTAAGGAATGGGTGGAAAAAGATGGCGGTGAAATAACGCAAGACATGGATTTCCGCTTTGTTACAAGTGAATATCAAGCTCAACGTCTTGCTAATATTATTTTACGTCGTAAGCGTGTTGGTCGAACAATTGAAGTGCCGTGTAACATGAAGGGGTATAAATTCCGCCCCGGCATGTATGTCAACGTAACCATTACAAATATCGGAATGAAAAACGTTGAGATGCGCGTTACTAAATGGTCATTCGATCCTAAAGGTGGGATTAATCTTGTTCTTCGTCAAGACTTCCTGGAAATGTGGGACGACGCTATTGGTAAACCAATGGAGCGCCCGGATCTGGTAGACCTTCCGTCTGGTGCTATTGCTCAACCGCAAAACCTTCAGTATCAGGTATTGCAGATTAGCGATGTTGTGCAGGGTGTTTTAACGTGGAGCAATATTGGACAGGTGGCATATAACCGCGTAGCCGTGCGACAGGGAGGTACAACGGTTTGGACCGCACAAGTCCCTGGCCAAAACGTTCGCGTTACTGGGCTGCTGCGTGGCGCGTATACAGCGCATGTTCAAGCGGTAGCTTATAGCGGGGCAATATCGCCGGAAGCATATCTTCAATTTAATATTCAGGCTCCGCCACCACCAACATCTGTTGAAGTGCAACAGGGTTACTTTGCTATAAGTTTGATCCCAAAATCTGCGGATATTGCGAACGTTAGCACACAATATGATTTCTGGACGTCCGGTGAAACAAGGTTGTCTTCCGTGTCTACAGACGTTGTTGAGCGTGAAGCAACTCGTAAAGGTATGGGCACAACCTGGACTTCGGAAGGTTTGAAGAATGACCACACTTATTACTGGTATGTGCGCACTATTAACGCTTTCGGATCTTCCGCATTTGTAGAAGTAGCTGCATTGTGCTTTACGACAGCTACAGATTTGATGCCACAAATTGATTCTGAGTTTAAGAAGACGGAGACTTACAAAGAGCTTACTACTGAAATCACAGGCGTTAAAGATGGAGTTACTCAGCTTAGCCAAACTGTGGTAGAAACTGAACAGCGTTTAAGTCAAAGCGTTAGCAATGTTCAACAAAGCGTTATCACTTTGGATGGTGTTGTTCAGGATCAGGGCGTTACCATTAACGAACAAGGACAGCTCATTGATGCACAGGGTAAGTTGATTGACACTCAAGGTAAGACTATTGAAACGGTTAGCGCGACCGTGCAGGAGACAAGTAAAGCAGTTGTAGATCTGGAAGGTAATGTTAACGCTCAATGGGGTGCTAAAGTACAGGTAGATAGTAAAGGGCAAAAATACGTTGCTGGTATGCAGTTAGGTATGGAAGGGTCCGGTGGAGCTGTTCAGTCTTACTTCATGGTAAGCGCTAACAACTTTGCTATCTATAACCCGACAAATAGCGTAGCAGAACTAGCATTTGCTGTTAAGAACGGTCAAGTGTTCATGAAGGCTGCGTTTATTGAAAATGGTACTATTGACAGCGCTAAAATTTCAAATCAAATTCAGTCAACAAACTACGCAAGCAATAGCGCGGGTTGGATGATTAACAAGAATGGTAGCGCTCAGTTTAACAACGTAACGATAAGAGGTACTGTATACGCTAGCAACGGATCTTTCACAGGCACAGTGAACGCAACAAGCGGTAAATTCAAGGGTACTGTGGAAGCCACCAGTTTTGTGGGTGATGTTGCGAATATGTGTGTCATTGCTGAGTCTGCAATACCAAACACACAAACTACAAGTTCAAGAACCTGGACAAAAACTTTCAAAGACTCATCTGGTTCAACATTATCTAAAGACTTTGTGTTGCTATTAAGTTATAGTTTAACCGCATATACATCAAATCAGGCGAGTCGTATAATTGTGACTGCTAACATAGGGGGGACATCAATTACAAGGAGGATTGAGCGAGCTGCCAATGGACCTGCGATGGATGTAACCATACCTTTAGCAGTGAAAGGAAAGACAGCGTCGAGCGTAACAATAAGCGTTAAGGAGGAGTATACGAACGTCTATGGGAGCTACTTGCGCCCGTCTGTAATATTAATGACTCGCGGAACTGGTAGTTGGTCGTAA

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

PDB ID
b4417b24f238ed8e76f57144aa68298251f5079e5ae1a5053f820182c008986b
ESMFold
Source ESMFold
Method ESMFold
Resolution 0,7257
Evidence 0,7257

Literature

Title Authors Date PMID Source
Complete Genome Sequences of Four Novel Escherichia coli Bacteriophages Belonging to New Phage Groups Carstens,A.B., Kot,W. and Hansen,L.H. 2015 26184932 GenBank