Protein

Genbank accession
AEV89341.1 [GenBank]
Protein name
putative tail fiber protein
RBP type
TF
Evidence RBPdetect
Probability 0,90
TF
Evidence RBPdetect2
Probability 0,96
TF
Evidence GenBank
Probability 1,00
TF
Evidence Phold
Probability 1,00
Protein sequence
MAAGTLSVTNNSKAVVGVGTTFTEYKAGDFLSLVVGQVPYTVAIASIESATALTLVLPFDGPTATGLAWDGIKRDTMSLATMGVTVQAQKALRLMIADENNWRAIFGEEEEITVTLPNGQVMQGMSWGYLSQLMKQIDPVEMRNLQQQAETAKNQAVTAKGQAESARDAANTAKTGAENARNQANTARDQANTAKTGAESARDAANTAKTGAENARSQAQGYRDEAEQFKNQINPSQFMLKSQNLSDVANKDTARDNLSLGRTQRAQFEGVDLSKSDWPGLRFITTSMSPTEVGYRVVFEHDSSDNRMALYWRNGSDANGQAAVHFTAPASGQTRFIAYKEEVNLPEITGWGVAAPSQGPRATNATNLAPGLYWGTVADVGNPISGTLGMSMLQTSGSSANYRCQLVFQDSAGGAMYLRSSNSSVFGNYKQVTTSAVSDERLKTVRGNLNLEGALDNINRMDFKIFSFLSDGPERSYRRGVISQQIRQIDKQYTKEIGGYYHLDQTPMLLDALAAIKALRARDEANKAEIAELKAAIAELKK
Physico‐chemical
properties
protein length:542 AA
molecular weight: 58588,78350 Da
isoelectric point:6,11567
aromaticity:0,07011
hydropathy:-0,41956

Domains

Domains [InterPro]
Legend: Pfam SMART CDD TIGRFAM HAMAP SUPFAM PRINTS Gene3D PANTHER Other

Taxonomy

  Name Taxonomy ID Lineage
Phage Shigella phage EP23
[NCBI]
1109721 Viruses > Duplodnaviria > Heunggongvirae > Uroviricota > Caudoviricetes
Host Shigella sonnei
[NCBI]
624 Bacteria > Proteobacteria > Gammaproteobacteria > Enterobacteriales > Enterobacteriaceae > Shigella

Coding sequence (CDS)

Coding sequence (CDS)
Genbank protein accession
AEV89341.1 [NCBI]
Genbank nucleotide accession
JN984867 [NCBI]
CDS location
range 22032 -> 23660
strand +
CDS
ATGGCAGCGGGTACGCTCTCCGTAACGAACAATAGCAAGGCTGTAGTAGGGGTTGGCACAACGTTTACCGAGTACAAGGCTGGTGACTTCTTATCGCTGGTGGTTGGGCAAGTGCCTTACACCGTGGCGATCGCGTCCATCGAAAGCGCAACCGCGCTCACACTGGTGTTGCCGTTCGACGGCCCAACGGCGACGGGTCTGGCCTGGGACGGCATCAAACGTGACACCATGTCATTGGCGACGATGGGCGTAACCGTCCAGGCGCAAAAAGCATTGCGATTGATGATCGCAGATGAGAACAACTGGCGCGCAATCTTCGGAGAAGAAGAGGAAATAACAGTGACGTTACCTAACGGGCAGGTTATGCAGGGCATGTCATGGGGCTATCTGTCGCAGCTAATGAAGCAGATCGACCCCGTAGAAATGCGCAACCTGCAACAACAGGCCGAGACGGCAAAAAACCAGGCTGTTACTGCGAAGGGCCAGGCAGAATCAGCGCGCGATGCGGCTAATACAGCAAAAACTGGCGCGGAGAACGCCCGCAACCAGGCCAACACCGCACGTGATCAGGCCAACACCGCCAAAACAGGCGCAGAATCAGCGCGCGATGCGGCTAATACAGCAAAAACTGGTGCGGAGAACGCCCGCAGCCAGGCGCAAGGGTATCGCGACGAAGCAGAGCAGTTTAAAAACCAAATCAACCCATCACAGTTCATGCTTAAGTCGCAAAACCTTAGCGACGTAGCGAACAAAGATACGGCGCGGGACAATCTGTCACTAGGCCGCACTCAGCGTGCGCAATTTGAGGGGGTTGACCTGTCTAAAAGCGACTGGCCCGGCTTAAGGTTTATTACAACCAGCATGTCACCAACAGAGGTCGGCTATCGCGTTGTCTTTGAGCATGATTCGAGTGATAACCGCATGGCGCTCTACTGGCGAAATGGCTCAGATGCCAATGGGCAGGCCGCCGTCCACTTCACCGCGCCTGCCAGCGGGCAGACGCGATTTATTGCATACAAGGAAGAAGTAAACCTTCCAGAAATTACAGGCTGGGGCGTAGCGGCTCCCTCGCAAGGACCGAGGGCGACGAACGCCACAAACCTTGCGCCGGGGCTTTATTGGGGTACCGTCGCTGACGTCGGCAATCCAATTTCAGGAACACTCGGCATGTCGATGCTTCAAACATCAGGATCATCCGCCAATTACCGCTGCCAGTTGGTGTTTCAGGATAGCGCGGGCGGGGCGATGTATCTCCGTTCCAGTAATAGTAGCGTTTTTGGTAACTACAAGCAGGTAACTACCTCTGCGGTGTCGGATGAGCGCCTGAAAACTGTCCGGGGAAATCTTAATCTAGAAGGTGCGCTGGATAACATAAACCGAATGGATTTTAAAATTTTCTCGTTCCTGAGTGATGGGCCGGAACGTAGCTACCGACGCGGCGTTATCTCGCAGCAGATCCGCCAGATTGATAAGCAGTACACGAAAGAGATCGGCGGGTACTATCATCTTGATCAGACACCAATGCTACTGGACGCCCTAGCGGCAATTAAAGCATTGCGGGCGCGTGACGAGGCTAATAAAGCAGAGATTGCAGAGTTAAAAGCGGCGATTGCAGAATTGAAAAAATAA

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

PDB ID
da1147e0f6a0bcd5eda23aff10db572135eea750d731a7184a1283be4cc47379
ESMFold
Source ESMFold
Method ESMFold
Resolution 0,7737
Evidence 0,7737

Literature

No literature entries available.