Protein

Genbank accession
AEV89340.1 [GenBank]
Protein name
phage tail fiber protein
RBP type
TSP
Evidence RBPdetect
Probability 0,86
TF
Evidence RBPdetect2
Probability 0,96
TF
Evidence Phold
Probability 1,00
Protein sequence
MTIIYDVTGHKGGGAKPHTPQETPDSLHSLAKARILLALGEGEFEGIPTDNELRQRVYLDGTPIQNADRSENFPGARVEFRPGTQHQDVIHGFSAVESEQTVGVKLEYGTPWVRQINDTSLDAVRIRVGIPALYTNEDNGDLVGGRIDYKIVVYTDNADPREFRFAAVGKTMSLYERDHRIELPSNVNTGWRVEVHRLTADSTSAKVVNDIRVQSITEIIDARLRYPLTALLFVEFDAKAFQNIPRVSIKCKGRKVLVPNNYDPINHTYSGDWDGTFKRAWTDNPAWHWYDICITERFGLGRRIKPQMLNRYALYQVAQRCDQLVSDGNGGQEIRFKNDMYIQSQTDAWTVLKDLAAIFAGMTWWGNQMLNIVSDQPVAAVSHTITNASVIDGRFDYASGSQKTRYSTFAVAYGNPKNHYEDAIATGQRVELVRRHKINRLDITAIGCTRESEAQRRGHWALISNQLDQQVSFKVGMEGLFFIPGSVVAIADTNISGGFETRGGRLLSDPGTRTVLNTDSEITFRPGDKFLVRTDSGNVETREIASVNGNKVTLKTALNADPIPDQPFCVDGDDIQLQKFRITDLEYDDATSTFSVRGIEYNDSKYDAVDNGARLDPGIFTQVPDGVMKGPESVIITPSQISSQGQLITNVDIVFPPVKDAVVYEIQWRRTSLQNMEIQWGNDWVNLPRTASNGAHIPNVFSGNYQARVRAIGMGEISSPWVESAITPVEGRLGGLNAPIITNAISGLHQILWKWNHNNAATDISYTELEVRKTGETEWKFLTNVPYPGAEYAQTSLEFGIYQQLRARVADKIGNLSDWSAPFEGQVSNKVDEYMKGLDDEFLTSEDGKRFQEAIDTMPQGIYEAMLTDAQQMFNARAEYQGIYAEIKVAYNVAADAHQAVAQLETLIGTRLDEAEAAIHTLQTAQSTQEQAFARYQQTVAAKFSEQEAAIQQVQTATADVAGALAEYKTQVAAQFGQQSAAIEQKMTSSFNHSGGSATYSLKAGVTYNGTYYDAGMQLSVVTSGNAVKSRIAFKADQFYIMHPSNGTLSSAFIVDGGQVYIDTARIKNASINFAQITDTLRSDNFVAGSRGWNLPKSGNAELNNVTVRGTVYANDGEFRGTVYARDGVFTGTVEATSFVGDVANMCVIGAKAIPNTARNGSRSWSVTFNDSSNSTKLKDFVLLLSYSLVGYTASQSSRIAITANIGGKVVSRTIERAASGNAIHTTIPLAAKGVTAGSVTITVKEEYTNVYGSTLHPSVILMTRGTGSWSQ
Physico‐chemical
properties
protein length:1272 AA
molecular weight: 139929,70270 Da
isoelectric point:5,59434
aromaticity:0,08962
hydropathy:-0,33066

Domains

Domains [InterPro]
Legend: Pfam SMART CDD TIGRFAM HAMAP SUPFAM PRINTS Gene3D PANTHER Other

Taxonomy

  Name Taxonomy ID Lineage
Phage Shigella phage EP23
[NCBI]
1109721 Viruses > Duplodnaviria > Heunggongvirae > Uroviricota > Caudoviricetes
Host Shigella sonnei
[NCBI]
624 Bacteria > Proteobacteria > Gammaproteobacteria > Enterobacteriales > Enterobacteriaceae > Shigella

Coding sequence (CDS)

Coding sequence (CDS)
Genbank protein accession
AEV89340.1 [NCBI]
Genbank nucleotide accession
JN984867.1 [NCBI]
CDS location
range 17858 -> 21676
strand +
CDS
ATGACGATTATCTACGACGTCACGGGCCATAAAGGCGGCGGCGCAAAACCGCACACCCCACAGGAGACGCCTGACAGCCTGCATTCACTGGCTAAAGCCCGCATCTTGCTAGCACTGGGTGAAGGTGAGTTTGAAGGCATCCCGACCGACAACGAATTACGGCAGCGCGTTTATTTGGACGGCACGCCGATCCAGAACGCCGACCGATCCGAAAACTTCCCCGGCGCGCGCGTGGAGTTCCGCCCAGGCACACAGCACCAGGACGTTATTCACGGATTTTCCGCGGTGGAAAGCGAGCAGACAGTGGGTGTAAAACTGGAATACGGCACGCCGTGGGTGCGCCAGATTAACGACACCAGTTTGGACGCCGTGCGCATTCGAGTCGGTATACCGGCCCTGTACACTAACGAAGATAATGGCGACCTGGTGGGCGGGCGCATCGACTATAAAATCGTTGTGTATACGGATAACGCCGATCCGCGCGAGTTCCGCTTCGCTGCCGTTGGAAAAACAATGTCGCTGTACGAGCGCGACCACCGCATCGAGCTACCGTCTAACGTGAATACCGGCTGGCGCGTGGAGGTGCACCGCTTAACGGCTGATTCCACCTCTGCGAAAGTGGTCAATGATATCCGGGTGCAATCCATTACGGAGATTATCGACGCCCGCCTGCGATACCCGCTAACCGCGCTGTTGTTCGTGGAGTTCGACGCTAAGGCTTTCCAGAATATCCCGCGCGTGTCAATCAAGTGCAAAGGTCGCAAGGTGTTGGTCCCGAACAACTACGACCCGATTAATCACACCTATTCGGGGGACTGGGATGGCACGTTTAAACGCGCATGGACGGATAACCCTGCGTGGCACTGGTACGATATTTGTATTACTGAGCGCTTTGGCCTCGGTCGTCGTATCAAACCGCAAATGCTAAACCGATACGCGCTTTACCAGGTTGCGCAGCGCTGCGATCAGCTGGTCAGCGACGGCAACGGCGGGCAGGAAATCCGATTTAAGAATGATATGTACATCCAGTCGCAGACCGACGCCTGGACCGTGCTTAAGGATTTAGCCGCTATCTTTGCCGGTATGACCTGGTGGGGCAATCAGATGTTAAATATCGTCAGTGACCAGCCGGTAGCCGCGGTATCGCACACTATCACAAACGCCTCGGTGATTGATGGTCGATTCGACTACGCATCCGGTAGCCAGAAAACCCGGTATTCCACGTTCGCGGTAGCATACGGCAACCCGAAGAACCACTACGAAGACGCCATCGCAACGGGCCAGCGCGTCGAACTGGTGCGCCGCCATAAGATTAACCGCCTCGATATTACGGCGATCGGCTGTACGCGTGAGTCCGAAGCGCAACGCCGCGGGCACTGGGCGCTAATCTCCAACCAGCTTGACCAGCAAGTTAGCTTTAAGGTTGGCATGGAAGGGCTGTTTTTTATCCCGGGAAGCGTAGTTGCGATCGCGGATACTAATATTTCCGGCGGCTTCGAGACACGCGGCGGACGCCTGTTGTCAGATCCAGGGACGCGTACTGTGCTGAACACGGACAGCGAAATCACGTTCCGCCCTGGCGATAAATTCTTGGTGCGCACCGATAGCGGGAATGTGGAGACTCGCGAGATTGCCAGCGTCAACGGCAACAAGGTTACGCTGAAAACAGCACTGAACGCTGACCCGATTCCCGATCAACCGTTTTGCGTTGATGGCGACGATATCCAGTTGCAGAAGTTCCGCATCACCGACCTGGAATATGACGACGCTACCAGCACTTTCTCGGTGCGCGGGATTGAATACAACGACAGCAAATACGATGCCGTTGATAATGGCGCTCGCCTTGATCCGGGCATCTTCACGCAAGTTCCTGACGGCGTAATGAAGGGGCCGGAGTCGGTAATCATCACCCCGTCGCAGATCTCATCGCAAGGCCAGCTAATCACCAACGTGGATATTGTTTTCCCGCCGGTGAAAGATGCCGTTGTGTACGAAATCCAGTGGCGGCGCACCAGCTTGCAGAATATGGAAATCCAGTGGGGTAACGACTGGGTGAACCTTCCACGCACGGCATCGAACGGCGCGCACATCCCTAACGTGTTCTCCGGCAACTACCAGGCACGCGTCCGCGCGATCGGTATGGGTGAGATTTCATCCCCGTGGGTGGAGTCCGCCATCACGCCGGTGGAAGGTCGCCTCGGCGGGCTTAACGCGCCGATCATCACCAACGCGATTTCTGGCCTCCACCAGATTTTGTGGAAGTGGAACCATAATAACGCCGCGACGGATATCTCATACACCGAGCTTGAAGTGCGCAAGACCGGTGAGACGGAATGGAAATTTCTTACCAACGTCCCATATCCAGGGGCGGAGTATGCACAAACGTCTCTGGAGTTCGGTATCTATCAGCAGTTGCGCGCCCGCGTAGCGGATAAAATCGGCAACCTGTCGGACTGGTCGGCCCCGTTTGAAGGGCAGGTAAGTAACAAAGTTGACGAGTACATGAAGGGGCTTGACGATGAGTTCTTGACTTCTGAGGATGGTAAACGCTTCCAGGAAGCGATCGACACCATGCCGCAAGGCATTTACGAGGCCATGCTCACCGACGCGCAGCAAATGTTCAACGCCCGCGCTGAGTACCAGGGGATTTATGCGGAAATCAAGGTGGCGTATAACGTGGCAGCAGACGCCCACCAGGCCGTCGCGCAACTGGAGACGTTGATCGGCACGCGCCTTGACGAAGCTGAAGCGGCGATTCACACGTTGCAGACGGCGCAGAGTACGCAAGAGCAGGCCTTCGCCCGGTATCAGCAGACTGTTGCCGCTAAGTTCTCGGAGCAGGAAGCCGCTATCCAGCAGGTACAGACGGCAACGGCGGACGTAGCTGGCGCACTGGCGGAGTATAAGACCCAGGTCGCAGCACAGTTTGGGCAGCAGTCTGCCGCTATTGAGCAGAAGATGACGTCTTCGTTTAACCACTCCGGTGGTAGCGCCACGTACAGCCTTAAGGCGGGCGTGACGTATAACGGGACTTACTACGATGCCGGTATGCAGCTTTCCGTCGTGACGTCGGGCAACGCGGTTAAATCACGCATCGCATTCAAGGCGGACCAGTTCTACATCATGCACCCGTCCAACGGCACGCTGTCATCGGCGTTTATTGTGGATGGGGGACAGGTGTATATAGACACAGCCCGCATCAAGAACGCGTCCATAAACTTCGCGCAGATCACGGACACGCTGCGGTCAGATAACTTCGTAGCGGGATCACGCGGGTGGAACCTGCCTAAATCTGGTAATGCGGAGCTTAACAATGTGACAGTCCGCGGCACCGTTTATGCGAATGATGGTGAGTTCCGGGGCACGGTTTATGCAAGGGATGGGGTATTTACCGGAACGGTCGAAGCCACCAGTTTTGTCGGTGACGTTGCAAATATGTGTGTTATCGGTGCGAAAGCAATACCCAACACAGCCAGGAATGGATCAAGGTCGTGGTCGGTGACTTTCAACGACTCATCCAATTCAACAAAACTCAAAGACTTTGTATTGCTGTTAAGCTACTCACTGGTCGGGTACACCGCAAGCCAGTCAAGCCGCATTGCTATTACCGCCAATATTGGCGGGAAAGTAGTCTCAAGGACTATTGAACGAGCGGCAAGCGGAAACGCAATCCATACAACGATTCCTTTAGCTGCGAAAGGTGTAACGGCGGGAAGTGTGACAATTACTGTGAAAGAAGAGTATACAAACGTCTACGGCAGCACATTGCACCCGTCTGTAATTTTAATGACTCGTGGGACAGGTAGCTGGTCGCAATAA

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

PDB ID
1fe09112ad86faba89d41b4b89b1738f9bf64182c11e220b73df7a5cd8b6ac8f
ESMFold
Source ESMFold
Method ESMFold
Resolution 0,7562
Evidence 0,7562

Literature

Title Authors Date PMID Source
Comparative genomic analysis of bacteriophage EP23 infecting Shigella sonnei and Escherichia coli Chang,H.W. and Kim,K.H. 2011 22203555 GenBank