Protein

Genbank accession
AGX01802.1 [GenBank]
Protein name
putative tail fiber protein
RBP type
TF
Evidence RBPdetect
Probability 0,75
TF
Evidence RBPdetect2
Probability 0,96
TF
Evidence GenBank
Probability 1,00
TF
Evidence Phold
Probability 1,00
Protein sequence
MNTEYTMANGQTPVQGSIMSMPVLADWTGQEYVPVVDETGNNKRVLLEDLKGRDAYEVAKEAGYTGTLQEWLASLKGEKGDKGDKGDNGADGKSNYDLAVEQGYQGDLASWLLTQKGADGNDGKDGVDGMLPDFTPVAVETDHVTEADFLTLMTAAVNGAKPADVTSKMLMIDHEGAVGESSLGGWTIYPTESGRYVYVFGRNALGNGQQGNYSAYMPDGGSPQFLSWSASEKGEQGIQGIDGKPGISVVPKGTTPSQVALPVASESNAGWMFTNDGSVTASDKGTLYVSDGTQWVIQGNILGPQGLKGDKGDKGDKGDTGTAGKNAVGMQIVGIVDKEEDLPPVADFTAGDTYVVGTHLWTKVGTEWVDIGDFTGPDGLSAYQVAKANGFVGTEAEWLSSLKGADGIGLKIIGSLPSKDNLPEVGEKSGDAYIINSVMWVWDTVQWSPVGQVGPQGKSAYQSALDTGYVGTEAAWIASLKGNKGDKGDKGDQGEKGEKGDNAAAVMLRGEKADEASLPSTGNTVSDAWLVGGNMFVWTGTAWFDAGPIQGPQGIKGDKGDKGDTGDTGKSAFEVAVAGGFSGTQTQWLASLVGNAVKAKGTLADFANLAGVVSPEAGWAYNITGGASAGHQFIYDGATWVDMGDVRGAKGEKGDQGIKGDTGDAGVDGATAYEIAQGAGFTGTEAEWLKSLIGPGLVAKGHVTDSADLLNVTNPVAGWVYNVLSGDDAGHQFIYNGSDWIDMGNVRGDQGIQGDKGDKGDKGDTGATGNAVNYRGTVATSDLLPSSGNQVSDAYFVGVNLWVWNGTTWIDNGSFQGPQGLKGDKGDKGDKGDTGTTGKSAYQSAVDSGFSGTESAWVTSLKGTTGKSAYQSAVDGGYAGTEAQWVASLKGTNGTNGTSLVPKGTVADLTALNAVANPVAGWLYNMTSTGHAYVYSGSAWVDQGDWRGLQGIQGLTGDQGIKGDTGDGYRYKGTVATTGDLPTTGQVKGDTYFVGTNMKVWNGTGWDDGGNFQGPQGIQGEKGDIGAGIKILGKKDTEEDLPATADAAGDGYMVGTNFWVWDGTAFVNVGAIQGPKGDQGLRGIQGLKGDKGDKGDKGDTGDKGTAWVVLARPPAAADGRIGDYYLNSSTLQFFVKTSDVLWGPLGYLGGGNVYDGPQDGKAYARKDGTWVLVDVLEAPDDGKQYVRKGKAWVSFDHYDMPLITVTAGAVDASKGNAYKLDATVNTTIAFSNLPANRVQTLVLTMMGKGGNLTWPAALKWSNAKAPTLGTTLTNIVVYWDGTNLTGSVGQTV
Physico‐chemical
properties
protein length:1292 AA
molecular weight: 134027,00400 Da
isoelectric point:4,51707
aromaticity:0,08669
hydropathy:-0,37895

Domains

Domains [InterPro]
Legend: Pfam SMART CDD TIGRFAM HAMAP SUPFAM PRINTS Gene3D PANTHER Other

Taxonomy

  Name Taxonomy ID Lineage
Phage Erwinia phage PhiEaH1
[NCBI]
1401669 Viruses > Duplodnaviria > Heunggongvirae > Uroviricota > Caudoviricetes
Host Erwinia amylovora
[NCBI]
552 Bacteria > Proteobacteria > Gammaproteobacteria > Enterobacteriales > Enterobacteriaceae > Erwinia

Coding sequence (CDS)

Coding sequence (CDS)
Genbank protein accession
AGX01802.1 [NCBI]
Genbank nucleotide accession
KF623294 [NCBI]
CDS location
range 66573 -> 70451
strand +
CDS
GTGAATACGGAGTACACGATGGCTAACGGTCAAACCCCTGTACAGGGTAGCATCATGTCGATGCCGGTACTCGCCGACTGGACGGGACAGGAGTACGTCCCGGTCGTTGATGAGACTGGCAACAACAAACGTGTTCTTCTCGAAGACCTGAAAGGGCGTGACGCATACGAGGTAGCCAAGGAAGCGGGCTACACGGGCACTCTCCAAGAGTGGCTGGCGTCCCTGAAAGGTGAGAAGGGCGATAAAGGCGATAAGGGCGATAACGGAGCCGACGGTAAGTCGAACTACGACCTCGCCGTGGAACAGGGTTATCAGGGCGACCTGGCATCCTGGCTGCTCACGCAAAAAGGCGCGGACGGTAACGACGGTAAAGACGGTGTTGATGGCATGTTGCCAGACTTCACTCCGGTGGCTGTTGAAACCGACCACGTCACCGAAGCTGACTTCCTGACATTGATGACCGCTGCGGTCAACGGTGCGAAGCCAGCAGATGTCACCAGCAAGATGCTGATGATTGACCATGAAGGCGCGGTGGGTGAAAGCTCACTGGGCGGTTGGACCATTTACCCTACCGAATCCGGTCGTTACGTTTACGTCTTCGGTCGTAATGCGCTGGGTAACGGTCAGCAAGGTAACTATTCAGCCTACATGCCGGATGGCGGTAGTCCGCAATTCCTGTCCTGGTCCGCTTCTGAAAAGGGCGAACAGGGTATCCAGGGTATTGACGGTAAGCCGGGTATCTCCGTCGTACCGAAAGGTACCACCCCTTCTCAGGTAGCTTTGCCGGTTGCAAGTGAGTCTAACGCAGGTTGGATGTTCACGAACGACGGTAGCGTAACTGCTTCTGATAAAGGTACGCTCTACGTCTCCGACGGTACTCAGTGGGTAATCCAGGGGAACATCCTGGGTCCGCAGGGCCTGAAGGGAGACAAAGGCGATAAAGGCGACAAAGGTGATACCGGTACAGCCGGTAAGAACGCCGTCGGGATGCAGATTGTTGGCATCGTTGACAAAGAAGAAGACCTGCCTCCGGTCGCAGACTTCACCGCGGGTGATACCTACGTAGTAGGCACACACCTCTGGACGAAAGTCGGCACTGAGTGGGTCGACATCGGCGACTTCACTGGCCCTGATGGTCTGTCTGCGTACCAGGTTGCGAAAGCGAACGGGTTCGTGGGTACCGAAGCTGAATGGCTGTCAAGCCTGAAAGGTGCTGACGGTATCGGTCTGAAAATCATCGGTTCTCTGCCGTCAAAGGATAACCTCCCAGAAGTGGGCGAGAAGTCTGGTGATGCTTACATCATCAACTCCGTCATGTGGGTGTGGGATACCGTTCAGTGGTCCCCGGTAGGTCAGGTTGGCCCGCAAGGTAAATCTGCTTACCAGTCTGCGCTCGACACCGGCTATGTCGGTACTGAAGCGGCCTGGATTGCCTCCCTGAAAGGGAACAAAGGCGATAAAGGCGATAAAGGCGACCAGGGTGAAAAAGGTGAGAAGGGCGATAACGCTGCCGCAGTCATGCTGCGTGGCGAGAAGGCCGACGAAGCATCGCTGCCTTCTACCGGTAACACCGTCTCTGACGCGTGGCTGGTTGGCGGTAACATGTTCGTCTGGACCGGTACTGCCTGGTTCGATGCAGGTCCTATCCAGGGCCCTCAGGGCATCAAGGGCGATAAGGGTGATAAAGGCGACACCGGTGACACCGGTAAGTCTGCATTCGAAGTTGCCGTTGCTGGCGGTTTCTCCGGTACCCAAACCCAATGGTTGGCGTCTCTCGTAGGCAATGCTGTTAAGGCGAAAGGTACACTGGCTGACTTCGCTAACCTGGCGGGTGTGGTTTCTCCGGAAGCCGGTTGGGCCTACAACATCACTGGCGGCGCATCTGCTGGCCATCAGTTCATCTACGACGGGGCAACCTGGGTAGACATGGGTGATGTTCGCGGAGCCAAAGGCGAGAAAGGTGACCAAGGTATTAAAGGTGATACCGGCGATGCCGGTGTAGACGGTGCAACCGCTTACGAAATCGCCCAAGGTGCTGGCTTTACTGGTACCGAAGCTGAATGGCTGAAGTCCCTGATTGGTCCGGGTCTGGTAGCGAAAGGTCACGTAACCGATTCTGCTGACCTGTTGAACGTCACTAACCCAGTTGCGGGTTGGGTCTACAATGTTCTCTCCGGCGACGATGCCGGTCACCAATTCATCTACAATGGTTCCGATTGGATTGACATGGGCAACGTCCGTGGCGACCAGGGTATCCAGGGTGATAAAGGTGATAAAGGGGACAAGGGCGACACGGGTGCAACCGGTAATGCGGTTAACTACCGTGGTACTGTAGCCACCTCTGACCTGCTGCCTTCCAGCGGTAACCAGGTTTCCGATGCGTACTTCGTGGGTGTAAACCTGTGGGTATGGAACGGTACCACCTGGATTGACAACGGTAGCTTCCAAGGCCCACAAGGTCTGAAAGGCGATAAGGGTGACAAAGGGGATAAAGGTGATACTGGTACTACCGGTAAATCTGCGTACCAGTCCGCCGTTGATTCCGGGTTCTCTGGCACTGAATCTGCGTGGGTAACCTCGCTGAAAGGTACCACTGGTAAGTCCGCTTATCAGTCTGCGGTTGACGGCGGTTACGCTGGCACCGAAGCACAGTGGGTTGCTTCCCTGAAAGGGACCAACGGTACTAACGGTACGTCACTGGTTCCGAAAGGAACTGTTGCTGACCTGACCGCACTGAACGCAGTGGCTAACCCGGTTGCTGGCTGGCTGTACAACATGACCTCTACGGGTCACGCGTACGTCTACAGCGGTAGCGCATGGGTCGACCAGGGCGATTGGCGTGGCCTCCAGGGCATCCAAGGTCTGACTGGTGACCAAGGTATTAAAGGTGATACCGGCGACGGCTATCGCTATAAAGGTACCGTGGCAACGACTGGCGACCTGCCAACCACCGGCCAAGTGAAAGGTGACACCTACTTCGTAGGCACCAACATGAAGGTCTGGAACGGCACTGGCTGGGATGACGGCGGTAACTTCCAGGGTCCGCAAGGTATCCAGGGTGAAAAAGGTGACATTGGTGCGGGCATCAAGATTCTCGGTAAGAAAGACACCGAAGAAGACTTGCCAGCAACTGCCGATGCAGCCGGTGACGGTTACATGGTAGGCACCAACTTCTGGGTGTGGGACGGCACGGCATTCGTGAACGTCGGCGCAATCCAGGGGCCGAAAGGTGACCAAGGTCTCCGCGGTATCCAGGGCCTGAAAGGTGATAAAGGTGACAAGGGTGATAAGGGTGACACCGGTGACAAAGGCACCGCGTGGGTTGTCCTGGCACGTCCCCCAGCTGCGGCTGATGGTCGTATCGGTGACTACTACCTGAACTCCTCTACTCTCCAGTTCTTCGTGAAAACCTCTGACGTTCTGTGGGGCCCTCTGGGTTACCTCGGCGGCGGTAACGTGTACGACGGTCCTCAGGACGGTAAAGCGTACGCCCGTAAAGACGGTACATGGGTTCTCGTTGACGTTCTGGAAGCCCCAGACGACGGTAAGCAGTACGTACGTAAAGGTAAAGCGTGGGTTAGCTTCGACCATTACGACATGCCGCTGATTACGGTCACTGCGGGTGCGGTAGATGCGAGCAAAGGTAACGCCTATAAGCTCGATGCCACCGTGAACACCACGATTGCGTTCAGTAACCTCCCAGCCAACCGCGTGCAGACTCTGGTCCTCACGATGATGGGCAAAGGTGGTAACCTGACGTGGCCAGCAGCACTCAAGTGGTCGAATGCGAAAGCACCGACTCTGGGTACTACGCTGACCAACATCGTGGTTTACTGGGACGGCACGAACCTGACCGGCTCTGTTGGTCAGACCGTGTAA

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

PDB ID
3c9ff4b6a9a310b54e26cb95534aa037a06d9df0b4eca13aedd36ee95318d98f
ESMFold
Source ESMFold
Method ESMFold
Resolution 0,6464
Evidence 0,6464

Literature

No literature entries available.