Protein

Genbank accession
QAY01016.1 [GenBank]
Protein name
putative tail fiber protein
RBP type
TSP
Evidence DepoScope
Probability 1,00
TF
Evidence RBPdetect
Probability 0,90
TF
Evidence RBPdetect2
Probability 0,96
TF
Evidence Phold
Probability 1,00
Protein sequence
MAEFKLSELNSISEIRSDDLLHVRVKKRTEMLGDEDRRMTYADFLASFRLERFLQLVGGTMTGNLGIVKLTYGGKDLLDPTGNSEIVFGDTAKTFKLNANGLKLTIADASRSATVYHTLNKPSPNELGMRTNDENDARFAIKGTQNTFTASQIFQTNNAGLTLKNTTESALYYEGRDVANNVRFTLGNLGAGTDVTLANTRQNTNIVLGAGAVGINKSLQVTGQVVPSDFGNFDARFYTRDAADQRFVQLAAANTITGNNTFRGTTTLLGDGARLTLKNLTSGKALYIEGQNTDASMKWAVGNYADGSSSVVLTNISNDTYVLLDNMVRVNKSLAITGQVQPTDWTNIDSRYVLIGTYANLAKKNEANTFTLQQNIQSDGEALRIYSKVQNNASYIVGKNTDNSNKWYIGCGTNGSGVASFHNYLTGARIDLSDEVLLSKSLQITGQVKPSDWGNLDARFLKVSGSTATDLVISSTGHGATVGRIISIATRADWNKPQGYSAMLQTPAASVIPSYPADSAAYLHVFAKRDTGVGQLAFLANYSAKGIWLARNLTETDTVTFEKIYTDGTKPTPSELGAYTKAEVDQKIASAVTDSTDLKKVYPVGIVTWFATNQNPNTTLAGMTWTYLNEGVGRTVRIGAANGSDVKSVGGADTVTLSVGHIPSHTHSFSATTSSFDYGTKTSNTTGAHTHSVSGSAASAGSHNHAISWIGQNSTHYSGGGGGKIGTGPGYTDAGGAHTHTISGTAASAGNHAHTVGIGAHTHTVSGNTGGTGSGSAFSVVNLFIRLMAWVRTA
Physico‐chemical
properties
protein length:794 AA
molecular weight: 84247,57760 Da
isoelectric point:8,66379
aromaticity:0,07809
hydropathy:-0,26348

Domains

Domains [InterPro]
Legend: Pfam SMART CDD TIGRFAM HAMAP SUPFAM PRINTS Gene3D PANTHER Other

Taxonomy

  Name Taxonomy ID Lineage
Phage Escherichia phage EcSzw_1
[NCBI]
2419742 Viruses > Duplodnaviria > Heunggongvirae > Uroviricota > Caudoviricetes
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)
Genbank protein accession
QAY01016.1 [NCBI]
Genbank nucleotide accession
MH791413 [NCBI]
CDS location
range 46797 -> 49181
strand -
CDS
ATGGCAGAATTTAAATTGAGTGAATTAAACTCGATTAGTGAGATTCGTTCTGATGACCTATTGCATGTCAGAGTAAAGAAAAGAACTGAGATGCTAGGTGATGAAGACCGTAGAATGACTTATGCAGATTTCCTTGCATCATTTAGATTGGAAAGATTTCTGCAATTAGTTGGTGGAACCATGACAGGTAATTTAGGAATTGTTAAACTTACCTATGGAGGAAAAGACTTACTTGACCCAACAGGTAATTCTGAAATTGTCTTTGGCGACACTGCAAAAACTTTCAAACTTAATGCAAATGGTCTTAAACTAACTATTGCAGATGCTTCGAGAAGTGCAACAGTTTATCACACTCTGAATAAGCCATCTCCTAACGAACTTGGGATGAGAACTAATGATGAGAATGATGCAAGATTTGCTATCAAAGGTACTCAAAACACCTTCACGGCAAGTCAAATTTTCCAGACCAATAACGCAGGTTTAACTTTAAAAAACACTACAGAGTCTGCGTTGTATTATGAGGGAAGAGATGTAGCAAATAACGTCAGATTCACCCTTGGTAACCTTGGTGCTGGAACAGATGTGACTTTGGCTAATACAAGGCAGAACACTAATATCGTTCTTGGTGCTGGTGCTGTAGGTATTAACAAGTCTCTTCAGGTAACTGGACAGGTTGTTCCTTCAGATTTTGGTAACTTTGATGCAAGATTCTACACTCGTGATGCAGCAGACCAAAGATTTGTTCAGTTGGCAGCGGCTAACACGATTACTGGAAACAACACCTTCAGAGGTACTACAACGTTACTTGGTGATGGTGCAAGATTAACTCTGAAGAACTTGACAAGTGGTAAGGCACTTTACATCGAAGGTCAGAACACAGATGCTTCTATGAAGTGGGCTGTAGGTAACTATGCAGATGGTAGTAGCTCAGTTGTCTTGACAAACATCTCAAATGATACCTATGTGTTGCTTGACAACATGGTTAGGGTGAACAAGAGTCTTGCAATCACTGGACAGGTTCAGCCTACCGATTGGACTAATATTGACTCTCGCTATGTTTTGATTGGTACATATGCTAATCTTGCTAAGAAGAATGAAGCTAACACGTTCACTCTTCAACAAAACATTCAATCAGATGGCGAAGCACTAAGAATCTACAGCAAAGTTCAGAATAATGCTTCTTACATCGTTGGGAAGAATACTGATAACTCTAACAAGTGGTATATTGGTTGTGGTACAAATGGTTCTGGTGTTGCATCTTTCCACAACTACTTGACTGGTGCAAGAATTGATTTGTCTGATGAAGTCCTTTTGTCTAAATCTCTACAGATTACTGGACAGGTAAAACCTTCTGATTGGGGCAACTTAGATGCAAGATTCTTGAAAGTGAGTGGAAGCACAGCGACAGACTTGGTGATTAGTTCTACAGGTCATGGTGCAACAGTTGGTAGAATTATCTCTATTGCTACAAGAGCAGACTGGAACAAGCCTCAAGGCTACTCAGCAATGCTACAGACACCTGCTGCAAGTGTTATCCCAAGCTATCCAGCAGATTCAGCAGCATATCTTCATGTTTTTGCGAAGAGAGATACAGGTGTTGGTCAGTTGGCTTTCTTAGCTAACTACTCAGCAAAAGGTATCTGGCTTGCAAGAAACTTGACAGAAACTGACACAGTAACTTTTGAGAAGATTTACACAGACGGTACTAAGCCAACCCCTTCAGAACTTGGTGCGTACACTAAAGCTGAAGTTGACCAGAAAATTGCATCTGCTGTAACTGACTCAACAGACCTTAAGAAAGTCTATCCAGTGGGGATTGTAACATGGTTTGCAACTAACCAGAACCCTAACACGACTCTTGCAGGTATGACTTGGACGTATCTGAATGAGGGTGTTGGAAGAACAGTTAGGATTGGTGCAGCAAACGGTTCAGATGTTAAATCAGTTGGTGGTGCGGATACTGTGACGTTATCTGTAGGGCATATCCCTTCACACACTCACAGCTTCTCAGCTACAACATCAAGCTTTGACTACGGTACTAAAACATCTAACACAACGGGTGCTCATACTCACTCTGTGAGTGGTTCTGCTGCATCTGCTGGTAGCCATAACCATGCAATCTCGTGGATTGGTCAGAACTCTACACACTATTCTGGTGGTGGTGGTGGTAAGATTGGTACAGGGCCAGGTTATACTGATGCAGGTGGTGCTCACACCCACACAATCTCTGGTACTGCTGCCTCTGCTGGTAACCATGCCCACACAGTAGGTATCGGTGCTCACACCCACACAGTTAGTGGTAACACTGGTGGTACAGGTTCTGGTTCAGCATTCAGTGTTGTAAACCTGTTTATTAGATTGATGGCATGGGTTAGAACGGCGTAA

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

PDB ID
c203bb68d4f0dbd675e6f633412961c3574945dfd72c84ea942e7875537342d2
ESMFold
Source ESMFold
Method ESMFold
Resolution 0,6775
Evidence 0,6775

Literature

Title Authors Date PMID Source
EcSzw_1, Complete genome sequences of 3 novel enterobacteria, Pakpunavirus like phages Yuan,S., Ma,Y. and Liu,Q. 2020-07-27 GenBank