Genbank accession
WWQ72104.1 [GenBank]
Protein name
tail fiber protein
RBP type
TF
Evidence GenBank
Probability 1,00
TF
Evidence RBPdetect
Probability 0,90
Protein sequence
MTTPKYGGLLTDIGAAALIAASEAGKKWQPTHMLIGDAGGAPGETADPIPSAAQTKLIRQRYRAQLNRLFVSEQSANVLVAELVLPMAIGGFWIREIGLEDADGKFVAVANCPPSFKASVESGSARTQTIRVQIILSGMEHVELIIDDGIVYATQDWVTAKVAADFKGRKVLAGNGLVGGGDLSADRTIALPASGVGAGTYRAVTVNANGIVTAGSNPTTLGGYGITDALHASEAVTTPTANKLLRLNAAGLLPASITGNAATASRLAAPITLSASGDATWSVRFDGATNVNGVLTLANSGVTAGTYAKVTVNAKGLVTGASGLVASDIPALDAGKITSGILPAARGGTGNGIGQAATAVKLAAPRTIYLGGDASGSTTFDGSANAGITVTLANSGVNAGSYPKVTVNAKGLVTGGGGLTAADIPALDASKIATGRLDLERLPLVSQGLATAVHTSVDPNSVVIPLVLTNHANGPVAGRYYYIQTMFYPSVEGNATQIATGYAGVADMYVRYAYGSPATTDPSKREWSAWVRCDLGGAFAHAPDGVLGGGVNLDSMIASGWWHQPFSANAQNGANYPVGEAGILTVHAPTSSMIYQTYRGYAAGGLYWRCRYNGTWGGWFRAWDSGNFNPANYVAKSEYNWSSLPGKPATFPPAGHNHDASQITSGILPLARGGLGANNATTARSNIGAGTIATASLGASGWWRDNDTGYIRQWGRVTVPGDGTAAITFPIAFPNVCLGGFAGQTANFHPGTDASTSFYNPSTTGATLENGYQFQAVLLWEAFGR
Physico‐chemical
properties
protein length:785 AA
molecular weight: 80409,19680 Da
isoelectric point:7,67874
aromaticity:0,08153
hydropathy:0,04153

Domains

Domains [InterPro]
IPR022225
ATT
5–156
WWQ72104.1
1 785
Architecture
ATT
STR
STR
STR
RBD
ATT 1-156 | STR 157-411 | STR 435-534 | STR 545-623 | RBD 694-784 |
Legend: ATT STR RBD CBM LEC ENZ CHP LNK TAS TTP UNK Unmapped

Taxonomy

  Name Taxonomy ID Lineage
Phage Pseudomonas phage Sirocco
[NCBI]
3097010 Viruses > Duplodnaviria > Heunggongvirae > Uroviricota > Caudoviricetes
Host Pseudomonas aeruginosa
[NCBI]
287 cellular organisms > Bacteria > Pseudomonadati > Pseudomonadota > Gammaproteobacteria > Pseudomonadales

Coding sequence (CDS)

Coding sequence (CDS)
Genbank protein accession
WWQ72104.1 [NCBI]
Genbank nucleotide accession
OR683410.1 [NCBI]
CDS location
range 14856 -> 17213
strand +
CDS
ATGACGACTCCCAAGTACGGCGGCCTGCTCACCGACATCGGCGCGGCAGCGCTGATCGCGGCGAGCGAAGCCGGGAAGAAGTGGCAGCCCACCCATATGCTCATCGGTGACGCCGGCGGCGCGCCCGGCGAGACGGCTGACCCCATCCCCTCGGCCGCTCAGACCAAGCTGATCCGCCAGCGCTACCGCGCTCAACTGAACCGTCTGTTCGTCTCCGAGCAAAGTGCAAACGTGCTGGTCGCCGAGCTGGTACTGCCGATGGCCATCGGCGGCTTCTGGATACGGGAGATCGGCCTCGAGGACGCCGACGGGAAGTTCGTGGCGGTCGCCAACTGCCCGCCCAGCTTCAAGGCCAGCGTCGAGAGCGGGAGCGCGCGCACCCAGACCATCCGCGTGCAGATCATCCTATCCGGCATGGAGCACGTCGAACTGATCATCGACGACGGCATCGTCTACGCCACCCAGGACTGGGTGACGGCGAAGGTTGCCGCGGACTTCAAGGGACGCAAGGTGCTGGCCGGCAACGGCCTGGTCGGCGGTGGCGATTTGTCTGCGGATCGCACCATCGCCTTGCCAGCCTCCGGCGTGGGTGCCGGCACCTACCGTGCGGTCACCGTCAACGCCAATGGCATAGTCACCGCCGGCAGCAACCCGACCACGCTGGGCGGCTACGGCATCACGGACGCACTGCATGCCAGCGAGGCGGTCACTACCCCGACGGCGAACAAGCTGCTCAGGCTGAACGCGGCCGGACTACTACCGGCCTCGATTACGGGCAACGCAGCCACTGCCAGCCGGCTTGCAGCGCCCATCACGCTCAGCGCGAGCGGCGATGCAACGTGGTCAGTTCGGTTCGATGGGGCCACAAACGTCAACGGAGTCCTGACGCTGGCCAACTCCGGCGTCACCGCCGGGACCTACGCGAAAGTCACGGTGAACGCCAAAGGACTGGTTACCGGAGCCAGTGGGCTTGTAGCGAGTGATATTCCGGCCCTTGATGCCGGAAAAATCACCTCCGGCATCTTACCTGCAGCCAGAGGCGGTACCGGCAATGGTATTGGTCAGGCTGCAACGGCGGTCAAACTCGCCGCCCCTCGTACGATCTACCTCGGTGGGGACGCCAGCGGCTCGACAACGTTCGACGGTAGCGCGAACGCTGGAATCACGGTCACGCTGGCGAACTCGGGTGTAAATGCCGGCTCCTACCCCAAAGTCACTGTTAACGCCAAGGGGCTGGTTACCGGTGGCGGTGGACTGACGGCAGCGGACATTCCTGCGCTGGATGCTTCGAAGATTGCTACCGGCCGACTCGATCTTGAGCGCTTGCCGTTGGTCTCTCAGGGACTGGCCACGGCTGTGCATACCAGCGTTGATCCCAACTCGGTAGTCATTCCGCTTGTGCTGACCAACCACGCGAATGGCCCAGTGGCTGGCCGCTACTACTACATCCAGACGATGTTCTACCCGAGCGTCGAAGGCAACGCGACGCAGATCGCAACCGGCTATGCCGGCGTGGCTGATATGTACGTTCGTTATGCCTACGGCTCCCCCGCAACGACCGATCCTTCCAAGCGAGAGTGGTCAGCATGGGTCCGCTGCGATCTGGGAGGGGCGTTCGCTCATGCGCCGGATGGCGTCCTGGGTGGCGGAGTCAACTTGGATTCAATGATTGCGTCGGGCTGGTGGCATCAACCGTTCAGCGCGAACGCACAGAACGGCGCGAACTATCCAGTGGGCGAGGCCGGCATATTGACGGTGCATGCTCCAACCTCCTCGATGATCTATCAGACCTATCGTGGCTATGCCGCTGGCGGTCTGTACTGGCGCTGCAGATACAACGGCACCTGGGGAGGATGGTTCCGGGCATGGGACTCCGGCAACTTCAACCCGGCCAACTACGTGGCCAAGTCGGAGTACAACTGGTCTTCGCTGCCAGGGAAGCCGGCAACCTTCCCGCCGGCAGGGCATAACCATGACGCTAGCCAGATTACCTCCGGCATCCTGCCGCTGGCTCGAGGCGGCCTTGGGGCGAACAATGCCACGACGGCGCGTAGCAACATCGGAGCCGGCACCATCGCCACAGCATCCTTGGGAGCAAGCGGTTGGTGGAGGGACAACGATACGGGTTACATCCGGCAATGGGGCCGGGTGACTGTGCCTGGTGATGGTACCGCGGCGATCACCTTCCCCATCGCGTTCCCGAATGTCTGCTTGGGCGGGTTCGCTGGTCAAACTGCGAATTTCCACCCAGGAACCGACGCGAGCACCTCGTTCTATAACCCGTCGACGACAGGTGCAACTTTGGAAAACGGGTATCAATTCCAGGCGGTTTTGCTTTGGGAGGCATTCGGTCGATGA

Genome Context

Genome Context

Tertiary structure

PDB ID
3394135c62f0ae0e90a37527be8577a3e540faa25c825fd55ea27696384b41ea
ESMFold
Source ESMFold
Method ESMFold
Resolution 0,7161
Oligomeric State monomer
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50

Literature

Title Authors Date PMID Source
Prophages of P. aeruginosa: insights into their role through their activity, abundance and persistence Kyrkou,I., Bartell,J.A., Lechuga,A., Lood,C., Lavigne,R., Molin,S. and Krogh Johansen,H. 2024-01-16 GenBank