Protein
- Genbank accession
- QHR68868.1 [GenBank]
- Protein name
- tail fiber protein
- RBP type
-
TSPTSPTF
- Protein sequence
-
MADIKVVRIESLPATTAVTEDDYLVVQQPDLTRRVKIGDVVHVDGTVSHVISFKEGGKLNGPMDFAYFEEEDLYLRWKGEFPHTVPALSSPYSDGGITDAAWMVYTDPSLREELESTIGASMIMTAEGQSVQDVMDVTVQTANDAKALAQRVDFGTVHTVGDVIHLDNFVGPAVIEEGRTTNYPSVAAGEKFLNGVVSRRDTTTVDGIFRGATTGAMYTIAVTNGVATTKRIALRDEFKRLEANNPTRTVIRAGDDLNTAGYLQLDATGRWGMWNQQTASWQPLAIEQGGTGARDAAGVRINIGAFYKQRAALEPNFNINNLTGNQDGVYYQPMTAYATEVNGYPAGSGAGHLIVWQNNANGGTGCRQEYYPFSNVDVWYLRTYQANTNQWTAWQPMVRPRNDDTFRSHIGLGKNNSPAFGHLYLAQYSGDVKSASGILHGDKYNTDGVLEHGYRIYSEVRNDNKAWLTIHLHKGAKGSETHKYLGFREDGVLDCPKYMQVGDLTGQLTNWGLGEWIRSSGAERGFWGSKKAAKMVIWDGGMDESGNGTLEWGVYNNRKAKWEPLPQAAGGTGATTLADAQNLFKVPIAAGVKDFLTLPRTAGMEDGKYYPIIVRTDPYYAPATGTDITIVTRSSSGGGPMNCATLQCHYRTGGWTDRGDSFYGVVNFYQNEKAILGMVAPTRGKQEYVAFYAEARAFPVSIYASRNVVEVFTREQDYQVGSVTDNQDGVKFVAPLQSADLNLAVLGDNNTNTRPIVDFKGTSGFYTGGGTQWHYIGTAERYAVMSKMNMPKVELWADGLDYLCYGSPRKALFSNAGFQCASDGTEDLTNGTFTSKCGNGAGLKGQAEFRSTPEAGQVIVRDVVGSAHRFYNFNKDGTFSAPGGFVCHTGADWNNQFGNNNPSKIMAGNVNGPEGSMVVGGLSVAFSGNYAFQIAGRLDQLYTRSIEQGNHRAWNKVIQHRGQGLGTSDLNDYKADREGIYHQEANANATAERHYPPGQQMAGTLIVLRNSANEGTGCVQIYKMYLGGTWERYYNNTGSGMNWSPWKRTSFPENTTTPVMPDLWLPLTSNLKPVLGEGEMIFSRPSTATYFTKRGVMAVAQANQPRFERDGLLIEGQRTNLMLNSEDPSKWGAQSQITVGNTVTNTNGTKGARFTVSNVSGVETTALNLATVPATRGADVTGAEKFCTGSIIARGGKAHQRLRVRFDMYNGSTTVFQGDAYVNLSTLEVKTTGGAAGRIKVKAERWQTAGPAWIRVVATFEAVDSDMNIGCQFQIAPPDGSQHAVGDWVDVAIPQFELGSCESSFIPTGSSPVTRAADLCKFPMTDNLAPRPFTIAATVDANWRGWGKAPNADPRVIDTEGHQSGAAFIMAFGSAANIAEDGYPYCDIGGSNRRVYEMAKTRKLKIGFRIKDDGKTCSFANGLVSTETQSSWEFLAGGALIRLGGQTATGERHLFGHIKDVRVWNSALTDTQLMMESVE
- Physico‐chemical
properties -
protein length: 1479 AA molecular weight: 161038,97610 Da isoelectric point: 6,06736 aromaticity: 0,09939 hydropathy: -0,38851
Domains
Domains [InterPro]
Legend:
Pfam
SMART
CDD
TIGRFAM
HAMAP
SUPFAM
PRINTS
Gene3D
PANTHER
Other
Taxonomy
| Name | Taxonomy ID | Lineage | |
|---|---|---|---|
| Phage |
Escherichia phage naswa [NCBI] |
2696428 | Viruses > Duplodnaviria > Heunggongvirae > Uroviricota > Caudoviricetes |
| Host | No host information | ||
Coding sequence (CDS)
Coding sequence (CDS)
Genbank protein accession
QHR68868.1
[NCBI]
Genbank nucleotide accession
MN850595.1
[NCBI]
CDS location
range 38340 -> 42779
strand +
strand +
CDS
ATGGCTGATATTAAAGTTGTCAGAATTGAATCTCTTCCTGCCACTACCGCAGTGACAGAGGATGATTACCTGGTTGTTCAACAACCAGACCTGACCCGTCGTGTAAAAATTGGCGATGTTGTCCATGTTGATGGGACTGTTTCTCATGTAATCTCCTTTAAGGAAGGTGGTAAGTTAAACGGCCCAATGGATTTTGCCTACTTCGAAGAGGAAGACCTCTACCTGCGTTGGAAAGGCGAATTTCCACACACTGTTCCTGCACTGTCTTCACCGTACTCCGATGGTGGGATTACTGACGCTGCATGGATGGTATATACTGACCCGTCCTTAAGGGAGGAGCTGGAATCTACCATTGGCGCATCTATGATCATGACAGCCGAAGGGCAGTCTGTCCAGGATGTAATGGATGTTACTGTTCAGACAGCTAACGATGCCAAAGCACTTGCACAGCGTGTAGATTTTGGTACAGTGCACACTGTAGGCGATGTTATTCACCTTGATAACTTTGTTGGCCCTGCAGTTATTGAAGAGGGAAGAACTACCAACTACCCTTCTGTAGCAGCTGGTGAAAAATTTCTGAACGGTGTGGTATCTCGTCGTGATACTACAACTGTTGATGGTATTTTCCGTGGCGCTACCACCGGAGCCATGTACACCATTGCAGTAACCAACGGTGTAGCAACAACGAAAAGAATAGCACTTCGTGATGAATTTAAACGCCTAGAGGCAAACAACCCAACAAGAACAGTTATCCGTGCTGGAGATGATTTAAATACGGCAGGGTACTTGCAGCTTGATGCAACAGGCCGTTGGGGTATGTGGAACCAACAAACAGCTTCGTGGCAACCTCTTGCTATAGAGCAAGGCGGCACAGGGGCTAGAGATGCCGCTGGAGTCCGTATCAATATCGGTGCTTTCTATAAGCAACGTGCAGCCCTTGAGCCAAATTTCAATATCAATAACTTGACTGGTAATCAGGATGGTGTATACTACCAGCCAATGACTGCTTATGCAACTGAGGTAAATGGCTACCCTGCAGGTTCTGGTGCTGGTCACTTGATTGTTTGGCAGAACAATGCTAACGGAGGTACAGGTTGTCGTCAGGAATACTATCCATTCTCTAACGTAGATGTTTGGTATTTGAGAACCTATCAGGCCAACACAAATCAGTGGACTGCATGGCAGCCGATGGTCAGACCTCGTAACGATGATACCTTCAGATCTCATATTGGCCTTGGTAAAAACAACTCGCCAGCCTTCGGGCACCTTTACTTAGCTCAATACTCTGGAGATGTTAAATCGGCCTCAGGTATTCTCCATGGAGATAAATATAACACCGATGGTGTTCTTGAGCATGGCTATAGGATCTACTCTGAGGTAAGAAACGACAATAAGGCTTGGCTGACAATCCACCTGCACAAAGGTGCAAAAGGATCTGAAACTCATAAATATTTAGGTTTCCGCGAAGACGGAGTATTAGACTGCCCTAAATATATGCAGGTTGGGGATCTTACTGGTCAGCTGACAAACTGGGGTCTTGGAGAATGGATCCGTAGTTCAGGAGCAGAAAGAGGCTTCTGGGGGTCCAAGAAAGCCGCCAAGATGGTTATCTGGGATGGTGGGATGGACGAATCCGGTAACGGCACTCTGGAATGGGGTGTTTATAACAACCGGAAGGCCAAGTGGGAACCTCTACCTCAAGCCGCAGGTGGCACTGGGGCTACAACTTTAGCAGATGCTCAGAATCTGTTCAAAGTTCCTATAGCAGCAGGTGTAAAAGACTTCTTAACACTGCCAAGAACCGCAGGGATGGAAGATGGGAAATACTACCCAATTATCGTTAGAACAGATCCGTACTATGCTCCAGCAACTGGCACTGATATTACCATAGTTACCAGATCGTCATCCGGCGGTGGCCCTATGAACTGTGCGACCTTACAGTGTCACTATAGAACTGGTGGTTGGACAGACAGAGGAGACTCTTTCTACGGGGTAGTAAATTTCTACCAGAATGAAAAAGCAATTCTTGGGATGGTTGCTCCAACAAGGGGTAAACAAGAATATGTTGCTTTCTATGCAGAGGCGCGTGCTTTCCCTGTCAGCATATACGCAAGTAGAAATGTTGTCGAGGTGTTTACCAGGGAGCAAGATTACCAGGTTGGCTCTGTAACAGACAATCAGGACGGTGTTAAGTTCGTTGCACCTCTCCAGTCAGCAGATTTGAATCTTGCTGTTCTTGGAGATAATAATACCAACACCAGACCTATTGTTGACTTCAAAGGGACTTCAGGGTTCTACACTGGTGGCGGCACACAGTGGCACTATATAGGCACTGCTGAACGCTATGCTGTGATGAGCAAAATGAATATGCCAAAAGTCGAGCTATGGGCAGACGGTCTTGACTATTTGTGCTACGGCAGTCCTAGAAAGGCGTTATTCTCTAATGCAGGCTTCCAGTGTGCATCAGATGGGACAGAGGATCTGACTAACGGTACGTTTACCTCTAAATGCGGTAATGGTGCTGGTCTCAAAGGCCAGGCAGAGTTCAGATCTACTCCAGAGGCTGGTCAAGTTATTGTTCGTGATGTTGTAGGTTCTGCTCACAGATTCTACAACTTCAACAAAGATGGCACTTTCTCAGCTCCTGGTGGTTTTGTATGCCACACAGGTGCAGACTGGAACAACCAGTTCGGGAATAACAACCCTTCTAAAATAATGGCTGGTAACGTCAACGGACCTGAAGGTTCGATGGTTGTTGGCGGGTTGTCTGTGGCATTCTCTGGGAACTATGCTTTCCAGATAGCAGGTCGCTTAGATCAGCTGTATACTCGTTCCATAGAGCAGGGTAACCACAGGGCGTGGAACAAAGTTATTCAGCACCGTGGTCAAGGGCTGGGAACTAGCGACCTTAACGATTACAAAGCGGATCGCGAAGGTATTTACCATCAAGAGGCAAATGCCAACGCCACAGCGGAGAGACACTACCCACCAGGACAGCAGATGGCTGGTACGCTTATCGTGCTTAGAAACTCTGCCAACGAGGGTACAGGGTGTGTCCAGATATATAAAATGTATCTTGGTGGAACGTGGGAAAGGTATTATAATAACACTGGCAGTGGAATGAACTGGAGCCCTTGGAAGAGAACGAGCTTCCCAGAAAATACAACTACCCCAGTCATGCCAGACCTGTGGCTACCTTTAACCTCTAACCTGAAACCTGTGTTAGGTGAGGGGGAGATGATCTTCTCCAGACCTTCTACAGCAACATATTTCACTAAGCGCGGTGTCATGGCTGTTGCACAGGCAAACCAACCACGTTTCGAAAGAGATGGTCTGCTCATTGAAGGTCAAAGGACAAATCTTATGCTTAACAGTGAAGACCCGAGCAAATGGGGAGCCCAGTCACAGATTACGGTCGGTAACACTGTAACAAACACTAACGGAACAAAAGGTGCAAGATTTACAGTAAGCAACGTATCCGGTGTAGAAACAACAGCGTTAAACCTGGCTACAGTTCCAGCCACCCGTGGTGCAGATGTAACAGGCGCTGAGAAATTTTGTACTGGGTCTATTATTGCGAGAGGTGGTAAAGCTCACCAGAGACTGCGTGTAAGATTCGACATGTATAACGGATCCACTACAGTATTCCAGGGTGATGCCTATGTTAACCTGTCAACACTTGAAGTTAAAACAACAGGTGGTGCAGCTGGGAGAATTAAAGTAAAAGCTGAGAGATGGCAAACAGCCGGACCTGCTTGGATAAGAGTTGTTGCTACGTTTGAGGCAGTGGACTCTGACATGAATATTGGGTGCCAGTTTCAGATTGCCCCACCGGATGGTTCTCAACATGCCGTAGGGGATTGGGTTGACGTCGCAATACCTCAGTTCGAGTTAGGATCTTGTGAGTCCTCGTTTATCCCTACAGGGTCTTCACCAGTGACTAGGGCAGCAGACCTTTGTAAATTCCCAATGACCGATAATTTAGCACCTAGACCGTTCACCATCGCCGCTACGGTGGATGCCAACTGGAGAGGTTGGGGAAAAGCTCCTAACGCAGATCCTAGAGTTATTGATACAGAGGGCCACCAATCCGGTGCTGCATTTATCATGGCATTTGGCTCTGCAGCAAATATTGCAGAAGATGGTTATCCATATTGTGATATTGGCGGTTCAAACAGACGTGTCTATGAGATGGCTAAAACACGTAAATTGAAAATTGGGTTCAGGATAAAAGATGACGGTAAAACCTGCTCTTTTGCTAACGGATTGGTGAGTACGGAAACGCAATCTTCTTGGGAATTCCTGGCAGGAGGCGCTCTCATCCGTCTCGGAGGTCAAACGGCGACAGGTGAAAGACATTTGTTTGGTCATATTAAAGATGTAAGAGTTTGGAACTCTGCATTGACCGACACCCAGCTTATGATGGAGAGTGTTGAATAA
Gene Ontology
No Gene Ontology terms available.
Enzymatic activity
No enzymatic activity data available.
Tertiary structure
PDB ID
22d5ccc20582706f173edc0f59b02d7e2a9ccc1602bc79f69a7bc8b79569de31
Literature
| Title | Authors | Date | PMID | Source |
|---|---|---|---|---|
| Exploring the Remarkable Diversity of Culturable Escherichia coli Phages in the Danish Wastewater Environment | Olsen,N.S., Forero-Junco,L., Kot,W. and Hansen,L.H. | 2020 | — | GenBank |