Protein
- Genbank accession
- AVZ45100.1 [GenBank]
- Protein name
- putative tail fiber protein
- RBP type
-
TSPTSPTSPTF
- Protein sequence
-
MALYPIKSLGAVGVIADQAPTDLAPNAFTNAMNARFVEQRVFKTGGNAPLSYVEEDKDLTPLSFVSMPFDYYSAGNSFLVVGTDKKLYKLTDESLTDISRKVATVTKKASAIIKIYPVVSRIVPKESTITMNFNQTKELEVQVFPEDANNANLTWEVSNPSYASIAVNPTDSKKATLTTLSTEGTLSITVSIEDESVTAQISVNIVDGDTGIFLSQDTITIRRGGTTTLTAISGKTPITWISSNGGALSVTPNANTLTAVLNAMGEGTFTVTADNGSKSATCTVNVIPQIDSISLSQTDVQMDRGTQYVLTATVNPADAPNKAITWTSSNPNIATVSGTSTEATITGLLAGFTEITAVTEEGSRSAVCTVRVNLAGRMLNTRSLAMAASAPLVEEFKEEEEPVVQNEEVVYFMSDSMGIDTSGMAEGNNFFDYSNVFDMEGFARAAENSRAAPLTNVTLDIVEASLDVGEEIVITATAAPEGDYSYQWVVDKSGYVSTTSTTGRSLKLTAVRKGEIKVTCTASQMTQRDYDAFDDYPWYHAVISNCAVATTHYETPQVKEFESEYFTDLPGWGEQTIVDGDGNPSVRKFNWKCERVRAFNNRLFALNMRESNASGVTTHYPLRLRWSNFANENKAPTLWDDYAYDRLTTSDLSANIVGQTEALENGYAGYIDLADSNGSLIDVLPLKDYLFVYTEFETYIGSPTNNTYQPLMFKKLFNDSGILAPECVVEVEGGHFVVTQNDVILHNGASKKSIASNRVKNMLINEVCLVNPLATRVHLHQDKKEVWVMYVGPGEPKESFACTKAAVWNYEFDTWSFRTIPYAQCIGLVDPPVLERGPVWTDFQTITWDDPAIDKLVWRKDATNFRQRITIVGSFLRGFYQVDVGALDYFYDRANDKIIERPLEMRLERTGIDFDNVTNEWNQKHINRFRPQTTGSGTYIFEAGGSQFSNEYGHNHTTKSYTIGVDRHVAVRLNHPYLFYNVIDNDVNSNAAINGLTIEFNVGGRR
- Physico‐chemical
properties -
protein length: 1006 AA molecular weight: 110891,69280 Da isoelectric point: 4,80701 aromaticity: 0,09642 hydropathy: -0,24503
Domains
Domains [InterPro]
IPR003343
118–202
118–202
IPR008964
118–204
118–204
G3DSA:2.60.40.1080
118–206
118–206
IPR008964
290–372
290–372
IPR003343
291–370
291–370
1
1006
Legend:
Pfam
SMART
CDD
TIGRFAM
HAMAP
SUPFAM
PRINTS
Gene3D
PANTHER
Other
Taxonomy
| Name | Taxonomy ID | Lineage | |
|---|---|---|---|
| Phage |
Escherichia phage EP335 [NCBI] |
2070199 | Viruses > Duplodnaviria > Heunggongvirae > Uroviricota > Caudoviricetes |
| Host | No host information | ||
Coding sequence (CDS)
Coding sequence (CDS)
Genbank protein accession
AVZ45100.1
[NCBI]
Genbank nucleotide accession
MG748548
[NCBI]
CDS location
range 16657 -> 19677
strand +
strand +
CDS
ATGGCGTTATACCCTATAAAATCACTTGGGGCTGTCGGTGTTATCGCTGATCAGGCACCAACTGATTTAGCACCTAACGCTTTCACCAACGCTATGAACGCTCGGTTTGTTGAGCAGAGAGTTTTTAAGACGGGGGGCAATGCCCCTCTTTCTTACGTGGAAGAAGACAAAGATCTGACTCCACTCTCTTTTGTCTCCATGCCTTTCGATTATTATAGCGCAGGAAATAGCTTCCTTGTAGTAGGTACAGATAAGAAGTTATATAAACTGACAGATGAAAGCTTAACTGATATCAGTCGTAAAGTTGCTACGGTAACTAAGAAAGCTTCTGCTATCATAAAGATTTATCCAGTGGTCTCAAGGATTGTTCCTAAAGAGAGTACTATCACAATGAACTTTAACCAGACAAAAGAGTTAGAAGTTCAGGTTTTTCCAGAGGATGCTAATAATGCTAATCTGACTTGGGAAGTAAGTAACCCTTCTTATGCCAGTATTGCAGTAAATCCTACAGATTCTAAAAAAGCCACCCTCACTACATTATCTACAGAAGGAACACTGTCCATTACTGTTTCCATTGAAGATGAATCTGTGACAGCTCAAATCTCCGTTAACATTGTTGATGGGGATACGGGTATCTTCTTGAGTCAAGACACAATCACAATTCGAAGAGGTGGTACAACAACTCTTACTGCTATCTCAGGTAAGACTCCTATCACTTGGATTAGTAGCAATGGTGGTGCTTTGTCTGTGACACCCAATGCCAATACACTAACTGCTGTTCTCAATGCCATGGGAGAAGGAACTTTTACTGTTACAGCTGATAATGGCTCTAAGTCTGCTACCTGTACAGTTAATGTGATACCTCAGATTGATTCTATTTCTCTGAGTCAGACAGATGTTCAGATGGATAGAGGGACTCAGTATGTTCTAACTGCAACAGTCAACCCTGCTGATGCTCCTAATAAAGCAATCACTTGGACTTCTTCCAATCCTAATATTGCTACAGTATCAGGGACAAGCACAGAGGCTACAATTACTGGACTTCTGGCTGGGTTTACAGAGATTACAGCAGTAACAGAGGAAGGTAGTCGTTCAGCTGTTTGTACTGTTCGTGTCAACCTAGCAGGTAGAATGCTAAATACCAGAAGCCTAGCGATGGCTGCTAGTGCACCTCTAGTGGAAGAGTTCAAAGAGGAAGAAGAACCAGTTGTGCAGAATGAAGAAGTTGTTTACTTCATGTCTGACTCTATGGGAATTGATACCTCTGGCATGGCTGAAGGCAATAACTTCTTTGACTACTCTAACGTATTTGATATGGAAGGTTTTGCTCGTGCTGCGGAGAACTCAAGAGCTGCTCCTCTGACAAATGTGACACTAGATATTGTTGAAGCTTCTCTAGATGTAGGTGAAGAAATTGTCATAACTGCTACAGCAGCTCCAGAAGGGGATTACTCCTATCAGTGGGTTGTTGACAAGAGTGGTTATGTTTCTACTACCTCAACAACTGGAAGATCTTTGAAACTTACAGCTGTTCGTAAAGGCGAGATTAAAGTTACATGTACGGCGAGTCAGATGACTCAAAGAGACTACGATGCTTTTGATGATTACCCTTGGTATCATGCAGTAATCTCTAACTGTGCAGTAGCGACAACTCACTATGAAACTCCTCAGGTTAAAGAATTCGAATCTGAATACTTTACAGACCTTCCGGGCTGGGGTGAACAAACAATTGTTGATGGTGATGGGAACCCTTCTGTTCGTAAGTTTAACTGGAAGTGCGAAAGGGTTAGAGCTTTTAACAACAGATTGTTTGCTCTGAATATGAGGGAATCTAATGCCTCTGGTGTTACCACTCACTATCCTTTACGTCTTCGCTGGTCTAACTTTGCGAACGAGAACAAGGCTCCTACTTTGTGGGATGATTATGCTTACGATCGACTGACAACTTCTGATCTTTCAGCGAACATTGTTGGGCAGACTGAAGCTCTTGAGAATGGTTATGCAGGGTATATTGATCTGGCTGACTCTAACGGTAGTTTGATTGATGTTCTCCCTTTGAAAGATTACTTATTTGTTTACACCGAGTTTGAAACCTACATCGGTTCTCCTACTAACAACACATACCAGCCTCTGATGTTTAAGAAGCTGTTTAACGATTCAGGTATTCTTGCTCCTGAGTGTGTGGTTGAAGTAGAGGGTGGTCACTTTGTTGTAACACAGAACGATGTGATTCTTCATAACGGTGCATCTAAGAAATCTATTGCATCTAACCGTGTCAAGAACATGCTTATTAATGAAGTGTGTTTGGTAAACCCTCTAGCTACTAGAGTTCACTTGCACCAAGATAAGAAAGAAGTTTGGGTCATGTATGTTGGGCCGGGAGAGCCGAAAGAAAGTTTTGCTTGTACGAAGGCTGCGGTCTGGAATTACGAGTTTGATACTTGGTCTTTCCGTACTATCCCGTATGCTCAATGTATTGGTCTTGTTGATCCTCCTGTTCTCGAAAGAGGTCCAGTGTGGACTGACTTCCAAACTATCACTTGGGACGACCCTGCTATTGATAAACTGGTGTGGAGAAAGGATGCAACTAACTTCCGTCAGAGAATTACTATCGTAGGCTCTTTCTTAAGGGGTTTCTATCAAGTAGATGTTGGTGCTTTGGATTATTTCTATGACAGAGCGAATGACAAAATAATAGAGCGCCCTCTGGAAATGAGGTTAGAGAGAACAGGGATTGATTTTGATAACGTCACTAACGAATGGAATCAAAAACACATCAACCGGTTCAGACCTCAGACTACAGGTTCTGGTACGTATATCTTTGAAGCTGGAGGTAGTCAATTCTCTAACGAGTATGGTCACAACCACACAACTAAGAGTTATACGATTGGAGTTGATAGGCACGTAGCTGTGAGACTGAACCATCCATACCTATTCTATAATGTTATAGATAATGATGTTAACAGTAACGCAGCCATAAATGGGCTGACAATAGAGTTTAATGTTGGCGGTCGAAGATAA
Gene Ontology
No Gene Ontology terms available.
Enzymatic activity
No enzymatic activity data available.
Tertiary structure
PDB ID
4853aef1eb2ab4040ab135726b6911b36134690d30c9c9f8453570c4453dff78
Literature
No literature entries available.