Protein

Accession
AZF89198.1 [Not found in db]
Protein name
tail fiber protein
RBP type
TF
Evidence Phold
Probability 1,00
Protein sequence
MSEFETPLNPLRIQAQLGKDVKRLYKEGSNIVTLSFAKVVKVNYKYNTVDVITTRHKNSTTKNPNDNGKFSAKLPIAFGGRTANGNVYGTNTLVTVGSTVLIGFLEGQVDNPIVINIYGENDNQSMLTRTTMTSGDDSEEIVQRELWQLFNLYPSMTYENIDGRGNREVTFSGKTFLYITDTDQENAYVQDGAFDYMDLPSSRYANGELIEPESPKAPTLLYVHQGIYDDHRFTVFIKSDGTFRVGSRHTKGDRHGITYQQMNPDGSFSIVKKNDTTDPEEESYDQSSMEILPNGNVLLQNPQHKFEITEEGVLVDGKEIGSGGGNGGGDYDEAIKEINDALSRISITVKAVEGGLETKVEKDTYEIDLDEVKGAQERLLAEIKTNIADLKNALEDLRTFIGSGFPDGQVTDARKVELNKKLQNIDVLKSVVDGKYDEVMADPFIGDTSKIPLQNAKNKIDGYHQALHNVIDASITDGVLTAQEKSDINKAITNYVTALADIEIVFSSSIQESIKARLKEAVDNPVNYTSKEMLRQSSVLTQLFNSLTLKVSSEQLTSQVQNLESKMATKEEQKEIKNELEKVNEKVDGTLSNLPYRVEVSSTNGTIFVNDYIDTRIYAKIYKGAEDVTAQAALKDIIWTRVSDDTAADTAWNSSHVNIGDSFQASVNDVKDRATFFCAYKTPAVATGSITVANLRDITVSDKEPANPRNGTMWFDTKEGKLKIFLNNKWEVSSKDLDFNIRNLFLNSRDCTGGGWTLQNATRTNNQYQGTYIIETAINWGSADYACQNLFTRGVVKKGDKVTYCVLARLTGATGVSKDLRFYCENAAVNGSIIGQVTTEWKQFYVTIDIVDAMNTAGSKMRVEVADLEAANLKLQTTSPILVQGEVAVNWIPAPEDTQRDIDDLNTNVNSLGDDSKLTRFERSLVRTSLADITGVYYNPTDTPATIAQIDTTGYGKGKLYALRQQARNLGLDTTKSPNYKKLGDAYTALVTYLSGFTPKAWDTTSGAIVAIPDRAVWNKLWNDYNNFYALFEIEVQDRQKEFTEQETLKMQKETIGAISQVGNYDTATLVNPTTTVTPPIATLGLPEFNGRTIDAHTWNGRNYITNSLKEYKDEGKLNTAQFKDLTLASGIASTLNRQVVTFFFDYRLTNIVYGTTNPWIGMQLTIRFSDNSVVYPTCAGGTTLGTGTTTDFKTVRATYTIDPTKTITSISAMAGGRDHTGTLEIKNFKLEIGSKNDSQIVFTPSIEEAKFSIGNRVRPITNPTFYNGTQLTIFGKFHGMYGYADRFYWNENGVATKEKYWEDFHLDTQQSWSLLENRDTADYRLFKSSSFLWYTPHANGTSQMIDSAGYYMTSKAAIDNQNQFYFDAANKNLTISVGNNATGFDKLYKTPTADEIKAFFLGWKLCDGTSVNSPYKGSGVKVWFPIGDTNLDRKFQSNDGKPPKDVSPSFQERLVCQPYQFVGLLATPVALEVEFEGILELLPKANAVSVAYPDWTPEISIGKYKYGINLATVNQDTRYLVPAMQKRISNAEQKITDKAITSTVINSIEYQLGLKEKANASDLSGLASKNELDKLSHDVDDRIQKNIDKLDFTPYVTKSDLEQTSLAWAARFLATGGMNIVNNSIGFAGMDFWFVDLPSTSNIPTVVATALLDSLGFGKGFQFKADNTKPKSMSQDLSTIPNQPYAISWYLDKFTGGANLVDHRFNIEILEQNDAGAWVVVTQLNNNNAKVTDSFEAAYMTFTPTKGKVRLRITAGKSCEAIISGIMVNIGDVPIPWTLSTGELYNTNIQLNINGIRVSQLDANKNEIGYTMISPTEFAGYYSNNGKYEKVFWLNGDETVTKKLRATQEINLGNVKVLDVNGVNTGWAFISNY
Physico‐chemical
properties
protein length:1874 AA
molecular weight: 208105,63630 Da
isoelectric point:5,16760
aromaticity:0,09658
hydropathy:-0,40827

Domains

Domains [InterPro]
AZF89198.1
1 1874
Legend: Pfam SMART CDD TIGRFAM HAMAP SUPFAM PRINTS Gene3D PANTHER Other

Taxonomy

  Name Taxonomy ID Lineage
Phage Bacillus phage vB_BthM-Goe5
[NCBI]
2491346 Viruses > Duplodnaviria > Heunggongvirae > Uroviricota > Caudoviricetes
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)
Genbank protein accession
AZF89198.1 [NCBI]
Genbank nucleotide accession
MK215646.1 [NCBI]
CDS location
range 52543 -> 58167
strand +
CDS
ATGAGCGAATTCGAAACACCATTAAACCCGTTGCGTATACAAGCGCAATTGGGTAAAGATGTAAAAAGACTTTACAAAGAGGGTAGCAATATTGTTACTCTTTCTTTTGCTAAGGTCGTTAAAGTTAACTACAAATATAATACGGTTGATGTAATCACAACTCGCCACAAAAACTCAACAACTAAAAATCCTAATGACAACGGTAAGTTCTCTGCTAAACTACCTATAGCTTTCGGTGGACGTACAGCAAATGGTAATGTCTACGGAACTAACACACTAGTTACAGTTGGTTCTACAGTCCTTATAGGATTCTTAGAAGGTCAAGTAGATAATCCTATCGTAATCAATATCTACGGAGAGAATGACAACCAGTCAATGCTAACTCGTACAACAATGACAAGCGGAGACGATTCCGAAGAAATTGTACAACGCGAACTATGGCAGCTATTCAATCTATATCCTTCTATGACATATGAAAATATAGATGGACGTGGTAACCGTGAAGTCACTTTCTCAGGTAAAACATTCTTGTATATTACCGATACAGACCAAGAGAACGCATATGTACAAGATGGTGCATTTGACTACATGGATTTACCAAGTTCAAGATACGCTAATGGTGAGCTAATTGAGCCAGAGTCTCCTAAAGCACCCACACTTCTATATGTACACCAAGGAATCTACGATGACCATAGATTTACCGTGTTCATTAAATCAGATGGTACATTCCGTGTAGGTAGTCGTCATACAAAAGGTGACCGTCATGGCATTACATATCAGCAAATGAATCCAGATGGTAGTTTTTCTATTGTGAAGAAGAACGATACAACCGACCCAGAAGAAGAGTCTTACGACCAGTCTTCTATGGAAATCCTTCCAAATGGTAACGTCCTATTACAGAATCCACAACACAAGTTCGAAATTACCGAAGAGGGTGTCCTTGTCGATGGTAAAGAGATTGGCTCTGGTGGTGGTAACGGAGGCGGAGACTATGACGAAGCCATTAAAGAAATCAATGACGCATTGAGCAGAATTTCTATCACAGTTAAAGCCGTTGAAGGTGGTTTGGAGACAAAGGTTGAGAAGGACACATACGAGATTGACCTTGATGAAGTAAAAGGAGCACAAGAGAGATTACTAGCGGAAATCAAGACTAACATCGCTGACCTTAAAAACGCATTAGAAGACTTACGTACATTCATTGGCTCTGGATTCCCAGATGGTCAAGTAACAGATGCACGTAAAGTAGAACTGAATAAAAAGCTACAAAACATTGATGTTCTTAAATCTGTAGTTGATGGTAAGTACGATGAGGTTATGGCTGACCCGTTTATTGGGGATACATCTAAAATCCCTTTACAGAACGCTAAAAACAAAATAGATGGATATCACCAAGCTCTACATAACGTTATCGATGCCTCTATTACAGACGGAGTGTTAACAGCTCAAGAGAAGTCAGATATCAACAAAGCTATTACAAACTACGTGACTGCTCTAGCGGATATCGAAATTGTGTTTTCGTCATCTATCCAAGAGTCTATTAAAGCACGATTAAAAGAAGCGGTAGACAACCCAGTTAACTACACAAGTAAAGAGATGCTTCGACAAAGTTCTGTGCTGACACAACTGTTTAATTCTTTAACTCTTAAAGTTAGTTCTGAACAGCTAACATCTCAGGTACAAAACCTAGAATCAAAAATGGCTACAAAAGAAGAACAAAAAGAGATTAAGAATGAACTAGAGAAAGTAAACGAGAAGGTCGATGGTACGCTATCAAATCTTCCATACCGTGTAGAAGTATCATCTACGAATGGTACAATCTTCGTAAACGACTATATCGATACAAGAATATACGCTAAGATTTATAAAGGTGCGGAAGATGTCACAGCTCAAGCAGCATTGAAGGATATTATCTGGACTCGTGTATCGGATGATACTGCTGCGGATACTGCGTGGAATAGTAGCCATGTAAATATTGGTGATTCGTTCCAAGCCTCTGTAAACGATGTGAAAGATAGAGCTACGTTCTTCTGTGCATATAAAACTCCAGCAGTTGCTACAGGTAGCATCACAGTAGCTAACTTACGAGACATCACTGTATCAGATAAAGAGCCAGCAAACCCTAGAAACGGTACGATGTGGTTTGATACGAAAGAAGGTAAACTTAAAATCTTCTTAAATAATAAGTGGGAAGTATCTTCTAAAGACCTAGACTTCAATATCCGTAACTTGTTCTTAAACTCTCGTGACTGTACAGGTGGGGGTTGGACGCTTCAAAATGCAACAAGAACAAACAACCAGTACCAAGGTACATACATCATAGAGACAGCTATTAACTGGGGGAGCGCCGATTACGCTTGTCAAAACCTATTCACACGTGGAGTAGTTAAAAAAGGAGATAAAGTTACATATTGTGTACTTGCTAGACTAACAGGCGCTACAGGAGTATCAAAAGATTTACGATTCTACTGTGAAAATGCCGCAGTAAACGGAAGTATCATAGGTCAAGTTACAACGGAGTGGAAGCAGTTCTATGTAACAATAGATATTGTAGATGCCATGAATACTGCTGGTAGTAAAATGCGTGTCGAGGTTGCAGACTTAGAAGCAGCTAACCTTAAGTTACAGACTACGAGCCCTATCCTAGTACAAGGTGAAGTAGCAGTAAACTGGATTCCTGCTCCAGAGGATACTCAACGTGACATCGACGACCTTAACACAAATGTAAACTCTTTAGGTGACGATTCTAAATTAACTCGTTTCGAGAGAAGTTTAGTACGTACATCATTAGCAGATATCACAGGTGTCTACTACAATCCAACCGACACTCCAGCAACTATCGCACAAATTGATACAACAGGGTATGGTAAGGGTAAATTATACGCATTACGTCAACAAGCTCGCAACTTAGGGTTAGACACAACAAAAAGCCCTAACTACAAGAAGCTAGGAGACGCGTACACAGCTCTTGTAACATACTTGAGTGGTTTTACACCTAAAGCATGGGATACGACGTCTGGAGCGATTGTAGCTATCCCAGACAGAGCGGTATGGAACAAGCTATGGAATGACTACAATAACTTCTACGCATTATTCGAAATCGAAGTACAGGATAGACAAAAAGAGTTTACAGAGCAAGAGACATTAAAGATGCAGAAGGAGACTATTGGAGCAATTAGTCAAGTAGGTAATTACGATACTGCGACTCTTGTAAACCCTACTACAACTGTCACACCACCTATTGCTACATTAGGTTTACCAGAGTTTAACGGTAGAACAATTGACGCGCACACATGGAACGGTAGGAACTATATCACAAACTCTTTAAAAGAGTATAAGGATGAAGGTAAACTTAACACGGCACAGTTCAAAGATTTGACATTGGCTTCTGGTATTGCTTCCACTCTTAACAGGCAGGTTGTCACATTCTTCTTTGATTATAGACTTACGAATATAGTTTACGGTACAACGAACCCGTGGATTGGTATGCAGTTGACAATTAGGTTCTCCGACAATTCTGTAGTTTACCCAACTTGTGCAGGTGGCACAACACTTGGGACAGGCACAACAACAGACTTCAAGACAGTTAGAGCTACCTACACAATTGACCCAACAAAAACGATAACATCCATATCTGCAATGGCAGGTGGTCGTGACCATACAGGTACACTAGAAATAAAAAACTTCAAGCTAGAGATAGGTTCTAAAAATGATAGCCAAATTGTATTTACACCATCTATTGAAGAAGCTAAATTTTCTATTGGTAATCGTGTACGACCTATAACAAACCCTACGTTCTATAACGGAACACAGCTTACTATTTTCGGTAAGTTTCACGGGATGTATGGGTACGCTGATAGGTTCTACTGGAATGAGAACGGGGTAGCTACAAAAGAGAAGTACTGGGAGGATTTCCACTTAGATACTCAGCAGAGCTGGTCACTACTAGAAAACCGTGATACAGCAGACTACAGACTATTTAAATCGTCGTCATTCTTGTGGTATACGCCTCATGCAAATGGTACATCTCAGATGATAGACTCTGCCGGTTATTATATGACTTCCAAGGCAGCAATTGACAATCAAAACCAGTTCTACTTTGATGCCGCAAATAAAAACCTTACAATTAGTGTAGGTAACAATGCAACTGGTTTCGACAAGCTGTACAAAACACCTACCGCCGATGAGATTAAGGCTTTCTTCTTAGGATGGAAGTTATGTGATGGGACTTCTGTGAATAGCCCCTACAAAGGCTCAGGAGTCAAGGTATGGTTTCCTATCGGGGATACAAACCTTGACAGAAAGTTCCAATCGAACGACGGTAAGCCTCCGAAAGACGTATCTCCATCTTTCCAAGAAAGATTGGTATGTCAACCGTATCAATTCGTAGGGTTACTAGCAACTCCAGTCGCTCTTGAAGTAGAGTTCGAAGGTATTTTGGAGTTATTACCAAAAGCTAACGCGGTATCAGTAGCCTACCCAGACTGGACGCCAGAGATTTCGATAGGTAAGTACAAGTATGGTATAAACCTAGCAACAGTTAACCAAGATACACGATACCTTGTACCTGCTATGCAAAAACGTATCTCTAATGCGGAGCAGAAGATTACAGATAAAGCTATCACAAGTACTGTAATCAACTCTATTGAATATCAGTTAGGTCTTAAAGAAAAAGCAAATGCATCGGATTTATCAGGACTAGCATCTAAAAATGAGCTTGACAAGTTATCCCATGACGTAGACGACAGAATCCAGAAAAACATAGACAAACTAGACTTCACTCCCTATGTAACAAAATCTGATTTAGAGCAAACGTCTTTAGCGTGGGCTGCACGATTCTTAGCTACTGGTGGTATGAACATTGTTAATAACTCTATCGGTTTTGCTGGTATGGACTTCTGGTTTGTAGACCTCCCAAGTACATCCAACATACCTACAGTTGTCGCAACAGCATTACTAGACAGCTTAGGGTTCGGTAAAGGATTCCAGTTCAAAGCAGATAACACGAAACCGAAGTCTATGAGTCAGGACTTATCGACGATACCTAACCAACCTTACGCGATTAGTTGGTATTTAGATAAGTTTACGGGCGGAGCAAACCTAGTAGACCATCGATTCAATATCGAGATACTAGAGCAGAACGATGCAGGAGCATGGGTTGTTGTTACGCAGTTAAATAATAACAACGCCAAAGTAACCGACAGTTTCGAAGCTGCTTACATGACATTCACTCCGACGAAGGGTAAGGTTAGACTTCGTATAACGGCTGGTAAGTCATGTGAGGCAATTATCTCTGGTATAATGGTTAATATAGGGGACGTACCGATTCCTTGGACTTTATCAACTGGAGAATTGTATAACACGAACATCCAATTAAACATCAACGGTATCCGTGTTTCTCAGTTAGATGCGAACAAAAACGAAATTGGCTACACGATGATTTCGCCGACAGAGTTCGCAGGTTACTACTCTAATAACGGTAAATACGAGAAAGTCTTCTGGTTAAACGGGGACGAGACAGTTACTAAGAAACTTCGTGCAACACAGGAGATAAACTTAGGTAACGTTAAAGTTCTCGACGTTAACGGAGTAAATACAGGGTGGGCGTTCATTTCAAACTACTAA

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

PDB ID
e8e7c62ea3d9e07c7d352626624f975672f015fa709c9e30229c2e21f859d817
ColabFold
Source ColabFold
Method ColabFold
Resolution 0,7792
Evidence 0,7792

Literature

No literature entries available.