Genbank accession
CAM0066943.1 [GenBank]
Protein name
tail fiber protein
RBP type
TF
Evidence Phold
Probability 1,00
Protein sequence
MTIRLLGVLPDPFGAPCPKAIIRFTALTSEGSVLKGTDAEHTTTKKGEYDFPLEQGRYLLEIQYSDELMESGSVLVDEFTPSSLTLVELIRYTTPHTPPLVDPNPTDWSVLFQEVIDNDEWDRQSEQQVRDEDVLVNEDKTIHKDEESYLSKETLTNTTGSSTTVQQTLNYKDATGREAGYNRREISTDLAKAHSSEEVYSDVDNYESKQSSLLEGAESASSQERSVTDTSVTETNKSTLGDQQITSERSLTDSGSEVTEEVAVSGAKVTSRKYIDANALVTALLEQGVVVENVEAFDRLSVSSTGSQRDIQVDRFNVGETLKVDTEQEIVTIAGQLKLTNGDDYKGPVGDSQYLEYAYATTEGKETGDWHIDNPDDPYDLPEGTVWRKHRTVYVKDGETTYSQWSNPYLLTGEDGASGDTLYWKYKYNTVSVISDPEATNPVGWDDELKDGHLYRIERLVSNGVYQGLWSEPAKIAGTDGDNGELVFEEYMYSPYGVSTPATSPEHWHTNFSNGDYFRSSRVIKYAAGTPIPIPDGTEPVMVTNWTTPALITPRKGYEYSDGLSQIVVHMYIRSTTSPALPTNTLTYNFNTLTLSGDTQGWSTEVPEGIGNLWITIATASSNTGSDAIAPNEWTDPQLSSTQAYNQASLSLYKRTSAGTVPDAPSTTLSYDFTTGALTGDLEGWSNGTIPAGTDDLYIATAEVRSQTTTADVLPNYWGVGVLTSTAFRQQTVNIYKRGDNAPPSADVTYNFTDGSITGLSGGWSLTIPSGDQDVFIGVAVASAAGVSDVIHPSDWSVGPLGTSGHQAQVINLYQTAQDVPAKPQSDIVYDFTQGKVTSNVSPWSTTVPQNTQGKIYVTTATANASALADSDIVPKEEWTFPQVIVESGINAVPVTLFRKHSLDSAPEKHTNTLTYSFDNAVLLNQPNNGWSATPMSVDLGEAVWSMVATAQGLATDATDDIAPTEFSAPVIYTATGSEVFTVYQYAEASTGPWYDEFTAERVWRIQATSINGVVSDWSEPVKLTGEDGATGDTIYVEYNYSEDLSNWHPVLVEGDIWRRERIVTNDVADEWSSPARLKGNEQYIEYQYSASLDLGEAGWHTNFSSGDYYRRERTVINGTYGEWSEGVQMVPLKDVDYSDGAGGDTIYEVYQYSVDGVTDWVYDFTDEHIYRRTAVVINGALGPWSEPAKLSGVDGADGDTIYMEYEYSVDGVNWHADMEDGDIWRHEREHTVGVTVPSDPWGNRTRIRGIDGAYYEYLYNEDADNYPTEPEDAFGTWHANFSEGDYYRIERLVQEASTGNWTTPTKLKPKKGEDYSDGIQGPKGEDGVTTYTWIKYADDASGSGMSESPVDKPYMGIAYNKTTDQESNDPNDYAWSKVEGDQGIQGENGYMWVQYSNYPNGMNGTAPNMHQDPVDPSTGRPYLYIGISYNNTSPTEGNDPTAYTWSKYVGDEIYYEYSYSPDQISWDMELDANDVWRRERRVENGQYGDWSDAIRIVGTEGPQGIPGNDGNDGNDGDSLYTWVKYADSDTGSGLSNNPEGKGYIGFAYNKTTPVESNDRNDYVWSLVKGTDGTDGKDGENGSDGAQGIPGPIGPDGKTLYTWIKYSPNANGYPLTEAPDENTKYLGISTNQQEQTESTDPDDYVWSPYTGADGQYYEDEFSVNGDPVDWETWHYPAQSTDKFKRTRLIDSKGEPVVDDTTDSAGWVYTQIAPIKDVDYGDGNSGDTIYEVYQYSVDGVMNWEDDFRDEHVFRRTAVVINGSQSAWSDAARISGKDGLQSNVITLYQVKSEQYSWKAEELIQTDETYLILTGSFVGYDPQDGINGWTLSVPAVLTAGEALYQVRVAALSEAGQLEVPIPADDWAYPVQVSASGISGEDGKHGSGSYILNWEDSYNSGQGYKDLNTGIGGQPTEGDVEYWFRELSTRESQAGDILTIQQKEVESAAPKQWLRDTGAWTEFVLSVDGNAIVNGTLGAEALKSGTTLTDVLYVSNDEDSHEMTLSGSGEWEDVTGTIQKDDDYRIWLGNRDPKKAPFSVTRQGSANVYGHLTATSLELMDNADIPAEMDNERRYETYNIVRDNGEFMVSEETYGDSGHGWTTDPTNLVPAAYAHPTSATSYDYHDCYVETRYFADGAPYMFMNSASENDTPRFYTSNVYNNYNITVDPRSSIVVKFKAKFSKFGNTQAAKPKVKMTFLYDDVNVSDPATWKGREHEWDSGLDWDTYYNFEYTLDLNLEPQSNNPTNPGTLRFEIAPHSQMIIFGVEVLQRVYKPYVPEYTFHEEPIYGVEEGDVLSLGVEYVGSETDLTWGLRSFNSKGQVIWERGHKVSDINTSLWELDIQEDITVDTIGDGGLFLFLTKNSAPFNWGNSLHSTVHVRNFQVVHGDSYKEHYVPVSDPNAYPDQEAGTYPQITGGGWTGPESESGWGVLSNGDAYFNNLTVTNGTLSSGTIIGADIYAGNTYHKIDYNTNNSSDTVYYKKYPNTVPLYASTSFSGRDYGTRTAVGLNGPLTDVYVRPWDVVSCLETRGEVNTRRFRWGKVRAGALSCKVQLPSTYVTSLGVDVYIIDDNGWERGVSGYPVISQGRYYEGTMSLGPMIFDVVINNTGSTYYVEIKNRQCQQFGEIDNDLYNGKFYVKVRAGIDNNKRVDATYDLQYTVDNDTFPG
Physico‐chemical
properties
protein length:2662 AA
molecular weight: 294870,02250 Da
isoelectric point:4,31319
aromaticity:0,11345
hydropathy:-0,61597

Domains

Domains [InterPro]
PTHR24637
Unmapped
1309–1655
CAM0066943.1
1 2662
Architecture
STR
STR
STR
STR
STR
STR 107-983 | STR 1056-1223 | STR 1270-1422 | STR 1454-1588 | STR 1597-2210 |
Legend: ATT STR RBD CBM LEC ENZ CHP LNK TAS TTP UNK Unmapped

Taxonomy

  Name Taxonomy ID Lineage
Phage Vibrio phage D445
[NCBI]
3105211 Viruses >
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)
Genbank protein accession
CAM0066943.1 [NCBI]
Genbank nucleotide accession
OZ196552.1 [NCBI]
CDS location
range 9898 -> 17886
strand +
CDS
ATGACAATCAGACTACTTGGGGTTTTACCAGATCCGTTTGGAGCCCCATGTCCTAAAGCTATTATTCGGTTTACTGCCCTCACTTCAGAGGGTAGCGTGCTTAAAGGTACAGATGCCGAACACACCACGACCAAAAAAGGTGAATACGATTTTCCTCTAGAGCAAGGCCGATACCTTTTAGAGATTCAGTACTCTGATGAACTAATGGAATCTGGTTCAGTTTTAGTGGATGAGTTTACTCCTTCATCTCTTACCCTTGTCGAATTAATTCGTTACACGACTCCCCATACACCTCCTTTGGTGGATCCGAATCCTACGGACTGGTCGGTACTATTCCAAGAAGTCATCGACAATGATGAGTGGGATCGCCAGAGCGAACAGCAGGTGCGTGACGAAGATGTTCTAGTCAACGAAGATAAGACCATCCATAAAGACGAAGAAAGCTACCTCAGCAAGGAGACCCTTACTAATACCACTGGTTCAAGTACGACAGTACAGCAGACACTGAACTACAAGGACGCTACTGGACGAGAGGCAGGGTACAACAGACGAGAGATCTCCACAGACCTAGCTAAGGCGCACTCTAGTGAAGAAGTATATTCGGACGTTGATAACTATGAGTCTAAGCAGTCTTCCCTCCTAGAAGGCGCTGAAAGCGCCTCTAGCCAAGAACGAAGCGTAACAGACACATCGGTAACAGAGACCAACAAGTCTACTTTAGGGGATCAGCAGATCACCTCAGAACGCAGCCTTACTGATTCTGGTTCAGAGGTAACTGAAGAAGTAGCTGTGTCTGGTGCCAAGGTAACCTCTCGTAAATACATTGACGCTAATGCTCTAGTTACTGCCCTATTGGAACAAGGTGTGGTGGTTGAGAATGTGGAAGCATTTGATCGCCTATCAGTAAGCTCTACTGGTTCACAGCGAGATATTCAGGTTGACCGTTTCAACGTAGGTGAGACGCTTAAAGTCGACACTGAACAAGAGATAGTGACTATTGCAGGACAACTGAAGTTGACCAATGGCGATGACTATAAAGGTCCTGTAGGCGACTCTCAGTACCTTGAGTACGCTTATGCGACTACCGAAGGCAAAGAGACAGGGGATTGGCATATCGACAACCCTGATGACCCTTATGACTTGCCTGAAGGGACTGTATGGCGTAAACACAGAACTGTTTACGTGAAAGATGGTGAGACGACTTACAGTCAATGGTCTAATCCTTACCTATTAACAGGCGAAGACGGGGCATCAGGCGATACTTTGTACTGGAAATATAAGTACAACACAGTGTCTGTGATCTCTGATCCAGAAGCGACTAACCCTGTAGGATGGGATGACGAATTAAAAGATGGACATTTATACCGAATTGAACGTTTGGTCAGCAACGGAGTTTATCAAGGACTTTGGTCGGAACCGGCTAAGATTGCAGGTACTGACGGTGACAACGGTGAATTAGTTTTTGAGGAGTACATGTATTCTCCTTATGGGGTAAGTACACCTGCTACCAGTCCTGAACATTGGCACACCAACTTCTCTAACGGGGACTACTTCCGATCGTCACGAGTAATTAAGTACGCTGCCGGTACTCCTATCCCTATCCCTGATGGGACTGAACCAGTAATGGTAACTAACTGGACTACTCCTGCGCTCATTACACCAAGAAAAGGCTATGAGTATTCGGATGGTTTATCGCAGATAGTGGTTCATATGTACATTCGTAGTACGACATCTCCTGCATTACCGACGAATACCCTAACGTACAACTTCAATACGCTGACCTTGTCTGGCGATACCCAGGGTTGGTCTACTGAAGTTCCTGAAGGCATAGGTAACTTATGGATCACCATCGCCACAGCATCTTCTAATACTGGTTCAGATGCTATTGCGCCCAATGAATGGACAGATCCCCAGTTAAGTTCAACTCAGGCGTATAACCAAGCGTCATTGAGCCTCTATAAGCGTACCTCAGCAGGCACAGTGCCAGATGCTCCTAGTACTACTCTGAGCTACGACTTTACGACAGGTGCTCTTACAGGCGACCTAGAAGGGTGGTCTAACGGTACTATCCCTGCAGGTACAGACGACCTGTATATTGCTACGGCAGAAGTACGAAGCCAGACCACTACAGCGGACGTACTGCCTAACTATTGGGGTGTAGGCGTTCTTACCTCTACTGCTTTCCGTCAGCAGACGGTGAACATCTATAAGCGAGGAGACAATGCCCCTCCAAGTGCAGACGTGACGTATAACTTCACCGATGGTTCAATTACCGGTTTATCTGGTGGGTGGAGCCTGACTATTCCGTCTGGGGATCAGGATGTGTTTATTGGCGTAGCGGTAGCATCGGCAGCAGGTGTTAGCGATGTTATTCACCCTTCAGATTGGTCAGTAGGTCCACTGGGAACCAGTGGACATCAAGCTCAGGTAATTAACCTGTACCAGACTGCCCAAGATGTACCGGCTAAGCCTCAGTCAGATATCGTGTATGACTTTACTCAGGGTAAGGTTACTTCGAATGTAAGCCCTTGGAGCACAACAGTACCTCAAAATACACAAGGTAAAATCTACGTTACTACGGCTACTGCGAATGCTTCTGCTCTGGCAGATTCAGACATCGTGCCTAAAGAAGAGTGGACATTCCCTCAGGTGATCGTAGAAAGCGGTATTAACGCAGTACCGGTAACTCTGTTCCGTAAGCACTCTTTAGATAGCGCTCCTGAGAAACATACGAATACCCTGACTTACTCTTTCGATAATGCAGTACTACTGAACCAACCTAATAATGGATGGTCAGCTACGCCTATGTCGGTAGATCTAGGTGAAGCAGTTTGGAGCATGGTAGCAACAGCACAAGGCTTAGCTACTGACGCTACTGATGACATTGCACCTACTGAGTTCTCAGCACCGGTTATCTACACTGCTACCGGTTCAGAAGTGTTTACTGTTTACCAGTACGCAGAAGCATCTACTGGCCCATGGTATGACGAGTTCACTGCTGAACGTGTTTGGCGTATTCAAGCCACCTCTATCAATGGGGTAGTTAGTGATTGGTCTGAACCAGTTAAGCTTACCGGTGAAGACGGTGCTACCGGTGATACGATTTACGTAGAGTATAACTATTCAGAAGACCTGAGTAATTGGCATCCTGTACTAGTGGAAGGGGATATTTGGCGAAGAGAACGCATCGTCACTAATGACGTAGCTGACGAATGGTCATCTCCTGCACGACTGAAAGGCAACGAGCAGTACATTGAATACCAATACTCAGCCTCTCTGGATCTAGGCGAAGCAGGTTGGCACACTAACTTCAGTTCTGGCGACTACTACCGTAGAGAACGCACGGTGATCAATGGCACTTACGGAGAATGGTCTGAAGGTGTTCAGATGGTTCCTCTAAAAGACGTGGATTACTCCGATGGAGCTGGGGGAGACACGATCTACGAAGTGTATCAATATTCAGTAGATGGCGTTACAGACTGGGTATATGACTTCACGGATGAACACATCTACCGTAGAACGGCAGTGGTAATCAACGGTGCTCTGGGGCCATGGTCTGAGCCTGCTAAATTATCAGGTGTTGATGGTGCAGATGGCGATACAATCTACATGGAGTACGAGTACTCAGTGGATGGTGTTAACTGGCATGCTGATATGGAAGATGGAGACATCTGGAGACATGAACGTGAGCATACCGTAGGTGTTACTGTACCTAGTGACCCTTGGGGAAACCGTACACGAATCCGTGGTATTGATGGTGCTTACTACGAGTACCTATACAATGAAGATGCGGATAATTATCCAACTGAACCAGAAGACGCTTTCGGTACTTGGCATGCTAACTTCTCCGAAGGTGACTATTACCGTATTGAGAGACTGGTTCAAGAAGCTTCTACAGGTAACTGGACTACCCCCACTAAGCTAAAGCCTAAGAAAGGTGAAGACTACAGTGACGGTATCCAAGGTCCTAAAGGCGAAGATGGTGTTACTACGTACACTTGGATTAAGTACGCTGATGATGCTTCTGGTTCGGGAATGTCTGAAAGTCCTGTTGATAAACCATACATGGGTATTGCCTATAACAAGACTACAGATCAAGAGTCAAATGATCCTAATGATTATGCTTGGTCTAAGGTCGAAGGTGATCAAGGTATCCAAGGCGAAAACGGGTACATGTGGGTACAATACTCTAACTACCCTAATGGTATGAACGGCACTGCCCCTAACATGCACCAAGACCCTGTTGACCCGTCTACAGGCCGTCCTTACCTATATATCGGCATCTCCTACAATAATACCTCTCCTACTGAAGGGAACGACCCTACAGCCTACACATGGTCTAAGTATGTGGGTGATGAGATCTACTACGAGTACAGCTACTCTCCTGATCAGATCTCTTGGGACATGGAGTTGGATGCGAACGATGTATGGCGTAGAGAAAGACGTGTTGAGAATGGACAATATGGAGATTGGTCAGACGCTATTCGTATTGTAGGAACTGAAGGCCCTCAAGGTATCCCGGGCAACGACGGTAATGACGGTAACGATGGAGACTCTCTGTACACATGGGTGAAGTACGCTGATTCAGATACTGGTTCAGGGCTAAGTAATAATCCAGAAGGTAAGGGCTACATCGGTTTTGCGTACAACAAAACTACGCCAGTGGAATCGAACGATCGTAATGACTATGTCTGGTCGTTGGTTAAGGGTACTGACGGTACCGATGGTAAGGACGGGGAGAACGGATCAGACGGTGCACAAGGTATCCCGGGACCTATTGGCCCAGATGGTAAGACGCTGTATACATGGATCAAGTATTCGCCTAATGCTAACGGATACCCTCTTACTGAAGCCCCTGACGAGAATACTAAATATCTGGGTATTTCTACGAATCAACAGGAACAGACTGAGTCTACGGATCCTGATGATTATGTATGGTCTCCTTACACAGGTGCAGACGGCCAATACTACGAAGATGAGTTCTCTGTAAACGGTGACCCTGTTGATTGGGAAACTTGGCACTATCCTGCACAGTCTACGGATAAATTTAAACGTACTCGTTTAATCGATTCTAAAGGCGAACCAGTAGTGGATGACACGACTGATAGTGCAGGGTGGGTGTACACTCAGATTGCTCCTATCAAAGACGTAGACTACGGAGATGGCAATTCAGGAGATACTATTTACGAGGTATATCAATACTCGGTAGACGGTGTCATGAATTGGGAAGATGATTTTCGAGACGAGCACGTATTCAGACGTACTGCCGTAGTAATTAATGGTTCACAAAGTGCATGGTCTGATGCAGCGAGAATCTCCGGCAAGGACGGTCTGCAATCTAACGTGATCACCTTGTATCAAGTTAAGAGTGAGCAGTACTCTTGGAAGGCGGAAGAGTTGATCCAGACAGACGAAACTTACCTGATTCTTACTGGTTCATTTGTAGGGTATGACCCTCAAGATGGCATCAATGGGTGGACGTTAAGTGTTCCTGCTGTACTCACCGCAGGTGAGGCGCTGTACCAAGTTCGTGTTGCAGCATTGTCTGAGGCAGGACAACTAGAAGTACCTATTCCTGCAGACGACTGGGCCTATCCTGTGCAGGTATCTGCAAGCGGTATCTCAGGGGAAGATGGTAAACATGGTTCAGGCAGCTACATCTTAAACTGGGAAGATTCGTATAACTCAGGCCAAGGGTACAAAGACCTGAACACTGGAATAGGAGGGCAGCCTACCGAAGGGGATGTTGAGTACTGGTTCAGAGAACTATCTACGAGAGAATCTCAGGCAGGGGACATTTTAACTATTCAACAGAAAGAAGTGGAATCTGCGGCGCCTAAACAGTGGTTAAGGGACACCGGAGCATGGACAGAGTTTGTATTGTCTGTAGATGGTAACGCTATTGTAAACGGTACATTGGGTGCTGAAGCACTTAAATCAGGCACTACGCTTACGGATGTGCTGTATGTGTCCAATGACGAAGATAGTCATGAGATGACTCTCTCTGGTTCAGGGGAATGGGAAGACGTTACTGGTACTATTCAAAAGGACGACGATTACCGTATCTGGTTAGGTAATAGAGATCCTAAGAAAGCGCCTTTCTCTGTTACTCGACAGGGTAGCGCTAATGTCTATGGGCATCTAACGGCAACTAGCCTAGAGCTTATGGATAATGCAGATATCCCTGCTGAGATGGACAATGAACGCCGGTACGAAACGTATAATATCGTAAGAGATAATGGGGAGTTCATGGTCTCTGAGGAGACGTATGGGGATTCAGGGCATGGTTGGACAACAGACCCAACTAACCTAGTACCTGCCGCATACGCTCACCCTACTTCGGCTACTTCGTATGATTATCATGACTGTTATGTGGAAACCCGCTACTTTGCAGACGGTGCTCCGTACATGTTTATGAACTCAGCAAGTGAGAACGATACACCAAGATTTTATACCTCTAATGTGTATAACAACTACAACATTACTGTAGATCCCCGTAGTTCTATTGTTGTTAAGTTTAAAGCTAAGTTTTCTAAGTTCGGTAATACGCAAGCAGCTAAACCGAAAGTGAAGATGACTTTCCTTTACGATGATGTGAACGTATCGGACCCTGCTACTTGGAAAGGCAGAGAACATGAATGGGATTCAGGGCTCGATTGGGATACTTACTATAATTTTGAGTACACGTTGGACCTCAATTTAGAGCCGCAATCTAATAACCCTACTAACCCCGGAACTTTAAGGTTTGAGATTGCACCTCATTCTCAGATGATCATCTTTGGGGTTGAAGTGCTACAGCGTGTGTATAAGCCTTATGTGCCTGAGTATACTTTCCATGAAGAGCCTATCTACGGAGTAGAAGAGGGAGACGTGCTGAGCTTAGGGGTTGAGTATGTTGGTTCAGAAACAGACCTTACTTGGGGCTTACGTAGCTTTAACTCTAAAGGTCAAGTCATATGGGAAAGAGGACATAAAGTCTCTGACATTAATACCAGTCTTTGGGAGTTAGATATTCAGGAAGACATTACGGTAGATACGATCGGAGATGGAGGCTTGTTCTTGTTCCTGACGAAGAACTCTGCACCTTTTAACTGGGGAAATTCATTACACAGTACTGTTCATGTACGTAACTTCCAAGTGGTTCACGGAGATTCGTATAAGGAGCACTATGTTCCTGTAAGTGACCCGAATGCATATCCTGACCAAGAAGCAGGGACTTACCCTCAAATTACCGGTGGTGGTTGGACTGGTCCAGAATCAGAGTCAGGATGGGGTGTGCTCTCTAATGGTGATGCGTACTTTAATAACTTGACCGTGACCAATGGTACTCTCTCAAGCGGTACGATCATTGGAGCTGATATCTACGCAGGTAACACGTACCACAAGATTGACTACAACACGAACAATAGCAGCGATACGGTGTATTATAAGAAGTACCCTAATACAGTACCTCTGTATGCTTCTACAAGCTTCTCTGGCAGAGATTACGGCACAAGGACTGCCGTTGGTTTGAATGGCCCACTGACAGACGTGTACGTCAGACCTTGGGATGTTGTGAGCTGTCTGGAGACACGAGGAGAAGTGAATACTCGTAGATTCCGTTGGGGTAAAGTAAGAGCAGGAGCGCTGAGTTGTAAGGTACAATTGCCAAGCACCTACGTAACTTCTCTAGGCGTTGATGTCTATATTATCGATGACAACGGGTGGGAACGAGGAGTATCAGGATACCCTGTTATTTCTCAAGGGCGATACTACGAAGGCACTATGTCTTTGGGTCCTATGATTTTTGATGTAGTTATTAATAACACCGGCTCTACTTACTATGTCGAAATCAAAAATAGACAATGTCAGCAATTTGGGGAGATAGATAATGATCTCTACAATGGCAAGTTCTATGTGAAGGTACGTGCAGGTATTGACAACAACAAACGTGTTGACGCTACTTATGACTTACAATACACAGTAGATAACGACACTTTCCCGGGATAA

Genome Context

Genome Context

Tertiary structure

PDB ID
9f4e41cefd66244d5e78a159cae44823d6b3426ce90977f6fe66d244b6cdf2fe
ColabFold
Source ColabFold
Method ColabFold
Resolution 0,4002
Oligomeric State monomer
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50