UniProt accession
A0AA87CIK5 [UniProt]
Protein name
Tail fiber protein
RBP type
TF
Evidence UniProt/TrEMBL
Probability 1,00
TF
Evidence Phold
Probability 1,00
Protein sequence
MELYRTFDRMTRSITSDRKGYAGQTYDSGSTRIHFKIQDGGEDWDFKADGYVPYIVFAVYDEVGNPYVYGPDSSPVFSGFDFPIPYEITSRASSLRVEYNLWFVKAEVADNFNGTPDGLLVTEYLLSATDGVAFRASCIKPPKPGCGCKAPPYTPATAPTVIGALEALKSLAVIRPVAKEAHINPYGEEQGLDLYFKSISGEFQQMWLNVPTLSDDGKLKASQLPTGNGVDSIPLLKSMVGNGDAIVYDGAKQGFVAKKVSATAGAALAAPASLVQASAMGQRMEYVKRLDGEAGAHYLRLLDGNGQEICHVDLPLESMISKAYYDASKKSLIFEVDGAEEPIVVPVHDLVDTFRPGDENITIELVASGDAENPTIHTISLSSSFLARIKDDELDLAAHKDDVQNPHRVTKAQVGLGNVENLSPANMPVSDATAAAIASAKQDVSADVEAVEARVGVIETQLNGDSGEGGIIEKHDRDIARIDSETTAIRAELALKASSQDLATAVGAKQDRLIPGSNIAISDTNVISYVGPNIDVDSKMSATSTNPVQNRVLVAELDKLQPKLTAGTNIKIENGRIDATVPPTTVDDAMSYSSKNPVQNRVIARELDKKANIGEGVSVWKGITQDGLYLYNTGDVVVYDNALYISRVDNNDHHPDDETCWSVVRGATTTQVVGITPATYIGVFGNTSDTVYTIEHRMNTRNIVFSFMRNDGSYQFVYPTMVSAPTLSTMRVQLPSPPGNNALIVNMVKARTVTPSGVKSYPAVIEFATPDREWQVYNDTGKPLYVKAYNTEGVEAEGDVIQDSATEYSPVTIDFGEATAGKLFLAESDIVKEYNGTSLDIPIASGDRYLVQCFRDGEGQSRLDIIQTDGNVHIGSSSPWAGTVAMFKATASKSWKAADMKQENGRYKISYQHNKGRLVGAQVYTSEGMAMTELSCTDNVITVYTNSAIDGELYII
Physico‐chemical
properties
protein length:956 AA
molecular weight: 103334,57150 Da
isoelectric point:4,79888
aromaticity:0,07741
hydropathy:-0,24059

Domains

Domains [InterPro]

No domain annotations available.

Legend: ATT STR RBD CBM LEC ENZ CHP LNK TAS TTP UNK Unmapped

Taxonomy

  Name Taxonomy ID Lineage
Phage Caudoviricetes sp. vir335
[NCBI]
3068357 Uroviricota > Caudoviricetes >
Host Methanomassiliicoccales
[NCBI]
1235850 cellular organisms > Archaea > Methanobacteriati > Thermoplasmatota > Thermoplasmata >

Coding sequence (CDS)

Coding sequence (CDS)
Genbank protein accession
DBA35590.1 [NCBI]
Genbank nucleotide accession
BK063680 [NCBI]
CDS location
range 31141 -> 34011
strand -
CDS
ATGGAACTGTATCGTACATTCGACAGGATGACCCGCTCGATAACGAGCGACCGCAAAGGATACGCGGGACAGACCTACGATTCCGGGAGCACCCGCATCCACTTCAAGATCCAGGACGGAGGGGAGGATTGGGACTTCAAGGCGGACGGGTATGTGCCGTACATCGTGTTCGCGGTATACGACGAGGTCGGGAATCCCTACGTCTACGGCCCCGATTCCTCGCCCGTGTTCTCCGGATTCGACTTCCCCATCCCCTACGAGATAACCTCCAGGGCCTCGTCCCTGCGTGTGGAGTACAACCTTTGGTTCGTCAAGGCGGAGGTCGCGGACAACTTCAACGGAACCCCCGACGGCCTTCTCGTGACCGAGTACCTCCTCAGCGCGACGGACGGCGTGGCGTTCCGCGCGTCCTGCATAAAGCCTCCCAAGCCCGGATGCGGATGCAAGGCGCCGCCGTACACTCCTGCAACGGCTCCTACCGTGATAGGGGCTCTGGAGGCCCTGAAGTCGCTGGCGGTCATCCGTCCCGTCGCCAAGGAGGCCCACATCAACCCCTATGGAGAGGAGCAGGGGCTGGACCTGTACTTCAAGTCCATCTCCGGCGAATTCCAGCAGATGTGGCTGAACGTCCCGACCCTGTCCGACGACGGGAAGCTCAAGGCATCCCAGCTCCCGACGGGCAACGGCGTCGATAGCATCCCGCTGCTCAAGAGCATGGTCGGGAACGGCGATGCCATCGTCTATGACGGCGCCAAGCAGGGCTTCGTCGCCAAGAAGGTCTCCGCCACCGCAGGCGCGGCCCTCGCCGCTCCCGCATCGCTCGTGCAGGCCTCCGCGATGGGTCAGAGGATGGAGTACGTCAAGAGGCTGGACGGAGAGGCCGGCGCCCACTATCTGCGCCTTCTCGACGGCAACGGCCAGGAGATATGCCATGTGGACCTTCCGCTGGAGAGCATGATATCCAAGGCGTACTACGACGCGTCCAAGAAGTCCCTCATCTTCGAAGTGGACGGAGCCGAGGAGCCCATCGTCGTCCCCGTCCACGATCTCGTGGACACGTTCAGGCCCGGCGATGAGAACATCACCATCGAGCTGGTGGCGTCCGGCGATGCCGAGAATCCGACCATCCACACCATCTCCCTGTCCTCGTCCTTCCTCGCCCGCATAAAGGACGACGAGCTGGACCTCGCGGCCCACAAGGACGACGTGCAGAACCCGCACCGCGTCACCAAGGCGCAGGTCGGCCTGGGCAACGTGGAGAACCTGTCTCCTGCCAACATGCCCGTGTCCGATGCGACCGCGGCGGCCATCGCCAGCGCCAAGCAGGACGTGTCCGCGGACGTCGAGGCCGTGGAGGCCAGGGTGGGAGTCATCGAGACGCAGCTCAACGGCGACAGCGGGGAGGGCGGCATCATCGAGAAGCACGACAGGGACATCGCCCGCATCGATTCCGAGACCACCGCCATCAGGGCGGAGCTGGCGCTGAAGGCATCCTCGCAGGACCTTGCGACCGCGGTCGGAGCGAAGCAGGACAGGCTCATCCCCGGCAGCAACATCGCCATCAGCGACACCAACGTCATCTCCTACGTCGGCCCGAACATCGATGTGGATTCCAAGATGTCCGCCACCTCGACCAATCCCGTCCAGAACAGGGTGCTGGTCGCAGAGCTGGACAAGCTCCAGCCGAAGCTCACGGCAGGGACGAATATCAAGATCGAGAACGGCAGGATAGACGCCACCGTCCCTCCGACCACCGTGGACGACGCGATGTCCTATTCGTCCAAGAATCCCGTCCAGAACAGGGTCATCGCACGCGAGCTGGACAAGAAGGCCAACATCGGCGAGGGAGTCAGCGTCTGGAAGGGAATCACCCAGGACGGCCTGTATCTGTACAATACAGGCGATGTCGTCGTCTACGACAACGCGCTCTACATCTCCCGCGTCGATAACAACGACCATCACCCGGACGACGAGACCTGCTGGTCCGTCGTGCGCGGAGCCACGACCACACAGGTCGTCGGAATCACGCCTGCCACATACATCGGCGTGTTCGGGAACACCTCCGACACCGTCTACACGATAGAGCATAGGATGAACACGCGCAACATCGTCTTCAGCTTCATGAGGAACGACGGCAGCTACCAGTTCGTCTACCCCACGATGGTGTCCGCGCCGACGCTGAGCACCATGCGCGTCCAGCTCCCGTCGCCTCCGGGCAACAACGCCCTCATCGTCAACATGGTGAAGGCGAGGACGGTCACGCCGTCGGGCGTCAAGAGCTATCCCGCGGTCATCGAGTTCGCAACCCCCGACAGGGAGTGGCAGGTCTACAACGACACGGGCAAGCCTCTGTACGTGAAGGCGTACAACACGGAAGGAGTCGAAGCGGAAGGCGATGTCATCCAGGACAGCGCTACGGAGTATTCTCCTGTCACCATAGATTTCGGCGAAGCGACGGCGGGAAAGCTCTTCCTTGCAGAGTCGGATATCGTGAAGGAGTACAACGGAACATCTTTGGACATCCCGATCGCATCGGGCGACAGATACCTTGTGCAGTGCTTCAGGGACGGCGAGGGGCAGTCCAGGCTGGATATCATCCAGACCGATGGGAATGTGCATATCGGATCTAGCAGCCCTTGGGCAGGTACCGTCGCCATGTTCAAGGCGACCGCGTCCAAGAGCTGGAAGGCTGCCGATATGAAGCAGGAGAACGGCAGGTACAAGATATCCTACCAGCATAACAAGGGCAGATTGGTCGGAGCACAGGTCTATACTTCCGAAGGGATGGCCATGACCGAGCTGTCCTGTACCGATAATGTCATCACCGTCTATACCAACTCGGCGATAGACGGCGAGCTCTACATAATCTGA

Genome Context

Genome Context

Tertiary structure

PDB ID
302e0761eee151d95816e725de51eb8e16b0bb1d776edcfc2950f5fb67c42f63
ESMFold
Source ESMFold
Method ESMFold
Resolution 0,5579
Oligomeric State monomer
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50