Genbank accession
YP_008059175.1 [GenBank]
Protein name
tail fiber protein
RBP type
TF
Evidence Phold
Probability 1,00
Protein sequence
MTGKIQDQVAENRNQTRRLDAQFVNVSTQNDRNLTFTNNYIEVEVQVEIYTRDLSTSLISGHPNAKHGSGRGESGDLRTDWTQESVTVSSEVLVRGGRNAIRDALDGQTGAVNQIGVGTGSDDAASGDTALTSETTKNFCWGQKDTFNVTRARSSPFLFAEYGDTVQEFGVFDEAARLLTRSVLDSSLNPTSEKELRVDVTFTFTGDGIGNSVITDDGEEALADSIASVGTATGLKEINYGTGTTTPSTSDTALAAEEIAKDCLRQLDAEQITTQAKLFDSEPATQPVDLTEIGVTDNNGRLVWRTLIKAFEKNSQFEVNTTIGFIIQSK
Physico‐chemical
properties
protein length:330 AA
molecular weight: 35702,60230 Da
isoelectric point:4,48735
aromaticity:0,06364
hydropathy:-0,45515

Domains

Domains [InterPro]

No domain annotations available.

Legend: ATT STR RBD CBM LEC ENZ CHP LNK TAS TTP UNK Unmapped

Taxonomy

  Name Taxonomy ID Lineage
Phage Halovirus HCTV-5
[NCBI]
1273748 No lineage information
Host Haloarcula californiae
[NCBI]
244363 Archaea > Euryarchaeota > Halobacteria > Halobacteriales > Halobacteriaceae > Haloarcula

Coding sequence (CDS)

Coding sequence (CDS)
Genbank protein accession
YP_008059175.1 [NCBI]
Genbank nucleotide accession
NC_021327.1 [NCBI]
CDS location
range 88094 -> 89086
strand +
CDS
ATGACGGGCAAAATTCAAGACCAAGTTGCGGAGAACAGGAATCAGACGAGACGTCTGGACGCGCAGTTCGTCAACGTCTCTACGCAGAACGACCGGAATCTTACGTTCACCAACAACTACATCGAGGTGGAGGTGCAGGTCGAGATTTACACGCGCGACCTTAGCACATCTCTCATCTCGGGACACCCGAACGCGAAGCATGGGTCTGGGCGTGGTGAATCGGGAGACCTTCGTACTGACTGGACGCAGGAATCCGTTACCGTGTCCTCGGAGGTCCTCGTGCGTGGAGGACGTAACGCCATCAGAGACGCGCTTGACGGGCAGACAGGCGCGGTCAATCAGATAGGCGTAGGTACGGGCAGCGACGACGCGGCTTCGGGAGACACCGCGCTCACGTCGGAGACGACGAAGAACTTCTGTTGGGGGCAGAAGGATACGTTCAACGTGACTCGTGCCCGTTCGTCGCCGTTCCTTTTCGCAGAGTACGGAGACACGGTTCAGGAGTTTGGCGTCTTCGACGAGGCAGCGAGGCTGCTCACGAGGTCTGTTCTCGATAGTTCACTCAACCCGACGAGTGAGAAGGAACTGCGCGTGGACGTGACTTTCACGTTCACTGGTGACGGAATCGGCAACTCGGTCATCACCGACGACGGTGAGGAAGCTCTTGCAGACAGCATCGCTTCGGTCGGTACCGCGACGGGTCTGAAAGAGATTAACTACGGGACTGGGACGACGACGCCCAGCACGTCTGACACAGCGCTCGCCGCCGAGGAGATAGCAAAGGACTGTCTTCGACAGCTCGACGCTGAGCAGATTACGACACAGGCGAAACTGTTTGATAGCGAACCTGCAACTCAGCCCGTAGACCTCACGGAAATCGGTGTTACTGACAACAACGGTCGTCTAGTCTGGCGCACGCTCATCAAGGCGTTCGAGAAGAATAGTCAGTTCGAGGTTAACACGACCATCGGGTTTATCATCCAGTCGAAGTAA

Genome Context

Genome Context

Tertiary structure

PDB ID
e3a9bb362ce28ddbc3a44ab9aa44ee636ed4fb0fef490ce0f2bda69849e62cd2
ESMFold
Source ESMFold
Method ESMFold
Resolution 0,8408
Oligomeric State monomer
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50