Genbank accession
UBF22183.1 [GenBank]
Protein name
putative tail fiber protein
RBP type
TF
Evidence RBPdetect
Probability 0,84
Protein sequence
MAYQSKVKTYGAPADERAERPDSWRWKQNEPPVAEYLNDLNYNVIEDIKHLVALTNAIDPDNDGMVANADKLDGKHADELGGFKYVQTETPNTPAVGNSWFKDTNGLILVGDGEKYQPQPAVGYQETGDFTAEGYSVSHETVPRTRLDGSGHIALLNEQVVVDFEDGNTTPEHSAWSWTDSSGLSAQGATVISGTQSGEYSVAGTLDAITLNREAPIIQDFEVTFQVGSDTGNISDYSELIVKAQDGTLIGGVRFNDGNGSPVVLDDSRAPIENIQSAWSVGQNYSFEWDWDFGNSQYDLYMDGALVGTYSVPAGVSDFGEFTVRQDNSSSGSTRSVFIDDLHTGAREYGEAVITFAEPDQRIVSWDIIRLTKTLAGESVVVDVEDSSGTKLVSDIENEGDLSAYVSGSENFQLRVKISRSNTANQPSLDYVYRRWTMRPGDTGLSEKEQEELNKKILATALMDF
Physico‐chemical
properties
protein length:465 AA
molecular weight: 50908,87940 Da
isoelectric point:4,28705
aromaticity:0,09247
hydropathy:-0,50129

Domains

Domains [InterPro]
UBF22183.1
1 465
Architecture
ATT
STR
RBD
ATT 1-185 | STR 209-396 | RBD 397-463 |
Legend: ATT STR RBD CBM LEC ENZ CHP LNK TAS TTP UNK Unmapped

Taxonomy

  Name Taxonomy ID Lineage
Phage Halorubrum virus HRTV-2
[NCBI]
2878000 Viruses > Duplodnaviria > Heunggongvirae > Uroviricota > Caudoviricetes
Host Halorubrum sp.
[NCBI]
1879286 cellular organisms > Archaea > Methanobacteriati > Methanobacteriota > Stenosarchaea group > Halobacteria

Coding sequence (CDS)

Coding sequence (CDS)
Genbank protein accession
UBF22183.1 [NCBI]
Genbank nucleotide accession
MZ334517 [NCBI]
CDS location
range 24942 -> 26339
strand +
CDS
ATGGCATACCAAAGCAAAGTCAAAACCTACGGCGCTCCGGCGGACGAGCGGGCCGAGCGGCCCGATTCGTGGCGCTGGAAACAAAACGAGCCGCCTGTCGCTGAGTACCTGAACGACCTGAACTACAACGTCATTGAGGACATCAAGCACCTCGTGGCGCTGACGAACGCCATCGACCCCGACAACGACGGGATGGTGGCGAACGCCGACAAGTTGGACGGCAAGCACGCCGATGAACTTGGCGGCTTCAAGTACGTTCAGACCGAGACCCCGAATACGCCCGCAGTGGGTAATTCGTGGTTCAAAGACACGAACGGCCTCATCCTCGTTGGTGACGGTGAAAAGTACCAACCTCAGCCAGCGGTCGGCTACCAAGAAACCGGCGACTTCACCGCCGAAGGGTACTCTGTTTCGCACGAAACAGTCCCTCGAACGAGACTCGATGGGAGTGGTCACATTGCGCTCCTCAACGAGCAGGTCGTCGTAGACTTCGAGGACGGCAATACCACCCCCGAACACTCGGCGTGGTCGTGGACGGACTCATCTGGTCTCTCCGCACAGGGAGCAACGGTCATCTCGGGTACTCAAAGCGGCGAGTATTCCGTCGCTGGCACGCTCGACGCAATTACACTAAATCGTGAAGCGCCCATCATCCAAGACTTCGAGGTGACGTTCCAAGTCGGCTCTGACACCGGGAACATCAGCGACTACTCGGAACTCATCGTCAAAGCACAAGACGGAACGCTCATCGGGGGCGTTCGGTTCAACGACGGTAATGGCTCACCTGTTGTACTTGATGATTCCAGAGCGCCCATCGAAAACATCCAGTCCGCGTGGTCTGTCGGACAGAACTACTCGTTCGAGTGGGACTGGGACTTCGGAAACTCCCAGTACGACCTGTATATGGACGGTGCGCTGGTCGGTACGTACTCTGTTCCGGCGGGCGTGAGCGACTTCGGGGAGTTCACTGTCCGACAAGACAACTCCTCGTCCGGTTCCACCCGGTCGGTCTTCATAGACGACCTCCACACCGGGGCGCGTGAATACGGCGAAGCGGTCATTACGTTCGCCGAACCCGACCAGCGAATCGTTTCGTGGGACATCATCCGGCTCACGAAAACGCTCGCAGGAGAGAGCGTCGTCGTGGACGTGGAGGATTCCAGCGGAACGAAACTCGTCTCAGACATCGAGAACGAGGGCGACCTCTCGGCCTATGTTTCCGGCTCGGAGAACTTCCAACTCCGGGTGAAAATTAGTCGGTCGAACACGGCAAATCAACCCTCTCTCGACTACGTGTATCGGCGCTGGACGATGCGCCCCGGAGACACTGGACTCAGCGAAAAGGAGCAGGAAGAATTAAACAAGAAAATCCTCGCTACCGCATTGATGGACTTCTAA

Genome Context

Genome Context

Tertiary structure

PDB ID
e41c9c0994fbf20ac0d32c59a476553dba6d136b87e73f00cd95776e8820466b
ESMFold
Source ESMFold
Method ESMFold
Resolution 0,7339
Oligomeric State monomer
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50