Genbank accession
WMM95241.1 [GenBank]
Protein name
tail fiber protein
RBP type
TF
Evidence Phold
Probability 1,00
TF
Evidence RBPdetect
Probability 0,90
TF
Evidence RBPdetect2
Probability 0,91
Protein sequence
MAQINIGRVRMGWKGTWSSGTTYVAQDAVYYNGETFVAKTDVPVGTATTNTTYWQQVAQKGTNGTNGVDGATGPQGPTGAAGIQGDTGPQGIQGATGPTGRTCATGPAGPQGDVGDTGPTGPQGPIGNTGPTGPQGVAGDDGDTGPTGPQGPIGNTGPTGPQGDAGDDGATGATGPQGPIGNTGPTGPQGPPGDDAATGATGPQGPIGNTGATGAQGPEGAQGDTGPQGSQGLTGNTGPTGPQGPQGDDGPTGATGPQGPIGNTGPTGPQGNIGNTGPQGPQGDDGPTGATGPQGPQGSTGSTGATGPSGPAPSHGWSGYTLRFMNPNGTWGSYTNLRGATGATGSTGPQGATGAQGPIGNTGPTGSTGPQGSTGPSGPAPSHGWSGYSLRFMNPNGTWGSYVNLRGATGATGPQGTTGAQGPTGNTGATGSTGPQGIQGPEGDQGPTGPQGATGPQGSQGVKGDTGNTGATGSAGPQGPTGPTGATGATPSHSWSGTSLRFTNSNGSWGSYVNLKGDTGNTGAAGSTGATGPQGATGPAGSAGPAGATGASGTGLGLWTLHASGYANNTSTAYNFGSTTGTFFIQLYHTRSSVKSFRVYNTAGGITFTGGRNGNGSSASVTSGTAYSGGVLSYNTINFIVKTNGSNTYVQGFGSEPMSYDIWRIS
Physico‐chemical
properties
protein length:666 AA
molecular weight: 63352,42470 Da
isoelectric point:4,96718
aromaticity:0,05856
hydropathy:-0,65976

Domains

Domains [InterPro]
DC_2181
ATT
4–154
DC_0341
STR
234–336
WMM95241.1
1 666
Architecture
ATT
STR
ATT 4-154 | STR 155-659 |
Legend: ATT STR RBD CBM LEC ENZ CHP LNK TAS TTP UNK Unmapped

Taxonomy

  Name Taxonomy ID Lineage
Phage Roseobacter phage CRP-114
[NCBI]
3072842 No lineage information
Host Rhodobacteraceae bacterium
[NCBI]
1904441 cellular organisms > Bacteria > Pseudomonadati > Pseudomonadota > Alphaproteobacteria > Rhodobacterales

Coding sequence (CDS)

Coding sequence (CDS)
Genbank protein accession
WMM95241.1 [NCBI]
Genbank nucleotide accession
OR420740.1 [NCBI]
CDS location
range 33485 -> 35485
strand +
CDS
ATGGCTCAAATTAATATTGGTCGAGTGCGTATGGGTTGGAAAGGAACTTGGAGTTCTGGAACAACCTATGTTGCACAAGATGCCGTCTACTATAACGGCGAAACCTTTGTTGCTAAGACTGACGTTCCCGTTGGTACAGCGACAACTAACACAACCTACTGGCAGCAAGTTGCTCAGAAGGGGACTAACGGAACCAACGGTGTTGATGGTGCTACAGGACCACAAGGCCCAACAGGCGCTGCTGGCATCCAAGGTGATACAGGTCCCCAAGGTATCCAAGGCGCTACAGGACCTACAGGACGTACGTGTGCCACAGGACCAGCAGGCCCCCAAGGTGATGTAGGTGATACAGGACCAACAGGACCACAAGGTCCTATCGGCAACACAGGACCTACAGGTCCTCAAGGTGTTGCGGGTGACGATGGTGATACAGGACCAACAGGACCTCAAGGTCCTATCGGCAATACAGGACCTACAGGTCCTCAAGGTGATGCGGGTGACGACGGTGCTACAGGTGCTACAGGACCACAAGGTCCTATCGGCAACACAGGACCTACAGGTCCTCAAGGACCTCCAGGAGACGATGCTGCTACAGGTGCTACAGGACCACAAGGTCCTATTGGCAATACAGGCGCGACTGGCGCTCAAGGCCCAGAGGGAGCACAAGGTGATACAGGACCACAAGGTTCACAAGGTTTGACCGGAAATACAGGACCTACAGGTCCACAAGGACCTCAAGGTGATGATGGTCCAACAGGAGCTACAGGTCCACAGGGTCCCATCGGTAACACAGGCCCTACAGGCCCACAGGGTAATATCGGTAATACAGGCCCACAAGGACCTCAAGGTGATGATGGTCCTACTGGCGCTACAGGCCCACAGGGTCCTCAAGGCTCGACAGGCTCAACAGGTGCAACAGGTCCGTCAGGCCCAGCGCCATCACACGGTTGGTCTGGTTATACCTTACGGTTCATGAACCCTAACGGTACATGGGGTAGCTATACAAACTTGCGTGGCGCTACAGGTGCTACTGGTTCAACAGGACCTCAAGGTGCTACAGGCGCACAAGGCCCAATTGGTAATACTGGTCCTACAGGTTCTACCGGACCTCAAGGTTCAACAGGCCCATCAGGCCCAGCTCCATCTCATGGATGGTCAGGGTACAGCCTACGGTTCATGAATCCCAATGGAACTTGGGGTAGCTATGTAAACTTGCGCGGAGCTACTGGCGCTACAGGCCCACAGGGTACTACAGGGGCGCAAGGCCCAACGGGTAACACCGGAGCTACTGGTTCTACTGGACCACAGGGTATTCAGGGACCAGAAGGTGACCAAGGTCCCACAGGCCCACAGGGTGCGACAGGTCCTCAAGGTTCTCAGGGTGTAAAGGGCGATACTGGTAACACAGGTGCTACTGGCTCGGCTGGTCCTCAAGGTCCAACAGGTCCAACAGGTGCCACAGGTGCAACGCCTAGCCACAGCTGGTCAGGTACTTCGCTTCGCTTTACAAACTCTAACGGGTCATGGGGCAGCTATGTAAACCTGAAAGGTGATACTGGTAATACAGGTGCTGCAGGTTCTACGGGTGCTACAGGTCCTCAAGGGGCTACTGGTCCAGCAGGTTCTGCAGGTCCGGCGGGTGCAACTGGAGCAAGCGGTACTGGCCTTGGTCTATGGACACTCCACGCCTCTGGATATGCCAATAATACCTCAACTGCTTACAACTTTGGTTCTACTACAGGAACATTTTTTATTCAGCTGTACCATACCCGTTCCTCTGTAAAATCTTTTCGTGTTTATAATACGGCTGGGGGTATAACTTTCACAGGCGGAAGAAATGGTAATGGGAGTTCAGCGTCAGTAACGAGCGGCACTGCATACTCAGGCGGTGTTCTAAGCTACAACACAATCAATTTTATTGTGAAAACCAATGGTTCAAATACATATGTACAAGGATTTGGTAGCGAACCAATGAGCTATGACATTTGGAGGATAAGCTAA

Genome Context

Genome Context

Tertiary structure

PDB ID
01d97862893c0605ae3b52bc161d27747b75ab51b6adfc8770699a77e662be53
ESMFold
Source ESMFold
Method ESMFold
Resolution 0,6008
Oligomeric State monomer
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50