Genbank accession
WJN63887.1 [GenBank]
Protein name
tail fiber
RBP type
TF
Evidence GenBank
Probability 1,00
TF
Evidence RBPdetect
Probability 0,89
Protein sequence
MASGSLHGLQGGFQRELANMGPVGQTLLAGIGAASIGGEVISTMNDYLGNAQDFGSSNPIQFDAQQQGMEMLGLNKAQAQRANETVHSAFNMMANGDPSAAAQMSVATRGLITLADIRESGGDPIKLNRIFVERARSRGWSQERIAGAAQMAGLDGFARTASTSDRIRNAAEGLDDTRGRQDTSEFAGAVREDNATRAAVSPDYFVQRYGAQDYNALVGGLSGGMSKAYQLMDKAEQVTHAKSLAEATQMLESGGRDYDDKGNPLTSSTGAKYSMQVLPSTARDPGYGVKPAQSDTPEEYNRVGRELLDKMVGKYAGDYDKAAAAYTDGAGTVDRAVKQWGNDWLKHMPAQAQKRVADLHKLATGANGFAGGSTGPSVGTIQVNVTATVNGKQATATANVGNQSQSHTINVGGAVSQKR
Physico‐chemical
properties
protein length:419 AA
molecular weight: 43995,17620 Da
isoelectric point:6,33513
aromaticity:0,05728
hydropathy:-0,50072

Domains

Domains [InterPro]
DC_0159
STR
1–245
G3DSA:1.10.530.10
RBD
229–362
G3DSA:1.10.530.10
RBD
229–355
IPR023346
STR
242–344
IPR008258
ENZ
266–340
WJN63887.1
1 419
Architecture
STR
RBD
STR 1-344 | RBD 345-419
Legend: ATT STR RBD CBM LEC ENZ CHP LNK TAS TTP UNK Unmapped

Taxonomy

  Name Taxonomy ID Lineage
Phage Erwinia phage Calisson
[NCBI]
3056594 Viruses > Duplodnaviria > Heunggongvirae > Uroviricota > Caudoviricetes
Host Erwinia amylovora
[NCBI]
552 cellular organisms > Bacteria > Pseudomonadati > Pseudomonadota > Gammaproteobacteria > Enterobacterales

Coding sequence (CDS)

Coding sequence (CDS)
Genbank protein accession
WJN63887.1 [NCBI]
Genbank nucleotide accession
OQ818696.1 [NCBI]
CDS location
range 53275 -> 54534
strand -
CDS
TTGGCAAGCGGCTCACTGCATGGCCTACAGGGAGGATTCCAGCGTGAACTAGCAAACATGGGGCCGGTTGGTCAAACTCTGTTGGCTGGTATCGGAGCGGCGTCAATTGGTGGTGAAGTCATTTCAACAATGAATGACTATCTTGGAAATGCTCAGGACTTTGGTTCAAGCAATCCAATTCAATTTGACGCTCAACAGCAAGGCATGGAAATGCTTGGGCTTAATAAAGCTCAGGCTCAACGTGCAAATGAAACAGTTCACTCTGCTTTCAACATGATGGCGAATGGCGACCCAAGTGCTGCCGCTCAAATGTCTGTTGCAACTCGTGGACTTATAACGCTTGCTGATATCCGTGAAAGCGGTGGCGACCCAATCAAGTTAAACCGCATCTTTGTTGAACGTGCCCGTTCTCGTGGCTGGAGTCAGGAAAGAATTGCTGGTGCTGCTCAAATGGCAGGACTTGATGGATTTGCTCGCACTGCCTCAACTTCTGACCGAATCCGAAATGCTGCGGAAGGTCTTGATGACACTCGTGGTCGTCAGGATACTTCTGAGTTTGCTGGTGCTGTTCGTGAAGACAATGCAACTCGTGCTGCTGTATCTCCTGACTATTTCGTTCAACGTTATGGTGCTCAGGATTACAACGCATTGGTTGGTGGTTTGTCTGGTGGTATGTCAAAAGCATATCAGCTTATGGACAAAGCCGAACAAGTTACTCATGCCAAATCTCTTGCAGAAGCGACACAAATGCTTGAGTCTGGTGGCCGTGATTACGACGATAAAGGTAATCCTTTAACAAGTTCTACCGGTGCCAAGTATTCAATGCAGGTTCTTCCTTCTACTGCACGTGACCCAGGATATGGAGTCAAGCCTGCTCAAAGTGATACACCAGAAGAATACAATCGTGTTGGTCGGGAACTTCTCGATAAGATGGTTGGTAAGTATGCTGGTGATTATGACAAAGCGGCTGCTGCATATACTGACGGTGCCGGTACTGTAGACAGAGCGGTTAAGCAATGGGGCAATGACTGGCTTAAACATATGCCAGCACAGGCTCAGAAACGTGTTGCTGATTTGCATAAACTGGCAACTGGTGCAAATGGATTTGCTGGTGGTTCTACTGGCCCATCTGTTGGAACTATCCAAGTTAATGTCACTGCGACTGTTAATGGTAAGCAAGCAACTGCAACTGCAAATGTCGGCAATCAATCTCAGTCTCACACTATTAATGTGGGTGGCGCTGTTTCTCAGAAGCGTTAA

Genome Context

Genome Context

Tertiary structure

PDB ID
5ee39abdee53b79fb274c5a49e8777c8d8ce7ba7ea556331c514afa448d021a7
ESMFold
Source ESMFold
Method ESMFold
Resolution 0,6471
Oligomeric State monomer
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50