Genbank accession
XRQ88083.1 [GenBank]
Protein name
tail fiber protein host specificity
RBP type
TF
Evidence Phold
Probability 1,00
TF
Evidence RBPdetect
Probability 0,82
Protein sequence
MPIGVNVSGTNRQATMKVNVGGVWKYPIGWVRENGEWKKFQNPEFTYTISQNTANFNLATAVGSREEVIINLVINSGVSVYSTSVSTPAIVIPDSFAGKTINIINNGNIYGQGGVGGTGAGQAGGPALRVQTSQKINLTNNGTIAGGGGGGGKGGTGGNGFFTTQSTQRDPSSGTWTYATGNHIEFRSRNCILRMGNSEIYRWYGPDFSTVVTYGITITVGEWTYYASNFYGDGVGVLKSNAMYRTRTVTNTTYTTGGAGGNGGNGQGFSQAATNGLAGANGGTNAGRGGNGGNGGTFGVAGATGATGASGNQTAGLGGQAGGAAGAAVDGTSKVNYVNAGKLLGPLIN
Physico‐chemical
properties
protein length:349 AA
molecular weight: 35213,25450 Da
isoelectric point:9,62708
aromaticity:0,08883
hydropathy:-0,25702

Domains

Domains [InterPro]
DC_1762
ATT
1–180
IPR007932
RBD
66–156
XRQ88083.1
1 349
Architecture
ATT
RBD
ATT 1-180 | RBD 196-348 |
Legend: ATT STR RBD CBM LEC ENZ CHP LNK TAS TTP UNK Unmapped

Taxonomy

  Name Taxonomy ID Lineage
Phage Vibrio phage Rostov-5342
[NCBI]
3414238 Viruses >
Host Vibrio cholerae O1 El Tor
[NCBI]
172918 Chordata > Actinopteri > Cypriniformes > Cyprinidae >

Coding sequence (CDS)

Coding sequence (CDS)
Genbank protein accession
XRQ88083.1 [NCBI]
Genbank nucleotide accession
PV403754.1 [NCBI]
CDS location
range 2018 -> 3067
strand -
CDS
ATGCCGATTGGAGTAAATGTTTCTGGTACTAACAGACAGGCTACAATGAAAGTTAATGTCGGTGGAGTTTGGAAATATCCGATTGGTTGGGTTCGTGAAAACGGGGAATGGAAGAAATTCCAAAACCCCGAATTTACTTACACAATCAGTCAGAATACAGCAAACTTCAACCTAGCTACGGCAGTAGGTAGTAGAGAAGAGGTTATTATTAATCTGGTAATTAACTCAGGTGTGAGTGTGTACTCAACAAGTGTTAGCACTCCAGCAATCGTGATTCCAGATAGCTTTGCAGGAAAGACAATTAACATCATCAACAATGGTAATATCTATGGTCAGGGTGGCGTTGGTGGAACCGGAGCAGGTCAGGCTGGAGGCCCTGCCCTACGAGTACAGACTTCCCAAAAGATTAACTTGACCAATAACGGCACAATCGCCGGTGGAGGCGGTGGTGGTGGTAAAGGCGGTACAGGTGGTAACGGTTTCTTTACTACTCAATCTACACAACGTGACCCATCATCTGGAACTTGGACATACGCTACAGGGAACCATATCGAATTTCGTTCTCGTAACTGTATCCTACGTATGGGTAACTCCGAGATATATCGTTGGTATGGTCCAGACTTTAGTACTGTTGTTACCTATGGTATCACCATAACCGTGGGAGAATGGACGTACTATGCCTCCAACTTCTACGGTGACGGTGTGGGTGTTTTGAAGTCGAATGCAATGTACAGAACTAGAACAGTGACCAATACTACATACACTACCGGTGGTGCTGGTGGTAATGGAGGAAACGGTCAAGGATTCTCACAAGCAGCAACCAACGGTCTAGCTGGTGCTAACGGTGGGACTAACGCAGGTCGAGGTGGTAACGGAGGCAATGGTGGTACTTTCGGGGTGGCAGGTGCAACCGGAGCTACCGGTGCATCAGGAAACCAGACTGCTGGACTCGGTGGTCAGGCTGGAGGTGCTGCTGGAGCTGCCGTTGATGGTACGTCTAAAGTTAACTATGTCAATGCTGGCAAACTGTTAGGTCCACTAATCAACTAA

Genome Context

Genome Context

Tertiary structure

PDB ID
e18184b40558197bd414a4c92fd313bec38b8a5789d1e3baf67fd0690cc15e1c
ESMFold
Source ESMFold
Method ESMFold
Resolution 0,8147
Oligomeric State monomer
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50