Genbank accession
QKN84465.1 [GenBank]
Protein name
central tail fiber J
RBP type
TF
Evidence Phold
Probability 1,00
TF
Evidence RBPdetect
Probability 0,55
Protein sequence
MYTVRSYGNFNPSKHKKGLYKGDTKIHEAGRSYNLCIYNPSTKQWTNSTYDVWGYISESNRLTAAINAAEDDEIIVLYAFDEPRTNRSAGGLPNAVYSIGGSKEMFSAKIGRWYAYALVGRKNLGPGNGLEAVSDQHEGHNPDFAEVSLSFVISPQGTVDFDYKGKLTDSIDGAKASISSLSDVVVSNNEALSRRIDSVTAKAGANESAIQSEITARTDADSALSSRIDSVTAKAAQNEAAITAEATARADADKAFASSMQTISAEVTSNKENAGDLYTVRSYGSFNPSKHKKGLYKGDTKIHEAGRSYNLCIYNPSTKQWTNSTYDVWGYISESNRLTAAINAAADDEIIVLYAFDEPRTNRSAGGLPNAVYSIGGSKEMFSAKIGRWYAYALVGRKNLGPGNGLEAVSDQHEGHNPDFAEVSLSFVISPQGTVDFDYKGKLTDSIDGAKASIATLNQTIADTEHTLAQQITQLESEFDGKTAAIQQELTTTVNKVGEVESKYSVKVDNNGYVSGFGLISTENNGVPTSEFAVRADKFFIAAPAGSNYDGGDNKYPFIVQNGKVYMRNAVIQNGAIDSAKIANVIQSTNYVPGKSGWKLPKNGAAEFNSDVVVNAQIQANSIVGDIVSAISKPASPKTQYKHATYTGLFGTVSITHARPFDRTLVISVGFRVAAISDSEGGTNVQTGWVECVSSKYGNSKSFRRRVQSIKDDPVNHATQDGVEQIVYSIPANTKGSISIYGYSEISEIGGYCRIYPVNGNPESESTSGVFFAQLFKNGADLV
Physico‐chemical
properties
protein length:783 AA
molecular weight: 84255,12730 Da
isoelectric point:5,77276
aromaticity:0,09323
hydropathy:-0,38825

Domains

Domains [InterPro]
IPR039477
STR
31–123
IPR015406
RBD
485–613
QKN84465.1
1 783
Architecture
STR
STR
RBD
STR 1-255 | STR 307-399 | RBD 437-745 |
Legend: ATT STR RBD CBM LEC ENZ CHP LNK TAS TTP UNK Unmapped

Taxonomy

  Name Taxonomy ID Lineage
Phage Vibrio phage Marilyn
[NCBI]
2736287 Viruses > Duplodnaviria > Heunggongvirae > Uroviricota > Caudoviricetes
Host Vibrio harveyi
[NCBI]
669 cellular organisms > Bacteria > Pseudomonadati > Pseudomonadota > Gammaproteobacteria > Vibrionales

Coding sequence (CDS)

Coding sequence (CDS)
Genbank protein accession
QKN84465.1 [NCBI]
Genbank nucleotide accession
MT448615.1 [NCBI]
CDS location
range 40012 -> 42363
strand -
CDS
TTGTACACGGTTAGATCTTATGGCAATTTCAATCCATCAAAGCACAAAAAAGGTCTATATAAAGGCGACACAAAAATACATGAAGCTGGTCGTAGCTACAACCTATGTATTTACAACCCATCAACAAAGCAATGGACAAATTCAACCTACGATGTTTGGGGTTATATTTCAGAATCAAACAGATTGACAGCTGCAATTAACGCCGCGGAAGATGACGAAATTATCGTTCTTTACGCTTTTGATGAACCTAGAACAAACAGATCGGCTGGCGGTTTGCCAAATGCGGTTTATAGCATTGGCGGTTCAAAAGAAATGTTTTCCGCAAAAATTGGTAGATGGTATGCGTATGCGCTTGTTGGTAGAAAAAACCTTGGTCCAGGTAACGGATTGGAAGCAGTATCAGACCAACACGAAGGGCACAATCCAGATTTTGCCGAAGTAAGCTTAAGCTTTGTTATTAGCCCACAAGGCACTGTTGATTTTGACTACAAGGGAAAGCTAACCGACTCTATTGATGGAGCAAAAGCAAGCATTTCATCATTAAGCGATGTCGTAGTGAGCAACAACGAAGCGCTATCTAGACGCATTGACTCTGTTACCGCCAAAGCTGGTGCAAACGAATCTGCCATTCAATCAGAAATTACTGCGCGTACTGATGCTGATAGCGCACTATCAAGTCGTATCGACTCTGTAACAGCAAAGGCTGCTCAGAACGAAGCGGCAATCACGGCAGAGGCTACAGCTAGAGCTGATGCAGATAAGGCTTTTGCATCATCAATGCAAACTATTAGCGCAGAGGTAACAAGCAATAAAGAAAATGCAGGTGATTTGTACACGGTTAGATCTTATGGCAGTTTCAATCCATCAAAGCACAAAAAAGGTCTATATAAAGGCGACACAAAAATACATGAAGCTGGTCGTAGCTACAACCTATGTATTTACAACCCATCAACAAAGCAATGGACAAATTCAACCTACGATGTTTGGGGTTATATTTCAGAATCAAACAGATTGACAGCTGCAATTAACGCCGCGGCAGATGACGAAATTATCGTTCTTTACGCTTTTGATGAACCTAGAACAAACAGATCGGCTGGCGGTTTGCCAAATGCGGTTTATAGCATTGGCGGTTCAAAAGAAATGTTTTCCGCAAAAATTGGTAGATGGTATGCGTATGCGCTTGTTGGTAGAAAAAACCTTGGTCCAGGTAACGGATTGGAAGCAGTATCAGACCAACACGAAGGGCACAATCCAGATTTTGCCGAAGTAAGCTTAAGCTTTGTTATTAGCCCACAAGGCACTGTTGATTTTGACTACAAGGGAAAGCTAACCGACTCTATTGATGGAGCAAAAGCAAGTATTGCTACGTTGAACCAAACTATAGCCGACACAGAGCACACTCTAGCGCAACAAATTACACAACTAGAGTCTGAGTTCGACGGGAAAACTGCTGCAATCCAGCAAGAGTTAACCACTACCGTTAACAAGGTTGGTGAGGTTGAGTCGAAATACTCCGTTAAAGTTGACAATAACGGATACGTTTCTGGTTTTGGCTTGATTTCCACAGAAAATAACGGGGTGCCGACTAGTGAGTTTGCGGTAAGGGCAGATAAATTTTTTATCGCTGCACCAGCTGGCAGCAACTACGACGGCGGCGACAATAAATACCCGTTCATTGTGCAGAACGGAAAAGTTTACATGCGCAATGCGGTTATCCAGAACGGCGCGATTGATTCAGCAAAAATTGCTAACGTAATCCAGTCGACGAACTATGTACCTGGAAAGTCTGGCTGGAAATTGCCAAAAAATGGCGCTGCAGAGTTTAACAGTGATGTTGTTGTCAATGCGCAAATTCAAGCCAACAGCATTGTTGGTGATATCGTTTCTGCGATTAGTAAGCCAGCGTCACCAAAAACTCAATACAAACACGCTACCTATACAGGATTATTTGGCACGGTATCAATCACGCACGCTAGACCATTTGATAGAACTCTGGTTATTTCTGTTGGGTTCAGGGTGGCGGCAATATCAGACTCGGAGGGTGGCACAAACGTGCAGACCGGTTGGGTTGAGTGTGTATCTTCAAAATATGGTAACTCCAAATCTTTCAGACGAAGAGTTCAATCAATAAAAGACGATCCGGTCAACCACGCAACACAAGATGGCGTTGAGCAGATTGTTTATTCAATACCAGCAAACACGAAGGGATCGATATCAATTTACGGTTATTCAGAAATATCGGAGATTGGTGGCTACTGCAGAATTTACCCGGTAAACGGAAACCCTGAAAGCGAATCAACTAGTGGCGTGTTCTTTGCTCAATTGTTTAAGAACGGCGCCGATCTGGTGTAA

Genome Context

Genome Context

Tertiary structure

PDB ID
147e8862d2a746abd9be71a6eff412c0857d0e29527888b7886c4dc1f9387ac9
ColabFold
Source ColabFold
Method ColabFold
Resolution 0,3826
Oligomeric State monomer
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50