Protein
View in Explore- Genbank accession
- WPJ54210.1 [GenBank]
- Protein name
- central tail fiber J
- RBP type
-
TFTF
- Protein sequence
-
MSKVITAMGKGGSSTPRQPKEMPDNLISKDRIKLLLAVADGEVENDFNLKQLMFGGVPVQNESGEFNYPDVIAEFRPGTQTQDFIQGFTESSSEVTVARDVKANQPYSLTVTNKNLSAIRFRLLWPRVLQQKDNGDMVGSKVEYAFDMAVDGAAFKEVLRDNIDGKNTTGGYDRSIRINLPQDFASQVILRVRRITPDADGVKTVDAFKVQSYAEVIDARFRYPLTALLYVEFDSDLFENNIPTISIKKKWKIIQVPSNYDPIGRTYNGTWNGTFKWAWSNNPAWVLYDLITNQRYGLDQKELGIPVDKWSLYEVAQYCDQKVPDNKGGEEPRYLCDMVVQDQVEAYRLIRDVCSIFRGMSFYNGESISIVVDKPRNPVYLFTNDNVIDGYFGYTFASEKSMYTTVNVMFDDEENAYTQDVEPVYDAVASRRFGHNPTDITAIGCVRRTEANRRGRWIIKTNLRSETVNFATGLQGMIPTCGDVIQVLDAHYQSNFGLILSGRVSEVSGLQVFLPMKTDAKAGDFIIMNKPDGAPVKRTIASVSSDGMTLTLNVGFGFDVAPDAVFAIERTDLAMKLYVVMSISKGSDEEEFQYNISAVEYNPGKYDEIDYGVITDERPTSIVEPDSMPAPQDVTISSFSRVVQGLSLETMVVGWSKVRYASVYEMQWRKDGGNWLNTPRTATTETQVEGIYSGVYEVRVRAINAGGVASPWSEIVAKSLTGKVGKPKAPTGITASDNEVLGIRVKWSMPEGSGDTAYIELHQSPDNNDANSSLLTMVPYPAFEYWHSTLRSGQVVWYKVRAVDRIGNVSDWTKLTRGMATDDVDQIMDTIKIDIQGTEGYKELQKNIFETNERIDEAEQTIAKVDEDAKQGITEAKKDAADAKKRAEEVNTAAQKGINEAKAAAKAADDHATQVGNDATAGLAEANAKTEQVRKDAEQGIAEAKNDAKAAKDEALRVEAKADKGISEAKDDAKKASEAAAAAGDKAGQAIDDAQGALNESIKNAGNIDALGNAVIENAQSQSEMYIHFEKENGDRKAEYDQAVTMVVNESEARVEQVERLRVEMGDSITASNTELKQAIATETEARATQMNQLTATMNEKFTATDKQWREAVASEEQARVSAIGEMKAQVDKDIAATSKTLTEAIATETEARIQAINELEAKIGDGIEADLTEVKKAIATETEARVEGDRRLEAKFDKGLSDSNAKITANEKAIAEEGKARVEQYNQLKATVDSNKKATDASITQLDKAIATEKEARVSQYGELKASIEKNDKDINAKVDASVKTLTEAIATEEAARVKQYNELNAKVDNNKKATDAAISELTEVVATDREASASKITELTASVSAVAEGVVENALANDKTSKESAAGIKEVREVIANETEARAEMMTQLQTSFDAEIGKVNGEITNLSQAISDESGARVQQYNELKASIEGQDFKTESYYYSLSQAISTEEESRVKQYEELNAKLESGQGGTDAAIKNLQEAIATEEKARAQAVSDLDANMKSEFGKTNANVKNVSDALAEEKQVRAQQYTELSGKISTTDKELAAQVKRLDTAIVTEKDARTEQYNQLKATVDANKKTAEANYQENKTAISNETQARVKAVSDLDASMKSEFGKTNANVKTVSDALAEETRVRAQQYTELNGKITAAGTESAAQIKRLDEAIAEEKQARTTQYNALNATIEKNNKDINTKVDASVKQLTEAIATEEAARVKQYNELKATVDSNKKAVDAAITEMNEVIATEKEATASKITDLTASMGANGEAALENALANDQESKRREAQYRNVVEVIANETQSRVTQMEELTATFVASDATTNGRITNLQEVITNDKEANAKQFTEIDAKFERADFYANANYSKLNQAIASETEARIKQYEELKASIGDDIQGEINNLQQAIADETQARTQAIQNLDSKFNTEIGKTNASVKTVSDALATEKSSTAQRFSDVNAKVDGLEKSTNASVKRLDEAIATETDARSQQYTGLSATVTKNNTDINKKVDDKDAAINKKVNDNNTAINTKVDANYKQLNTAIADETQARTQAITRLESSIGSNSAAIEQKLDSWVDANSTGAMYGVKLGMKYQGNYYQAGMNMQLVGSGNQMKSQILFQADRFVIFPYPDREDIKTIPFLVDGDQVYMQSALIKDGSITNAKIGNEIKSNNFVWDQQGWGLSKEGWFQMNGQAGGGRTLINQNGVQVFDGNNRLRVRLGLW
- Physico‐chemical
properties -
protein length: 2209 AA molecular weight: 242569,78170 Da isoelectric point: 4,86044 aromaticity: 0,06338 hydropathy: -0,56546
Domains
Domains [InterPro]
DC_0129
STR
2–974
STR
2–974
IPR053171
Unmapped
4–1195
Unmapped
4–1195
IPR055385
ATT
92–219
ATT
92–219
IPR036116
STR
629–716
STR
629–716
IPR003961
STR
629–716
STR
629–716
IPR003961
STR
630–724
STR
630–724
1
2209
Architecture
STR 2-91 | ATT 92-219 | STR 220-1352 | RBD 1353-1433 | STR 1434-1785 | RBD 1786-1845 | STR 1846-1918 | RBD 1919-2209
Legend:
ATT
STR
RBD
CBM
LEC
ENZ
CHP
LNK
TAS
TTP
UNK
Unmapped
Taxonomy
| Name | Taxonomy ID | Lineage | |
|---|---|---|---|
| Phage |
Klebsiella phage RCIP0073 [NCBI] |
3094241 | Viruses > unclassified bacterial viruses > |
| Host |
Klebsiella pneumoniae [NCBI] |
573 | cellular organisms > Bacteria > Pseudomonadati > Pseudomonadota > Gammaproteobacteria > Enterobacterales |
Coding sequence (CDS)
Coding sequence (CDS)
Genbank protein accession
WPJ54210.1
[NCBI]
Genbank nucleotide accession
OR532867.1
[NCBI]
CDS location
range 15544 -> 22173
strand +
strand +
CDS
ATGTCAAAAGTAATAACCGCAATGGGTAAGGGTGGCAGCAGCACACCGCGCCAGCCAAAGGAGATGCCCGATAACCTCATCTCCAAAGACAGGATTAAGCTTCTTCTCGCGGTTGCTGACGGTGAAGTAGAGAACGATTTCAACCTTAAGCAATTGATGTTCGGCGGCGTCCCGGTTCAAAACGAAAGCGGGGAATTCAACTACCCGGACGTGATCGCAGAATTCCGGCCAGGTACGCAGACTCAAGACTTTATTCAAGGCTTCACCGAATCAAGTTCTGAAGTAACCGTTGCGCGCGACGTTAAGGCTAACCAGCCATACTCACTGACCGTTACCAACAAAAACCTTTCTGCTATTCGCTTCCGCCTCTTGTGGCCTCGCGTGCTTCAGCAGAAAGATAACGGCGACATGGTAGGCTCAAAGGTTGAATATGCCTTTGATATGGCGGTTGACGGAGCGGCGTTTAAAGAGGTACTCCGCGACAATATCGACGGAAAGAATACCACTGGCGGATATGATCGAAGCATTCGCATTAACTTGCCGCAGGACTTCGCCAGCCAAGTAATTTTGCGAGTTCGCCGAATCACGCCGGATGCAGACGGGGTAAAAACGGTTGACGCCTTTAAGGTTCAGAGTTACGCCGAAGTTATTGATGCCCGCTTCCGCTACCCGTTAACCGCGCTTCTCTATGTTGAGTTTGATTCGGACTTGTTCGAGAACAACATCCCGACAATCTCCATTAAGAAGAAGTGGAAGATAATCCAGGTTCCGAGCAACTACGACCCGATCGGCAGGACTTACAACGGGACGTGGAACGGTACATTTAAATGGGCCTGGAGCAACAACCCGGCGTGGGTACTTTATGACCTGATAACCAATCAGCGATACGGACTCGATCAGAAAGAGTTAGGAATCCCGGTTGATAAGTGGTCACTGTATGAAGTTGCGCAGTATTGCGATCAGAAGGTTCCAGACAACAAAGGCGGCGAAGAGCCTCGTTACCTTTGTGATATGGTTGTACAGGATCAGGTTGAAGCTTATCGGTTGATTCGCGACGTTTGCTCTATCTTCCGTGGAATGAGCTTCTACAACGGCGAAAGCATCTCAATCGTTGTTGATAAGCCTCGGAATCCGGTTTACCTTTTCACTAACGACAACGTTATTGATGGTTATTTCGGTTACACGTTCGCCTCCGAAAAGAGCATGTATACGACCGTTAACGTGATGTTTGATGACGAAGAGAACGCCTATACTCAGGACGTTGAGCCTGTTTATGATGCGGTAGCTAGTCGCCGATTCGGTCACAACCCAACGGACATTACAGCGATCGGATGTGTTCGACGCACTGAAGCCAACCGTCGCGGGCGCTGGATTATCAAAACCAACCTTCGCAGCGAAACGGTAAACTTTGCAACCGGGTTGCAGGGCATGATCCCGACTTGCGGCGACGTTATCCAGGTTCTTGATGCGCATTACCAAAGCAACTTTGGCCTCATCCTTAGCGGAAGGGTTAGCGAAGTTTCCGGCCTTCAAGTTTTCTTGCCGATGAAGACAGACGCAAAAGCGGGTGACTTCATTATCATGAACAAGCCGGACGGCGCGCCAGTTAAGCGCACGATCGCAAGCGTATCAAGCGATGGCATGACGTTAACGCTTAACGTTGGATTCGGATTTGACGTTGCGCCTGATGCAGTATTCGCGATTGAACGTACAGACCTGGCGATGAAGCTTTACGTTGTAATGAGCATCTCCAAGGGCAGCGACGAAGAAGAGTTTCAGTATAATATTTCAGCGGTTGAGTATAACCCTGGCAAGTACGACGAGATCGATTATGGAGTCATCACCGATGAGCGACCAACGAGCATCGTTGAGCCGGATTCGATGCCAGCGCCGCAGGATGTAACAATCTCTTCATTCTCTCGCGTGGTTCAGGGCTTAAGCCTGGAAACAATGGTTGTAGGGTGGAGCAAGGTTCGTTACGCATCGGTTTACGAGATGCAATGGCGTAAGGATGGCGGCAACTGGCTTAACACTCCGCGCACGGCAACAACCGAAACTCAAGTGGAGGGGATTTACTCCGGGGTTTATGAAGTAAGGGTTCGCGCGATTAATGCTGGTGGGGTTGCTTCTCCGTGGTCTGAGATAGTAGCTAAGTCGCTGACTGGTAAAGTTGGCAAGCCGAAAGCGCCGACCGGGATCACAGCATCGGATAACGAAGTTTTAGGTATTCGCGTTAAGTGGTCTATGCCTGAAGGCTCAGGCGATACTGCCTATATTGAATTGCACCAGTCGCCAGATAACAACGATGCTAACTCATCGCTGTTAACTATGGTTCCTTATCCTGCTTTCGAATATTGGCATAGCACGCTAAGAAGCGGTCAGGTTGTTTGGTACAAGGTTAGAGCGGTTGACCGCATCGGCAACGTATCTGACTGGACGAAACTCACTCGCGGCATGGCGACCGACGATGTAGATCAGATAATGGATACCATTAAGATCGACATTCAAGGCACGGAAGGATACAAGGAGTTACAGAAGAATATCTTCGAAACAAACGAGCGGATTGATGAAGCCGAGCAGACCATCGCCAAGGTTGACGAAGACGCTAAGCAGGGAATCACAGAAGCCAAAAAGGATGCAGCAGACGCCAAGAAGCGAGCCGAAGAAGTTAACACGGCAGCGCAGAAAGGAATCAACGAAGCGAAGGCGGCAGCTAAAGCAGCAGACGACCACGCAACGCAAGTTGGAAATGATGCTACGGCTGGACTGGCGGAAGCCAATGCAAAAACCGAACAAGTAAGGAAGGATGCAGAGCAGGGAATCGCCGAGGCGAAGAACGACGCCAAAGCGGCGAAGGATGAGGCGCTAAGGGTTGAGGCTAAGGCTGATAAAGGAATCAGCGAAGCCAAAGACGATGCAAAGAAAGCAAGCGAAGCAGCGGCGGCGGCGGGAGATAAGGCAGGGCAGGCGATTGATGATGCGCAAGGGGCGCTAAACGAATCAATCAAGAACGCCGGAAATATCGACGCGCTAGGCAATGCCGTTATAGAAAACGCACAAAGCCAAAGCGAGATGTACATACACTTTGAAAAGGAGAATGGAGACAGAAAAGCAGAATACGATCAGGCTGTAACGATGGTTGTTAACGAGTCGGAAGCCAGGGTTGAACAAGTTGAACGCCTCCGGGTTGAAATGGGAGACAGCATCACCGCCAGCAACACGGAACTGAAGCAGGCGATCGCGACCGAAACGGAAGCGAGAGCAACGCAGATGAACCAGCTAACGGCTACCATGAATGAGAAATTCACGGCTACAGATAAGCAGTGGCGGGAGGCGGTAGCCAGCGAAGAGCAGGCCAGGGTTTCAGCAATTGGCGAAATGAAGGCGCAAGTTGATAAGGACATTGCAGCGACAAGCAAGACGCTAACCGAAGCAATCGCAACCGAAACCGAAGCAAGGATTCAGGCCATTAACGAGCTTGAAGCTAAGATCGGCGACGGCATAGAGGCTGACTTGACAGAGGTCAAGAAGGCGATAGCAACGGAAACGGAAGCGCGCGTTGAAGGTGATAGACGCCTTGAGGCTAAGTTTGATAAAGGCTTATCTGACTCTAACGCCAAGATCACCGCCAACGAGAAAGCCATTGCTGAAGAAGGAAAAGCCAGGGTTGAGCAGTACAACCAGCTTAAGGCAACGGTTGACAGCAACAAGAAGGCGACAGATGCAAGCATCACTCAGCTTGATAAGGCGATAGCTACCGAAAAGGAAGCTAGAGTTTCGCAGTACGGAGAGCTTAAGGCTAGCATCGAAAAGAACGATAAGGACATTAACGCCAAGGTAGACGCCAGCGTTAAAACGCTAACGGAAGCGATCGCGACAGAAGAGGCGGCAAGGGTTAAGCAGTACAATGAACTCAACGCCAAGGTTGACAACAACAAGAAAGCCACGGATGCGGCAATCTCCGAGTTAACCGAAGTAGTCGCCACTGACCGGGAAGCCAGCGCCAGCAAAATCACAGAGCTAACGGCAAGTGTTAGCGCAGTAGCTGAAGGCGTGGTTGAGAACGCATTAGCCAACGACAAGACAAGCAAGGAGTCGGCAGCGGGAATTAAGGAGGTAAGAGAGGTAATCGCCAACGAAACCGAAGCGCGAGCGGAGATGATGACTCAGTTACAAACATCATTCGACGCAGAGATCGGGAAGGTTAACGGCGAGATAACCAATCTAAGCCAGGCTATTAGTGATGAGTCAGGGGCGAGGGTTCAGCAGTACAACGAGCTTAAGGCGAGCATTGAGGGGCAGGATTTCAAAACAGAATCCTATTACTACTCACTAAGCCAAGCCATAAGCACCGAGGAAGAGTCGCGAGTTAAGCAGTACGAAGAGCTTAACGCAAAACTTGAATCCGGTCAGGGTGGTACTGATGCCGCAATTAAGAACCTTCAGGAAGCGATCGCAACCGAAGAGAAAGCGAGAGCGCAGGCAGTTAGCGATCTTGATGCCAACATGAAATCGGAGTTTGGCAAAACAAACGCCAACGTTAAAAACGTATCCGACGCGCTGGCAGAAGAGAAGCAGGTTAGAGCGCAGCAGTACACGGAGCTAAGCGGCAAGATCTCCACAACGGATAAGGAGCTTGCAGCGCAGGTTAAGCGCTTAGATACGGCGATTGTTACGGAAAAGGACGCCAGAACAGAGCAGTACAACCAGCTTAAAGCCACGGTTGACGCCAACAAAAAGACGGCTGAAGCCAACTATCAGGAGAACAAAACGGCGATCTCTAACGAGACGCAAGCGAGAGTTAAAGCAGTAAGCGACCTTGACGCGAGCATGAAATCCGAATTCGGAAAGACTAACGCGAACGTTAAAACGGTATCTGATGCCCTAGCGGAAGAAACCAGAGTTCGGGCGCAGCAGTACACCGAATTGAACGGCAAGATCACGGCGGCTGGAACGGAAAGCGCCGCGCAGATTAAGCGACTGGATGAGGCAATCGCTGAAGAGAAGCAGGCCCGCACTACTCAGTACAATGCATTGAATGCTACGATTGAGAAGAACAACAAGGACATTAACACCAAGGTTGATGCCAGCGTTAAGCAGTTAACGGAGGCGATCGCCACCGAGGAGGCGGCAAGGGTTAAGCAGTACAACGAACTTAAGGCAACGGTTGACAGCAACAAAAAAGCGGTTGACGCTGCAATCACTGAAATGAACGAAGTGATAGCGACGGAGAAGGAAGCAACCGCCAGCAAGATAACCGACCTTACCGCCAGCATGGGGGCAAATGGAGAGGCGGCGCTTGAAAACGCTTTAGCCAACGATCAGGAATCGAAACGCCGTGAGGCGCAATATCGAAACGTGGTTGAGGTTATTGCTAACGAGACTCAATCGCGAGTAACTCAAATGGAAGAGCTTACAGCGACATTTGTTGCAAGCGACGCAACTACAAACGGAAGGATAACTAACCTTCAGGAGGTGATTACTAACGACAAAGAGGCTAATGCCAAACAGTTTACGGAGATTGACGCGAAGTTTGAGCGGGCGGATTTTTACGCAAACGCGAACTACTCCAAGTTGAACCAGGCAATAGCTAGTGAAACAGAGGCGAGAATTAAGCAGTACGAAGAGCTTAAGGCCAGCATCGGAGACGATATCCAGGGCGAGATAAACAACTTGCAGCAGGCCATCGCAGACGAAACGCAAGCCAGAACACAAGCGATTCAGAACCTTGATTCTAAGTTTAACACTGAGATCGGTAAGACAAACGCCAGCGTTAAAACCGTATCCGATGCACTGGCTACGGAAAAGAGTTCCACGGCGCAAAGGTTCTCGGATGTTAACGCAAAAGTTGACGGTCTGGAGAAGTCAACTAACGCTTCAGTTAAGCGACTGGATGAGGCAATCGCCACGGAAACAGACGCGAGATCGCAGCAGTACACGGGGTTAAGCGCTACTGTTACAAAAAACAATACGGACATTAACAAAAAGGTTGACGACAAGGATGCAGCTATCAACAAAAAGGTTAATGACAACAACACCGCCATTAACACAAAGGTTGATGCGAACTACAAACAGCTAAACACCGCGATCGCGGATGAAACGCAGGCCAGGACGCAGGCGATCACAAGGCTGGAATCGTCAATCGGTAGCAACTCGGCGGCGATAGAGCAAAAACTTGATTCGTGGGTTGATGCCAACTCTACGGGGGCAATGTACGGCGTTAAGCTTGGCATGAAGTATCAAGGTAATTACTACCAAGCCGGGATGAACATGCAGCTAGTCGGGAGCGGGAATCAGATGAAGTCACAAATCCTATTCCAGGCTGATCGGTTCGTGATTTTCCCGTACCCGGACAGGGAGGATATAAAAACCATCCCTTTCCTTGTTGATGGTGATCAGGTTTACATGCAGAGCGCACTTATCAAGGATGGCTCCATAACTAACGCCAAGATCGGGAACGAAATAAAATCCAACAACTTTGTTTGGGATCAGCAGGGATGGGGGTTAAGCAAAGAGGGTTGGTTCCAGATGAACGGACAAGCGGGAGGTGGTAGAACGTTGATTAATCAGAATGGAGTTCAGGTTTTCGACGGCAACAATAGATTGAGGGTTAGATTAGGATTGTGGTAA
Genome Context
Genome Context
Tertiary structure
PDB ID
ea3b56b0565d0e3ebd4c9d057e4b92008a0dfcfc14a107c0cd313d6f25c523c5
Model Confidence
Very high
pLDDT > 90
pLDDT > 90
High
90 > pLDDT > 70
90 > pLDDT > 70
Low
70 > pLDDT > 50
70 > pLDDT > 50
Very low
pLDDT < 50
pLDDT < 50