Protein
View in Explore- Genbank accession
- CAO2432697.1 [GenBank]
- Protein name
- central tail fiber J
- RBP type
-
TFTSPTF
- Protein sequence
-
MAKHMISGSKGGSKKPYVPKEMEDNLISINKIKVLLAVSDGECDPDFTLRDLYLDDVPVIASDGTVNYEGVTAEYRPGTQTQDYIQGFTDTSSEVTVSRDITADNPYIISVTNKNLSAIRIKILMPTGIKQEDNGDLVGVRVQYAVDMAVDGGSYNEVMRDVIDGKTRSGYDRSRRIDLPKFEERVLIRVKRLTPDSTSSKVTDKIKLQSYAEVVDAKFRYPLTGLVFVEFDSELFPTQIPNISIKKKWKIINVPSNYDPISREYHGSWDGTFKKAWSNNPAWVLYDLVTNQRYGLDQRELGIQIDKWSLYEAGVYCDQKVPDGKGGTEPRYLCDVVIQNQVEAYQLIRDICSIFRGMSFWNGESLSIVIDKPRDPSYVFTNENVINGDFQYTAASEKSMYTQCNVTFDDEQNMYQQDVEGVFDTEAALRFGYNPTSITAIGCTRRSEANRRGRWVLKTNLRSTTVNFATGLEGMIPSIGDVIAIADNFQSSNLTLNLSGRVMEVSGLQVFVPFKVDARPGDFIIINKPDGKPVKRTISKVSADGKTIELNIGFGFDVKPDTVFAIDRTDLALQQYVVTTISKGDDENEFTFSITAVEYDPNKYDEIDYGVNIDDRPTSIVQPDVMVAPENVQISSYSRVVQGVSVETMVVSWDKVPYASLYEMQWRKGDGNWLNTPQTANKEVEVEGIYSGNYQVRVRSVSASGSTSPWSKIATATLTGKVGEPGAPINLTASDNEVFGIRVKWGMPEGSGDTAYIELHQSPDGTVENSSLLTLIPYPQYEYWHSTLPAGQVVWYRIRSVDRIGNVSGWTDFVRGMASDDVESVLGDILDKIFDTEAGQEIKENAIDSANKIKDQAQSIIQNALANDADVKWTRVQNGKRKAEYGHALELIATETEARVTQIEELKASIDEDIVSSIKTVQEAIANESETRATQVQQLDSKFTKEIDGVKRDTAASIGEVRQTIANESEARAQAVQQLDAKFTKEINDLDEVIKTEVEANISEVKQAIANETEARVQADQALTAKFGDVESALAEKLDSWANVDSVGAKYAMKLGLTYKGQKYSAGMIMQLSQSSQGLISQILFDANRFAIMTSSTGGTFTLPFVVENNQVFINSLLVKNGSITNAMIGNVIQSNNFVQNQQGWRLDKNGNFENYGTRAGEGATKFTNEGLKVKDANGQLRVEIGRITGSW
- Physico‐chemical
properties -
protein length: 1192 AA molecular weight: 132299,64850 Da isoelectric point: 4,84197 aromaticity: 0,08473 hydropathy: -0,38951
Domains
Domains [InterPro]
DC_0014
STR
1–964
STR
1–964
IPR053171
Unmapped
2–872
Unmapped
2–872
IPR055385
ATT
92–217
ATT
92–217
IPR003961
STR
628–722
STR
628–722
IPR036116
STR
628–716
STR
628–716
IPR003961
STR
628–717
STR
628–717
IPR003961
STR
638–707
STR
638–707
1
1192
Architecture
STR 1-91 | ATT 92-217 | STR 218-339 | ATT 340-491 | STR 492-964 | RBD 965-1191 |
Legend:
ATT
STR
RBD
CBM
LEC
ENZ
CHP
LNK
TAS
TTP
UNK
Unmapped
Tail Spike Domain Segmentation
Tail Spike Domain Segmentation
This protein has been segmented into three structural domains: N-terminal, central domain, and C-terminal.
Domain Layout
1
1192
| Domain | Start | End | Length (AA) | Confidence |
|---|---|---|---|---|
| N-terminal | 1 | 510 | 510 | 0,9295 |
| Central domain | 511 | 1066 | 557 | 0,0356 |
| C-terminal | 1067 | 1192 | 125 | 0,3728 |
Legend:
N-terminal
Central domain
C-terminal
3D Structure with Domain Coloring
The structure is colored according to the domain segmentation: N-terminal (blue), Central (green), C-terminal (pink).
Domain Coloring
N-terminal
1-510
1-510
Central
511-1066
511-1066
C-terminal
1067-1192
1067-1192
Taxonomy
Coding sequence (CDS)
Coding sequence (CDS)
Genbank protein accession
CAO2432697.1
[NCBI]
Genbank nucleotide accession
OZ346203.1
[NCBI]
CDS location
range 15902 -> 19480
strand +
strand +
CDS
ATGGCTAAACATATGATAAGCGGCAGTAAGGGCGGAAGCAAAAAGCCATACGTGCCAAAAGAGATGGAAGATAACCTGATCTCGATAAACAAGATTAAAGTTTTGCTGGCTGTATCTGATGGCGAGTGCGATCCAGATTTCACGTTGCGTGATCTTTATCTTGATGATGTTCCTGTAATTGCCAGCGATGGCACTGTTAACTATGAGGGGGTAACGGCTGAATATAGGCCAGGCACACAGACGCAAGATTACATCCAGGGGTTTACTGACACATCAAGCGAGGTGACAGTTTCGCGAGATATTACAGCAGACAATCCCTATATTATCTCTGTAACAAACAAAAATCTTTCCGCAATCAGAATCAAGATTCTGATGCCAACTGGCATAAAACAGGAGGATAACGGCGATCTTGTTGGTGTAAGGGTTCAATATGCCGTAGATATGGCTGTTGATGGCGGTTCTTATAACGAGGTTATGAGAGATGTAATTGACGGCAAGACAAGATCAGGGTATGACCGCAGCAGAAGGATTGATCTTCCTAAGTTTGAGGAGCGCGTTTTAATCAGAGTTAAGCGACTGACTCCAGATAGCACATCTTCAAAGGTGACTGATAAAATCAAACTGCAAAGTTACGCTGAGGTTGTGGATGCAAAATTCCGTTATCCTCTGACTGGACTTGTATTCGTAGAATTTGACAGCGAATTGTTTCCTACGCAAATCCCTAACATTTCTATAAAAAAGAAATGGAAGATTATTAATGTGCCAAGCAACTACGATCCAATATCAAGAGAATATCACGGGTCATGGGATGGTACTTTTAAAAAAGCGTGGTCAAATAATCCTGCATGGGTTCTTTATGATCTGGTGACGAATCAGCGTTATGGACTTGACCAGCGAGAGTTAGGAATACAGATCGACAAGTGGAGCTTATACGAGGCAGGTGTTTACTGTGATCAGAAAGTTCCAGACGGCAAGGGCGGGACAGAGCCTCGCTACCTATGCGATGTGGTGATCCAGAATCAAGTAGAGGCTTATCAGCTAATCCGTGACATTTGCTCAATCTTTCGCGGAATGAGCTTTTGGAATGGAGAGAGCTTATCAATCGTGATTGATAAACCGCGCGATCCATCATACGTGTTTACTAATGAAAACGTCATCAACGGTGATTTTCAGTACACAGCAGCAAGCGAAAAAAGCATGTACACGCAATGTAATGTGACGTTTGACGACGAACAAAACATGTATCAACAGGACGTCGAGGGTGTTTTTGATACTGAGGCGGCATTACGATTTGGATACAATCCAACAAGCATTACAGCGATCGGGTGTACACGCAGGAGCGAAGCGAATCGTCGCGGTCGGTGGGTTTTGAAAACAAACCTTAGAAGCACTACTGTAAACTTTGCTACTGGACTGGAGGGGATGATTCCATCAATAGGTGATGTGATTGCTATCGCTGATAATTTTCAGAGCAGCAACCTAACGTTAAACCTATCGGGCCGAGTGATGGAAGTTTCAGGATTGCAGGTTTTCGTTCCGTTTAAGGTTGATGCTCGTCCTGGTGATTTTATTATCATCAACAAGCCGGACGGCAAGCCAGTTAAGCGAACGATCTCAAAGGTGAGCGCAGACGGAAAAACCATTGAGTTAAATATTGGATTTGGTTTTGATGTTAAGCCTGATACTGTTTTTGCGATTGACCGTACTGACCTTGCGTTGCAGCAATACGTTGTGACAACTATCAGCAAGGGTGATGACGAAAACGAGTTTACCTTTTCAATCACGGCTGTGGAGTACGATCCGAACAAATACGACGAGATTGATTATGGCGTAAACATTGATGACAGGCCAACTTCAATTGTTCAGCCTGACGTGATGGTAGCGCCTGAGAACGTGCAGATCTCTTCTTATTCTCGCGTCGTGCAGGGTGTAAGCGTTGAAACTATGGTTGTGTCGTGGGATAAAGTGCCTTATGCATCGCTGTATGAAATGCAGTGGCGTAAAGGTGATGGAAACTGGCTGAATACACCACAGACCGCAAACAAAGAAGTTGAGGTTGAAGGTATTTATTCAGGTAACTACCAAGTAAGGGTGAGATCTGTTTCTGCTTCCGGTTCTACGTCGCCGTGGTCAAAGATTGCAACCGCCACCCTGACAGGTAAAGTTGGCGAGCCAGGAGCGCCGATTAATCTTACAGCTTCTGATAATGAAGTTTTTGGCATTCGTGTTAAATGGGGTATGCCGGAAGGATCAGGCGATACGGCTTACATTGAGCTTCACCAATCGCCGGATGGGACGGTGGAAAACTCAAGTTTGCTTACGCTGATCCCGTATCCTCAATATGAGTATTGGCATAGCACGTTACCAGCGGGTCAAGTTGTATGGTATAGAATCCGTAGCGTTGACAGGATCGGCAACGTTTCCGGCTGGACTGACTTTGTTCGCGGCATGGCGTCAGATGATGTTGAATCTGTTTTAGGCGACATTCTGGACAAGATTTTTGATACCGAAGCGGGTCAAGAAATCAAAGAGAACGCCATAGACAGTGCAAACAAAATCAAAGACCAGGCGCAATCAATCATCCAGAACGCATTGGCAAACGATGCCGATGTGAAGTGGACGCGAGTACAAAACGGAAAGCGTAAGGCTGAATATGGTCATGCACTGGAGCTTATTGCTACCGAAACGGAGGCGCGAGTAACTCAAATCGAAGAGTTAAAGGCGTCAATTGATGAAGATATAGTTTCAAGTATCAAAACCGTTCAGGAAGCGATTGCCAACGAATCAGAGACGCGAGCTACTCAAGTGCAGCAGCTTGATTCTAAATTTACGAAGGAAATAGACGGCGTAAAACGAGATACGGCTGCAAGTATTGGCGAGGTAAGGCAAACAATTGCCAATGAATCAGAAGCGCGCGCCCAGGCAGTTCAGCAGCTTGACGCCAAGTTTACGAAAGAGATAAACGACCTTGACGAAGTTATCAAGACCGAAGTCGAGGCTAACATCTCAGAAGTGAAACAGGCGATCGCTAATGAGACAGAGGCGAGGGTTCAGGCTGACCAGGCTTTAACAGCCAAATTCGGAGACGTTGAATCAGCACTAGCCGAAAAACTTGATTCGTGGGCTAACGTTGATTCGGTTGGCGCTAAGTACGCTATGAAATTGGGCCTTACTTACAAGGGACAGAAGTACAGCGCAGGAATGATCATGCAGTTGTCGCAATCTTCTCAAGGCTTGATCTCGCAAATCTTGTTTGATGCTAACAGGTTCGCGATCATGACTAGCTCTACTGGCGGAACGTTTACTTTGCCTTTCGTTGTTGAAAACAACCAGGTTTTCATTAACAGCCTGCTAGTGAAAAACGGGTCCATTACTAACGCAATGATTGGTAACGTGATTCAGTCAAATAACTTTGTTCAGAATCAGCAAGGATGGAGGCTTGATAAAAATGGTAACTTTGAGAACTACGGAACGAGAGCGGGGGAAGGTGCTACAAAATTCACTAACGAAGGATTAAAGGTGAAAGATGCAAATGGACAATTGAGGGTTGAAATAGGGAGAATAACCGGAAGTTGGTAA
Genome Context
Genome Context
Tertiary structure
PDB ID
b9dbee43117cec36b620ec8aeb24057cc405d8b73975d68b9ef89412d0059975
Model Confidence
Very high
pLDDT > 90
pLDDT > 90
High
90 > pLDDT > 70
90 > pLDDT > 70
Low
70 > pLDDT > 50
70 > pLDDT > 50
Very low
pLDDT < 50
pLDDT < 50