Protein
View in Explore- Genbank accession
- QXV76504.1 [GenBank]
- Protein name
- central tail fiber J
- RBP type
-
TFTSPTF
- Protein sequence
-
MIKKVITGSKGGGGKPHTPVEMEDNLISINKIRVLLAVSDGEVDPDFDMKMLLLNDVPVMNDDGTYNYEGITAEFRPGTQTQDYIQGFTDSASEVTMNRVIDRDTPYSINVTNKSLSAIRIKMYMPRGVKVESNGDKNGVRVEYKVELAVDGAAFTEYFTDVIDGKTMSGYDRSRRINLPAFNSQVIVRVSRVTPDSDDSNLIDEMQLKSYAEVIDAKFRYPLTGLLFVEFDSKMFPTQIPNISLRKKWKVINVPSNYNPITRDYNGSWDGTFKKAWSNNPAWVLYDIITNQRYGLDQRELGINVDKWALYEAGQYCDQMVPDGKGGTEPRYLCDVVIQSQIEAFKLVRDICSIFRGMSFWNGESLSVVIDKPRDPSYIFTNENVVGGNFSYSFASEKSMYTTCNVQFDDHENLYQQDVEPVFDTDAARRFGNNPTSITAIGCTRRSEANRRGRWILRTNLRSTTVNFATGLEGMIPQIGDVVAIADNFWSSNLTLNLSGRVMEVSGLQIFTAFRVDARAGDFIVVNKPDGKPVKRTISSVSPDGKTIEINVGFGFDVAPSTIFAIDRTDVALQQYVVTDIKKGDGDEEFVFSITAVEYDPNKYDAIDYGVNVDDRPTSIVDPDKLPAPQNVAVESFSRIVQGLSVETMVISWEKVDYAAFYEVQWRKNNGNWINVPRTQTNETFVEGIYAGEYSVRVRAISSSENASVWSEVVTVGLTGKVGEPGAPINFTASDDVVFGIRLKWGMPEDSGDTAYIEVQQSPNNSVENASLLTLVPYPQHEYFHTPLEAGKMIFYRIRAIDKIGNVSPWTDLIPGMASTDVESIIGEIKVDIENSEGYQWLKENATDAIARIQNTAESAIENALANDKDIRIQRVKNGKFTAQIKESLQLIANETEARVTQVSQMEADFDGKINAQNTELRQVIATETEALSEEIEELKASIGDDIQAQLTQVQQAIANETEARTTADTALSARIGNNEAALNQKLDSWANVDSVGAMYGVKLGLTYNGQQYSAGMAMSLIASGNDVKSQLLFEADRFAIYNGANNHIRYPFIVEGGQVILSSAVIKDGFITNAMIGGYIQSNNYVWNQSGWHLGKDGTFLNFGSTPGEGSMKQDNQTISVRDQNGVLRVQIGRITGVW
- Physico‐chemical
properties -
protein length: 1140 AA molecular weight: 126615,38760 Da isoelectric point: 4,75370 aromaticity: 0,09211 hydropathy: -0,32667
Domains
Domains [InterPro]
DC_0014
STR
1–1140
STR
1–1140
IPR053171
Unmapped
3–867
Unmapped
3–867
IPR055385
ATT
92–217
ATT
92–217
IPR003961
STR
627–716
STR
627–716
IPR003961
STR
628–722
STR
628–722
IPR003961
STR
638–708
STR
638–708
IPR036116
STR
643–813
STR
643–813
1
1140
Architecture
STR 1-91 | ATT 92-217 | STR 218-342 | ATT 343-495 | STR 496-1140
Legend:
ATT
STR
RBD
CBM
LEC
ENZ
CHP
LNK
TAS
TTP
UNK
Unmapped
Tail Spike Domain Segmentation
Tail Spike Domain Segmentation
This protein has been segmented into three structural domains: N-terminal, central domain, and C-terminal.
Domain Layout
1
1140
| Domain | Start | End | Length (AA) | Confidence |
|---|---|---|---|---|
| N-terminal | 1 | 505 | 505 | 0,8971 |
| Central domain | 506 | 1077 | 573 | 0,0458 |
| C-terminal | 1078 | 1140 | 62 | 0,8559 |
Note: Constraints were applied during segmentation.
Fixed 41 C-terminal predictions appearing before Central domain
Fixed 41 C-terminal predictions appearing before Central domain
Legend:
N-terminal
Central domain
C-terminal
3D Structure with Domain Coloring
The structure is colored according to the domain segmentation: N-terminal (blue), Central (green), C-terminal (pink).
Domain Coloring
N-terminal
1-505
1-505
Central
506-1077
506-1077
C-terminal
1078-1140
1078-1140
Taxonomy
| Name | Taxonomy ID | Lineage | |
|---|---|---|---|
| Phage |
Escherichia phage BrunoManser [NCBI] |
2851976 | Uroviricota > Caudoviricetes > Drexlerviridae > Sertoctavirus > Sertoctavirus brunomanser |
| Host |
Escherichia coli K-12 [NCBI] |
83333 | Bacteria > Proteobacteria > Gammaproteobacteria > Enterobacteriales > Enterobacteriaceae > Escherichia |
Coding sequence (CDS)
Coding sequence (CDS)
Genbank protein accession
QXV76504.1
[NCBI]
Genbank nucleotide accession
MZ501053.1
[NCBI]
CDS location
range 16575 -> 19997
strand +
strand +
CDS
ATGATTAAAAAAGTGATAACCGGATCGAAAGGTGGTGGCGGAAAGCCTCATACACCTGTCGAAATGGAAGATAACTTGATTTCAATCAACAAAATACGCGTGTTGCTGGCAGTGTCAGACGGCGAAGTCGATCCAGACTTTGACATGAAAATGTTGTTGCTTAACGACGTTCCAGTTATGAACGACGACGGCACATACAACTACGAAGGCATTACGGCGGAATTCCGTCCCGGTACGCAAACGCAAGATTACATACAGGGATTCACCGACAGCGCAAGCGAAGTAACGATGAATCGCGTTATCGATCGCGATACCCCGTACAGCATCAACGTTACGAACAAATCGCTTTCGGCTATCCGTATCAAAATGTATATGCCGCGTGGCGTGAAGGTGGAAAGCAACGGCGACAAGAACGGCGTTCGCGTCGAATACAAAGTCGAATTAGCTGTCGATGGTGCGGCGTTTACCGAATACTTCACCGACGTAATCGACGGTAAAACAATGTCGGGCTATGACCGCAGCCGCCGAATCAACCTTCCGGCGTTCAACAGTCAAGTAATCGTGAGAGTGTCGCGCGTTACTCCTGATTCCGACGATTCTAACTTGATCGACGAAATGCAGTTGAAAAGCTATGCGGAAGTAATCGACGCTAAATTCCGTTACCCGCTGACTGGTCTGTTATTCGTGGAATTCGATTCGAAGATGTTCCCGACCCAGATCCCGAATATTTCGCTTCGTAAAAAGTGGAAGGTGATTAACGTCCCGTCGAACTATAACCCGATTACCCGCGACTATAACGGAAGCTGGGACGGCACGTTCAAAAAGGCGTGGAGTAATAACCCGGCATGGGTTCTTTACGATATCATCACGAACCAACGTTACGGGTTAGATCAGCGTGAGTTAGGGATTAATGTCGATAAGTGGGCGTTATACGAAGCCGGGCAATATTGCGATCAGATGGTTCCAGACGGAAAAGGCGGAACCGAACCGCGTTACCTTTGTGATGTGGTTATTCAGTCGCAGATTGAAGCGTTTAAACTGGTTCGCGATATCTGTTCAATCTTCCGTGGTATGTCGTTTTGGAATGGCGAAAGTTTATCTGTCGTGATCGATAAGCCGCGCGACCCGTCTTACATTTTCACCAACGAAAACGTAGTCGGCGGTAACTTCTCTTACTCGTTCGCGAGCGAAAAGAGCATGTACACGACTTGCAACGTGCAGTTTGATGATCATGAAAACTTATATCAACAGGACGTTGAACCTGTATTCGATACTGACGCGGCGCGACGCTTCGGCAATAACCCGACAAGCATTACAGCGATCGGATGTACGCGACGCAGCGAGGCTAACAGGCGCGGACGCTGGATTCTGCGAACGAACCTTCGCAGCACGACGGTTAACTTTGCTACCGGGCTGGAAGGTATGATCCCGCAAATCGGCGACGTCGTGGCGATTGCAGATAATTTCTGGTCAAGCAATCTTACGTTAAACCTTTCCGGGCGAGTCATGGAAGTATCAGGATTGCAGATTTTTACGGCGTTTCGTGTTGACGCTCGCGCAGGTGATTTTATTGTCGTAAATAAGCCGGATGGGAAGCCCGTCAAGCGCACAATTTCTTCGGTGTCACCAGACGGCAAGACAATAGAAATAAACGTAGGATTCGGCTTTGACGTCGCACCAAGCACTATCTTTGCTATCGACCGAACCGACGTTGCTTTGCAACAGTACGTTGTAACCGATATCAAGAAAGGCGACGGCGACGAAGAGTTCGTATTCAGTATCACGGCGGTTGAGTATGACCCGAACAAATACGACGCAATCGACTACGGCGTAAACGTTGACGACAGGCCGACAAGTATCGTAGATCCTGACAAGTTGCCAGCGCCGCAAAACGTAGCTGTCGAATCGTTTTCGCGTATCGTGCAGGGCTTGAGCGTCGAAACGATGGTCATTAGCTGGGAGAAGGTCGATTATGCGGCATTCTATGAGGTTCAATGGAGAAAGAATAACGGTAACTGGATTAACGTTCCGCGCACCCAAACGAACGAAACATTCGTCGAAGGGATTTACGCGGGCGAATACTCCGTCCGAGTTCGCGCTATTTCATCTTCGGAAAACGCTTCCGTATGGTCTGAGGTGGTAACGGTCGGATTAACTGGCAAGGTAGGCGAACCGGGCGCACCGATTAACTTCACTGCGTCCGACGACGTTGTTTTCGGTATCCGGTTAAAATGGGGTATGCCTGAAGACTCCGGCGACACGGCTTACATTGAAGTGCAGCAATCGCCAAACAACAGCGTAGAAAATGCATCGTTGCTAACGTTAGTCCCGTATCCGCAACATGAGTATTTCCATACTCCGCTTGAGGCCGGGAAGATGATTTTCTATAGAATTCGCGCCATCGACAAAATCGGCAACGTGTCTCCTTGGACTGACTTAATACCCGGAATGGCGTCTACCGACGTGGAAAGTATCATCGGTGAAATTAAGGTTGATATCGAAAATTCAGAGGGATATCAATGGCTGAAGGAAAACGCAACGGACGCAATAGCGAGAATCCAGAACACGGCAGAATCGGCGATAGAGAATGCGTTAGCCAATGATAAGGACATAAGAATCCAGCGAGTGAAGAACGGCAAATTCACGGCGCAGATTAAAGAGTCGTTACAGCTTATCGCCAACGAAACCGAAGCGCGTGTGACGCAAGTTTCACAGATGGAAGCGGACTTCGACGGCAAGATTAACGCTCAAAATACTGAGCTTCGACAGGTTATCGCAACGGAAACGGAAGCGCTATCCGAAGAGATCGAAGAACTGAAAGCCTCAATCGGTGACGATATTCAGGCGCAGTTAACGCAGGTTCAACAGGCGATCGCCAACGAAACGGAAGCGAGAACCACGGCGGACACGGCGTTAAGCGCAAGAATCGGAAACAACGAAGCGGCATTGAATCAGAAGCTTGATTCGTGGGCTAACGTTGATTCTGTTGGTGCGATGTATGGCGTTAAGCTTGGATTGACATATAACGGACAGCAGTATAGCGCGGGTATGGCTATGTCTCTGATTGCTTCCGGTAACGATGTTAAGTCACAACTTCTGTTTGAGGCCGACAGATTCGCGATCTACAATGGTGCTAATAATCATATTAGATACCCATTTATCGTTGAGGGCGGGCAGGTAATACTAAGTAGCGCGGTGATTAAGGATGGATTCATCACTAATGCCATGATTGGTGGATACATTCAATCAAATAACTACGTGTGGAACCAATCTGGTTGGCACTTAGGGAAGGACGGAACGTTCCTTAACTTCGGTTCAACTCCGGGCGAAGGTTCGATGAAACAGGATAACCAGACGATCAGCGTTAGAGATCAGAACGGTGTTCTTCGCGTCCAGATTGGTAGAATTACTGGCGTATGGTAA
Genome Context
Genome Context
Tertiary structure
PDB ID
fedb1c6bce61f7ea0fb8a684b2b95b0f9483f45ee6d60f124584de58391750b8
Model Confidence
Very high
pLDDT > 90
pLDDT > 90
High
90 > pLDDT > 70
90 > pLDDT > 70
Low
70 > pLDDT > 50
70 > pLDDT > 50
Very low
pLDDT < 50
pLDDT < 50
Literature
| Title | Authors | Date | PMID | Source |
|---|---|---|---|---|
| Systematic exploration of Escherichia coli phage-host interactions with the BASEL phage collection | Maffei,E., Shaidullina,A., Burkolter,M., Heyer,Y., Estermann,F., Druelle,V., Sauer,P., Willi,L., Michaelis,S., Hilbi,H., Thaler,D.S. and Harms,A. | 2021 | — | GenBank |