Protein
View in Explore- UniProt accession
- C6ZCZ5 [UniProt]
- Protein name
- Tail:host specificity protein
- RBP type
-
TFTFTFTSPTF
- Protein sequence
-
MGKGSSKGHTPREAKDNLKSTQLLSVIDAISEGPIEGPVDGLKSVLLNSTPVLDTEGNTNISGVTVVFRAGEQEQTPPEGFESSGSETVLGTEVKYDTPITRTITSANIDRLRFTFGVQALVETTSKGDRNPSEVRLLVQIQRNGGWVTEKDITIKGKTTSQYLASVVMGNLPPRPFNIRMRRMTPDSTTDQLQNKTLWSSYTEIIDVKQCYPNTALVGVQVDSEQFGSQQVSRNYHLRGRILQVPSNYNPQTRQYSGIWDGTFKPAYSNNMAWCLWDMLTHPRYGMGKRLGAADVDKWALYVIGQYCDQSVPDGFGGTEPRITCNAYLTTQRKAWDVLSDFCSAMRCMPVWNGQTLTFVQDRPSDKTWTYNRSNVVMPDDGAPFRYSFSALKDRHNAVEVNWIDPNNGWETATELVEDTQAIARYGRNVTKMDAFGCTSRGQAHRAGLWLIKTELLETQTVDFSVGAEGLRHVPGDVIEICDDDYAGISTGGRVLAVNSQTRTLTLDREITLPSSGTALISLVDGSGNPVSVEVQSVTDGVKVKVSRVPDGVAEYSVWELKLPTLRQRLFRCVSIRENDDGTYAITAVQHVPEKEAIVDNGAHFDGEQSGTVNGVTPPAVQHLTAEVTADSGEYQVLARWDTPKVVKGVSFLLRLTVTADDGSERLVSTARTTETTYRFTQLALGNYRLTVRAVNAWGQQGDPASVSFRIAAPAAPSRIELTPGYFQITATPHLAVYDPTVQFEFWFSEKQIADIRQVETSTRYLGTALYWIAASINIKPGHDYYFYIRSVNTVGKSAFVEAVGRASDDAEGYLDFFKGKITESHLGKELLEKVELTEDNASRLEEFSKEWKDASDKWNAMWAVKIEQTKDGKHYVAGIGLSMEDTEEGKLSQFLVAANRIAFIDPANGNETPMFVAQGNQIFMNDVFLKRLTAPTITSGGNPPAFSLTPDGKLTAKNADISGSVNANSGTLSNVTIAENCTINGTLRAEKIVGDIVKAASAAFPRQRESSVDWPSGTRTVTVTDDHPFDRQIVVLPLTFRGSKRTVSGRTTYSMCYLKVLMNGAVIYDGAANEAVQVFSRIVDMPAGRGNVILTFTLTSTRHSADIPPYTFASDVQVMVIKKQALGISVV
- Physico‐chemical
properties -
protein length: 1132 AA molecular weight: 124420,46210 Da isoelectric point: 5,67079 aromaticity: 0,08392 hydropathy: -0,30451
Domains
Domains [InterPro]
IPR053171
Unmapped
1–840
Unmapped
1–840
DC_0014
STR
1–1128
STR
1–1128
IPR055385
ATT
86–207
ATT
86–207
IPR055383
STR
610–714
STR
610–714
IPR036116
STR
617–719
STR
617–719
IPR003961
STR
618–710
STR
618–710
IPR003961
STR
618–701
STR
618–701
IPR003961
STR
620–715
STR
620–715
1
1132
Architecture
STR 1-85 | ATT 86-207 | STR 208-330 | ATT 331-498 | STR 499-715 | ATT 716-818 | STR 819-1132
Legend:
ATT
STR
RBD
CBM
LEC
ENZ
CHP
LNK
TAS
TTP
UNK
Unmapped
Tail Spike Domain Segmentation
Tail Spike Domain Segmentation
This protein has been segmented into three structural domains: N-terminal, central domain, and C-terminal.
Domain Layout
1
1132
| Domain | Start | End | Length (AA) | Confidence |
|---|---|---|---|---|
| N-terminal | 1 | 993 | 993 | 0,8979 |
| Central domain | 994 | 1121 | 129 | 0,2224 |
| C-terminal | 1122 | 1132 | 10 | 0,9867 |
Note: Constraints were applied during segmentation.
Fixed 45 C-terminal predictions appearing before Central domain|Sequence started with non-N-terminal domain|C-terminal too short, adjusted boundary
Fixed 45 C-terminal predictions appearing before Central domain|Sequence started with non-N-terminal domain|C-terminal too short, adjusted boundary
Legend:
N-terminal
Central domain
C-terminal
3D Structure with Domain Coloring
The structure is colored according to the domain segmentation: N-terminal (blue), Central (green), C-terminal (pink).
Domain Coloring
N-terminal
1-993
1-993
Central
994-1121
994-1121
C-terminal
1122-1132
1122-1132
Taxonomy
| Name | Taxonomy ID | Lineage | |
|---|---|---|---|
| Phage |
Escherichia phage DE3 [NCBI] |
482822 | Uroviricota > Caudoviricetes > Lambdavirus > Escherichia virus DE3 > |
| Host |
Escherichia coli [NCBI] |
562 | cellular organisms > Bacteria > Pseudomonadati > Pseudomonadota > Gammaproteobacteria > Enterobacterales |
Coding sequence (CDS)
Coding sequence (CDS)
No CDS data available.
Genome Context
Genome Context
Tertiary structure
1 / 9
PDB ID