Protein
View in Explore- Genbank accession
- YP_007151754.1 [GenBank]
- Protein name
- tail protein
- RBP type
-
TFTSPTF
- Protein sequence
-
MATATAIKGRKGGSSKTRTPTEQPDDLQSVAKAKILIALGEGEFSGQLTGKDIYLDGTALENADGSQNFSGVTWEFRAGTQAQKYIQGIPGTENEISVGTEVSSATAWTRTFTNTQLSAVRLRLKWPSLFKQEDDGDLVGYSVNYAIDLQTDGGTWQTVLNTSVTGKTTSGYERSHRIDLPQAGSTWTIRLRKITSDANSAKIGDTMMLQSFTEVIDAKLRYPNTALLYVEFDSSQFNGSIPQISCEPRGRVIRVPDTYDPETRTYSGTWTGAFKWAWTDNPAWIFYDLVVSDRFGLGHRLTAANIDKWTLYQVAQYCDQMVPDGKGGNGTEPRYTCNVYIQDRNDAYTVLRDFAAIFRGMTYWGGDQIVALADMPRDVDYSYTRANVVGGRFTYSSSTTKSRYTTALVSWSDPGNAYADAMEPVFEQALVARYGFNQLEMTAIGCTRQSEANRKGRWGILTNNKDRVVSFDVGLDGNIPQPGYIIAVADELLSGKVMGGRISAVNGRVIKLDRVADAAPGDRLILNLPSGASQSRTIQAVNGESVTVTTAYSETPQAEAVWVVESDELYAQQYRVVSVSDNNDGTFSITGAWHDPDKYARIDTGAIIDQRPVSVIPPGNQSPPANIVISSFSVVQQNISVETMRVSWDQAQNAIAYEAQWRRNDGNWVNVPRTSTTSFDVPGIYAGRYLVRVRAINAAEISSGWGYSEEKTLTGKVGNPPKPVGFIASENVVFGIELNWGFPANTDDTLKTEIQYSLTGSEDDAILLSDVPYPQRKYQQMGLKAGQIFWYRAQLVDRTGNESGYTDWVRGQASIDVSDITDVILEDIKESDTFKELIESAVDSNEKIAGMADDIRQNADDLEQQALAIKENADGLAQAEVKIDEISVSMDGMTGGVKNSSIAVIQNSLAQVTSRRSQTATNAGNSASIDRIDTTIADTSQAVARALVTLDASAGGNVSNATDLTETLADFTQASATKINSLTVTVNGQTAAINQTAQAVADVNGNLSAMYNIKVGVSSNGQYYAAGMGIGVENTPSGMQSQVIFLADRFAVTTAAGNSVALPFVIQNGQTFIRASFIQDGTISNAKIGNFIQSNNYVAGSAGWKLDKGGTFENYGSTAGEGAMKLTNQTISVKDGSNVLRVQVGRLTGVF
- Physico‐chemical
properties -
protein length: 1151 AA molecular weight: 124620,91850 Da isoelectric point: 4,81605 aromaticity: 0,08514 hydropathy: -0,30695
Domains
Domains [InterPro]
DC_0014
STR
1–1151
STR
1–1151
IPR053171
Unmapped
6–847
Unmapped
6–847
IPR055385
ATT
92–218
ATT
92–218
IPR013783
STR
622–714
STR
622–714
IPR003961
STR
623–717
STR
623–717
IPR003961
STR
628–702
STR
628–702
IPR036116
STR
637–702
STR
637–702
1
1151
Architecture
STR 1-91 | ATT 92-218 | STR 219-342 | ATT 343-505 | STR 506-1151
Legend:
ATT
STR
RBD
CBM
LEC
ENZ
CHP
LNK
TAS
TTP
UNK
Unmapped
Tail Spike Domain Segmentation
Tail Spike Domain Segmentation
This protein has been segmented into three structural domains: N-terminal, central domain, and C-terminal.
Domain Layout
1
1151
| Domain | Start | End | Length (AA) | Confidence |
|---|---|---|---|---|
| N-terminal | 1 | 540 | 540 | 0,8099 |
| Central domain | 541 | 1090 | 551 | 0,0286 |
| C-terminal | 1091 | 1151 | 60 | 0,9212 |
Legend:
N-terminal
Central domain
C-terminal
3D Structure with Domain Coloring
The structure is colored according to the domain segmentation: N-terminal (blue), Central (green), C-terminal (pink).
Domain Coloring
N-terminal
1-540
1-540
Central
541-1090
541-1090
C-terminal
1091-1151
1091-1151
Taxonomy
| Name | Taxonomy ID | Lineage | |
|---|---|---|---|
| Phage |
Escherichia phage HK542 [NCBI] |
432200 | Uroviricota > Caudoviricetes > Hendrixvirinae > Wongtaivirus HK542 > |
| Host |
Escherichia coli [NCBI] |
562 | cellular organisms > Bacteria > Pseudomonadati > Pseudomonadota > Gammaproteobacteria > Enterobacterales |
Coding sequence (CDS)
Coding sequence (CDS)
Genbank protein accession
YP_007151754.1
[NCBI]
Genbank nucleotide accession
NC_019769
[NCBI]
CDS location
range 13999 -> 17454
strand +
strand +
CDS
ATGGCAACTGCAACCGCGATTAAAGGCCGCAAAGGCGGTAGTTCAAAGACACGCACTCCAACTGAACAGCCCGATGATCTGCAGTCAGTTGCGAAGGCGAAAATTTTAATCGCCCTTGGCGAGGGCGAGTTCTCCGGGCAATTAACCGGCAAAGATATCTACCTGGACGGAACAGCGCTGGAGAATGCTGACGGCTCCCAAAACTTCAGCGGGGTAACGTGGGAGTTTCGCGCGGGAACGCAGGCGCAAAAATATATTCAGGGTATTCCCGGTACCGAAAACGAGATCAGCGTAGGAACTGAGGTATCAAGTGCCACAGCCTGGACGCGCACGTTTACCAATACGCAGCTTTCAGCAGTTCGCCTGCGTCTTAAATGGCCCTCGCTTTTCAAACAGGAAGACGACGGCGATCTGGTGGGTTACTCGGTCAATTATGCGATTGACCTGCAGACTGACGGCGGCACATGGCAGACGGTACTCAATACCAGCGTGACCGGCAAAACGACGTCTGGTTATGAGCGCAGCCACCGTATCGATTTACCGCAGGCTGGCAGCACATGGACAATACGTCTGCGTAAGATTACCTCTGACGCCAACAGCGCGAAGATCGGCGACACGATGATGCTGCAGAGCTTCACCGAGGTGATTGACGCCAAACTGCGCTACCCGAACACCGCGCTGCTCTACGTCGAATTCGACTCAAGCCAGTTCAACGGCTCTATTCCTCAAATTTCATGCGAACCGCGCGGCCGCGTTATCCGCGTTCCAGATACCTACGACCCTGAAACCCGCACTTATAGCGGTACATGGACCGGTGCGTTTAAGTGGGCATGGACGGATAACCCTGCGTGGATTTTTTACGACCTGGTTGTTTCTGACCGGTTCGGCCTTGGGCACCGTTTGACCGCTGCGAATATTGATAAATGGACGCTTTATCAGGTTGCTCAGTATTGTGATCAGATGGTACCAGACGGCAAAGGGGGCAACGGTACAGAACCACGCTATACCTGCAACGTGTACATTCAGGACCGGAACGACGCCTACACAGTCCTGCGTGATTTTGCCGCTATCTTCCGTGGCATGACCTACTGGGGCGGGGATCAGATTGTGGCCCTGGCTGACATGCCGCGCGATGTTGATTACAGCTATACGCGCGCTAACGTTGTTGGCGGTCGCTTCACCTATTCGAGCAGCACCACGAAAAGCCGCTACACCACAGCGCTGGTTTCATGGTCAGACCCGGGTAACGCTTATGCCGACGCGATGGAGCCGGTATTTGAGCAGGCGCTGGTGGCGCGGTACGGCTTCAATCAGCTGGAAATGACAGCCATCGGCTGCACCAGGCAGTCAGAGGCGAACCGAAAGGGGCGCTGGGGTATTCTCACCAATAACAAGGATCGCGTTGTTTCGTTTGATGTCGGGCTGGACGGAAACATTCCGCAGCCTGGCTACATCATCGCCGTGGCAGACGAGCTGCTTTCCGGAAAGGTTATGGGCGGCCGCATCAGCGCCGTTAACGGTCGCGTTATCAAACTTGACCGCGTAGCTGATGCAGCACCAGGTGATCGCCTTATTCTCAACCTTCCCTCCGGAGCGTCGCAGAGCAGGACCATTCAGGCCGTGAACGGGGAATCAGTCACAGTCACCACGGCATACAGTGAGACGCCACAGGCCGAAGCTGTTTGGGTGGTTGAATCTGACGAGCTCTACGCGCAGCAGTATCGAGTTGTCAGCGTTTCCGATAACAATGATGGCACTTTCTCGATTACCGGCGCATGGCACGACCCGGATAAATATGCCCGTATCGATACCGGAGCCATCATTGACCAACGGCCGGTGAGCGTGATCCCGCCGGGCAACCAGTCGCCGCCTGCGAATATCGTGATCAGCTCGTTTTCCGTGGTTCAGCAAAATATCAGCGTCGAAACAATGCGCGTGAGCTGGGACCAGGCGCAGAACGCTATCGCCTATGAAGCGCAATGGCGCCGCAACGACGGGAACTGGGTTAACGTGCCGCGCACCTCCACCACGTCATTCGACGTCCCGGGGATTTATGCCGGGCGCTACCTGGTGCGGGTGCGCGCAATCAATGCCGCAGAAATTTCATCCGGATGGGGCTATTCAGAAGAGAAAACGCTGACGGGTAAAGTGGGCAATCCACCGAAGCCGGTTGGCTTTATCGCCTCTGAAAACGTGGTGTTCGGTATCGAGCTGAACTGGGGATTCCCGGCGAATACCGACGACACGCTGAAGACGGAAATTCAGTACAGCCTGACCGGGAGCGAAGATGATGCCATTCTTCTGAGCGATGTTCCCTATCCGCAGCGCAAGTATCAGCAGATGGGCCTGAAGGCGGGGCAAATTTTCTGGTACCGGGCGCAGCTGGTGGACAGGACAGGCAATGAGTCGGGTTATACCGACTGGGTACGTGGACAGGCCAGTATCGATGTGTCGGATATCACCGATGTTATCCTGGAGGACATTAAAGAATCGGACACGTTCAAGGAACTGATCGAAAGCGCAGTAGACAGCAACGAAAAAATTGCTGGTATGGCTGACGATATCAGACAGAACGCTGACGATCTGGAGCAACAGGCGCTGGCCATCAAGGAAAACGCCGATGGGCTCGCCCAGGCCGAGGTGAAGATTGACGAAATCTCTGTCTCGATGGATGGCATGACGGGAGGCGTTAAAAACTCCTCTATCGCGGTTATTCAGAACAGCCTCGCGCAGGTCACCAGCCGTCGATCCCAGACAGCCACCAACGCCGGGAACAGCGCCAGCATCGACCGTATCGATACCACCATTGCAGATACCAGCCAGGCGGTTGCCCGTGCGCTGGTTACGCTTGATGCTTCTGCCGGTGGTAATGTCTCAAACGCGACCGATCTCACCGAAACCCTTGCTGATTTCACGCAGGCCTCGGCCACGAAAATCAACTCCCTGACGGTTACGGTAAACGGCCAGACAGCGGCTATTAACCAGACCGCGCAGGCGGTGGCTGATGTGAACGGTAACCTCAGCGCGATGTATAACATCAAGGTTGGCGTTTCCAGCAACGGTCAGTATTACGCCGCGGGCATGGGGATTGGTGTTGAGAATACGCCGTCCGGGATGCAGTCGCAGGTCATCTTCCTGGCTGACCGCTTCGCCGTCACCACGGCAGCAGGTAACAGCGTGGCTTTGCCGTTTGTGATCCAGAATGGACAGACATTCATCCGGGCCAGTTTCATCCAGGACGGCACTATCAGCAACGCAAAGATTGGTAATTTTATCCAGTCGAACAATTATGTTGCTGGTTCTGCTGGCTGGAAGCTTGATAAAGGGGGGACGTTTGAGAACTACGGTTCGACTGCTGGTGAGGGAGCCATGAAACTGACAAATCAGACGATCAGCGTCAAAGATGGCAGTAATGTTCTTAGGGTGCAGGTTGGACGATTAACGGGAGTATTCTGA
Genome Context
Genome Context
Tertiary structure
PDB ID
3a45c913112fda2619a82a4bacadadbbca83032c0ef3423076d987a28a9d405d
Model Confidence
Very high
pLDDT > 90
pLDDT > 90
High
90 > pLDDT > 70
90 > pLDDT > 70
Low
70 > pLDDT > 50
70 > pLDDT > 50
Very low
pLDDT < 50
pLDDT < 50
Literature
| Title | Authors | Date | PMID | Source |
|---|---|---|---|---|
| The genomes of several lambdoid coliphages | Refardt,D., Gencoglu,M., Kunzli-Gontarczyk,M., Bruggmann,R. and Kropinski,A.M. | 2012-04 | — | GenBank |