Protein
View in Explore- Genbank accession
- NP_037717.1 [GenBank]
- Protein name
- tail protein
- RBP type
-
TFTSPTF
- Protein sequence
-
MATEKVLKGRKGGSSSSRTPTEQPDDLQSVAKAKILVALGEGEFAGQLTGKDIYLDGTALENADGSQNFSGVTWEFRAGTQAQKYIQGIPGTENEISVGTEVSSATTWTRTFTNTQLSAVRLRLKWPSLFKQEDDGDLVGYSVNYAIDLQTDGGTWQTVLNTSVTGKTTSGYERSHRIDLPQAGSTWTIRLRKITSDANSAKIGDTMMLQSFTEVIDAKLRYPNTALLYVEFDSSQFNGSIPQISCEPRGRVIRVPDTYDPETRTYSGTWTGAFKWAWTDNPAWIFYDLVVSDRFGLGHRLTAANIDKWTLYQVAQYCDQMVPDGKGGNGTEPRYTCNVYIQDRNDAYTVLRDFAAIFRGMTYWGGDQIVALADMPRDVDYSYTRANVVGGRFTYSSSTTKSRYTTALVSWSDPGNAYADAMEPVFEQALVARYGFNQLEMTAIGCTRQSEANRKGRWGILTNNKDRVVSFDVGLDGNIPQPGYIIAVADELLSGKVMGGRISAVNGRVIKLDRVADAAPGDRLILNLPSGASQSRTIQAVNGESVTVTTAYSETPQAEAVWVVESDELYAQQYRVVSVSDNNDGTFSITGAWHDPDKYARIDTGAIIDQRPVSVIPPGNQSPPANIVISSFSVVQQNISVETMRVSWDQAQNAIAYEAQWRRNDGNWVNVPRTSTTSFDVPGIYAGRYLVRVRAINAAEISSGWGYSEEKTLTGKVGNPPKPVGFIASENVVFGIELNWGFPANTDDTLKTEIQYSLTGTEDDAMLLADVPYPQRKYQQMGLKAGQIFWYRAQLVDRSGNESGYTGWVRGQASIDVSDITDVILEDIKGSETFKDLIENAVDSNEKIAGMADDIKQANDELELQAQEIAKNAQDIGQVQTSVNELSSTVGDVSSSLSDLEQTVATADTALGQRIDSISVSMDGMTGGVKNSAIAIIQGNLAQVAARKTLSASVAGNSAQLDRLDEVIVSEKEATARSLLSLQTDVNGNKASINSLNQTLSDYQQATATQINGITATVNGHTSAITTNAQAIANVNGELSAMYNIKVGVSSNGQYYAAGMGIGVGNTPSGMQSQVIFLADPFAVTTAAGNSVALPFVIQNGQTFIRASFIQDGTIENAKIGNYIQSNNYAAGSAGWKLNKAGDAEFNNVTVRGVVYASGGSFTGEIQATSGKFKGTVEAQSFIGDIANMHTGTNVSRSSNGLLEKVITYTDSSSSGHARHVCVIANVRGNGAGTININGSEGSSSVQDVERLIMHSAVVTGPNVTVRITVSAQNNRGASISSPTIIVSHGSGSFTG
- Physico‐chemical
properties -
protein length: 1296 AA molecular weight: 139335,92470 Da isoelectric point: 4,94223 aromaticity: 0,07870 hydropathy: -0,28472
Domains
Domains [InterPro]
DC_0014
STR
1–1293
STR
1–1293
IPR053171
Unmapped
4–880
Unmapped
4–880
IPR055385
ATT
92–218
ATT
92–218
IPR013783
STR
622–714
STR
622–714
IPR003961
STR
623–717
STR
623–717
IPR003961
STR
628–702
STR
628–702
IPR036116
STR
637–702
STR
637–702
1
1296
Architecture
STR 1-91 | ATT 92-218 | STR 219-342 | ATT 343-505 | STR 506-1293 |
Legend:
ATT
STR
RBD
CBM
LEC
ENZ
CHP
LNK
TAS
TTP
UNK
Unmapped
Tail Spike Domain Segmentation
Tail Spike Domain Segmentation
This protein has been segmented into three structural domains: N-terminal, central domain, and C-terminal.
Domain Layout
1
1296
| Domain | Start | End | Length (AA) | Confidence |
|---|---|---|---|---|
| N-terminal | 1 | 539 | 539 | 0,7640 |
| Central domain | 540 | 1123 | 585 | 0,0327 |
| C-terminal | 1124 | 1296 | 172 | 0,6749 |
Legend:
N-terminal
Central domain
C-terminal
3D Structure with Domain Coloring
The structure is colored according to the domain segmentation: N-terminal (blue), Central (green), C-terminal (pink).
Domain Coloring
N-terminal
1-539
1-539
Central
540-1123
540-1123
C-terminal
1124-1296
1124-1296
Taxonomy
| Name | Taxonomy ID | Lineage | |
|---|---|---|---|
| Phage |
Enterobacteria phage HK97 [NCBI] |
2681617 | Uroviricota > Caudoviricetes > Hendrixvirinae > Byrnievirus HK97 > |
| Host | No host information | ||
Coding sequence (CDS)
Coding sequence (CDS)
Genbank protein accession
NP_037717.1
[NCBI]
Genbank nucleotide accession
NC_002167
[NCBI]
CDS location
range 15141 -> 19031
strand +
strand +
CDS
ATGGCTACAGAGAAAGTATTAAAGGGCCGCAAGGGCGGCAGCTCCAGTTCACGAACTCCTACCGAACAACCTGATGATCTGCAATCTGTAGCGAAGGCCAAAATCCTCGTTGCGCTTGGGGAAGGTGAGTTTGCAGGGCAGCTCACCGGCAAAGATATCTACCTGGACGGAACAGCGCTGGAGAATGCTGACGGCTCCCAAAACTTCAGCGGGGTAACGTGGGAGTTTCGCGCGGGAACTCAGGCGCAAAAATATATTCAGGGTATTCCCGGTACCGAAAACGAGATCAGCGTAGGAACTGAGGTATCAAGTGCCACAACCTGGACGCGCACGTTTACCAATACGCAGCTTTCAGCAGTTCGCCTGCGTCTTAAATGGCCCTCGCTTTTCAAACAGGAGGACGACGGCGATCTGGTGGGTTACTCGGTCAATTATGCAATTGACCTGCAGACGGACGGCGGCACATGGCAGACGGTACTCAATACCAGCGTGACCGGCAAAACGACGTCTGGTTATGAGCGCAGCCACCGTATCGATTTACCGCAGGCTGGCAGCACATGGACAATACGTCTGCGTAAGATTACCTCTGACGCCAACAGCGCGAAGATCGGCGACACGATGATGCTGCAGAGCTTCACCGAGGTGATTGATGCCAAACTGCGCTACCCGAACACCGCGCTGCTCTACGTCGAATTCGACTCAAGCCAGTTCAACGGCTCTATTCCTCAAATTTCATGCGAACCGCGCGGCCGCGTTATCCGCGTTCCAGATACCTACGACCCTGAAACCCGCACTTATAGCGGTACATGGACCGGTGCGTTTAAGTGGGCATGGACGGATAACCCTGCGTGGATTTTTTACGACCTGGTTGTTTCTGACCGGTTCGGCCTTGGGCACCGTTTGACCGCTGCGAATATTGATAAATGGACGCTTTATCAGGTTGCTCAGTATTGTGATCAGATGGTACCAGACGGCAAAGGGGGCAACGGTACAGAACCACGTTATACCTGCAACGTGTACATTCAGGACCGGAACGACGCCTACACAGTCCTGCGTGATTTTGCCGCTATCTTCCGTGGCATGACCTACTGGGGCGGGGATCAGATTGTGGCCCTGGCTGACATGCCGCGCGATGTTGATTACAGCTATACGCGCGCTAACGTTGTTGGCGGTCGCTTCACCTATTCGAGCAGCACCACGAAAAGCCGCTACACCACAGCGCTGGTTTCATGGTCAGACCCGGGTAACGCTTATGCCGACGCGATGGAACCGGTATTTGAGCAGGCGCTGGTGGCGCGGTACGGCTTCAATCAGCTGGAAATGACAGCCATCGGCTGCACCAGGCAGTCAGAGGCGAACCGAAAGGGGCGCTGGGGTATTCTCACCAATAACAAGGATCGCGTTGTTTCGTTTGATGTCGGGCTGGACGGAAACATTCCGCAGCCGGGCTACATCATCGCCGTGGCAGACGAGCTGCTTTCCGGAAAGGTTATGGGCGGCCGCATCAGCGCCGTTAACGGTCGCGTTATCAAACTTGACCGCGTAGCTGATGCAGCACCAGGTGATCGCCTTATTCTCAACCTGCCTTCCGGAGCGTCGCAGAGCAGGACCATTCAGGCCGTGAACGGGGAATCAGTCACAGTCACCACGGCATACAGTGAGACGCCACAGGCCGAAGCTGTTTGGGTGGTTGAATCTGACGAGCTTTACGCGCAGCAGTATCGTGTTGTCAGCGTTTCCGATAACAATGATGGCACTTTCTCGATTACCGGCGCATGGCACGACCCGGATAAATATGCCCGTATCGATACCGGAGCCATCATTGACCAACGGCCGGTGAGCGTGATCCCGCCGGGCAACCAGTCGCCGCCTGCGAATATCGTGATCAGCTCGTTTTCCGTGGTTCAGCAAAATATCAGCGTCGAAACAATGCGCGTGAGCTGGGACCAGGCGCAGAACGCTATCGCCTATGAAGCGCAATGGCGCCGCAACGACGGGAACTGGGTTAACGTGCCGCGCACCTCCACCACGTCATTCGACGTCCCGGGGATTTATGCCGGGCGCTACCTGGTGCGGGTGCGCGCAATCAATGCCGCAGAAATTTCATCCGGATGGGGCTATTCAGAAGAGAAAACGCTGACGGGTAAAGTGGGCAATCCACCGAAGCCGGTTGGCTTTATCGCCTCTGAAAACGTGGTGTTCGGTATCGAGCTGAACTGGGGATTCCCGGCGAATACCGACGACACGCTGAAGACGGAAATTCAGTACAGCCTGACCGGTACTGAAGACGATGCCATGCTGCTGGCCGATGTGCCTTACCCGCAGCGCAAATATCAGCAGATGGGCCTTAAGGCTGGGCAGATTTTCTGGTACCGCGCGCAGCTGGTTGACCGCAGCGGTAACGAGTCAGGTTATACCGGCTGGGTTCGTGGGCAGGCCAGTATCGATGTTTCTGACATCACAGATGTGATCCTTGAAGACATCAAAGGGTCTGAGACGTTCAAAGACCTGATCGAGAACGCTGTGGACAGCAATGAAAAAATTGCTGGCATGGCTGACGACATCAAACAGGCCAACGATGAACTTGAACTCCAGGCGCAGGAAATCGCAAAAAACGCGCAGGACATCGGGCAGGTTCAGACCAGCGTTAATGAGCTTTCTAGCACGGTCGGTGATGTGTCGTCTTCTCTCTCAGATCTTGAGCAGACTGTTGCGACTGCTGATACCGCACTGGGCCAGCGAATCGACAGCATCAGCGTGTCTATGGACGGCATGACGGGCGGGGTGAAGAACTCTGCTATCGCAATTATTCAGGGCAACCTGGCGCAGGTAGCCGCGCGTAAAACGTTGTCTGCATCTGTAGCAGGGAACAGCGCTCAGCTGGACCGTCTAGACGAGGTGATCGTCAGTGAGAAGGAAGCCACAGCACGTTCATTGCTGAGCCTGCAGACGGACGTCAACGGCAACAAGGCATCCATCAACAGCCTGAATCAGACGCTCTCCGACTATCAGCAGGCCACCGCCACGCAGATAAACGGCATCACGGCGACCGTGAACGGGCATACCTCCGCCATCACCACTAACGCTCAGGCTATAGCCAACGTTAATGGCGAACTCAGCGCGATGTACAACATCAAGGTTGGTGTCTCCAGCAACGGGCAGTATTACGCCGCGGGCATGGGGATTGGCGTTGGGAACACGCCGTCAGGCATGCAGTCGCAGGTTATCTTCCTGGCTGACCCGTTCGCCGTCACCACGGCAGCAGGTAACAGCGTGGCTTTGCCGTTCGTGATCCAGAACGGGCAGACATTCATCCGGGCCAGCTTCATCCAGGACGGCACCATTGAGAACGCCAAAATCGGCAACTATATCCAGTCCAACAACTATGCAGCTGGTTCTGCTGGTTGGAAGTTGAATAAAGCTGGAGATGCTGAATTCAACAATGTGACCGTCAGAGGTGTAGTATATGCTAGTGGCGGTAGCTTTACTGGTGAGATCCAAGCAACAAGCGGTAAATTCAAAGGGACTGTGGAAGCCCAAAGTTTTATCGGGGACATTGCAAATATGCACACCGGAACTAACGTTAGTCGGTCTAGTAACGGTCTTTTAGAAAAGGTAATAACTTATACGGATTCATCCAGTTCTGGGCACGCAAGACACGTCTGTGTTATAGCAAACGTGAGAGGGAATGGTGCAGGTACGATAAATATTAACGGATCTGAGGGCTCTTCCAGCGTACAGGATGTAGAACGACTTATTATGCATTCTGCTGTTGTAACTGGTCCAAACGTTACGGTAAGAATTACAGTTTCCGCTCAGAACAATAGAGGGGCGTCTATATCCTCACCTACTATTATTGTTTCGCACGGATCCGGTTCATTCACTGGTTAA
Genome Context
Genome Context
Tertiary structure
PDB ID
9402ed7760eb0c702ace54a06b6c21c12b178a7334558badb66d4d402705b448
Model Confidence
Very high
pLDDT > 90
pLDDT > 90
High
90 > pLDDT > 70
90 > pLDDT > 70
Low
70 > pLDDT > 50
70 > pLDDT > 50
Very low
pLDDT < 50
pLDDT < 50