Protein
View in Explore- UniProt accession
- A0A8S5UWH4 [UniProt]
- Protein name
- Tail protein
- RBP type
-
TSPTF
- Protein sequence
-
MINVVLVRNPFKPDQHETQYRPYKANMPLSFYAKQDGDWVYSINGQEATIDTIVNDGDCIVAMPQIDGKFFGIILTIGLSIATGGIASGAIFGIQSLIWRTVLSMAIGMIGNMLVNKLTQPKADRSHTDSAQANTYGWGGAKTVTGQGYPLAVTYGRMKSAGLLLSRHIISDGEKQYLNLLYCAGEGELSKIEDIRINANPISNYQDVQVDIRLGTNDQTVIPNFNDNYADQVLNYELKTGWSTQRVQGDACNAIELTISFPNGLYYSNDTGGMDATSVTLDAEIRKVGENEEWHKLPLSNQKGMQAFVKKSGDGWSFTRQKSDAEIAEGDYKGKVTEATNTAFYRVYRFDNLDKAQYEVRVRCSSKDGSSIRYNNKVYWNQLTQIIYDDFVHPGKALIGIKALATSQLNGSDPEVSWIQERSAVYVFNPYQQKYEVQRADNPAWAAYDLLHMARKFGDEYVVFGQPHGRMDYDAFKAWASNCDKNGFTFNYIYDSASRLWDALKYPENVGRGKVIPQGTRFTCVSDYKSTPVQLFTVANIKQGSFSEEFQGIQSRANSVEISFLNKDKDYERDVIPVYGDTYDESDTLTNPAQIELMGCTSLDQAFKHGKHYLRCNKYEVRTVTIEAFTDAIACTIGDIILIQHDVPEWGEGGRVVAVTGSTITLDKEVSTLPGKQYQLLIRNSATDAVTTLTVLSVIGRNVTVKETITVEPGSVYAFGELTKAAKPFRVLAITEGGTDLTRKIQCMEYYPEVYSSDDGTVPTIDYKSEVGSDIEDIGLVSDVYGANGIMYSRIAVRWQLPRDGKITNVVVNYRNAKSDTWKYVGNFPASPNSTEISDVLLGATYEVKVQAINDLGQLTTGVTKEIVIPKMQAPGDVQNLHVISRYNLTADKSVYYDLQVMFEPPANPGNFDSAEVWYKLKSKNGQAVTGQDWQFAGSSNSQVIIKALGPGEEYEIKAVAVDRFGNRSDTAQVVDVVVKAMDEVPDMPSNFTVAFKDHATASWNDVLNADVDYYELRTDNDPGKDTNALLAKVKDTSANLPLTKRSGTVYLYARSTLGKYSTPATYSYNLPQLEAPTFEVKDQLGGFSLYFGAKPPQAYVIRCHVIGDDRTDDLETTSSMLTYSNKAGVYRVRCEYVDVFGSSLVAEKSVTIKDRVDKSLLDAEALGLKAMDESIQAMSSEVGTMKTSVNGFESKLVQLDKGITQKVTDLNQNLSGQITTLANGIDLQVTQAIGNLSGKDIVSRINLSPEGTRIDGKLLHVTGQALFDNNIITEGMLQANSVSADKIQALSISSDKLQADSVTADKLKVNSLDAITATIGTLRTKTSGARVEISDNLIQVFDDNNVLRVRLGLWDD
- Physico‐chemical
properties -
protein length: 1357 AA molecular weight: 149798,53300 Da isoelectric point: 5,08870 aromaticity: 0,09064 hydropathy: -0,31680
Domains
Domains [InterPro]
DC_0129
STR
1–943
STR
1–943
IPR053171
Unmapped
193–976
Unmapped
193–976
IPR055385
ATT
231–388
ATT
231–388
NF040662
Unmapped
310–755
Unmapped
310–755
IPR003961
STR
792–858
STR
792–858
IPR003961
STR
793–857
STR
793–857
IPR036116
STR
793–861
STR
793–861
1
1357
Architecture
STR 1-230 | ATT 231-388 | STR 389-529 | ATT 530-659 | STR 660-943 | RBD 944-1350 |
Legend:
ATT
STR
RBD
CBM
LEC
ENZ
CHP
LNK
TAS
TTP
UNK
Unmapped
Tail Spike Domain Segmentation
Tail Spike Domain Segmentation
This protein has been segmented into three structural domains: N-terminal, central domain, and C-terminal.
Domain Layout
1
1357
| Domain | Start | End | Length (AA) | Confidence |
|---|---|---|---|---|
| N-terminal | 1 | 288 | 288 | 0,8215 |
| Central domain | 289 | 487 | 200 | 0,2074 |
| C-terminal | 488 | 1357 | 869 | 0,0361 |
Note: Constraints were applied during segmentation.
Fixed 21 C-terminal predictions appearing before Central domain
Fixed 21 C-terminal predictions appearing before Central domain
Legend:
N-terminal
Central domain
C-terminal
3D Structure with Domain Coloring
The structure is colored according to the domain segmentation: N-terminal (blue), Central (green), C-terminal (pink).
Domain Coloring
N-terminal
1-288
1-288
Central
289-487
289-487
C-terminal
488-1357
488-1357
Taxonomy
| Name | Taxonomy ID | Lineage | |
|---|---|---|---|
| Phage |
Myoviridae sp. ctj4n23 [NCBI] |
2825159 | Uroviricota > Caudoviricetes > |
| Host | No host information | ||
Coding sequence (CDS)
Coding sequence (CDS)
Genbank protein accession
DAF98839.1
[NCBI]
Genbank nucleotide accession
BK016156
[NCBI]
CDS location
range 27577 -> 31650
strand +
strand +
CDS
ATGATTAATGTAGTGCTAGTAAGGAATCCGTTTAAACCGGATCAGCATGAAACACAATACCGCCCTTATAAGGCAAACATGCCATTAAGCTTTTATGCTAAACAAGATGGCGACTGGGTATACTCCATTAATGGCCAAGAGGCAACGATTGATACCATCGTTAACGATGGCGATTGTATCGTGGCCATGCCACAGATTGACGGTAAGTTCTTTGGAATTATTTTAACCATAGGCCTTAGTATCGCCACAGGCGGTATCGCAAGTGGCGCGATATTTGGTATCCAAAGTCTAATATGGCGTACAGTACTCTCCATGGCCATTGGTATGATTGGCAATATGCTCGTCAATAAGTTAACTCAACCAAAGGCTGACCGGTCACATACGGACTCTGCACAGGCTAACACGTATGGGTGGGGCGGTGCTAAGACTGTAACCGGGCAAGGGTACCCTCTAGCCGTTACGTACGGCCGTATGAAGAGCGCGGGGCTCCTTTTATCTCGTCACATTATTAGTGACGGCGAAAAGCAGTACCTTAACCTTTTATATTGTGCCGGTGAAGGTGAGTTATCAAAAATCGAGGATATCCGTATCAATGCTAACCCTATTAGTAACTACCAGGATGTGCAAGTGGATATCCGATTAGGTACCAACGACCAAACTGTTATCCCTAATTTCAATGATAACTACGCAGACCAAGTACTCAACTATGAGCTAAAAACCGGGTGGAGTACGCAACGTGTACAAGGCGACGCGTGCAATGCTATCGAGCTAACTATCAGCTTCCCTAATGGATTGTATTACTCCAACGATACAGGCGGGATGGACGCTACGTCGGTTACTCTTGACGCGGAAATCCGGAAAGTAGGTGAAAACGAAGAGTGGCATAAGTTACCACTCTCTAATCAAAAGGGCATGCAAGCCTTCGTTAAGAAATCCGGAGACGGATGGTCCTTTACTCGTCAAAAGTCTGATGCAGAAATCGCTGAAGGCGACTATAAGGGCAAGGTTACCGAGGCTACAAACACCGCGTTCTATCGAGTGTACCGATTTGATAACCTCGATAAGGCGCAGTATGAAGTCCGTGTTCGTTGCTCCAGTAAGGATGGTAGCTCTATTCGATACAACAATAAAGTGTACTGGAACCAGTTAACGCAGATTATATATGATGACTTCGTACATCCAGGCAAGGCTCTTATCGGTATTAAAGCGTTGGCCACATCTCAACTTAACGGCTCTGACCCTGAAGTATCTTGGATACAGGAACGCTCCGCCGTGTATGTGTTTAACCCATATCAACAAAAGTACGAGGTCCAACGTGCGGATAATCCGGCATGGGCGGCGTATGATCTACTTCATATGGCGCGTAAGTTTGGCGATGAATACGTCGTGTTTGGCCAACCGCATGGACGCATGGATTATGACGCGTTCAAAGCCTGGGCAAGTAATTGCGATAAGAACGGATTCACATTCAACTATATCTACGATAGCGCTAGTCGGTTATGGGATGCGCTCAAATATCCGGAAAACGTAGGCCGAGGTAAAGTCATTCCACAGGGGACTAGGTTCACATGTGTTAGCGATTATAAGTCAACACCTGTACAGTTATTTACGGTGGCCAACATTAAGCAAGGCAGTTTCTCAGAAGAGTTCCAAGGTATCCAAAGCCGGGCCAACTCCGTAGAAATCTCCTTCCTTAATAAGGATAAGGACTACGAACGTGATGTTATCCCGGTATACGGCGATACATACGACGAATCGGATACGCTTACCAACCCTGCACAAATAGAGCTCATGGGATGTACTAGCCTAGACCAAGCTTTCAAACATGGTAAGCACTACCTACGATGCAATAAGTACGAGGTGCGTACTGTTACTATCGAAGCTTTCACCGACGCCATTGCATGTACGATAGGGGATATTATCCTTATCCAACATGACGTACCTGAATGGGGCGAAGGTGGTCGAGTGGTAGCGGTTACAGGTAGCACGATTACTCTTGATAAGGAAGTGTCGACCCTACCTGGCAAGCAGTACCAGCTACTGATTCGTAACAGCGCTACCGATGCGGTGACTACGCTCACAGTACTCAGCGTGATTGGCCGTAATGTAACCGTTAAGGAAACGATTACAGTCGAACCCGGTAGTGTGTACGCCTTTGGCGAGTTAACCAAAGCGGCTAAACCATTCCGGGTGCTAGCAATCACAGAGGGCGGTACCGACCTTACTCGTAAAATACAGTGCATGGAATACTATCCAGAAGTGTATTCGAGTGATGATGGGACCGTACCAACTATCGACTATAAGTCGGAGGTTGGCAGTGATATCGAGGATATAGGCCTCGTGAGTGATGTATACGGTGCTAACGGCATTATGTACTCACGAATTGCCGTCCGTTGGCAACTACCTCGTGACGGCAAGATAACCAACGTAGTGGTTAACTATCGTAACGCTAAAAGCGATACCTGGAAATACGTGGGGAACTTCCCCGCATCACCTAATAGCACGGAGATATCCGATGTACTATTAGGGGCTACTTACGAGGTTAAGGTGCAAGCGATTAACGATTTAGGGCAACTCACTACAGGTGTTACGAAGGAAATCGTTATCCCTAAGATGCAAGCGCCTGGCGATGTGCAGAACCTACATGTCATTAGCCGATATAATCTAACCGCTGATAAGAGCGTGTACTATGACCTTCAAGTGATGTTCGAGCCACCAGCTAACCCTGGCAACTTCGACAGCGCTGAGGTGTGGTACAAACTTAAATCTAAGAATGGCCAGGCCGTAACCGGTCAAGATTGGCAGTTCGCGGGCAGTAGTAACAGCCAGGTCATTATCAAGGCGTTAGGCCCTGGTGAAGAGTACGAGATTAAGGCCGTAGCCGTGGATAGGTTCGGTAATCGTTCCGATACCGCCCAAGTCGTTGACGTAGTAGTCAAGGCTATGGACGAGGTACCGGATATGCCTAGTAACTTCACCGTAGCTTTCAAGGACCACGCCACCGCATCATGGAACGATGTTCTAAACGCTGACGTGGATTACTACGAACTACGCACCGATAATGACCCAGGGAAGGATACCAACGCACTACTTGCGAAGGTGAAAGATACCTCGGCTAACTTACCGCTTACGAAACGAAGTGGTACGGTGTACTTGTATGCGCGAAGTACGCTAGGCAAGTACTCAACGCCGGCAACGTATTCGTATAATTTGCCACAGTTAGAGGCACCTACGTTTGAGGTCAAGGACCAACTTGGAGGATTCAGCTTGTACTTTGGGGCGAAGCCTCCACAGGCTTACGTTATCCGTTGCCACGTTATTGGTGATGATCGTACAGACGATTTAGAGACAACGTCTAGCATGCTCACCTATTCCAATAAAGCCGGGGTATATCGTGTGCGGTGTGAATATGTCGACGTGTTCGGTAGTAGCTTAGTCGCTGAGAAGTCGGTCACTATTAAGGACAGGGTTGATAAGAGCCTACTTGATGCGGAAGCATTAGGGCTAAAAGCTATGGACGAATCAATCCAAGCGATGAGCTCTGAAGTTGGAACGATGAAAACCTCTGTTAATGGGTTCGAATCTAAATTGGTTCAACTTGATAAGGGAATTACTCAAAAGGTAACTGACCTTAATCAGAACCTATCCGGTCAAATTACTACGCTAGCCAATGGTATTGACCTTCAGGTAACACAGGCTATCGGTAACCTGAGTGGTAAGGATATTGTTAGCCGGATTAACTTATCCCCTGAAGGTACTCGAATCGACGGCAAGCTATTACACGTAACTGGCCAAGCTCTGTTCGATAATAACATCATCACAGAAGGTATGCTCCAAGCTAACTCCGTGAGTGCGGATAAGATACAAGCCCTATCCATTAGTAGTGACAAACTTCAAGCAGATAGCGTTACCGCTGATAAATTAAAGGTGAATAGCCTTGACGCTATCACGGCAACGATTGGTACGCTCCGCACTAAGACGAGTGGCGCTCGTGTTGAGATATCCGATAACTTAATCCAGGTGTTTGACGATAACAATGTACTGAGAGTGAGGTTAGGCCTATGGGACGATTGA
Genome Context
Genome Context
Tertiary structure
PDB ID
79e093a8deb1b24db60688779884e30cf483d4ba8b74b5583775d33de3628fc1
Model Confidence
Very high
pLDDT > 90
pLDDT > 90
High
90 > pLDDT > 70
90 > pLDDT > 70
Low
70 > pLDDT > 50
70 > pLDDT > 50
Very low
pLDDT < 50
pLDDT < 50