Protein
View in Explore- Genbank accession
- XOB77785.1 [GenBank]
- Protein name
- tail fiber protein
- RBP type
-
TFTSP
- Protein sequence
-
MNFTKTRNLLGVLALLIASAIGMNGQVIGTATPLPGAKALPGVIADYDFLQGSGTTVPDISGNGNNGTFGTSAPVWTPTGLSFTDPSQSVNLPIALNAAQAFMFAVYINPTTQTSATGQMYDLLISSTLGVDGTNLLISSFDKGSFSPASFSGHAFTDSCTTPIAGFHVLTYVLGFHNSSVDHFYIDGQECTSPTNTGTSYGFQTSGNYVLGSSPDSNFGTTSAFPGTVYRVAAWPNVLDATDVATFSQQVLGEVTNRGVAVKPQPQPQQGSTFFAIGDSITFGLGVTTAWPEGMTLASQSFIIQDYGIPSIELSTILGSEENRVAPMCAGGYGPNVAAVFAGTNDFAQGKSVAQVMSYLAGEVSVLKKAGCRVLVATMLDRTGADATKDAYDAAILTQAKGMGADGVIDFAAEPLIGADGSSVNTTNFQDGIHPTQPLQTRMSQIASNSLNYYFGSNAASPTVVTSATYQMLSGDGYVAAQPTANMTLTLPDCTGPSGAIYSISNGQSAFTVGVVSGSSAQPINGFLTGTVVTVPSNGSISLRDVPNMKNVSGCHWEATLPGSGSGVSTPSVPPIPGATADYFFTEGTGTTLVDHSGNGNDGTFAAAPPTWGPKGLVFTNIGQGVNLPAALNGTKTFYLAFYVQGPAITNASNIGNDVPQLVTSSLGQAGTNILMTDFAGSNATFTPVLENNGTLGTGAQTAISGFNVLTYTLGTSGTTVDHIYLNGVELPYFGVTADFGGQTSGNYILGSSNLDFFAGSGFLGTFYRAAFTPNEDTPAQVAANYLQIMADITSRGVPTTPQALPQVGPALFAVGDSITFGLGVDPTQAWPEILTLTNQPTYTIVNGGIPSISARASAGAEPNRFAPSCLTSSGQAVAILFAGTNDIGAFSQTADGTMGSIASFVTKMKKAGCTVFVGTMISRPFQGDDSRKDALDAKILSKWKSIGADGLVDFAADPNLGADMQGLNATYFQPDNIHPTVVGQTLLANAASNSLNYYYGASASAPTIVTTATYQMASGDGYVMANPTVNEVLTLPDCIGPSGATYTISNIQNSVTVGVTTGNANELINSFPQATVIPVPSNGSITLRDVPNPKQVSGCHWEAATLPLLGTTAPQSLGPGACGTSSVTIPGLTSTMVISMSAQGLLGTGVTIGQPFYTDPSVANVNVCTGAGTSTQNIVFNVRAE
- Physico‐chemical
properties -
protein length: 1186 AA molecular weight: 121566,33740 Da isoelectric point: 4,40396 aromaticity: 0,08432 hydropathy: 0,10936
Domains
Domains [InterPro]
IPR013320
STR
34–247
STR
34–247
G3DSA:2.60.120.200
STR
41–250
STR
41–250
PF13385
LEC
87–245
LEC
87–245
IPR013830
ENZ
814–985
ENZ
814–985
1
1186
Architecture
STR 34-250 | STR 257-453 | STR 579-789 | STR 809-1000 |
Legend:
ATT
STR
RBD
CBM
LEC
ENZ
CHP
LNK
TAS
TTP
UNK
Unmapped
Tail Spike Domain Segmentation
Tail Spike Domain Segmentation
This protein has been segmented into three structural domains: N-terminal, central domain, and C-terminal.
Domain Layout
1
1186
| Domain | Start | End | Length (AA) | Confidence |
|---|---|---|---|---|
| N-terminal | 1 | 649 | 649 | 0,9531 |
| Central domain | 650 | 854 | 206 | 0,2144 |
| C-terminal | 855 | 1186 | 331 | 0,0051 |
Legend:
N-terminal
Central domain
C-terminal
3D Structure with Domain Coloring
The structure is colored according to the domain segmentation: N-terminal (blue), Central (green), C-terminal (pink).
Domain Coloring
N-terminal
1-649
1-649
Central
650-854
650-854
C-terminal
855-1186
855-1186
Taxonomy
| Name | Taxonomy ID | Lineage | |
|---|---|---|---|
| Phage |
Tunturiibacter phage Tunturi_5 [NCBI] |
3378199 | Viruses > Duplodnaviria > Heunggongvirae > Uroviricota > Caudoviricetes |
| Host | No host information | ||
Coding sequence (CDS)
Coding sequence (CDS)
Genbank protein accession
XOB77785.1
[NCBI]
Genbank nucleotide accession
PP885688.1
[NCBI]
CDS location
range 294888 -> 298448
strand +
strand +
CDS
ATGAACTTCACGAAGACAAGGAACTTGCTGGGCGTACTCGCACTGCTGATCGCATCGGCGATCGGCATGAATGGACAGGTTATCGGCACCGCGACTCCGCTCCCAGGAGCGAAGGCCCTCCCCGGCGTCATCGCGGACTACGACTTTCTTCAGGGTTCGGGAACAACCGTTCCTGACATCTCGGGCAACGGCAACAACGGAACGTTCGGGACATCCGCGCCAGTGTGGACGCCGACTGGTCTTAGCTTCACCGACCCGTCCCAGTCGGTGAATCTGCCAATCGCGTTGAACGCAGCACAGGCTTTCATGTTCGCGGTGTACATCAACCCGACGACTCAGACCAGCGCAACAGGACAGATGTACGACCTGCTCATAAGCTCCACCCTCGGGGTGGACGGAACGAACCTGTTGATCTCCAGCTTTGACAAAGGCTCCTTTTCTCCAGCGAGCTTCTCGGGTCACGCCTTCACGGATAGCTGCACGACGCCAATTGCGGGCTTCCACGTACTGACGTATGTGTTGGGGTTCCACAACTCGAGCGTAGACCACTTCTACATCGACGGGCAGGAGTGTACCTCACCAACGAACACCGGAACATCCTATGGCTTCCAGACAAGCGGCAACTACGTGTTGGGTTCTAGCCCAGACTCCAACTTCGGTACAACCTCAGCCTTTCCCGGCACGGTATACCGCGTGGCAGCATGGCCGAATGTGCTTGACGCTACCGATGTCGCAACCTTCAGTCAGCAAGTTCTCGGGGAGGTCACAAACCGAGGAGTAGCTGTAAAGCCGCAGCCGCAACCACAGCAGGGGTCGACGTTCTTCGCTATTGGAGACTCGATCACATTCGGTCTGGGGGTAACGACAGCATGGCCAGAAGGCATGACGCTTGCTAGTCAGTCGTTCATCATTCAGGACTATGGTATCCCGTCTATCGAGCTCTCTACGATACTTGGTTCAGAGGAGAATCGTGTAGCGCCGATGTGTGCTGGGGGCTATGGGCCGAACGTAGCGGCTGTGTTCGCAGGCACGAATGACTTCGCCCAGGGAAAATCAGTGGCGCAGGTCATGTCTTACCTAGCAGGGGAAGTGTCCGTGCTGAAGAAGGCTGGTTGCAGAGTGCTTGTCGCAACGATGCTAGATCGTACGGGTGCGGACGCCACCAAGGACGCTTACGACGCAGCCATACTGACCCAAGCCAAAGGAATGGGCGCAGACGGTGTGATCGACTTCGCCGCAGAGCCATTGATCGGCGCTGATGGATCTTCGGTCAACACGACCAATTTCCAAGACGGCATCCACCCGACCCAGCCGCTGCAGACTCGCATGAGTCAGATCGCCAGCAACTCCTTGAACTACTACTTTGGCTCGAACGCTGCCAGTCCAACTGTGGTCACCTCCGCGACCTATCAGATGCTTTCAGGCGACGGGTACGTGGCAGCGCAGCCGACCGCGAACATGACTCTCACCCTTCCGGATTGCACCGGACCGAGCGGCGCGATCTACTCGATCAGCAATGGGCAGTCCGCGTTCACCGTCGGAGTAGTCAGTGGAAGCTCAGCTCAGCCGATCAACGGTTTCCTCACAGGAACGGTTGTTACGGTACCCTCGAACGGCTCGATCAGCCTGCGTGACGTACCCAACATGAAGAACGTCTCCGGCTGCCACTGGGAAGCAACACTCCCCGGTAGCGGAAGCGGTGTGTCAACTCCGAGTGTTCCTCCCATCCCCGGCGCGACGGCTGACTACTTCTTCACCGAGGGTACCGGCACCACACTCGTCGACCACAGCGGGAACGGTAACGACGGAACCTTCGCCGCCGCTCCTCCTACGTGGGGACCCAAGGGACTCGTGTTCACCAACATCGGTCAGGGTGTCAATCTTCCGGCTGCACTCAACGGTACGAAGACTTTTTATTTGGCCTTCTACGTACAGGGCCCGGCCATCACCAACGCGAGCAACATTGGTAACGATGTACCGCAACTGGTCACGAGCTCGCTCGGCCAGGCTGGTACCAACATCCTCATGACCGACTTCGCTGGTTCGAACGCCACGTTCACCCCCGTGCTTGAAAACAACGGGACGCTTGGAACGGGAGCCCAGACTGCCATCTCCGGATTCAATGTCCTGACCTACACCCTGGGCACGAGTGGCACCACAGTGGACCATATCTACCTCAACGGTGTGGAGCTGCCCTACTTCGGTGTCACTGCAGACTTTGGTGGCCAGACGAGCGGTAACTACATCCTCGGATCAAGCAACCTCGACTTCTTTGCGGGCAGCGGCTTCCTTGGAACGTTCTATCGCGCCGCGTTCACTCCCAACGAAGATACTCCAGCTCAGGTCGCCGCCAACTATCTGCAGATCATGGCCGACATCACCTCTCGCGGCGTCCCCACGACTCCGCAGGCACTGCCTCAGGTAGGCCCGGCCCTGTTCGCTGTAGGAGACTCCATTACGTTCGGACTTGGTGTTGACCCCACTCAGGCGTGGCCGGAGATTCTGACCCTGACCAATCAGCCGACCTACACCATCGTCAACGGCGGTATCCCCAGCATCAGCGCCCGTGCCTCGGCTGGAGCTGAGCCGAACCGCTTCGCACCCTCCTGCCTCACCTCCTCGGGTCAGGCCGTCGCGATCCTCTTTGCGGGAACTAACGATATCGGCGCATTCAGTCAGACAGCTGACGGCACCATGGGCAGCATCGCCAGCTTCGTCACCAAGATGAAGAAGGCTGGCTGCACCGTATTCGTCGGCACGATGATCTCTCGCCCCTTCCAGGGTGATGACAGCCGCAAAGATGCCCTCGATGCCAAGATCCTTTCGAAGTGGAAGTCGATCGGTGCGGATGGTCTCGTGGACTTCGCTGCGGACCCTAACCTGGGTGCGGACATGCAGGGCCTCAACGCGACTTACTTCCAGCCAGACAACATCCACCCCACGGTTGTCGGTCAGACGCTGCTTGCAAACGCGGCCAGCAACTCTCTCAACTACTACTATGGTGCGAGCGCGTCGGCTCCGACGATCGTAACGACAGCGACCTATCAGATGGCATCAGGGGACGGCTATGTCATGGCAAACCCCACCGTCAACGAGGTGCTCACGCTGCCCGACTGCATTGGTCCGAGCGGTGCGACCTACACGATCAGCAACATTCAGAACTCCGTCACTGTCGGTGTGACAACTGGCAATGCAAATGAGCTGATCAACTCCTTCCCGCAGGCTACCGTGATTCCAGTCCCGTCGAACGGATCCATCACGCTGCGTGACGTGCCCAATCCGAAGCAGGTCTCAGGGTGTCACTGGGAGGCGGCAACTCTTCCGCTGCTCGGTACAACCGCGCCGCAATCCCTCGGACCGGGAGCCTGCGGAACGTCGAGCGTCACGATCCCCGGACTCACCAGCACCATGGTGATCTCGATGTCGGCGCAGGGACTTCTCGGCACAGGCGTAACGATTGGTCAGCCCTTCTACACCGACCCGAGCGTGGCCAATGTCAACGTATGCACCGGCGCTGGTACCTCGACACAAAACATCGTGTTCAACGTACGGGCGGAGTAA
Genome Context
Genome Context
Tertiary structure
PDB ID
46286e4f855ed331514e647bcb533b0c0a6b6794ae06d41229f52669f00e2b86
Model Confidence
Very high
pLDDT > 90
pLDDT > 90
High
90 > pLDDT > 70
90 > pLDDT > 70
Low
70 > pLDDT > 50
70 > pLDDT > 50
Very low
pLDDT < 50
pLDDT < 50
Literature
| Title | Authors | Date | PMID | Source |
|---|---|---|---|---|
| Tunturi virus isolates and metagenome-assembled viral genomes provide insights into the virome of Acidobacteriota in Arctic tundra soils | Demina,T., Marttila,H., Pessi,I.S., Maennistoe,M.K., Dutilh,B.E., Roux,S. and Hultman,J. | 2024 | — | GenBank |