Protein
View in Explore- Genbank accession
- XTK82899.1 [GenBank]
- Protein name
- hypothetical protein
- RBP type
-
TSP
- Protein sequence
-
MEQNKLTLGSLFDGSGGFPLGGLLSGITPVWASEIEPFPIRVTTKRMPFMKHYGDVSKMDGADVEPVDIITFGSPCQDMSIAGRREGLDGSRSSLFYEAVRIVKEMRCATDGKYPRYIVWENVPGAFSSNKGADFQSVLEEICSVKGYEIDPARPARWPAAGEIVADDFSLAWRVFDAQYWGVPQRRKRIYLVADFADGSAGKILFESEGVSGYTPQGFRPWQGTAGTFKESAGASGCVCLNDQGGSRMAVTENAAATLRAENHGHPPCVMGAAGFCTEHSAQARGIGYEEETSPTLRAGTVPAAVYENHSQDTRYTGPLETAPTVMSTYGTGGNNQPFVVETPKTLKIRSGCNGGGKGALIQENKSATLGCNNDQTVFVPFVKGTRPHSPDEGQQWKPSDVANTLNTYDVGEARCNELAVRVYGICSKQSHAMLSDNPHSGFYEADTSRCLDANGGNPTCNQGGMAVVAVQGSMIGRADKNGPQGSGVNEDVSFTLDAADRHAVAYCMTTGSYTQALEEQSPTLMARDYKDPPVVNETEPEYIVRRLTPTECARLQGFPDWWCDGLGTDEPSEEEIEFWTEVFETHRTVLGTSSKPKSRNQIIKWLKGPHSDSAEYKMWGNGVALPSVYFVLSGIVYYAQFPEG
- Physico‐chemical
properties -
protein length: 645 AA molecular weight: 69971,29760 Da isoelectric point: 5,08944 aromaticity: 0,09302 hydropathy: -0,42388
Domains
Domains [InterPro]
IPR029063
STR
1–214
STR
1–214
IPR029063
STR
6–636
STR
6–636
IPR001525
ATT
6–590
ATT
6–590
IPR001525
ATT
7–23
ATT
7–23
IPR001525
ATT
8–194
ATT
8–194
IPR001525
ATT
10–194
ATT
10–194
IPR018117
Unmapped
68–80
Unmapped
68–80
1
645
Architecture
ATT 1-590 | STR 591-636 |
Legend:
ATT
STR
RBD
CBM
LEC
ENZ
CHP
LNK
TAS
TTP
UNK
Unmapped
Tail Spike Domain Segmentation
Tail Spike Domain Segmentation
This protein has been segmented into three structural domains: N-terminal, central domain, and C-terminal.
Domain Layout
1
645
| Domain | Start | End | Length (AA) | Confidence |
|---|---|---|---|---|
| N-terminal | 1 | 223 | 223 | 0,1195 |
| Central domain | 224 | 537 | 315 | 0,8986 |
| C-terminal | 538 | 645 | 107 | 0,5639 |
Note: Constraints were applied during segmentation.
Fixed 19 C-terminal predictions appearing before Central domain
Fixed 19 C-terminal predictions appearing before Central domain
Legend:
N-terminal
Central domain
C-terminal
3D Structure with Domain Coloring
The structure is colored according to the domain segmentation: N-terminal (blue), Central (green), C-terminal (pink).
Domain Coloring
N-terminal
1-223
1-223
Central
224-537
224-537
C-terminal
538-645
538-645
Taxonomy
| Name | Taxonomy ID | Lineage | |
|---|---|---|---|
| Phage |
Acutalibacteraceae phage NatCom_11578 [NCBI] |
3403511 | Viruses > Duplodnaviria > Heunggongvirae > Uroviricota > Caudoviricetes |
| Host |
Acutalibacteraceae [NCBI] |
3082771 | cellular organisms > Bacteria > Bacillati > Bacillota > Clostridia > Eubacteriales |
Coding sequence (CDS)
Coding sequence (CDS)
Genbank protein accession
XTK82899.1
[NCBI]
Genbank nucleotide accession
PV036426
[NCBI]
CDS location
range 27785 -> 29722
strand +
strand +
CDS
ATGGAACAGAATAAACTGACGCTGGGCAGCCTATTTGACGGCTCCGGCGGATTCCCTCTGGGCGGTCTGCTTTCCGGCATTACCCCTGTGTGGGCTTCGGAGATCGAGCCATTTCCCATCCGGGTGACCACCAAACGGATGCCGTTTATGAAGCATTACGGCGATGTATCCAAAATGGATGGCGCAGATGTAGAGCCGGTGGACATCATCACCTTCGGCTCGCCCTGCCAGGATATGAGCATCGCCGGCCGGCGGGAGGGTCTGGATGGCTCCCGCTCCAGCCTGTTCTACGAAGCCGTCCGAATCGTAAAAGAAATGAGGTGTGCGACCGATGGAAAATATCCAAGGTATATCGTCTGGGAAAACGTCCCCGGCGCGTTCAGCTCCAACAAGGGCGCGGACTTCCAGTCCGTCCTCGAAGAAATCTGCTCGGTCAAAGGATACGAGATTGATCCTGCTCGACCTGCGAGGTGGCCAGCCGCCGGGGAGATCGTGGCAGACGATTTCAGTCTCGCATGGCGGGTATTTGATGCGCAGTACTGGGGAGTCCCCCAACGCAGAAAACGTATCTACCTTGTCGCAGATTTTGCAGACGGGAGTGCCGGAAAAATACTATTTGAGTCCGAAGGCGTGTCTGGGTATACTCCGCAGGGCTTCCGCCCGTGGCAAGGAACTGCCGGAACTTTTAAGGAAAGCGCTGGAGCGTCAGGCTGTGTCTGCTTAAACGACCAGGGCGGCAGCCGCATGGCTGTGACGGAGAATGCCGCGGCAACGCTCCGGGCGGAAAACCACGGACACCCTCCCTGCGTGATGGGGGCAGCCGGTTTTTGTACCGAGCATTCCGCACAGGCAAGGGGCATTGGGTATGAGGAAGAAACTTCTCCCACCCTCCGTGCCGGGACGGTGCCGGCGGCGGTTTATGAGAACCATAGCCAGGATACCAGATACACCGGTCCACTGGAGACGGCGCCTACAGTAATGTCTACCTACGGCACGGGCGGCAACAACCAGCCCTTTGTGGTGGAAACGCCCAAGACGCTGAAGATACGCTCCGGCTGCAATGGCGGCGGCAAGGGCGCGCTGATCCAGGAGAACAAGTCCGCCACCCTCGGCTGCAACAACGACCAGACGGTTTTTGTGCCGTTCGTGAAAGGCACCCGCCCCCATTCTCCTGATGAGGGGCAGCAGTGGAAACCGTCCGATGTAGCGAATACACTGAACACCTACGATGTAGGCGAGGCCCGGTGCAATGAACTGGCGGTCAGGGTGTACGGCATCTGCTCCAAGCAGAGCCACGCCATGCTGTCGGACAATCCCCACAGCGGTTTTTATGAAGCGGACACTTCCCGGTGCCTGGACGCAAACGGCGGCAATCCCACCTGCAACCAGGGCGGCATGGCTGTGGTAGCAGTGCAGGGGTCCATGATCGGCAGGGCGGATAAGAATGGTCCCCAGGGAAGCGGTGTGAACGAGGATGTGTCTTTCACACTGGACGCTGCCGACCGCCATGCGGTGGCTTACTGTATGACCACCGGCTCTTACACTCAGGCATTGGAAGAACAATCTCCAACCTTGATGGCAAGGGATTATAAAGACCCGCCTGTGGTGAACGAGACTGAGCCGGAGTATATCGTCCGCAGACTGACGCCTACCGAGTGCGCCCGGCTGCAGGGATTCCCAGACTGGTGGTGCGATGGTCTTGGGACAGACGAGCCGTCTGAGGAGGAAATCGAGTTCTGGACAGAGGTGTTTGAGACACATCGCACCGTTCTGGGAACTTCCTCCAAGCCAAAGAGCCGGAACCAGATCATCAAGTGGCTGAAAGGCCCCCACTCCGATTCCGCAGAATACAAAATGTGGGGCAACGGTGTGGCGCTTCCCAGTGTTTACTTCGTGCTCTCCGGGATCGTGTACTATGCACAGTTTCCGGAAGGATAA
Genome Context
Genome Context
Tertiary structure
PDB ID
a66cd8a88c463b736f74a38200a85716aa60e31b08b22331a6052d0b8e36f743
Model Confidence
Very high
pLDDT > 90
pLDDT > 90
High
90 > pLDDT > 70
90 > pLDDT > 70
Low
70 > pLDDT > 50
70 > pLDDT > 50
Very low
pLDDT < 50
pLDDT < 50