Protein
View in Explore- Genbank accession
- CAB4130344.1 [GenBank]
- Protein name
- tail spike protein
- RBP type
-
TFTSPTF
- Protein sequence
-
MALKIAGVTVISDDIDLDISGASRLNSLEVSGTTVVNNLSINGLTAGLTAATGTNSSQIATTAFVQNTVAGSMSGIASNISSYPLNQLVSVNSAPTFQGTNISGFASSLNIGGNAATVTNGVYNNGGTYNVNITGNAATVTNGVYNNGGTYNVNITGNAATVTNGVYNNGGAYNITAANLLGLTLNSSSVPINPNNVTQNQIGYQNGLSVLGQSDGALYSSAYSSEWVHQIAGDFRSGQIAVRGKNSGSWQPWRIVLDSSNYNNYVPTVTGSNASGTWGINITGSAATAGSAASLTGMLTPQSMPDSGASAGTYGSNIQIPVLTVDSKGRITSITTVTGGFTGTYASDAYFNNVTIGRGAGNKSRNTALGTGALLNNTTGDGNTGVGYWSLINNTSGYNNIAVGDVALGNNSTGNNNVAVGISALNGNTTGYLNTAIGAQSLMKNTTGLNNTAVGFNTLQQNTVGNNMVAVGVGALNANTTGENNVAVGFATLSKNESGKSNTAMGYNSLYHNTSGSRNSAFGDGALMNNTDSSNTAVGYWSLINNTSGTLNTAVGDATLGNTTSGTHNTAVGAGALNGNTVGTRNVAFGYNALAKSVDNSWNTAMGIVTLSELISGAGNTAIGGGALRKAVTAGHNTAIGNGSNQSLTTGSSNTSVGSGTLSTLTTGDKNTAIGLRAGEALSSANLNTFIGANSGISVTSGNANVIIGGNNGSSIATTNNNIILSDGDGNIRATCDSTGSWNIGGERIGKNQITYLSKSCHNLMAMIMDGILYTTSGSNGSYSNYCTGRVPNGGYNKLGLDEFQPVLFPDSNVFSTRVVASGGNGHAVQWALLDDGRLYTWGLNSNGQCGLGDTITRVFPTLSASNVQEVFSHPSNGDYSVTYSRLFYRSKSGELFGCGYNGYGALGIGSTNGNVVFWTKINIPGLTSTTPMKVFNLGAHLGCTVILINGRIYVAGYNGYGQLGDGGTANVITFKDVTANWQTSTNATAVAAARSAQQQTIAAINTGITAIWNLAAAWGVTLGDGAGGTAIFKVTPEGYFSANYNVINGPASSFQPFKDAFYAAGGPYSQTYGRESTLRQQTDALNALLADQSIYPTITDIQASGGFGYYDSQGYSQSTLVILIKEASRSYVRACGNNSWGTLGDGTTTIRYTPVSPAFPATDADSIDEIAVFGGGPATVYARTKGYKLYTWGYNSFGQLGRPVTSVTYMTPGLAAENVGILHCDGLTSHTHSYYSQGLYCDVFGKLWITGYNDSSGYMGNGYTFQPTAYFTQPMLPKTEVVEEVGHYTTTSYGRSFIVRTSTGNVYAWGYNEYHNITSWLTGAAQVPVLIKLPMLNKELK
- Physico‐chemical
properties -
protein length: 1342 AA molecular weight: 138375,58430 Da isoelectric point: 6,24254 aromaticity: 0,08718 hydropathy: -0,11624
Domains
Domains [InterPro]
cd19958
STR
192–254
STR
192–254
DC_1422
STR
444–559
STR
444–559
IPR011049
STR
462–596
STR
462–596
IPR000408
STR
836–866
STR
836–866
IPR000408
STR
837–887
STR
837–887
DC_1778
STR
859–1048
STR
859–1048
1
1342
Architecture
STR 192-254 | STR 283-1048 | STR 1061-1339 |
Legend:
ATT
STR
RBD
CBM
LEC
ENZ
CHP
LNK
TAS
TTP
UNK
Unmapped
Tail Spike Domain Segmentation
Tail Spike Domain Segmentation
This protein has been segmented into three structural domains: N-terminal, central domain, and C-terminal.
Domain Layout
1
1342
| Domain | Start | End | Length (AA) | Confidence |
|---|---|---|---|---|
| N-terminal | 1 | 76 | 76 | 0,3503 |
| Central domain | 77 | 295 | 220 | 0,5421 |
| C-terminal | 296 | 1342 | 1046 | 0,0960 |
Note: Constraints were applied during segmentation.
Fixed 42 C-terminal predictions appearing before Central domain
Fixed 42 C-terminal predictions appearing before Central domain
Legend:
N-terminal
Central domain
C-terminal
3D Structure with Domain Coloring
The structure is colored according to the domain segmentation: N-terminal (blue), Central (green), C-terminal (pink).
Domain Coloring
N-terminal
1-76
1-76
Central
77-295
77-295
C-terminal
296-1342
296-1342
Taxonomy
| Name | Taxonomy ID | Lineage | |
|---|---|---|---|
| Phage |
uncultured Caudovirales phage [NCBI] |
2100421 | Uroviricota > Caudoviricetes > Peduoviridae > Maltschvirus maltsch > |
| Host | No host information | ||
Coding sequence (CDS)
Coding sequence (CDS)
Genbank protein accession
CAB4130344.1
[NCBI]
Genbank nucleotide accession
LR796237.1
[NCBI]
CDS location
range 244244 -> 248272
strand -
strand -
CDS
ATGGCGTTAAAAATTGCAGGAGTCACAGTAATAAGTGACGATATAGATTTAGACATCAGCGGTGCTTCCAGATTAAATTCACTTGAGGTGTCTGGAACCACAGTTGTTAATAACTTATCAATTAATGGCTTAACTGCTGGCTTAACCGCAGCCACTGGTACAAATAGTTCGCAGATTGCAACAACAGCATTTGTTCAAAATACAGTAGCAGGCTCAATGTCTGGTATTGCATCAAATATTTCTTCGTATCCACTCAATCAACTGGTGTCTGTTAATTCAGCACCAACATTCCAAGGTACTAATATCTCTGGATTTGCAAGTTCACTGAACATTGGTGGTAATGCAGCAACAGTAACCAATGGTGTATATAACAATGGTGGTACTTATAACGTCAACATTACTGGCAATGCAGCAACAGTAACCAATGGTGTATATAATAATGGTGGTACTTATAACGTCAACATTACTGGCAATGCAGCAACAGTAACCAATGGTGTATATAACAATGGCGGCGCATATAATATTACTGCCGCTAACTTACTGGGACTGACACTTAATTCCAGTTCTGTTCCAATCAATCCAAATAATGTAACCCAAAATCAAATCGGGTATCAAAATGGCTTGTCAGTACTTGGGCAGTCTGATGGTGCACTGTATTCATCTGCATACAGTTCGGAATGGGTACATCAAATTGCCGGTGATTTTAGATCAGGGCAAATTGCAGTTCGCGGTAAAAACAGCGGAAGCTGGCAACCCTGGCGTATAGTATTGGATAGCAGTAATTATAATAATTATGTACCCACAGTAACCGGGTCAAATGCAAGCGGTACGTGGGGCATCAACATTACTGGGTCTGCAGCAACGGCTGGTTCAGCAGCATCATTGACTGGTATGTTAACACCGCAATCAATGCCAGATTCCGGAGCCAGTGCGGGTACATATGGCAGTAATATACAAATTCCAGTATTAACCGTTGATAGTAAAGGCCGGATTACCTCTATTACCACAGTAACAGGTGGTTTCACTGGAACTTATGCAAGTGATGCATATTTCAACAATGTCACTATTGGCCGAGGTGCAGGAAATAAATCCCGCAATACTGCACTGGGCACAGGTGCGTTACTGAACAATACAACTGGTGATGGAAATACTGGAGTCGGCTACTGGTCATTAATCAATAATACATCCGGCTATAATAATATTGCAGTTGGTGACGTGGCCCTTGGTAATAATTCGACAGGTAATAATAACGTGGCAGTGGGTATTTCTGCATTGAATGGTAATACTACGGGTTATTTAAATACTGCAATTGGTGCTCAGTCATTGATGAAAAACACCACCGGTCTTAATAACACTGCCGTTGGGTTTAACACACTACAGCAAAACACTGTTGGGAATAACATGGTTGCCGTTGGTGTTGGTGCACTAAACGCAAACACAACAGGCGAAAACAATGTAGCAGTTGGCTTTGCCACATTGTCTAAAAATGAATCGGGCAAATCAAATACTGCAATGGGTTATAACTCATTGTATCATAATACCAGTGGTTCACGTAATTCAGCATTTGGTGATGGTGCATTGATGAACAACACTGACTCCAGTAATACTGCGGTTGGTTATTGGTCATTGATTAACAATACTTCTGGTACTTTGAATACCGCAGTTGGTGACGCCACCCTTGGGAATACCACCTCAGGGACACATAATACCGCAGTTGGTGCAGGTGCGTTGAATGGGAATACGGTTGGGACACGTAATGTTGCATTCGGTTATAACGCATTGGCCAAAAGTGTTGATAACAGTTGGAATACTGCAATGGGCATTGTTACCCTGTCAGAACTTATTTCTGGCGCTGGTAACACTGCAATTGGTGGGGGTGCATTGCGAAAAGCAGTTACCGCAGGCCACAACACTGCAATTGGTAATGGATCAAATCAATCATTGACAACAGGCTCTAGCAATACTTCGGTTGGATCGGGGACATTGTCTACTTTAACAACTGGGGATAAGAATACTGCAATTGGGTTACGTGCAGGAGAAGCACTGTCTTCGGCAAACCTCAATACCTTCATTGGTGCTAATTCTGGGATTTCAGTAACTTCTGGAAACGCCAATGTTATTATTGGTGGTAACAACGGGTCGTCAATAGCCACAACTAACAATAATATTATTCTCAGTGATGGTGACGGCAATATTCGTGCTACCTGCGACAGTACTGGTTCCTGGAATATTGGTGGTGAGCGCATAGGAAAAAATCAGATTACCTATTTGAGTAAGTCTTGCCATAATTTGATGGCTATGATAATGGACGGCATATTGTACACCACCAGCGGCTCAAATGGTTCCTATAGTAACTATTGTACTGGCCGGGTACCAAATGGTGGGTATAACAAATTGGGGCTTGATGAATTCCAACCAGTGTTGTTTCCTGACTCGAACGTATTTTCAACCCGGGTTGTGGCATCGGGTGGCAATGGCCATGCAGTTCAATGGGCTTTGCTAGATGATGGTAGATTATACACTTGGGGGTTGAATAGTAATGGGCAGTGTGGTTTAGGTGACACAATCACCAGAGTTTTCCCAACACTTTCTGCCTCAAACGTACAAGAGGTGTTCAGCCATCCATCTAATGGTGATTATTCTGTAACTTATTCCAGATTGTTTTATCGTAGCAAATCTGGAGAATTGTTTGGATGTGGCTATAATGGCTATGGTGCTCTTGGTATTGGTAGTACTAATGGCAACGTAGTGTTCTGGACCAAAATAAATATCCCAGGACTAACTTCAACAACACCAATGAAGGTTTTTAATTTGGGCGCCCATCTTGGGTGTACAGTTATTCTTATTAATGGTCGAATTTATGTTGCGGGATATAATGGGTACGGGCAATTGGGTGACGGCGGCACTGCAAACGTAATAACTTTCAAGGATGTTACTGCAAATTGGCAAACCTCAACAAATGCGACTGCCGTTGCGGCAGCCCGATCGGCGCAGCAGCAAACGATAGCAGCCATAAACACTGGAATTACTGCAATTTGGAATCTGGCTGCGGCATGGGGTGTGACATTAGGTGATGGTGCTGGTGGGACAGCTATATTTAAAGTAACCCCTGAAGGATATTTTTCTGCAAATTATAATGTCATAAATGGCCCAGCATCAAGTTTTCAACCGTTTAAGGATGCGTTTTATGCTGCTGGTGGCCCATACTCACAAACTTATGGTAGGGAAAGTACACTGAGACAACAAACAGACGCCCTTAACGCACTTCTAGCAGACCAGTCGATTTACCCAACCATAACTGATATACAAGCATCGGGTGGTTTTGGGTATTATGACTCCCAGGGCTATTCTCAAAGCACACTTGTTATTTTGATAAAAGAAGCAAGCCGTTCATATGTCAGGGCTTGCGGCAACAATAGCTGGGGCACATTGGGGGACGGCACAACAACAATTAGATATACCCCTGTATCCCCAGCGTTTCCAGCGACGGATGCAGATTCAATCGACGAAATAGCAGTTTTTGGTGGTGGCCCAGCTACGGTTTATGCAAGGACCAAAGGTTATAAATTATATACTTGGGGCTATAATTCGTTTGGGCAATTAGGTAGACCAGTGACATCAGTAACATACATGACCCCTGGTCTAGCTGCAGAAAACGTCGGAATATTGCATTGTGATGGGTTAACTAGTCATACACATTCATATTATTCACAGGGGTTGTATTGTGATGTTTTTGGTAAGCTCTGGATCACTGGATACAATGATTCGAGTGGTTATATGGGGAATGGCTATACTTTCCAACCCACTGCCTATTTTACACAGCCAATGTTGCCCAAAACTGAAGTTGTAGAAGAAGTGGGGCACTATACAACCACTTCGTATGGTAGGTCATTCATTGTAAGAACATCAACAGGAAACGTGTATGCCTGGGGATATAACGAGTACCACAATATAACAAGCTGGCTAACTGGTGCCGCACAAGTTCCAGTACTTATTAAGCTACCAATGTTAAATAAGGAATTAAAATGA
Genome Context
Genome Context
Tertiary structure
PDB ID
0b306832db5af0cc0f6aa1336c751e8dda1afc379648f96f47508f7f814562f5
Model Confidence
Very high
pLDDT > 90
pLDDT > 90
High
90 > pLDDT > 70
90 > pLDDT > 70
Low
70 > pLDDT > 50
70 > pLDDT > 50
Very low
pLDDT < 50
pLDDT < 50