UniProt accession
A0A514TUI2 [UniProt]
Protein name
Tail spike TSP1/Gp66 N-terminal domain-containing protein
RBP type
TSP
Evidence DepoScope
Probability 1,00
TF
Evidence GenBank
Probability 1,00
TF
Evidence Phold
Probability 1,00
TSP
Evidence RBPdetect
Probability 0,91
TSP
Evidence RBPdetect2
Probability 0,95
TSP
Evidence UniProt/TrEMBL
Probability 1,00
Protein sequence
MSRFNTGNNIPSISEEDFYDNSMALDEAMNSTDPTWRDRFNVEKPTIDAALKSAGFMPAAFDFVTGGTLQPGDRNKAVYNPAPNGDNNWYRWNGVFPKEIAANSQPNPKDENNWVLAHFRIGIVEKEALRRTYLEAGYNLVNGSFEQGGTLVNSNDVLLQERTGKVFTGPAGIVAAGTNPASGGFVDVSPFVLKTEPPLIAKTGSFSSGGRAESMKSALLSSDGYYYTPRTGVITAAPGSSPNSMWVCVGLLNGAPIDDMLNWLDGRPHLNAFIAARDSKSIRGGGVVTFPAGDYTFADEFVCVDYVKFVGEDRSKCRIFGAAGSGAGKAVVRACKSPVGSADIPEYLSYSGFSNCTINGNATWDVGLYVRHCTNESVFDNVTAQNCKKANSIFIGVFYVSAKNHVSRDARDLGAVIGKKIFSEGGLREVNASAFNNLRGNYAGLDDAYDPTTNPYAGACITIWTANSCAFDYVGAENAYGAGAIIRKGINSTIPNLYVESNGKGTAAVDKIGARIIAADFPSLIIGSLFATRQQKIYLEGSSLLQVGEIYSESFANGIFMGTGKVLLMNGHSANNYIGADQAFINNIETKRIAQFGNVPFTNFASLDSTAVIFGEAMTNVQVVMVPRVTLTTADPIVIGLSNSLSGGQVLEFGTSFTAGVPITKTFSKVSKGAGRLTHRSNYLPSTTTNFAMDVFIIQYVCDYQAIYNKWF
Physico‐chemical
properties
protein length:712 AA
molecular weight: 76382,91840 Da
isoelectric point:5,84785
aromaticity:0,10815
hydropathy:-0,09888

Domains

Domains [InterPro]
DC_1559
ATT
1–133
G3DSA:2.10.10.80
ATT
59–117
DC_0018
ATT
115–199
A0A514TUI2
1 712
Architecture
ATT
STR
ATT 1-199 | STR 270-608 |
Legend: ATT STR RBD CBM LEC ENZ CHP LNK TAS TTP UNK Unmapped

Tail Spike Domain Segmentation

Tail Spike Domain Segmentation

This protein has been segmented into three structural domains: N-terminal, central domain, and C-terminal.

Domain Layout
N-terminal
Central
C-terminal
A0A514TUI2
1 712
Domain Start End Length (AA) Confidence
N-terminal 1 266 266 0,9940
Central domain 267 592 327 0,9831
C-terminal 593 712 119 0,9822
Legend: N-terminal Central domain C-terminal
3D Structure with Domain Coloring

The structure is colored according to the domain segmentation: N-terminal (blue), Central (green), C-terminal (pink).

Domain Coloring
N-terminal
1-266
Central
267-592
C-terminal
593-712

Taxonomy

  Name Taxonomy ID Lineage
Phage Aeromonas phage PS1
[NCBI]
2591406 Uroviricota > Caudoviricetes > Chimalliviridae > Ferozepurvirus PS1 >
Host Aeromonas hydrophila subsp. hydrophila
[NCBI]
196023 Pseudomonadota > Gammaproteobacteria > Aeromonadales > Aeromonadaceae > Aeromonas > Aeromonas hydrophila subsp. hydrophila

Coding sequence (CDS)

Coding sequence (CDS)
Genbank protein accession
QDJ96679.1 [NCBI]
Genbank nucleotide accession
MN032614 [NCBI]
CDS location
range 164481 -> 166619
strand +
CDS
ATGTCAAGATTTAATACTGGGAACAATATCCCGTCGATATCAGAAGAAGATTTTTATGATAACTCCATGGCTTTAGATGAAGCTATGAATAGCACTGATCCTACATGGAGAGATCGGTTTAATGTTGAGAAACCTACGATCGACGCAGCTTTGAAATCCGCAGGATTTATGCCGGCGGCATTTGACTTTGTAACTGGTGGCACTCTTCAACCTGGAGATCGCAATAAAGCTGTTTATAATCCGGCACCTAACGGCGATAATAACTGGTATCGCTGGAATGGTGTATTTCCGAAGGAAATTGCTGCTAATAGTCAGCCTAATCCCAAAGATGAAAATAATTGGGTATTGGCACATTTTAGAATTGGAATAGTCGAAAAGGAAGCACTTCGTAGAACATATTTGGAAGCAGGTTATAATCTTGTTAACGGTAGCTTTGAACAAGGTGGGACGTTAGTAAATAGTAATGATGTGCTGCTGCAAGAGCGCACAGGGAAAGTATTCACCGGTCCTGCGGGTATAGTTGCCGCCGGAACGAATCCGGCGAGCGGCGGGTTTGTTGATGTGTCGCCATTCGTATTGAAGACTGAGCCGCCACTGATAGCAAAAACAGGTTCATTCTCATCTGGTGGGCGTGCGGAATCTATGAAATCTGCGCTTCTTTCTTCAGACGGTTACTACTACACACCGAGAACTGGTGTTATCACAGCGGCTCCAGGATCATCTCCTAATTCTATGTGGGTCTGTGTGGGTCTGCTTAACGGAGCGCCCATAGACGACATGCTAAACTGGCTTGATGGTCGCCCGCATCTAAATGCTTTCATTGCTGCTCGCGACAGCAAGTCAATTCGTGGCGGCGGGGTGGTCACTTTCCCTGCTGGCGATTACACGTTTGCTGATGAATTCGTGTGCGTAGATTACGTCAAGTTTGTGGGTGAAGATCGTAGCAAGTGTCGCATATTTGGTGCAGCGGGTTCAGGAGCGGGAAAGGCTGTAGTGAGAGCTTGCAAGTCGCCTGTCGGTTCTGCTGACATACCGGAATATCTATCTTATTCGGGGTTTAGTAACTGCACAATTAACGGCAACGCAACTTGGGATGTTGGCCTTTACGTTCGCCACTGTACAAACGAATCTGTTTTCGATAACGTTACAGCGCAGAACTGTAAAAAGGCTAACTCAATTTTTATCGGCGTCTTTTATGTTAGTGCCAAAAATCACGTAAGCCGAGACGCACGTGACTTGGGTGCTGTCATCGGTAAGAAGATTTTCAGCGAAGGCGGGCTTCGTGAGGTTAATGCTTCAGCTTTCAATAACCTCAGAGGCAATTACGCAGGACTTGACGACGCCTATGATCCGACCACAAACCCATATGCAGGTGCCTGCATAACAATATGGACCGCTAACAGTTGCGCATTTGATTATGTTGGTGCCGAGAATGCTTACGGTGCTGGTGCTATTATCCGCAAAGGGATAAACAGCACAATCCCGAACTTGTATGTCGAGTCCAATGGGAAGGGCACTGCTGCTGTGGATAAGATCGGAGCCAGGATCATTGCTGCAGATTTCCCTTCACTGATCATTGGTTCCCTCTTTGCAACAAGACAGCAGAAGATTTATCTGGAGGGGAGCTCACTGCTGCAAGTTGGCGAAATCTACTCTGAGTCTTTTGCAAATGGAATTTTCATGGGCACAGGCAAAGTGCTGTTGATGAATGGCCACAGCGCCAATAATTACATTGGGGCAGATCAAGCATTCATTAATAACATTGAAACTAAACGCATTGCTCAGTTTGGGAATGTCCCTTTCACCAACTTCGCGAGCCTAGACTCCACAGCTGTTATTTTTGGCGAGGCAATGACAAATGTCCAAGTGGTGATGGTGCCTAGAGTAACTTTAACCACGGCAGACCCGATAGTCATTGGGCTGTCAAATTCTCTATCCGGGGGGCAAGTGCTAGAATTTGGGACATCGTTTACTGCGGGAGTGCCTATTACCAAAACGTTCAGCAAAGTTTCTAAAGGTGCTGGAAGACTTACTCACCGGTCAAATTACCTGCCATCTACAACAACAAACTTTGCAATGGACGTGTTCATTATTCAGTACGTTTGTGATTACCAGGCTATTTATAACAAGTGGTTTTAA

Genome Context

Genome Context

Gene Ontology

Description Category Evidence (source)
GO:0044423 virion component Cellular Component IEA:UniProtKB-KW (UniProt)
GO:0051701 biological process involved in interaction with host Biological Process IEA:UniProtKB-ARBA (UniProt)
GO:0019058 viral life cycle Biological Process IEA:UniProtKB-ARBA (UniProt)

Tertiary structure

PDB ID
5b69871e24806f4bb5526882d8096f140a7c44e655b11281c0f431b6b46579b0
ESMFold
Source ESMFold
Method ESMFold
Resolution 0,7365
Oligomeric State monomer
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50