Protein
View in Explore- Genbank accession
- UGO51777.1 [GenBank]
- Protein name
- central tail fiber J
- RBP type
-
TFTSPTF
- Protein sequence
-
MGKGGGGKARTPVEDKDTIESSQMISVIDLWSEGQIYGLVDGLKSVRLDNTAVVAEDGSVNIPGVDISYNLGTEDQDYLDGFPQNASEISVGVEVKQSAPVIRTITDQRIDMLRINLYSRALFVITNDGDTKRTDLKMRVETRKGSEPWEVKARIDFIDQKSRNEFSFKAEIWDLPATPFDVRVVRETSDDQTGDFQQIQNSSFWRSYSQVINQKYRWPLTAYMGLKFDSKSFEGAIPRRNYIVRGMIVKVPTNYDPETRKYNGFWNGQFKAAYTNNPAWIVYDIIKNPRYGLGRRGINVEETEMYKAAQWCDQLVPDGRGGMEPRVTCNCYITEQRNAWDLITDIMSCFRAMPLWNGQQFVPSLDIAKDVVATYNNSNVINGTFEYSASSMEDRHSVIEVRYANKANNYEQDTVQITDDLMIEQYGWNVLKVEAFGTDTESQAYRFGSYLLETERLERKTVSFSTGAEGLRNLPGDVIAVADSRQYGRIIGGRILSVSEDRKSIELDDEVEIPNNSETLIIVIGDDRKPVELLCTNNPGKAKVLNFSATCPESLGRLSPWSLKINNSGLKLWRCVSVKENDDGTYAINCVEHVPEKNEIVDNGVKFNPPEETLYGNNLPPVENISVEAVVENPNANVRVYWDAPRTARQIRYNVRIYRSGNLVTNQNIDNPSFSFMADTAGTYRAEIRCLGSDGKLGDSVDVVFVIAEPSMPSDVSWRASNFTVTLRPIPGGLVTIGEVYEWFIGSTEQEVLAMNNNLGEAFVLNQVGLKPNTEYWFGVRAVNMIGRSAIKTVLTKTAFETESLEGLIDVALPKTDYIKEMNKDIEGLGELASLRVVDKNGGRPRVTGVYLNAGDAGNNIASVIDFVADAVSISSPDTLERWVYFDSTNRRLVLGGEIQAVSGRLKNVVIEENCVIEGKLSVANIEGYAMEGQSYEFNISNTGGSKTINYGGNAKIPVRLFGQVWARQHKNQKTRVTVNGKTINQMEVSVIVNNNGTVTTRTYTWLYTFVVDLSINQGAEIFVSAGSLDQGNSESSTYRTQFWIAPQSNGFTSN
- Physico‐chemical
properties -
protein length: 1055 AA molecular weight: 117830,78600 Da isoelectric point: 4,97741 aromaticity: 0,09194 hydropathy: -0,37924
Domains
Domains [InterPro]
DC_0323
STR
1–1036
STR
1–1036
IPR053171
Unmapped
3–793
Unmapped
3–793
IPR055385
ATT
87–213
ATT
87–213
IPR003961
STR
619–695
STR
619–695
IPR013783
STR
620–803
STR
620–803
IPR036116
STR
632–796
STR
632–796
IPR057587
ATT
712–802
ATT
712–802
1
1055
Architecture
STR 1-86 | ATT 87-213 | STR 214-333 | ATT 334-498 | STR 499-711 | ATT 712-802 | STR 803-1036 |
Legend:
ATT
STR
RBD
CBM
LEC
ENZ
CHP
LNK
TAS
TTP
UNK
Unmapped
Tail Spike Domain Segmentation
Tail Spike Domain Segmentation
This protein has been segmented into three structural domains: N-terminal, central domain, and C-terminal.
Domain Layout
1
1055
| Domain | Start | End | Length (AA) | Confidence |
|---|---|---|---|---|
| N-terminal | 1 | 502 | 502 | 0,8546 |
| Central domain | 503 | 917 | 416 | 0,0482 |
| C-terminal | 918 | 1055 | 137 | 0,9104 |
Note: Constraints were applied during segmentation.
Sequence started with non-N-terminal domain
Sequence started with non-N-terminal domain
Legend:
N-terminal
Central domain
C-terminal
3D Structure with Domain Coloring
The structure is colored according to the domain segmentation: N-terminal (blue), Central (green), C-terminal (pink).
Domain Coloring
N-terminal
1-502
1-502
Central
503-917
503-917
C-terminal
918-1055
918-1055
Taxonomy
| Name | Taxonomy ID | Lineage | |
|---|---|---|---|
| Phage |
Serratia phage vB_SmaS_Swain [NCBI] |
2902693 | Viruses > Duplodnaviria > Heunggongvirae > Uroviricota > Caudoviricetes |
| Host | No host information | ||
Coding sequence (CDS)
Coding sequence (CDS)
Genbank protein accession
UGO51777.1
[NCBI]
Genbank nucleotide accession
OL539438.1
[NCBI]
CDS location
range 21694 -> 24861
strand +
strand +
CDS
ATGGGAAAAGGTGGAGGCGGTAAAGCAAGAACGCCAGTCGAAGACAAAGACACCATCGAGTCTAGTCAGATGATATCTGTGATTGACCTTTGGTCAGAAGGTCAGATTTACGGACTTGTTGACGGACTTAAAAGTGTTCGCCTTGATAATACTGCGGTAGTTGCTGAAGACGGTTCAGTAAATATCCCCGGTGTTGATATTTCATATAACCTTGGGACGGAAGACCAAGATTATTTAGATGGCTTTCCGCAAAACGCTTCGGAAATAAGCGTTGGCGTTGAGGTAAAGCAGTCGGCTCCTGTAATTAGAACCATTACAGACCAGCGCATAGATATGCTTCGCATAAATCTCTATAGCCGTGCTTTGTTCGTAATAACCAATGACGGCGACACGAAGCGCACTGATTTAAAGATGCGAGTAGAAACAAGAAAAGGCTCTGAGCCTTGGGAAGTTAAAGCGCGCATTGATTTCATAGACCAGAAATCAAGAAACGAGTTTTCTTTCAAGGCTGAAATATGGGATTTACCGGCGACGCCATTTGACGTGCGAGTAGTTCGCGAAACGTCAGACGACCAGACCGGCGATTTCCAGCAAATACAAAACTCTTCTTTCTGGCGCTCTTACTCGCAGGTAATTAACCAGAAATATCGCTGGCCTTTAACTGCGTACATGGGTCTTAAATTCGATTCAAAATCATTTGAAGGTGCAATTCCGCGAAGAAATTATATCGTTCGTGGAATGATTGTTAAAGTTCCTACAAACTACGACCCGGAAACCAGGAAATATAACGGATTCTGGAATGGTCAGTTTAAGGCGGCTTACACAAATAACCCTGCCTGGATTGTTTACGATATTATCAAAAACCCGCGTTACGGCCTTGGTCGTCGTGGAATTAACGTTGAAGAAACAGAAATGTACAAGGCCGCGCAATGGTGTGACCAGCTTGTTCCTGATGGACGTGGTGGAATGGAGCCAAGGGTTACATGCAATTGCTATATCACTGAACAACGAAATGCCTGGGATTTAATAACAGACATAATGAGTTGCTTCCGAGCTATGCCGTTATGGAACGGTCAGCAATTTGTACCGTCTCTTGATATTGCGAAAGACGTTGTTGCTACATATAACAATTCAAACGTAATTAACGGCACGTTTGAATATAGCGCATCATCTATGGAAGACCGGCACTCAGTTATAGAGGTTCGATATGCAAATAAGGCCAATAACTATGAACAGGACACTGTTCAAATAACTGACGACCTGATGATAGAGCAGTACGGCTGGAACGTTTTAAAGGTTGAGGCATTCGGTACTGATACTGAATCACAGGCTTACCGCTTCGGCTCTTATTTGCTTGAAACTGAAAGACTGGAAAGGAAAACCGTTTCATTCTCAACCGGCGCTGAGGGTTTAAGAAACCTTCCTGGCGACGTAATAGCCGTTGCTGACTCGCGCCAATACGGGCGAATTATTGGCGGTAGGATTCTGTCTGTATCAGAAGACAGAAAAAGCATCGAACTTGATGATGAGGTTGAAATTCCAAACAACTCAGAAACTCTAATTATAGTTATCGGCGATGACAGGAAACCAGTTGAGCTTTTATGCACTAACAATCCTGGCAAGGCGAAGGTTTTGAATTTTTCCGCCACTTGCCCCGAAAGTTTGGGTCGCTTGTCTCCTTGGTCGTTAAAAATAAATAACAGCGGTCTGAAGCTTTGGCGATGCGTTAGCGTTAAGGAAAACGACGATGGGACTTACGCCATTAACTGCGTTGAGCACGTTCCTGAAAAAAACGAAATTGTAGATAACGGCGTAAAATTTAACCCTCCTGAAGAAACTTTGTATGGTAACAACCTGCCGCCAGTTGAGAATATCAGCGTTGAGGCTGTAGTAGAAAACCCTAACGCAAACGTTCGAGTTTATTGGGATGCACCAAGAACGGCGAGGCAAATTCGCTACAACGTTAGAATCTATCGCTCTGGTAATTTGGTCACAAACCAAAATATCGATAATCCATCATTTTCTTTCATGGCGGATACGGCGGGAACATACCGCGCGGAAATTCGCTGCCTCGGCTCTGATGGGAAACTAGGTGATAGCGTTGATGTTGTTTTTGTCATTGCTGAACCCTCAATGCCGTCAGATGTTTCATGGCGCGCATCAAACTTCACGGTTACGTTAAGGCCAATTCCTGGAGGACTGGTTACTATAGGTGAGGTTTACGAGTGGTTTATAGGCTCAACCGAGCAAGAAGTTTTGGCTATGAATAACAATCTTGGCGAAGCGTTCGTTTTAAACCAGGTTGGATTAAAGCCAAATACTGAATATTGGTTCGGTGTTCGAGCTGTAAACATGATTGGCCGCTCGGCGATAAAAACGGTATTGACGAAAACGGCGTTCGAAACCGAATCACTTGAAGGTCTTATTGATGTTGCGCTGCCTAAGACTGACTACATCAAGGAAATGAACAAAGACATAGAGGGTCTTGGAGAGCTTGCGTCTCTTAGGGTTGTTGACAAGAACGGCGGCAGGCCTCGCGTAACTGGTGTTTATCTAAACGCTGGTGATGCAGGAAACAACATAGCTTCGGTGATTGATTTTGTAGCTGATGCCGTCTCAATCTCAAGTCCTGACACGCTAGAGCGCTGGGTGTACTTTGATTCAACCAACAGACGTTTGGTGCTTGGTGGTGAGATACAGGCGGTTTCTGGTCGGCTGAAAAACGTTGTCATAGAAGAAAACTGCGTCATAGAAGGAAAGTTGTCAGTGGCGAACATCGAAGGCTATGCGATGGAGGGGCAAAGCTACGAGTTCAACATAAGCAACACTGGAGGGAGTAAGACCATAAACTACGGAGGAAACGCAAAGATACCTGTTAGGCTGTTCGGCCAAGTGTGGGCTAGACAGCACAAAAACCAAAAGACTAGGGTAACGGTAAACGGAAAAACCATAAACCAAATGGAGGTATCAGTTATCGTTAACAACAACGGCACGGTGACGACCAGGACTTACACCTGGCTTTATACGTTTGTTGTTGACCTGAGCATAAACCAGGGGGCTGAAATCTTCGTCAGCGCCGGGTCGCTAGACCAGGGTAACAGCGAATCCTCAACATACAGGACGCAGTTTTGGATTGCCCCCCAATCCAATGGATTCACCTCTAACTAA
Genome Context
Genome Context
Tertiary structure
PDB ID
d9b6dbde5fbd2453040f6afb25e768989de8b7f6c44d332aeb0eb41725920d8f
Model Confidence
Very high
pLDDT > 90
pLDDT > 90
High
90 > pLDDT > 70
90 > pLDDT > 70
Low
70 > pLDDT > 50
70 > pLDDT > 50
Very low
pLDDT < 50
pLDDT < 50