Protein
View in Explore- Genbank accession
- UIS24890.1 [GenBank]
- Protein name
- central tail fiber J
- RBP type
-
TFTSPTF
- Protein sequence
-
MRLLEGQTVISGRKGGGGSPHTPVEQPDDLLSVAKLKMVIALSEGEIQGDLTAQQIFLNNTPLADDAGNYNFSNVKWEYRKGTQDQTYIQGMPEIDNEISANIEVKAASPWVRQFSNLTIDAVRIKLSLPIQYQYKDNGDMVGTVTQYAIDLSTDGSAYQTVVDGKFDGKTTSDYQRDHRIDLPRATSGWSIRVRRITPDSTSSKLINAFKVFSFAEVIDSKMRYPNTALLYIELDSSQFNGSVPKTTCKPKGKLIRVPDNYNPVTRTYSGTWTGNFKLAYSNNPAWIFYDLVLDEIYGMGGRVDASMIDKWQLYSIAAYCDEMVSNGAGGKEPRFTCNVFIQNQQDAYTVLRDLAAVFRGITFWGNDQIYVNADVPQSDVDYVYHVSNVVDGVFTYAGGSYKNRFSSCQVSWSDPLNHYSDTIEGVYDSELVERYRVNQMQLTAIGCTSQSEAHRRGRWAILSNAKDGSISFNVGLDGYIPLPAEIIGVADPFRAGRENGGRISQVGGRNITVDRPANYAVGDRLVVNLPDGTAQSRTISAISADKKTLTVSTAYRQTPVPGAVWCIDSDKLAIQYFRVTSISANDDGTFTIAGVQHDPNKYRYIDDGVRIDPPPISVTPPNVMPAPKGIVITEVDHQAQGLTVASMQVTWERVEGAIDYLAQWRKDKGDWVNIGRTSAQGFTVQGIYAGVYDVRVRAVNAVDVSSPWGYADSTTLSGKVGKPGTPVNLMASDNVVWAIDITWGFPSGAGDTAYTEIEQATTADGQNPLLLANVPYPGVSYQHGPMPAGVRRWYRARLVDRIGNKGDWTPFVAGMSNVDADDLIGSVVDDYLNSEDGKALLEPLKTSPEAILQSVLAEYGTANQQWANYGENRAGIIQAQKVAADAQSSVAQLETDVTAKFNDQEAAIQEKMTAYADASGPSAIWTLKTGVKYNGTNYDAGLAVAVTVNGTQVDTRVAVNANQFVVISGSSGSYYSPFIIKDGQVLISQGFIGKGWIENAMIGDYIQSNDYVAGRQGWRLDKSGIFENNGTGSGGRMIQRNNSIRLYDGNGVLRVAIGEY
- Physico‐chemical
properties -
protein length: 1061 AA molecular weight: 115941,07040 Da isoelectric point: 5,06904 aromaticity: 0,09614 hydropathy: -0,31433
Domains
Domains [InterPro]
IPR053171
Unmapped
3–847
Unmapped
3–847
DC_0014
STR
3–1061
STR
3–1061
IPR055385
ATT
96–220
ATT
96–220
IPR013783
STR
626–718
STR
626–718
IPR003961
STR
626–708
STR
626–708
1
1061
Architecture
STR 3-95 | ATT 96-220 | STR 221-343 | ATT 344-507 | STR 508-1061
Legend:
ATT
STR
RBD
CBM
LEC
ENZ
CHP
LNK
TAS
TTP
UNK
Unmapped
Tail Spike Domain Segmentation
Tail Spike Domain Segmentation
This protein has been segmented into three structural domains: N-terminal, central domain, and C-terminal.
Domain Layout
1
1061
| Domain | Start | End | Length (AA) | Confidence |
|---|---|---|---|---|
| N-terminal | 1 | 508 | 508 | 0,9223 |
| Central domain | 509 | 1006 | 499 | 0,0503 |
| C-terminal | 1007 | 1061 | 54 | 0,6556 |
Note: Constraints were applied during segmentation.
Fixed 28 C-terminal predictions appearing before Central domain
Fixed 28 C-terminal predictions appearing before Central domain
Legend:
N-terminal
Central domain
C-terminal
3D Structure with Domain Coloring
The structure is colored according to the domain segmentation: N-terminal (blue), Central (green), C-terminal (pink).
Domain Coloring
N-terminal
1-508
1-508
Central
509-1006
509-1006
C-terminal
1007-1061
1007-1061
Taxonomy
| Name | Taxonomy ID | Lineage | |
|---|---|---|---|
| Phage |
Aeromonas phage pAEv1818 [NCBI] |
2908746 | Viruses > Duplodnaviria > Heunggongvirae > Uroviricota > Caudoviricetes |
| Host |
Aeromonas sp. [NCBI] |
647 | cellular organisms > Bacteria > Pseudomonadati > Pseudomonadota > Gammaproteobacteria > Aeromonadales |
Coding sequence (CDS)
Coding sequence (CDS)
Genbank protein accession
UIS24890.1
[NCBI]
Genbank nucleotide accession
OL964755.1
[NCBI]
CDS location
range 2800 -> 5985
strand +
strand +
CDS
ATGAGACTACTTGAAGGGCAAACGGTCATCAGTGGCCGCAAAGGTGGCGGTGGAAGCCCACATACTCCGGTAGAGCAGCCTGACGATCTGTTGTCGGTTGCCAAACTGAAAATGGTGATTGCGTTATCGGAAGGAGAAATCCAGGGCGATCTCACGGCGCAGCAAATTTTCCTGAACAATACCCCGCTGGCGGATGATGCCGGCAATTACAACTTTTCAAACGTTAAGTGGGAGTATCGCAAAGGGACGCAGGACCAGACCTATATCCAGGGCATGCCGGAGATCGACAACGAGATTTCAGCGAATATTGAAGTTAAAGCCGCGTCACCGTGGGTGCGGCAGTTCTCCAATCTGACGATCGATGCGGTGCGCATCAAGCTCAGCCTGCCTATCCAGTACCAGTACAAAGACAACGGCGACATGGTCGGGACGGTCACGCAATATGCTATCGATCTCTCAACAGACGGCAGCGCCTACCAAACCGTCGTTGATGGGAAATTTGATGGCAAGACCACCTCAGACTATCAGCGTGATCACCGCATCGACCTACCGCGGGCAACGTCTGGTTGGTCTATCCGGGTGCGCCGCATTACGCCTGATTCGACGTCCAGCAAACTGATCAATGCCTTCAAAGTGTTCTCTTTTGCTGAGGTTATCGACAGTAAGATGCGTTACCCCAATACGGCGCTGCTGTATATCGAGCTCGATTCCAGCCAGTTTAACGGCAGCGTGCCGAAAACGACCTGCAAGCCGAAAGGCAAGTTGATCCGCGTGCCGGACAACTACAACCCGGTCACGAGAACCTATAGCGGGACGTGGACCGGTAATTTTAAGCTGGCCTACAGCAATAACCCGGCCTGGATTTTTTACGATCTAGTGCTGGATGAAATTTACGGCATGGGCGGGCGCGTTGATGCCAGCATGATCGATAAATGGCAGCTATACAGCATCGCCGCATACTGCGATGAAATGGTGTCTAACGGCGCCGGCGGGAAAGAGCCGCGCTTTACCTGCAATGTGTTCATTCAGAACCAGCAGGATGCCTATACGGTACTCCGAGACCTGGCCGCCGTGTTTCGCGGCATCACCTTCTGGGGTAACGATCAGATCTACGTGAACGCGGACGTGCCGCAAAGCGACGTCGATTACGTGTATCACGTTTCAAACGTGGTTGACGGCGTTTTCACCTATGCCGGTGGTTCGTATAAAAACCGGTTCAGTTCATGCCAGGTGTCTTGGTCAGATCCGCTTAACCACTACTCGGACACCATCGAAGGCGTCTATGATTCCGAGCTGGTCGAGCGCTACCGCGTGAATCAGATGCAGCTGACGGCAATCGGCTGTACCTCGCAGAGCGAAGCGCACCGCCGGGGCCGGTGGGCTATTTTGTCCAATGCCAAAGACGGATCGATTTCCTTCAACGTGGGGCTGGATGGTTACATCCCGCTGCCCGCGGAAATCATCGGTGTGGCTGATCCTTTCCGTGCGGGCCGGGAGAATGGTGGCCGTATCAGCCAGGTTGGCGGCCGGAATATTACCGTTGACCGTCCGGCAAACTATGCTGTTGGCGACCGCCTGGTGGTGAACCTGCCAGATGGCACTGCGCAGAGCCGCACAATCAGCGCTATTAGCGCCGACAAAAAGACGCTGACCGTTTCAACGGCTTATCGGCAAACACCTGTGCCGGGGGCGGTGTGGTGCATCGACAGTGACAAGCTGGCGATCCAGTATTTTCGCGTCACGTCTATCTCAGCAAACGATGACGGCACATTCACAATCGCCGGGGTGCAGCATGACCCCAACAAGTACCGCTATATCGATGACGGCGTGCGTATAGATCCGCCGCCGATCTCAGTCACGCCGCCAAACGTCATGCCGGCGCCGAAGGGCATCGTTATCACCGAAGTTGACCACCAGGCGCAGGGGCTGACGGTAGCATCCATGCAGGTGACCTGGGAGCGCGTGGAGGGCGCCATCGACTATCTGGCGCAGTGGCGCAAGGATAAGGGGGACTGGGTAAACATCGGGCGCACCAGTGCGCAGGGGTTTACCGTTCAGGGGATCTACGCCGGGGTTTACGATGTCCGCGTGCGCGCCGTGAATGCGGTCGACGTGTCATCGCCCTGGGGTTATGCCGACTCGACCACGCTCAGTGGTAAGGTGGGTAAACCCGGCACGCCGGTGAACCTCATGGCCAGTGATAATGTCGTATGGGCCATCGATATCACCTGGGGCTTTCCCTCCGGCGCCGGCGATACAGCCTACACCGAGATCGAGCAGGCAACTACCGCCGACGGCCAGAACCCTCTGTTGCTGGCGAACGTTCCCTATCCTGGGGTGAGCTATCAGCACGGCCCTATGCCGGCGGGTGTGCGCCGTTGGTACCGCGCGCGGCTGGTTGACCGGATCGGGAATAAGGGAGATTGGACGCCGTTTGTTGCGGGAATGTCGAATGTAGATGCCGACGACCTGATCGGCTCGGTGGTTGATGATTACCTCAATTCCGAGGACGGCAAGGCGCTGCTGGAGCCGCTGAAAACCAGCCCAGAGGCTATTTTGCAAAGCGTGCTGGCGGAATACGGCACCGCTAACCAACAATGGGCCAACTATGGCGAAAACCGGGCCGGGATAATTCAGGCACAGAAGGTGGCCGCTGATGCTCAAAGCTCAGTTGCGCAGCTGGAAACCGACGTCACCGCCAAATTCAACGATCAGGAAGCCGCCATACAGGAGAAAATGACGGCCTACGCAGACGCTTCCGGCCCTTCGGCGATCTGGACGTTGAAAACCGGCGTTAAGTACAACGGAACGAATTACGACGCCGGCCTGGCGGTGGCCGTGACCGTCAACGGCACACAGGTGGATACGCGCGTCGCCGTCAACGCTAACCAGTTCGTGGTTATCAGCGGTAGCAGCGGAAGCTATTACTCGCCGTTCATTATCAAGGATGGACAGGTGCTTATTAGTCAGGGCTTTATCGGCAAGGGTTGGATCGAAAACGCGATGATTGGCGATTATATCCAGTCGAATGATTATGTTGCTGGAAGGCAAGGATGGAGGCTTGATAAGAGTGGCATCTTTGAAAATAACGGGACGGGTAGCGGCGGAAGGATGATCCAAAGAAATAATTCAATCAGGCTATATGATGGCAATGGAGTTCTGCGTGTAGCTATAGGCGAATATTAA
Genome Context
Genome Context
Tertiary structure
PDB ID
f629583b433a92b769dbc3deb76ea6eda2897da6931b114d88f99379d5df0c30
Model Confidence
Very high
pLDDT > 90
pLDDT > 90
High
90 > pLDDT > 70
90 > pLDDT > 70
Low
70 > pLDDT > 50
70 > pLDDT > 50
Very low
pLDDT < 50
pLDDT < 50