Protein
View in Explore- Genbank accession
- DBA51858.1 [GenBank]
- Protein name
- central tail fiber J
- RBP type
-
TF
- Protein sequence
-
YERGIAPTLLSSPTLSEHDDGTVTGGNTYYYTVSAVNAIGASPLITPMVAGDSTSPANAPTSLNGSPNSLAQPVMTWTPSTDLGGGTLLSIQVQRSTDSGATWNTIATTSNTAPPYTDTTASVGVSYDYQVASINEAGIGAYSTPATIVAGVPPDAPTNLTSVINNPNPSPLTISLDWDAPQYTGTGTLTGYAVYRDGSLITTTGLTSAVDNTVPVSGSYTYEIKSVSTHGTSGFSGSSSITTPTAPDAPSITLSIVNPNPSPLTITAGITAPSNNGGSTITGYNLFHSTDDITYGSVTSPYTVSSAGTHYFQAEAVNNAGTSVRSSSYSITTPSVPTSPTSATSNIADVDNAPFDVTVSWGLPSSSGGSALTGYNVYRQTGTGAFTLVDTTTALATVDQVPTVLNQAFTYKIHAINNVGESTLFTTTTITTGDVPDAPVVTAGTVGTTSFSWTVPSSDATITGYEIYRDTVLLTTVTTTSHTDFTTINFGQSYAYDVKAVSSLGSSVLSNTIISAPETEITGMIAQGITGTGAVIDWNEPAYYQGQITSYNVYYSEITASVSTPTTLAGTTTNTYSNFAPTLDYDTTYIFGVKIISPLGNSGFSNYVTVTTSVDGSIVAFDPTTGGMAWFDIDSVNDQTVNVIEFQRETQTINGTATDTLQVAYPSWWDDMTCDVDYKFAQKTEQYVEGTDMTSQINSADANQQVIGFAFQDIDNEVIEVECAPQQSQQDDGVSAKYVMTQNNLVTGLPNIPLVTQVTNFTSGEYGTDGDFGAIDIVGLFVILVSMVGFNRVSPIVGVLISASLIFALSFFGLIAIPTVIIGVIGLVIFLAWGITRNR
- Physico‐chemical
properties -
protein length: 839 AA molecular weight: 87632,40100 Da isoelectric point: 4,05003 aromaticity: 0,09058 hydropathy: 0,04160
Domains
Domains [InterPro]
IPR050964
Unmapped
22–429
Unmapped
22–429
IPR013783
STR
54–150
STR
54–150
IPR036116
STR
54–148
STR
54–148
IPR013783
STR
55–150
STR
55–150
IPR003961
STR
57–140
STR
57–140
IPR003961
STR
59–155
STR
59–155
IPR003961
STR
529–612
STR
529–612
1
839
Architecture
STR 54-206 | ATT 207-434 | STR 435-615 | RBD 616-839
Legend:
ATT
STR
RBD
CBM
LEC
ENZ
CHP
LNK
TAS
TTP
UNK
Unmapped
Taxonomy
| Name | Taxonomy ID | Lineage | |
|---|---|---|---|
| Phage |
Nitrosopumilaceae spindle-shaped virus [NCBI] |
3065433 | Viruses > unclassified viruses > unclassified archaeal viruses > |
| Host | No host information | ||
Coding sequence (CDS)
Coding sequence (CDS)
Genbank protein accession
DBA51858.1
[NCBI]
Genbank nucleotide accession
BK067785.1
[NCBI]
CDS location
range 31437 -> 33957
strand -
strand -
CDS
TTACGAAAGGGGAATTGCACCAACATTACTATCATCACCTACCCTTTCAGAACATGATGATGGAACGGTTACAGGTGGCAATACATACTATTACACCGTATCGGCAGTTAACGCAATAGGTGCTTCACCATTAATCACACCAATGGTTGCTGGGGATTCAACCTCACCAGCCAACGCACCAACGAGTTTGAATGGCTCACCAAATAGTTTGGCACAGCCAGTAATGACTTGGACACCAAGCACTGATTTGGGTGGTGGTACTTTATTATCCATACAGGTTCAACGTTCTACTGACTCAGGTGCAACATGGAATACCATAGCAACTACCAGTAATACAGCACCACCATACACTGACACAACTGCAAGTGTTGGAGTATCTTATGATTATCAAGTTGCAAGTATCAATGAAGCTGGTATCGGTGCATACTCAACTCCAGCCACCATTGTCGCAGGGGTTCCCCCAGATGCACCAACTAACTTGACCAGTGTGATTAACAATCCTAATCCTAGTCCACTAACCATTTCACTTGATTGGGATGCACCTCAGTACACTGGAACTGGAACCCTAACTGGATATGCTGTTTACCGTGACGGTTCCCTAATAACTACAACTGGATTAACCAGTGCAGTTGATAATACCGTTCCAGTTTCAGGAAGTTATACCTATGAAATTAAAAGTGTGAGCACACACGGAACCAGTGGGTTCAGTGGTTCATCAAGCATAACCACACCAACTGCACCTGATGCACCAAGTATAACACTAAGCATCGTAAACCCAAACCCTAGTCCATTAACAATTACTGCTGGAATTACAGCACCAAGTAACAACGGTGGTTCAACAATCACTGGGTATAATTTATTCCATTCAACAGATGATATCACTTATGGTAGTGTAACAAGTCCGTACACTGTAAGTTCTGCTGGCACTCATTACTTCCAAGCCGAAGCTGTGAATAACGCTGGAACATCAGTACGTTCTTCATCATACAGTATAACAACACCAAGTGTTCCAACATCACCAACAAGTGCAACCAGTAACATAGCAGACGTTGACAACGCACCATTTGATGTTACTGTCTCTTGGGGATTGCCAAGCAGTTCAGGTGGCTCGGCACTAACTGGATATAATGTTTACCGACAAACTGGCACTGGTGCTTTTACATTAGTTGATACAACAACTGCATTAGCAACAGTAGACCAAGTACCAACTGTATTGAACCAAGCATTTACCTACAAGATTCACGCAATAAACAATGTGGGTGAAAGTACATTATTCACAACCACAACAATCACAACTGGTGACGTACCTGATGCACCAGTGGTAACTGCTGGCACTGTGGGAACAACTTCATTCTCTTGGACTGTTCCAAGTTCTGATGCAACCATAACTGGATATGAGATTTATCGTGACACTGTACTCTTAACAACTGTAACCACAACAAGTCACACCGACTTTACAACAATCAACTTTGGTCAAAGCTATGCCTATGATGTTAAAGCAGTCTCATCATTGGGAAGCAGTGTGCTGTCAAACACAATAATCAGTGCACCTGAAACTGAAATCACTGGAATGATTGCACAGGGTATAACTGGAACTGGTGCAGTAATTGATTGGAATGAACCAGCATATTATCAAGGTCAAATTACTTCTTACAATGTGTACTACTCTGAGATAACTGCAAGTGTGTCCACACCAACAACACTCGCTGGTACTACAACCAATACATATTCCAACTTTGCACCAACATTAGATTATGATACTACATACATCTTTGGTGTAAAGATTATTTCGCCATTAGGCAATTCTGGATTCAGCAACTATGTCACTGTCACAACAAGTGTTGATGGAAGTATCGTGGCATTTGACCCTACTACTGGTGGTATGGCATGGTTTGATATTGATTCAGTAAATGACCAAACAGTAAATGTTATAGAATTTCAAAGAGAAACCCAAACCATTAACGGAACTGCAACTGACACATTACAAGTCGCATACCCTTCATGGTGGGATGATATGACTTGTGATGTGGATTACAAGTTCGCACAAAAGACAGAACAGTATGTTGAGGGAACTGACATGACTTCACAAATTAATTCTGCTGATGCAAACCAACAGGTTATAGGATTTGCATTTCAGGATATTGACAACGAAGTAATTGAAGTAGAATGTGCACCTCAACAAAGTCAACAAGATGATGGCGTGTCAGCAAAATATGTTATGACACAAAATAACTTAGTAACTGGACTTCCTAACATTCCTCTAGTAACACAGGTGACAAACTTTACTTCAGGTGAATACGGAACTGACGGTGACTTTGGTGCAATAGATATTGTCGGACTGTTTGTCATACTTGTTTCAATGGTCGGCTTTAACAGAGTGTCACCAATAGTGGGTGTACTAATATCTGCAAGTTTGATATTTGCACTGTCATTCTTCGGATTAATAGCAATTCCAACTGTGATAATAGGTGTAATAGGATTAGTAATATTCTTAGCATGGGGAATTACAAGGAATAGATAA
Genome Context
Genome Context
Tertiary structure
PDB ID
8410694b9e096d14096c021673674f6fd01ebc7770bd4b5b7a23f6cb77bab340
Model Confidence
Very high
pLDDT > 90
pLDDT > 90
High
90 > pLDDT > 70
90 > pLDDT > 70
Low
70 > pLDDT > 50
70 > pLDDT > 50
Very low
pLDDT < 50
pLDDT < 50