UniProt accession
A0A1D8KHB8 [UniProt]
Protein name
Short tail fiber
RBP type
TF
Evidence UniProt/TrEMBL
Probability 1,00
TF
Evidence GenBank
Probability 1,00
Protein sequence
MFYQERHEAKGAVIGTIMAWTGGLSSIPHGWVICDGGTLPADDFPLLAATIGDSYNMGTSSNFNGTFPSYTGLITLPDLNGRMLMDIENDYFPLTGRAADSDTDARSIMSSIVGSKKQNTQGMALTGSYTDITTDIIFQISPSDRTGYQGKITGNTILAGEGTKTVYVAPRKLGRKHITRHNHPGNVSTIRNDDPRYPGDGVVPYFPISYTLYVSAVDIDSGGDVGDVGLVMLVMVTLYSLVGLIIISKADILVILQKEVKLELL
Physico‐chemical
properties
protein length:265 AA
molecular weight: 28533,16610 Da
isoelectric point:5,13634
aromaticity:0,07925
hydropathy:0,00717

Domains

Domains [InterPro]
IPR037053
ATT
4–67
SSF88874
STR
12–85
IPR011083
ATT
15–72
A0A1D8KHB8
1 265
Architecture
ATT
STR
ATT 4-72 | STR 73-237 |
Legend: ATT STR RBD CBM LEC ENZ CHP LNK TAS TTP UNK Unmapped

Taxonomy

  Name Taxonomy ID Lineage
Phage Synechococcus phage S-CAM1
[NCBI]
754037 Uroviricota > Caudoviricetes > Pantevenvirales > Anaposvirus > Anaposvirus socalone
Host Synechococcus sp.
[NCBI]
1131 cellular organisms > Bacteria > Bacillati > Cyanobacteriota/Melainabacteria group > Cyanobacteriota > Cyanophyceae
Host Synechococcus sp. WH 7803
[NCBI]
32051 Bacteria > Cyanobacteria > Oscillatoriophycideae > Chroococcales > Synechococcus >

Coding sequence (CDS)

Coding sequence (CDS)
Genbank protein accession
AOV58026.1 [NCBI]
Genbank nucleotide accession
KU686195 [NCBI]
CDS location
range 30938 -> 31735
strand +
CDS
ATGTTTTATCAGGAAAGACATGAAGCAAAAGGTGCCGTTATCGGCACCATTATGGCGTGGACAGGGGGATTAAGTTCTATCCCCCATGGTTGGGTCATTTGTGATGGGGGAACATTACCTGCGGATGATTTCCCTCTGTTGGCTGCTACTATTGGTGACTCATATAACATGGGAACTAGTAGTAATTTCAACGGAACATTCCCATCATATACTGGACTGATTACTCTACCAGATCTAAATGGTAGAATGCTGATGGATATTGAGAATGACTATTTTCCTCTTACTGGAAGAGCGGCGGATAGTGATACTGACGCTAGATCTATTATGAGCTCTATCGTTGGTAGTAAGAAACAGAATACTCAAGGAATGGCACTTACTGGTAGTTATACTGACATCACGACAGACATTATTTTCCAAATTAGTCCAAGCGACAGAACTGGTTATCAAGGAAAAATTACTGGTAATACTATTCTTGCTGGTGAAGGAACAAAGACAGTATATGTTGCTCCTAGAAAACTAGGTAGAAAGCATATTACTAGACATAATCATCCTGGAAACGTCTCGACTATTAGGAATGACGACCCGAGATATCCTGGTGATGGTGTTGTTCCTTACTTTCCAATATCATATACATTATATGTGTCAGCGGTTGACATCGACAGTGGTGGTGATGTTGGTGATGTTGGGTTGGTGATGTTGGTGATGGTGACCTTATATTCTTTGGTTGGACTGATAATAATATCCAAGGCAGACATACTGGTGATCCTACAGAAAGAAGTGAAGTTGGAACTCCTGTAA

Genome Context

Genome Context

Tertiary structure

PDB ID
2a19e3d286e994b431a742b006aebc8724b9fa54b6f7f138eebec81466c4833f
ESMFold
Source ESMFold
Method ESMFold
Resolution 0,4356
Oligomeric State monomer
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50