UniProt accession
A0A0G2SSY7 [UniProt]
Protein name
Long tail fiber proximal connector
RBP type
TF
Evidence UniProt/TrEMBL
Probability 1,00
Protein sequence
MSNIMVEFGQDYVGAQTFSENNALTYKLTIRASNKNTQGESCVLINDKKISDQCQKGFNVWVIDNKGNLFNKMIFDFTEESGRNAFIDFMKSQDKNIICISSSDELFSSEELTKYMKSIGSASWNDFLIKKIKTVSYACVYRTDIKQIVLESIQYSDGIKEDNVLELETIFDSEYSLGITGFPGSIVYDHKEYTSVENDYKKWPTSLLNNKLSDYGLKPGDWVSLSASIFGDKELKDEGGWTRIDCRWVLGNTWKQSFYLESTLKGNYPVQSMVNPDIWESKTVYSQIPEGVDGFVIIASRYNSELGHSAVKNVAFGKSAEPIIEESDRQIGINGIRNSFIKEEEQKVGSLLSLLNLKDKSDTISSINFKEI
Physico‐chemical
properties
protein length:372 AA
molecular weight: 42120,88630 Da
isoelectric point:4,91625
aromaticity:0,11022
hydropathy:-0,39973

Domains

Domains [InterPro]
DC_0912
STR
1–372
PS52031
LEC
22–186
IPR039477
STR
57–126
A0A0G2SSY7
1 372
Architecture
STR
STR 1-372
Legend: ATT STR RBD CBM LEC ENZ CHP LNK TAS TTP UNK Unmapped

Taxonomy

  Name Taxonomy ID Lineage
Phage Proteus phage vB_PmiM_Pm5461
[NCBI]
1636250 Uroviricota > Caudoviricetes > Pantevenvirales > Bragavirus > Bragavirus pm5461
Host Proteus mirabilis
[NCBI]
584 cellular organisms > Bacteria > Pseudomonadati > Pseudomonadota > Gammaproteobacteria > Enterobacterales

Coding sequence (CDS)

Coding sequence (CDS)
Genbank protein accession
AKA62100.1 [NCBI]
Genbank nucleotide accession
KP890823 [NCBI]
CDS location
range 149159 -> 150277
strand +
CDS
ATGAGTAATATAATGGTTGAATTCGGCCAAGATTATGTGGGTGCTCAAACGTTCTCAGAGAACAATGCTCTCACATATAAGTTAACTATAAGGGCCAGTAATAAAAACACACAGGGTGAATCCTGTGTGTTAATTAATGATAAAAAAATATCAGATCAATGTCAAAAAGGATTTAATGTTTGGGTTATAGATAATAAAGGTAATTTATTCAACAAAATGATTTTTGATTTTACAGAAGAATCTGGAAGAAATGCCTTTATTGATTTCATGAAATCCCAGGATAAAAATATAATTTGTATTAGTTCTTCTGATGAATTATTTTCTTCGGAAGAACTCACAAAATATATGAAAAGTATAGGGTCTGCTTCATGGAATGATTTTTTAATTAAAAAGATTAAAACCGTTTCTTATGCATGTGTATATAGAACAGATATAAAGCAAATAGTACTGGAATCTATTCAGTATTCAGATGGTATAAAAGAAGATAATGTTTTAGAATTAGAAACTATTTTTGATTCTGAGTATTCTTTAGGGATAACTGGATTTCCGGGTTCTATAGTCTATGATCATAAAGAATATACATCAGTTGAAAATGACTACAAAAAATGGCCCACTAGTTTGTTGAATAATAAATTATCTGATTATGGTCTTAAGCCAGGTGATTGGGTTTCTTTAAGTGCATCTATATTTGGCGATAAAGAACTTAAAGACGAAGGTGGTTGGACAAGGATAGATTGTAGATGGGTTCTTGGCAATACATGGAAACAATCATTTTATCTTGAGAGCACACTAAAAGGAAATTACCCAGTCCAATCTATGGTTAATCCAGATATATGGGAATCTAAAACTGTTTATTCTCAGATTCCAGAAGGTGTAGATGGATTTGTCATTATAGCTTCTAGATATAATTCTGAGCTAGGACATTCTGCTGTAAAAAACGTGGCATTTGGTAAATCAGCAGAACCTATAATAGAAGAATCTGATCGACAAATAGGGATAAATGGTATCAGAAATTCATTTATAAAAGAGGAAGAACAAAAAGTAGGTTCATTATTAAGTTTATTAAATCTTAAGGACAAATCAGATACAATTTCTTCTATAAACTTTAAAGAAATCTAA

Genome Context

Genome Context

Tertiary structure

PDB ID
4cb7708b774f8fb4aa956a8394937ba21edc50dff3ad1ac697b90de8f1f42073
ESMFold
Source ESMFold
Method ESMFold
Resolution 0,6665
Oligomeric State monomer
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50