Genbank accession
ARB15326.1 [GenBank]
Protein name
PE-PGRS virulence associated protein
RBP type
TF
Evidence RBPdetect2
Probability 0,86
Protein sequence
MLEGTTYGTYMLAGDGQIDGVAPNNVNVNTGGGGGGGYYGGGGNATNSSRYWGGNGGSGYINPKYTGTITGASSALAANNTDPDYVAGVGVAGVGSTTYNNPVTNGGDGQIVFTYEIPPNLVESLTTAVPVDGTVKTYIVGADGNLVLDLWGGGGGAATMLAAGGSERGGGGGYVGGTIPVTAGQIIRFYNGRGGGGGVYTSGTATALVGTGGPGGWPDGGAGGYYAGAAPSTNGILAGAGGGSSRVYIDDQLILVAGGGGGGGDGTGTRTPGGGGGLTGGNSDIPSGQNFGATQIRGGYNSNRPTDPVTTGGLFRGGAGYVSGGSNSISATSPGGGGGGGLFGGGGSGNSTIYIGGAGGSGFIFDGLTLSKTDPFRAEVIAQMTFETGGVIDDGRQREILPVDTAPTAVTTSPKYGAYCGNYPGSGHTTMTVPAFGLQNFTIEAWFSPNSLSSGVLFAYGNSGVGGFSLHYNVNTLTLRHNGDTATDLTWADTARVANVWAHYAVVRDMAGTRVYKDGRLVMTYVNSIGTSFTATQLTLANYTGAAGNSTRFTGRIDEFRATLGACRYVKPFTPSSFAAVSTSVPTLTTITQAPQGSSGTAANNGSAKYIAGRGMGALTRLTAGTAPSGGDGQISYFIATSTVSAAGPIGTVIVSGLTDAAAGAFYPLPPVGSVVVEPYSGARVNYEVTEAVGARIKVEMWGGGGGGSSANTTLTTNGGGGGGYTVIELDLVQGDRITVQTPSGGAGGVNAGSGSAINLGGYPDGGDGYRPAFTALNCGGGGSARLWAQGNLAAVAGGGGGAAYGGGAYDFPGGAGGGNLGGPGAYDGVNAPFPNGGGTQVAGGAGTANGVNGTSLQGGHGGTVSGIANNGCGGGGGYYGGGGGGAYKSGGGGSGYVNTGLPGYRTGSTTGGSGNLPAGMSSPNYVSGIGVGSNGKGGAFTNGGNGRIVISVITPTPGNASGSIGTVNVSGLNNFGLLIGVPTGPLDTIDVVVPVGVSGQPGFAEGPLTTIGVGPAETIPRAQAIVIVPINDQTSILIEPPINAPLEVPGDGIGELDTILVSPFDSTQTAGVAFDVADLPTIILTAPEGEAVEIPPVLTSGDIGTVVVTTPEATTQIIPPVETSGAIGTITVVTVTGEASWNNNVSASGDIGTITLAAPTADAVGDDLAMGDIGTITVIAPEGVALQDAAVAADIGTISVYPIEGGQPGDAVGDIPYIQVVTPGATVNASSGDDISLYADIGTIYVLQVYGQGFWISEDNYVHALPDPLIVSITAAPQASARGDVHIVQPLPTIVVTAPVPVAAGNALADAYTGDFIILVAAPVPQTELNANVNVAMPPPIVINGNDAEASLDVTVPFSDTAVFITGPEALGLGFHGADLGPPIVVTPPQGGPEISVEIFVDPGTILVEAPRFHYIPPITVLPPEGVALDAKSAEASGDLGTITIGVPTGGYQANVAINLPLPTIFVNVPQVMVFASVAVSGDIGTITLTPPAATLTTGADAAFTLPGPIVVTAPEATATAGTAAATSGALTTITLTPPEGSVSTGAAAATSGAIGTILVSPFDGSVFISYPGNASGAIGTIVVTPPAATVSNGRNPTRYHSRGGARVRVHQETRRRGGSAIQPVPLVS
Physico‐chemical
properties
protein length:1630 AA
molecular weight: 159189,34400 Da
isoelectric point:4,34406
aromaticity:0,06687
hydropathy:0,14006

Domains

Domains [InterPro]
DC_0450
STR
124–531
PF13385
LEC
427–561
IPR013320
STR
431–561
ARB15326.1
1 1630
Architecture
STR
STR
STR 1-562 | STR 565-1605 |
Legend: ATT STR RBD CBM LEC ENZ CHP LNK TAS TTP UNK Unmapped

Taxonomy

  Name Taxonomy ID Lineage
Phage Caulobacter phage Ccr34
[NCBI]
1959739 Uroviricota > Caudoviricetes > Jeanschmidtviridae > Shapirovirus cbk >
Host Caulobacter crescentus
[NCBI]
155892 cellular organisms > Bacteria > Pseudomonadati > Pseudomonadota > Alphaproteobacteria > Caulobacterales

Coding sequence (CDS)

Coding sequence (CDS)
Genbank protein accession
ARB15326.1 [NCBI]
Genbank nucleotide accession
KY555147 [NCBI]
CDS location
range 39815 -> 44707
strand +
CDS
GTGCTCGAAGGCACCACCTATGGCACCTACATGCTGGCCGGCGACGGCCAGATCGATGGCGTCGCTCCCAACAACGTCAACGTCAACACAGGCGGCGGCGGCGGCGGCGGCTATTATGGCGGCGGCGGCAACGCCACTAACTCCAGCCGCTATTGGGGCGGCAACGGTGGCTCGGGCTACATCAATCCCAAATATACCGGTACGATCACTGGCGCGTCGAGCGCGCTTGCCGCCAACAACACCGATCCTGACTATGTCGCCGGCGTCGGTGTCGCCGGTGTCGGCTCGACGACCTATAACAACCCTGTCACCAATGGCGGTGACGGTCAGATCGTCTTCACCTACGAAATCCCGCCCAATCTGGTCGAAAGCCTGACCACGGCGGTGCCGGTCGACGGAACGGTCAAGACCTACATCGTCGGCGCTGACGGTAACCTCGTGCTCGACCTGTGGGGCGGCGGCGGCGGCGCGGCGACCATGCTGGCCGCCGGCGGCAGCGAGCGCGGCGGCGGCGGCGGTTATGTGGGGGGCACCATTCCGGTCACGGCCGGACAGATCATCCGGTTCTATAATGGTCGTGGCGGCGGCGGTGGGGTCTATACCAGCGGCACGGCCACGGCCCTGGTAGGCACGGGCGGACCTGGAGGTTGGCCGGATGGTGGGGCCGGGGGCTATTATGCCGGGGCGGCGCCGTCGACCAACGGCATCCTCGCGGGCGCCGGTGGTGGCTCTTCGCGGGTCTATATCGATGACCAACTGATCCTGGTGGCCGGCGGCGGCGGCGGCGGCGGCGATGGCACCGGCACCCGCACGCCCGGCGGGGGTGGCGGCCTGACTGGTGGCAATTCGGATATTCCCTCGGGCCAAAACTTTGGTGCCACGCAGATTCGTGGCGGTTATAACAGCAACCGACCGACCGACCCGGTCACGACCGGTGGTCTCTTCCGGGGCGGCGCGGGCTACGTCTCGGGGGGCTCGAACAGCATCAGCGCCACCTCGCCCGGCGGCGGCGGCGGCGGCGGTCTGTTCGGCGGCGGCGGCTCGGGCAACTCCACGATCTACATCGGCGGCGCTGGTGGCTCGGGCTTCATCTTTGATGGCCTGACGCTCTCGAAGACCGATCCTTTCCGCGCCGAAGTCATTGCCCAGATGACCTTCGAGACGGGTGGCGTCATCGACGACGGCCGTCAGCGCGAAATTCTGCCGGTCGACACCGCGCCGACGGCGGTCACCACCTCGCCCAAGTACGGCGCCTATTGCGGTAACTATCCCGGCAGCGGTCATACCACTATGACTGTGCCGGCCTTCGGCCTGCAAAACTTCACGATCGAGGCGTGGTTCAGCCCCAACTCGCTGTCCAGCGGCGTCCTGTTCGCCTATGGTAACAGCGGCGTCGGTGGCTTCTCCCTGCACTATAACGTCAACACCCTGACCCTGCGCCACAACGGCGACACGGCCACCGACCTGACTTGGGCCGACACCGCTCGCGTCGCCAACGTCTGGGCGCACTACGCCGTCGTCCGCGACATGGCCGGCACCCGCGTCTACAAGGACGGGCGGCTGGTGATGACCTATGTCAACTCGATCGGCACCAGCTTTACCGCCACGCAACTGACCCTGGCCAACTATACGGGGGCAGCGGGTAATAGCACGCGCTTCACCGGCCGGATCGACGAGTTCCGCGCCACCCTGGGCGCATGCCGCTACGTCAAGCCGTTCACGCCGTCGTCGTTTGCGGCGGTGTCGACTTCTGTCCCGACCCTGACGACCATCACCCAAGCCCCGCAAGGCTCGTCGGGCACCGCCGCCAACAACGGCTCGGCGAAATATATTGCCGGACGGGGCATGGGCGCCCTGACGCGTCTCACGGCTGGCACCGCTCCATCGGGCGGCGACGGTCAGATCAGTTATTTCATCGCCACCTCCACGGTCTCGGCGGCGGGGCCGATCGGCACGGTCATCGTCTCGGGTCTGACCGACGCCGCCGCCGGCGCCTTCTATCCCCTCCCGCCGGTGGGTTCGGTCGTCGTCGAGCCTTATAGTGGCGCGCGCGTAAACTACGAGGTCACCGAAGCCGTCGGCGCGCGGATCAAGGTCGAGATGTGGGGCGGCGGCGGCGGCGGCAGTTCGGCCAACACCACCCTGACCACCAACGGCGGTGGCGGCGGCGGCTACACCGTCATCGAACTCGACCTTGTTCAGGGCGATCGGATCACCGTCCAGACGCCGTCGGGCGGCGCGGGCGGCGTCAACGCCGGTAGTGGCTCGGCGATCAACCTCGGTGGTTATCCCGACGGCGGTGACGGTTATCGGCCGGCCTTCACGGCGCTCAACTGCGGTGGCGGCGGCTCGGCGCGTCTGTGGGCGCAAGGCAATCTGGCGGCCGTCGCCGGCGGCGGCGGCGGCGCGGCCTATGGCGGCGGCGCCTATGACTTCCCTGGCGGCGCGGGCGGCGGCAACCTCGGCGGCCCCGGCGCCTATGACGGCGTCAACGCCCCCTTCCCCAATGGCGGCGGCACCCAGGTCGCGGGCGGCGCGGGTACGGCCAACGGCGTCAATGGGACGTCGCTGCAAGGCGGGCACGGCGGCACCGTTTCGGGGATCGCCAACAACGGCTGCGGCGGCGGCGGCGGCTATTATGGCGGCGGCGGCGGCGGCGCCTATAAGTCGGGTGGCGGCGGCTCGGGCTACGTCAACACGGGCCTGCCGGGCTACCGCACGGGCTCCACCACGGGCGGCTCGGGCAACTTGCCGGCCGGCATGTCCTCGCCCAACTACGTTTCGGGCATTGGCGTCGGCTCGAACGGCAAGGGCGGGGCGTTCACCAACGGCGGCAACGGCCGGATCGTCATCTCGGTCATCACCCCGACGCCGGGCAACGCGTCGGGCTCGATCGGCACCGTCAACGTCTCGGGCCTGAACAATTTCGGTCTGTTGATCGGCGTTCCGACCGGGCCGCTCGACACCATCGACGTCGTTGTTCCGGTCGGCGTCTCGGGCCAGCCCGGTTTCGCCGAAGGCCCGCTGACCACCATTGGCGTCGGCCCGGCCGAGACGATTCCCCGGGCCCAGGCGATCGTCATCGTCCCGATCAACGATCAGACCTCGATCCTGATCGAACCGCCGATCAACGCGCCACTGGAGGTCCCTGGCGACGGGATCGGCGAGCTTGACACGATCCTCGTCTCGCCGTTCGATTCGACCCAGACGGCGGGTGTCGCCTTCGATGTGGCGGACCTGCCGACCATCATCCTGACCGCGCCCGAGGGCGAGGCGGTCGAGATTCCCCCGGTCTTGACGTCCGGCGATATCGGCACGGTCGTGGTCACCACGCCCGAGGCGACGACCCAGATCATTCCGCCGGTCGAGACCAGCGGGGCCATCGGCACCATCACCGTGGTGACGGTGACGGGCGAGGCGTCGTGGAATAACAACGTCTCGGCCTCGGGCGATATCGGCACGATCACCCTCGCCGCGCCGACCGCCGACGCCGTGGGCGACGATCTAGCCATGGGCGATATCGGCACGATCACGGTGATCGCGCCCGAGGGCGTGGCGCTCCAGGACGCCGCCGTCGCGGCCGACATCGGCACCATCTCGGTCTATCCGATCGAGGGTGGTCAACCCGGCGACGCGGTCGGCGACATCCCCTATATTCAGGTGGTCACGCCGGGCGCGACGGTCAACGCCTCGTCGGGCGACGACATCTCGCTCTACGCCGATATCGGCACGATCTATGTCCTTCAGGTCTACGGCCAGGGCTTCTGGATTTCCGAGGACAACTACGTCCACGCCCTGCCGGACCCGCTGATCGTCAGCATCACCGCCGCGCCGCAAGCCTCGGCGCGCGGCGACGTCCATATCGTCCAGCCGCTGCCGACCATCGTCGTCACCGCGCCGGTCCCGGTCGCGGCCGGTAATGCCCTGGCCGACGCCTATACGGGCGATTTCATCATCCTGGTCGCCGCGCCGGTTCCGCAAACGGAACTGAACGCGAACGTCAACGTGGCGATGCCGCCGCCGATCGTCATCAACGGCAATGACGCGGAAGCCTCGCTTGACGTCACCGTCCCGTTCAGCGACACGGCGGTGTTCATCACCGGCCCCGAGGCTCTGGGTCTGGGCTTCCACGGGGCGGACCTTGGTCCGCCGATCGTGGTCACGCCGCCGCAGGGCGGTCCGGAGATTTCGGTCGAAATCTTTGTCGATCCCGGCACGATCCTGGTGGAGGCCCCGCGCTTCCACTATATCCCGCCGATCACGGTTCTGCCGCCCGAAGGCGTGGCGCTCGACGCCAAGTCGGCCGAGGCCTCCGGTGATCTCGGCACCATCACCATCGGGGTCCCGACCGGCGGCTACCAAGCCAACGTCGCCATCAACCTGCCGCTGCCGACGATCTTCGTCAACGTCCCGCAGGTCATGGTCTTCGCCTCGGTCGCCGTCTCGGGCGACATCGGGACGATCACCCTCACCCCGCCGGCCGCCACCCTGACCACCGGCGCGGACGCGGCCTTCACCCTGCCCGGCCCGATCGTCGTCACCGCGCCCGAGGCGACGGCGACGGCGGGCACGGCGGCGGCGACCTCCGGCGCCCTGACCACGATCACCCTGACCCCGCCTGAGGGTTCGGTCTCGACCGGCGCGGCGGCGGCGACCTCGGGCGCGATCGGCACGATCCTCGTCTCGCCGTTCGACGGCAGCGTCTTCATCTCCTATCCGGGCAATGCGTCGGGCGCGATCGGCACGATCGTCGTCACGCCGCCGGCCGCGACCGTCTCCAACGGCCGCAACCCCACGCGCTATCACAGCAGAGGGGGAGCCCGTGTCCGCGTTCACCAAGAAACTCGCCGCCGTGGCGGAAGCGCAATTCAACCAGTTCCATTGGTATCATGA

Genome Context

Genome Context

Tertiary structure

PDB ID
cab5ec95c9320bd5d06009e748063abcbc351ca7972be6b237b8606b06af9f3b
ColabFold
Source ColabFold
Method ColabFold
Resolution 0,5025
Oligomeric State monomer
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50