Protein

Genbank accession
AFU62486.1 [GenBank]
Protein name
bacterial surface protein
RBP type
TSP
Evidence DepoScope
Probability 1,00
TSP
Evidence RBPdetect
Probability 0,90
TF
Evidence GenBank
Probability 1,00
TF
Evidence Phold
Probability 1,00
Protein sequence
MALYPIKSLGAVGVIADQAPTDLAPNAFTNAINARFVEQRVFKTGGNAPLSYVDEDKDLTPLSFVSMPFDYYSAGNSFLVVGTDKKLYKLTDESLTDISRKVATVTKKASASIKIYPVVSQIVPKESTISMNFNQTKNLEVSLLPADANNTDLVWEVSNSSYGSITVDPSDSKLATLTSFEREGNLVVTISTADDSVVAQIAVNIIDGDSGIFLSQDTVTIRKGGTTTLTAVTGKTPVTWSSSNASIVSVTPNANSLTAVITANGEGNVTITADNGTKTASCEVVSIPQIDSITLSQSDVIVSRGSQYILTATLSPANAPNQNITWTSSNPNIATVSGTSTQGTVNALLAGFTEITATTEEGNRVAVCTVRVDLAGRAMRTSAMAFAAPASEPIESQEEEVVTPPESEEMVYFAEPTSGIDTSGMYEGNSFYDYSNVNDIEGFARASLFATPLSSVTLDVVSASLDVGEEIVITATASPEGDYSYQWSVDKTGYVSTTSVTGKSIKLVALRKGEINVTCTVSQMTQKDYDAFDDYPWYHAVISNCAVATTHYETPQVKEFESEYFVDLPGWGEQTVVDNDGNPSVKKFNWKCERVRSFNNRLFALNMREANASGVTTNYPLRLRWSNFANENKAPTLWDDFAYDRVVSSDLASNIVGQTQALENGYAGYIDLADSNGSLIDILPLKDYLFVYTEFETYIGSPTNNTYQPLMFKKLFNDSGILAPECVVEVEGSHFVVTQNDVILHNGATKKSIASNRVKNMLINEVCLVNPLATRVHLHQDKKEVWVLYVGPGEPKESFACTKAAVWNYEFDTWSFRTIPYAQCIGLVDPPVLERGPIWSDFQEITWDDPSIKELVWRKDATNFRQRVTIVGSFLRGFYQVDVGALDYFYDRLNDVVIEKPLEMRLERTGLDFDNVTNEWNQKHINRFRPQTTGSGTYTFEAGGSQFSNEYGHPHTSKTYTIGVDRHVSVRLNHPYLFYNVIDNDVNSNAAINGLTIEFAVGGRR
Physico‐chemical
properties
protein length:1005 AA
molecular weight: 110214,64500 Da
isoelectric point:4,70697
aromaticity:0,09652
hydropathy:-0,20308

Domains

Taxonomy

  Name Taxonomy ID Lineage
Phage Escherichia phage NJ01
[NCBI]
1237159 Viruses > Duplodnaviria > Heunggongvirae > Uroviricota > Caudoviricetes
Host Escherichia coli
[NCBI]
562 Bacteria > Proteobacteria > Gammaproteobacteria > Enterobacteriales > Enterobacteriaceae > Escherichia

Coding sequence (CDS)

Coding sequence (CDS)
Genbank protein accession
AFU62486.1 [NCBI]
Genbank nucleotide accession
JX867715 [NCBI]
CDS location
range 22732 -> 25749
strand +
CDS
ATGGCGCTGTATCCAATAAAGTCACTAGGTGCTGTAGGCGTTATCGCTGATCAGGCTCCGACAGACTTAGCTCCTAATGCTTTCACTAATGCTATAAATGCTCGATTTGTTGAGCAGAGAGTATTTAAGACGGGGGGCAATGCCCCTCTTTCTTATGTAGATGAAGATAAGGATTTAACCCCTCTGTCTTTCGTTTCTATGCCTTTCGATTATTATAGCGCAGGTAATAGCTTTCTTGTTGTAGGTACGGATAAGAAGTTATATAAACTGACAGATGAAAGCTTGACTGACATTAGCCGTAAGGTTGCTACAGTCACTAAAAAGGCTTCTGCCTCAATTAAGATTTATCCAGTTGTTTCTCAGATTGTTCCAAAAGAATCAACTATTTCAATGAACTTCAATCAGACTAAGAATCTAGAAGTTTCTCTTCTTCCTGCTGATGCTAACAATACCGACCTTGTTTGGGAGGTTAGTAATTCATCTTATGGAAGTATTACAGTAGACCCTAGTGATTCTAAACTTGCTACTCTAACATCTTTTGAGAGAGAGGGTAATCTTGTAGTAACTATCTCTACTGCTGATGATTCTGTAGTGGCTCAGATTGCTGTTAACATTATAGATGGTGATTCGGGAATCTTCTTGAGTCAAGACACTGTTACTATCCGTAAAGGAGGGACTACCACTCTTACAGCTGTTACTGGTAAAACTCCTGTAACTTGGTCCAGCAGCAATGCTTCTATTGTATCTGTAACCCCTAATGCTAATTCCCTAACTGCTGTTATTACTGCTAATGGTGAGGGCAATGTAACAATCACTGCTGATAACGGAACGAAAACTGCTTCTTGTGAGGTTGTTTCCATACCTCAGATTGACAGTATCACCTTAAGTCAGTCAGATGTGATAGTTAGTAGAGGTTCTCAATACATTTTAACTGCTACCCTTTCTCCTGCTAACGCCCCTAATCAAAACATTACTTGGACTTCTTCTAATCCAAATATTGCAACAGTATCAGGGACCAGTACACAAGGGACGGTCAACGCCCTACTCGCTGGCTTTACTGAGATTACGGCTACCACTGAAGAAGGTAACAGAGTTGCTGTCTGTACTGTACGAGTAGACTTAGCTGGAAGGGCGATGAGAACAAGTGCTATGGCATTTGCTGCACCTGCATCAGAACCAATTGAATCACAAGAGGAAGAAGTAGTAACTCCTCCTGAAAGTGAAGAGATGGTTTATTTTGCTGAGCCTACGTCTGGTATTGATACGTCAGGGATGTACGAAGGTAACAGCTTCTATGACTATTCTAACGTAAATGATATTGAAGGTTTTGCAAGAGCTTCTTTGTTCGCAACTCCTTTGTCATCCGTAACCTTAGATGTTGTCAGCGCTTCTCTCGATGTTGGTGAGGAAATAGTTATCACAGCTACAGCTTCCCCAGAAGGTGATTATTCTTATCAGTGGTCTGTTGATAAGACTGGTTATGTTTCTACAACTTCAGTTACTGGTAAATCTATCAAACTGGTTGCTCTTCGTAAAGGAGAGATTAATGTAACATGTACTGTCAGCCAAATGACTCAGAAGGATTACGACGCTTTTGATGACTACCCTTGGTATCACGCTGTTATCTCTAACTGTGCAGTTGCTACAACTCACTATGAAACTCCTCAGGTAAAAGAATTCGAATCAGAATACTTTGTAGACCTTCCGGGATGGGGTGAGCAAACAGTTGTTGATAATGATGGAAACCCTTCAGTCAAGAAGTTTAACTGGAAATGTGAAAGAGTAAGATCTTTTAACAACCGTCTTTTTGCTCTGAATATGAGAGAGGCTAATGCTTCTGGTGTTACCACTAACTACCCACTTCGTCTTCGTTGGTCTAACTTTGCCAATGAGAACAAAGCTCCTACACTGTGGGATGACTTTGCCTACGATCGGGTTGTGTCTTCGGACTTGGCTTCTAACATCGTAGGACAGACTCAGGCTTTAGAAAATGGATATGCTGGTTACATTGACCTAGCGGACTCTAACGGCAGCTTAATTGATATCTTACCTCTTAAAGATTACTTGTTCGTTTATACAGAGTTTGAAACGTATATTGGTTCTCCGACTAATAACACATACCAACCTCTGATGTTCAAGAAACTCTTTAATGACTCTGGTATCCTTGCTCCTGAGTGTGTGGTTGAAGTAGAAGGTAGCCACTTCGTAGTTACTCAGAACGATGTAATCCTACACAATGGTGCAACCAAGAAGTCAATTGCATCTAACCGTGTTAAGAACATGCTGATTAATGAGGTGTGTCTGGTTAACCCTCTAGCAACTCGAGTACACTTGCATCAGGATAAGAAAGAAGTTTGGGTTCTCTATGTAGGTCCGGGAGAGCCGAAAGAAAGCTTCGCTTGCACTAAAGCTGCTGTATGGAACTACGAGTTTGATACTTGGTCTTTCCGTACTATCCCTTATGCTCAGTGTATCGGTCTGGTTGATCCTCCTGTTCTAGAGAGAGGTCCAATCTGGTCTGACTTCCAAGAGATCACTTGGGATGACCCATCTATCAAAGAACTCGTTTGGAGAAAGGATGCAACAAACTTTAGACAGAGGGTTACGATAGTAGGTTCGTTCTTGAGGGGATTCTACCAAGTGGATGTAGGTGCTCTTGATTATTTCTATGACAGGTTAAATGATGTTGTTATAGAGAAGCCTCTGGAGATGAGGTTGGAGAGAACTGGTCTAGATTTTGATAATGTGACAAATGAGTGGAATCAGAAGCATATTAACAGATTCCGTCCTCAGACAACCGGCTCAGGCACTTATACCTTTGAAGCTGGAGGTAGTCAATTCTCAAATGAATATGGACACCCACACACATCTAAGACTTATACAATCGGTGTTGACAGGCATGTCTCGGTGAGACTGAACCATCCATACCTTTTCTATAATGTTATAGATAATGATGTTAACAGTAATGCTGCTATCAATGGGTTAACGATAGAGTTTGCTGTTGGCGGACGGAGATAA

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

PDB ID
cf51c3e40605c1bc47b75378ea821ce45314747dec5d14f2cdce88d2fe89260f
ESMFold
Source ESMFold
Method ESMFold
Resolution 0,7730
Evidence 0,7730

Literature

No literature entries available.