Protein
- Genbank accession
- AGY47351.1 [GenBank]
- Protein name
- minor structural protein
- RBP type
-
TSP
- Protein sequence
-
MALSGTISTSVRTHWKLSVSWSATQSISNNTSTITAKMYWEAVDGYGAIYSDVSKSGSIYIDGTWYNFSGAGLARLSPNQKKLIATKSKTVKHNADGTKSFSLGGWFDPDVDLGGHQGKISLSNKTFTLNTIPRKSTMSSGGDFTAGSDRTISISRASSSFSHKLYIDIKDSGGNWVNIKSINFSKSETSKSTSFSTDEKKTIFRALNERTSAQIRYNLHTLSGSNDIGYNTYYGTANRPKLSVVNKLNGQAGSSNSVYIDQSLTIDLTRYDSEFDHKVQVICGSFTKEFNGVGYTQSWTPTASEQSTLYGILNNVISKSATVRVYTYYNGVRVGSTDYGMTYYVRSSNNKPTFTDAGIFYTDTNPTTLNITADDQYIIQGVSTLRVEIPVESKAVAVNGATMKSYSITVNGVTKNVNYSSTGTVSADFGTINSASNATVSIKAIDSRGLSTAVTKVVKIVPYAYPSVTTTAKRTNGYERTTTLTLRGGLSPIAVNGSNRNALESARYRYKLTDATTYGSWNNFTVSGFPSYSATNVSLDLDENETWDVQVEVSDSLGVTTKTISVSAGRPIMFLDAQRKAVGIGDFPSNEYELKINGRIVFGATMWASNGGGEGFGAIDLNNSDISNANGIYFADVAQNMNGEGLMFFKSGSPYGSSDPTHYDNLMVRDGVLYLNAAGSEMKLGNHLQMQNYDIRNVNHITINDPGGTEGIEWLGGSGWKIVEAPNDLSNVGGPLQFATGNSRKITFSTSGNIYLAGGTFKGESGGYTYFASAAQACLTSDAPGGASIRVHVDGAGGRVWSNDIYNRTYDKASNVYITTEGTLGRSTSASKYKVFIKKVDTERLPSKILELNPKSWYNKTAVELYADQLGTPEDDLESGEEVEEDIPFIERSYGLIAEDLVAAGLDMFVSWGKADENGQREVEGIEYDRLWVLLIPLVREQKQQIEELQERIAQLELSN
- Physico‐chemical
properties -
protein length: 960 AA molecular weight: 104314,37280 Da isoelectric point: 5,78066 aromaticity: 0,09896 hydropathy: -0,37406
Domains
Domains [InterPro]
Legend:
Pfam
SMART
CDD
TIGRFAM
HAMAP
SUPFAM
PRINTS
Gene3D
PANTHER
Other
Taxonomy
| Name | Taxonomy ID | Lineage | |
|---|---|---|---|
| Phage |
Bacillus phage Grass [NCBI] |
1406785 | Viruses > Duplodnaviria > Heunggongvirae > Uroviricota > Caudoviricetes |
| Host |
Bacillus subtilis [NCBI] |
1423 | Bacteria > Firmicutes > Bacilli > Bacillales > Bacillaceae > Bacillus |
Coding sequence (CDS)
Coding sequence (CDS)
Genbank protein accession
AGY47351.1
[NCBI]
Genbank nucleotide accession
KF669652
[NCBI]
CDS location
range 59621 -> 62503
strand +
strand +
CDS
ATGGCGTTATCAGGAACTATATCTACGTCCGTTCGTACTCACTGGAAGCTATCTGTAAGCTGGTCAGCTACCCAGAGTATATCGAACAACACGAGTACGATAACAGCAAAAATGTACTGGGAAGCTGTAGATGGTTATGGGGCAATTTACTCTGACGTATCAAAAAGTGGCTCTATTTACATTGACGGTACTTGGTACAATTTCAGCGGGGCAGGTCTAGCGAGACTTAGTCCAAACCAAAAGAAGTTAATTGCAACAAAATCTAAAACAGTAAAGCACAACGCCGATGGTACAAAGTCATTTAGTCTAGGTGGTTGGTTTGATCCTGATGTAGACCTTGGAGGGCACCAAGGGAAGATCAGCCTTTCAAACAAGACGTTTACACTAAATACAATCCCACGTAAATCTACAATGTCCTCTGGAGGAGACTTTACAGCAGGAAGCGACCGGACAATCTCAATCTCCCGGGCATCATCTTCGTTCTCCCATAAGCTGTATATCGACATTAAAGACTCTGGAGGAAACTGGGTTAATATTAAGTCCATAAACTTCTCTAAATCGGAAACATCTAAGTCTACATCTTTCTCGACTGATGAAAAGAAGACGATCTTCCGAGCTCTAAATGAGAGAACATCGGCGCAGATAAGGTATAATCTCCATACCTTAAGCGGCAGTAACGATATTGGATATAATACCTACTATGGGACTGCAAATAGACCTAAGCTGAGTGTTGTCAACAAGCTTAATGGGCAGGCTGGCAGCTCTAATAGTGTATATATCGACCAGAGTTTAACAATTGACTTGACTCGATATGACAGCGAGTTCGATCATAAGGTGCAGGTCATCTGCGGATCGTTTACCAAAGAGTTTAATGGTGTAGGATACACACAGAGCTGGACTCCTACAGCGAGCGAACAATCTACACTATATGGTATACTAAATAATGTAATCAGCAAATCAGCAACCGTGAGAGTGTATACATACTATAATGGGGTTAGAGTAGGGTCAACAGACTACGGGATGACGTATTATGTTCGGTCTAGTAACAATAAACCTACGTTTACAGATGCAGGTATATTCTATACTGACACAAACCCGACTACATTAAATATTACAGCAGACGATCAATACATAATTCAAGGAGTATCTACTTTGAGAGTAGAAATACCTGTCGAATCTAAGGCAGTAGCTGTTAACGGTGCCACAATGAAATCATATTCTATCACAGTTAACGGTGTAACTAAAAACGTAAACTATTCTTCGACAGGAACTGTATCAGCAGACTTTGGTACAATCAACTCGGCTTCTAATGCAACAGTGAGTATCAAGGCTATTGACAGTCGAGGGTTGAGTACAGCAGTGACGAAGGTAGTGAAGATTGTTCCTTATGCGTATCCTTCTGTTACTACTACAGCTAAACGTACAAACGGGTATGAAAGGACCACAACTTTGACCCTCCGAGGGGGCTTATCTCCTATCGCAGTCAACGGTTCTAACAGAAACGCCCTAGAGTCCGCAAGGTACCGTTACAAGTTGACAGATGCTACCACTTACGGAAGCTGGAATAACTTTACAGTATCAGGATTTCCGTCATACTCTGCAACAAATGTATCTCTAGACTTAGATGAAAATGAAACATGGGATGTGCAGGTAGAAGTATCCGACTCTTTAGGGGTCACAACGAAAACAATATCTGTATCCGCAGGTAGACCAATTATGTTCCTAGATGCCCAGCGAAAGGCAGTAGGTATCGGAGACTTCCCTTCCAATGAGTATGAACTGAAGATCAACGGTCGGATCGTATTTGGAGCGACTATGTGGGCTTCTAACGGAGGCGGAGAGGGCTTCGGAGCTATCGACCTTAATAACTCCGACATCTCTAACGCTAACGGCATTTATTTTGCAGACGTAGCTCAAAACATGAATGGTGAAGGCTTAATGTTCTTCAAGTCAGGTTCTCCCTACGGTTCTAGTGATCCTACTCATTATGACAACCTGATGGTTCGAGATGGTGTTTTGTATCTAAACGCCGCAGGGTCAGAGATGAAGTTAGGTAACCACCTACAAATGCAAAACTATGACATTCGAAATGTTAACCATATCACTATCAATGACCCCGGAGGTACCGAAGGTATTGAGTGGTTAGGTGGCAGTGGATGGAAGATTGTAGAGGCACCAAATGACTTGTCCAATGTTGGAGGACCCCTACAGTTTGCTACAGGTAACTCTCGAAAAATCACCTTCTCTACATCAGGTAATATCTATCTTGCTGGAGGTACCTTTAAAGGGGAGTCAGGAGGATACACGTACTTTGCAAGTGCTGCACAAGCCTGCCTCACCAGTGATGCTCCCGGAGGTGCTTCAATACGAGTACATGTAGACGGTGCAGGAGGTCGTGTATGGTCAAATGACATTTATAACCGAACCTACGATAAAGCGTCTAATGTTTATATCACAACAGAGGGTACTCTTGGAAGGTCTACCTCTGCTAGTAAGTACAAAGTTTTTATAAAGAAGGTAGACACTGAGCGCTTACCTAGTAAGATTCTTGAGCTTAATCCTAAGTCTTGGTATAACAAGACCGCAGTAGAGCTGTACGCTGATCAACTGGGAACACCGGAAGATGATCTAGAATCAGGAGAGGAAGTAGAGGAAGATATTCCGTTCATTGAAAGAAGTTACGGTTTAATTGCGGAGGACCTTGTAGCCGCAGGATTAGATATGTTCGTTAGTTGGGGTAAAGCAGACGAAAACGGGCAAAGAGAGGTTGAAGGGATTGAGTACGATAGACTATGGGTGCTCCTAATCCCCTTAGTTAGAGAACAGAAACAACAGATTGAAGAACTACAAGAAAGGATAGCTCAGTTAGAGCTATCCAATTAG
Gene Ontology
No Gene Ontology terms available.
Enzymatic activity
No enzymatic activity data available.
Tertiary structure
PDB ID
ff8340d71579ba9e2772d19397297e49dd498dac07ff56fb1b289280cf1e563c
Literature
No literature entries available.