Ubiquitin-like prokaryotic MoaD as a fusion tag for expression of heterologous proteins in Escherichia coli
© Yuan et al.; licensee BioMed Central Ltd. 2014
Received: 11 October 2013
Accepted: 17 January 2014
Published: 21 January 2014
Eukaryotic ubiquitin and SUMO are frequently used as tags to enhance the fusion protein expression in microbial host. They increase the solubility and stability, and protect the peptides from proteolytic degradation due to their stable and highly conserved structures. Few of prokaryotic ubiquitin-like proteins was used as fusion tags except ThiS, which enhances the fusion expression, however, reduces the solubility and stability of the expressed peptides in E. coli. Hence, we investigated if MoaD, a conserved small sulfur carrier in prokaryotes with the similar structure of ubiquitin, could also be used as fusion tag in heterologous expression in E. coli.
Fusion of MoaD to either end of EGFP enhanced the expression yield of EGFP with a similar efficacy of ThiS. However, the major parts of the fusion proteins were expressed in the aggregated form, which was associated with the retarded folding of EGFP, similar to ThiS fusions. Fusion of MoaD to insulin chain A or B did not boost their expression as efficiently as ThiS tag did, probably due to a less efficient aggregation of products. Interestingly, fusion of MoaD to the murine ribonuclease inhibitor enhanced protein expression by completely protecting the protein from intracellular degradation in contrast to ThiS fusion, which enhanced degradation of this unstable protein when expressed in E. coli.
Prokaryotic ubiquitin-like protein MoaD can act as a fusion tag to promote the fusion expression with varying mechanisms, which enriches the arsenal of fusion tags in the category of insoluble expression.
KeywordsMoaD Prokaryote Ubiquitin Expression Degradation Protein folding
Fusion expression is a common strategy in the production of recombinant proteins. Various fusion tags are used to enhance the total and soluble expression yield in E. coli. Fusion tags show differing efficiency in enhancing the expression, solubility and stability of recombinant proteins [1–5]. The stable or conserved structure of the fusion tag is speculated as a determinant of its fusion properties .
Ubiquitin (Ub) and SUMO are stable, highly conserved small proteins expressed in all eukaryotic cells. They are frequently used as tags to enhance the fusion expression by increasing the solubility and stability of the expressed peptides and protecting the peptides from proteolytic degradation in prokaryotic host [6–8]. MoaD and ThiS, components of prokaryotic sulfur transfer systems, are also highly conserved small proteins found in prokaryotic cells. They display a high degree of structural similarity although sharing limited sequence similarities to Ub, and interact with correlating enzymes in similar ways as Ub [9–12]. They are Ub-like proteins (Ubl) and have been suggested as prokaryotic antecedents of Ub. Prokaryotic ThiS was also tried as a fusion tag in expression of heterologous proteins in E. coli. It was able to induce aggregation of fusion proteins due to a slowdown of refolding, and enhance the expression of some targets more significantly than its eukaryotic counterpart Ub. But it promoted the degradation of unstable target fusion.
In this report, we observed the effect of fusion of MoaD, a small prokaryotic ubl containing 81 amino acid residues, on the expression of several targets in E. coli. MoaD showed the similar properties to prokaryotic ubl ThiS in enhancing the recombinant expression and promoting the aggregation of fusion proteins through slowdown of target folding. Contrary to ThiS fusion, MoaD fusion conferred a complete protection of the murine ribonuclease inhibitor (mRI) from intracellular degradation.
MoaD fusion enhances the expression of EGFP
The fluorescence of cells expressing EGFP with or without fusions was measured after IPTG induction. The intensity of fluorescence increased steadily at 37°C. The fluorescence was normalized to the measured OD600, since a slight but significant difference in OD600 was noticed among the cells, although they grew at the similar rate (Figure 1B). The normalized intensity of fluorescence thus was in direct proportion to the emitted fluorescence of single cell in average. The fluorescence of the cells bearing EGFP alone reached to a plateau after 1 h induction at 37°C (Figure 1B). N-terminal MoaD fusion gave a continued increase in normalized fluorescence which was much higher than EGFP alone after 2 h induction at 37°C. N-terminal ThiS fusion emitted much lower fluorescence. The cells bearing C-terminal fusion of MoaD had significantly lower fluorescence than cells bearing EGFP alone. C-terminal fusion of ThiS also caused significantly lower fluorescence than EGFP alone, consisting to the previous report .
Since the accumulation of non-native EGFPs in inclusion bodies could have reduced the measured fluorescence, we did similar experiments with lower concentration of inducer and under lower cultured temperature in expecting to improve the soluble expression and reduce the inclusion body formation. Indeed, higher fluorescence was reached for all the recombinants (Figure 1C). Nevertheless, much less fluorescence had been observed in both N-terminal MoaD fusion and N-terminal ThiS fusion than EGFP alone. Interestingly, even less fluorescence was observed in the C-terminal MoaD fusion under the same conditions. Whereas the C-terminal ThiS fusion produced an identical fluorescence as did EGFP alone at room temperature. The discrepancy in growing fluorescence may reflect the difference in relative amount of soluble active proteins and insoluble fluorescent folding intermediates, as previously identified .
MoaD fusion promotes the aggregation of EGFP
These results suggest that fusion of EGFP with MoaD at either N- or C-terminus enhances the expression of the fusion protein which presents mostly as aggregated inclusion bodies in a similar way as fusions with ThiS.
MoaD fusion retards the refolding of EGFP
MoaD is inefficient in enhancing the expression of insulin A and B chains
MoaD fusion protects murine Ribonuclease Inhibitor from degradation
It seemed that breakdown occurred more frequently at C-terminus of mRI [13, 14]. A mutated mRI, with a merely single mutation which changed Glu340 to a stop coden at C-terminal part in mRI, was fused to MoaD or ThiS. The prematurely stopped and thus slightly shortened products of both fusions were efficiently overexpressed in full length in TG1 and BL21 (DE3) pLysS. Only faintly stained smaller fragments were observed for ThiS fusion (Figure 6C), which was different from that of native mRI in Figure 6A and B. It suggested that target itself determined its degradability in vivo when fused to ThiS. The MoaD fusion product remained intact without any degradation. This proved again that MoaD fusion protected the target from degradation.
A variety of fusion tags are used to increase expression yields and change solubility and native folding . However, it is still not clear how fusion tags enhance protein expression. Ub was reported to exert chaperoning effects on fusion proteins, thus increase expression of proteins in E. coli and yeast [7, 16]. Indeed, Ub has a highly stable structure and is the fastest folding protein known . Thus, it may serve to stabilize and promote proper folding of the fusion target. SUMO, structurally similar to Ub, also promotes folding and structural stability of fusion proteins, and leads to their enhanced expression [18–20].
Prokaryotic MoaD and ThiS share the structure of Ub-like domain with their eukaryotic counterparts Ub and SUMO. Our current results indicate that MoaD confers the enhanced expression of EGFP, either in N-terminal fusion or in C-terminal fusion. In contrast to eukaryotic counterparts Ub and SUMO, MoaD fusion at both N- and C-terminus reduces the soluble protein expression rather than enhances the solubility. This is the same case for ThiS fusion. Instead of promoting its proper folding in refolding experiments, MoaD hinders the native EGFP folding, in a similar way to ThiS. This was not expected because they are stable small proteins with a similar conserved structure as Ub or SUMO. The slowdown of folding, rather than the fast expression of fusion protein, should be a primary factor to drive the expressed fusion protein to inclusion bodies. This was proved by the fact that the most of EGFP with MoaD fusion at C-terminus are aggregated in inclusion bodies as detected by confocal microscopy. Actually, the fast expression of fusion protein should not occur under the leaky expression condition.
Enhancing the production of inclusion bodies is one of the fusion strategies although fewer tags are reported for this purpose . Inclusion bodies protect the products from proteolyses, and usually lead to a higher expression yield. In this study, MoaD did not drive the satisfied overexpression of insulin A and B chains in comparison to the ThiS fusions because of a decreased ability of MoaD in driving the protein aggregation of insulin chains.
Different fusion systems have given variable results of expression [1, 3, 4], and no single fusion tag is ideal for every protein target. On the expression of unstable mRI, MoaD fusion affords complete protection of the product from intracellular degradation. Ub- and SUMO-fusion of mRI, although leading expression products to inclusion bodies, show a moderate intracellular degradation of products . This difference may be attributed to the rapid aggregation of MoaD fusion product to inclusion bodies that afford protection from proteolytic degradation. On the other hand, ThiS fusion, expected to enhance the rapid aggregation due to the sluggish of folding in a similar way as MoaD fusion, led to the significant degradation of mRI. An active ThiS-directed degradation was anticipated for this unusual observation .
This work shows that MoaD, as a fusion tag, is able to promote expression of some target proteins as expected from its structural similarity to Ub and SUMO. While unexpectedly, MoaD enhances the aggregation of fusion proteins due to a slowdown of refolding, the same as the typical prokaryotic ubl ThiS. Furthermore, MoaD fusion provides an advantage over ThiS in protecting the unstable target from degradation. Hence, in addition to ThiS, MoaD enriches the arsenal of fusion tags in the category of insoluble expression. Expression in inclusion bodies may be required specifically in cases where protein production is toxic to the host cell. It is feasible for the practical use of MoaD as a fusion tag in the expression of some target proteins.
Biochemicals were purchased from Sigma (St. Louis, MO). Ni-IDA agarose affinity resin was from Vigorous Biotechnology (Beijing, China). Oligonucleotides were from Invitrogen (Shanghai, China). All restriction enzymes and T4 DNA ligase were from TaKaRa (Dalian, China). Pfu DNA Polymerase and LA Taq DNA Polymerase were from Vigorous Biotechnology (Beijing, China).
E. coli TG1 cells were used for cloning, maintenance and propagation of plasmids. TG1 and BL21 (DE3) pLysS cells were used as host for protein expression studies. E. coli cells were cultivated in Luria broth under appropriate selective conditions.
Construction of expression vectors
Primers used in this study
Primer sequence/featured site*
PCR product: coding protein
1: ThiS up
ATAagatctATGCAGATCCTGTTTAACGATC /Bgl II
primers 1 + 2: ThiS.
2: ThiS down
ATAgaattcAACCCCCTGCAATAACC /EcoR I
3: ThiS down
ATAggatccCCCTGCAATAACCTGAAAAAG /BamH I
primers 1 + 3: ThiS for fusion to N-terminus of targets
4: MoaD up
ATAagatctATTAAAGTTCTTTTTTTCGCCCAG / Bgl II
primers 4 + 5: MoaD for fusion to N-terminus of targets
5: MoaD down
ATAggatccTCCGGTTACCGGCGGG / BamH I
6: MoaD down
ATActcgagTTAACCTCCGGTTACCGGCGGG /Xho I
primers 4 + 6: MoaD for fusion to C-terminus of targets
7: A up
ATAagatctATGGGCATTGTGGAACAGTGCTGCAC /Bgl II
primers 7 + 8: insulin chain A
8: A down
ATActcgagTTAGTTGCAATAGTTTTCCAGCTG /Xho I
primers 4 + 8: MoaD fusion to A
9: B up
ATAagatctATGTTTGTGAACCAGCATCTGTG /Bgl II
primers 9 + 10: insulin chain B
10: B down
ATActcgagTTAGGTTTTCGGGGTATAAAAAAAG /Xho I
primers 4 + 10: MoaD fusion to B
11: EGFP up
ATAggatccATGGTGAGCAAGGGCGAGGAGCTG /BamH I
primers 11 + 12: EGFP
primers 11 + 6: MoaD fusion to C-terminus of EGFP
TATGGCTGATTATGATCAGT /universal vector primer
primers 11 + 12: EGFP for fusion at C-turminus
13: EGFP down
ATActcgagTCACTTGTACAGCTCGTCCATG /Xho I
primers 11 + 13: EGFP for fusion at N-turminus
primers 1 + 13: ThiS fusion to N-terminal EGFP
primers 4 + 13: MoaD fusion to N-terminal EGFP
14: RNH up
ATAagatctATGAGTCTTGACATCCAGTGTGAGC /Bgl II
primers 14 + 15: mRI
15: RNH down
ATAgtcgacTCAGGAAATGATCCTCAGGGAAGG /Sal I
primers 1 + 15: ThiS fusion to ΔmRI
primers 7 + 15: MoaD fusion to mRI or ΔmRI
MoaD and ThiS genes were amplified from genomic DNA of E. coli strain TG1. Insulin genes for chain A and B were synthesized as described Yuan et al.. Gene fusions were made by restricted fragment ligation. A cDNA of mRNH coding mRI (with 456 amino acid residues)  and its PCR amplified spontaneous mutant (coding truncated product ΔmRI due to Glu340 was mutated to a stop code) were used for gene fusions.
Strains and plasmids used in this study
Strain or plasmid
Source or reference
E. coli strain
B strain with Lon protease deficiency, contains pLysS plasmid expressing T7 lysozyme
EGFP with his-tag at N-terminus, with extra 22 vector sequences.
Previous study 
ThiS fused to N-terminus of EGFP with his-tag at N-terminus.
ThiS fused to C-terminus of EGFP with his-tag at N-terminus.
Previous study 
MoaD fused to N-terminus of EGFP with his-tag at N-terminus.
MoaD fused to C-terminus of EGFP with his-tag at N-terminus.
ThiS fused Insulin A chain with his-tag at N-terminus.
Previous study 
ThiS fused Insulin B with his-tag at N-terminus.
Previous study 
MoaD fused Insulin A chain with his-tag at N-terminus.
MoaD fused Insulin B with his-tag at N-terminus.
ThiS fused mRI with his-tag at N-terminus.
Previous study 
MoaD fused mRI with his-tag at N-terminus.
ThiS fused ΔmRI with his-tag at N-terminus.
MoaD fused ΔmRI with his-tag at N-terminus.
Expression and purification of recombinant proteins
The culture of E. coli was grown overnight and subcultured at 1:100 into Luria broth at 37°C. When the cell growth of the culture reached a mid-log phase, protein expression was induced by adding Isopropyl β-D-1-thiogalactopyranoside (IPTG) to a final concentration of 1 mM, with a further 4 h growth at 37°C, or otherwise indicated. Cells were harvested, resuspended in PBS containing 1% Triton X-100, subjected to cycles of freezing and thawing, and then disrupted using sonication. The soluble protein fraction was separated from insoluble one by centrifugation at 4°C (15 min at 14,000 g). Soluble fraction of His-tagged recombinant proteins were purified by nickel-affinity chromatography under native conditions based on the supplier’s instructions.
Electrophoresis and Western blot
The samples of whole cells or the purified protein fractions were mixed with Laemmli buffer, either heated in boiling water bath for 5 min or not heated, and analyzed by SDS-PAGE, as described by Laemmli , using a 5% stacking gel and a 10% to 15% separating gel run in a Mini-Protean II electrophoresis system (BioRad, Hercules, CA, USA). The gels were stained with Coomassie Blue, or electroblotted onto nitrocellulose or PVDF membranes. His-tagged fusions were detected by immunoblot using anti-His antibody and goat anti-mouse HRP labelled antibody (CoWin Biotech, Beijing, China). Chemiluminescence was recorded using the reagents according to supplier’s protocol (CoWin Biotech, Beijing, China).
Fluorescence determination of EGFP
The fluorescence of purified soluble EGFPs was measured with excitation wavelength at 488 nm and emission wavelength at 509 nm using EnSpire Multimode Reader (Perkin-Elmer, Waltham, MA, USA). For E. coli expressing recombinant EGFP proteins, cultured media containing live whole cells was aliquoted and the fluorescence was measured promptly, the same way as purified proteins. The bacteria concentration of the same sample was also measured as absorbance at 600 nm.
E. coli in LB-medium that expressing recombinant EGFP proteins were dropped onto a slide and sealed with a coverslip. Images were recorded with a confocal laser scanning microscope (Leica TCS SP2, Leica Laser-technik, Heidelberg, Germany) either in phase contrast mode or fluorescence mode (wavelengths at 488 nm for excitation, and 500–560 nm for detection).
Denaturation and refolding of EGFP
Purified ThiS- or MoaD-tagged EGFP and EGFP without fusion were denatured in PBS containing 8 M urea and 5 mM DTT for 5 min in a boiling water bath. Urea-denatured samples were renatured at room temperature by 10-fold dilution into PBS with 5 mM DTT. Fluorescence recovery was monitored with an interval of 5 s for 50 min. Data were fitted with Sigma Plot (Systat Software, San Jose, CA, USA) and kinetics for fast and slow refolding phases obtained as described . Final refolding was measured at 15 h. The percentage of refolding was calculated on the basis of the final constant amount of fluorescence, corresponding to the amount of fluorescence before denaturation.
The results were derived from three to four independent experiments. The Student’s t-test for paired samples was used to calculate the p values. The statistical analyses were performed using SPSS 13.0 (IBM SPSS, Armonk, NY, USA), and p values less than 0.05 were considered statistically significant.
We are grateful to Professor Zhuo-wei Hu of this Institute for critical reading and amendment of the manuscript. This work was supported by grants from China National Science & Technology Major Project “New Drug Innovation and Production Program” (General platform construction, No: 2012ZX09301002-001-002 and 2012ZX09301002-002-006).
- Hammarström M, Hellgren N, van Den Berg S, Berglund H, Härd T: Rapid screening for improved solubility of small human proteins produced as fusion proteins in Escherichia coli. Protein Sci. 2002, 11: 313-321.View Article
- Braun P, Hu Y, Shen B, Halleck A, Koundinya M, Harlow E, LaBaer J: Proteome-scale purification of human proteins from bacteria. Proc Natl Acad Sci USA. 2002, 99: 2654-2659. 10.1073/pnas.042684199.View Article
- Shih YP, Kung WM, Chen JC, Yeh CH, Wang AH, Wang TF: High-throughput screening of soluble recombinant proteins. Protein Sci. 2002, 11: 1714-1719.View Article
- Dyson MR, Shadbolt SP, Vincent KJ, Perera RL, McCafferty J: Production of soluble mammalian proteins in Escherichia coli: identification of protein features that correlate with successful expression. BMC Biotechnol. 2004, 4: 32-10.1186/1472-6750-4-32.View Article
- De Marco V, Stier G, Blandin S, de Marco A: The solubility and stability of recombinant proteins are increased by their fusion to NusA. Biochem Biophys Res Commun. 2004, 322: 766-771. 10.1016/j.bbrc.2004.07.189.View Article
- Butt TR, Edavettal SC, Hall JP, Mattern MR: SUMO fusion technology for difficult-to-express proteins. Protein Express Purif. 2005, 43: 1-9. 10.1016/j.pep.2005.03.016.View Article
- Butt TR, Jonnalagadda S, Monia BP, Sternberg EJ, Marsh JA, Stadel JM, Ecker DJ, Crooke ST: Ubiquitin fusion augments the yield of cloned gene products in Escherichia coli. Proc Natl Acad Sci USA. 1989, 86: 2540-2544. 10.1073/pnas.86.8.2540.View Article
- Baker RT, Smith SA, Marano R, McKee J, Board PG: Protein expression using cotranslational fusion and cleavage of ubiquitin. Mutagenesis of the glutathione-binding site of human Pi class glutathione S-transferase. J Biol Chem. 1994, 269: 25381-25386.
- Rudolph MJ, Wuebbens MM, Rajagopalan KV, Schindelin H: Crystal structure of molybdopterin synthase and its evolutionary relationship to ubiquitin activation. Nat Struct Biol. 2001, 8: 42-46. 10.1038/83034.View Article
- Duda DM, Walden H, Sfondouris J, Schulman BA: Structural analysis of Escherichia coli ThiF. J Mol Biol. 2005, 349: 774-786. 10.1016/j.jmb.2005.04.011.View Article
- Lake MW, Wuebbens MM, Rajagopalan KV, Schindelin H: Mechanism of ubiquitin activation revealed by the structure of a bacterial MoeB-MoaD complex. Nature. 2001, 414: 325-329. 10.1038/35104586.View Article
- Lehmann C, Begley TP, Ealick SE: Structure of the Escherichia coli ThiS-ThiF complex, a key component of the sulfur transfer system in thiamin biosynthesis. Biochemistry. 2006, 45: 11-19. 10.1021/bi051502y.View Article
- Yuan S, Xu J, Ge Y, Yan Z, Du G, Wang N: Prokaryotic ubiquitin-like ThiS fusion enhances the heterologous protein overexpression and aggregation in Escherichia coli. PLoS One. 2013, 8: e62529-10.1371/journal.pone.0062529.View Article
- Ge Y, Sun J, Yan Z, Wang N: Efficient soluble expression and oxidative stability of recombinant mouse ribonuclease inhibitor. China Biotechnol. 2010, 30: 17-23.
- Young CL, Britton ZT, Robinson AS: Recombinant protein expression and purification: a comprehensive review of affinity tags and microbial applications. Biotechnol J. 2012, 7: 620-634. 10.1002/biot.201100155.View Article
- Ecker DJ, Stadel JM, Butt TR, Marsh JA, Monia BP, Powers DA, Gorman JA, Clark PE, Warren F, Shatzman A, Crooke ST: Increasing gene expression in yeast by fusion to ubiquitin. J Biol Chem. 1989, 264: 7715-7719.
- Jackson SE: Ubiquitin: a small protein folding paradigm. Org Biomol Chem. 2006, 4: 1845-1853. 10.1039/b600829c.View Article
- Marblestone JG, Edavettal SC, Lim Y, Lim P, Zuo X, Butt TR: Comparison of SUMO fusion technology with traditional gene fusion systems: enhanced expression and solubility with SUMO. Protein Sci. 2006, 15: 182-189. 10.1110/ps.051812706.View Article
- Wang Z, Li H, Guan W, Ling H, Wang Z, Mu T, Shuler FD, Fang X: Human SUMO fusion systems enhance protein expression and solubility. Protein Expr Purif. 2010, 73: 203-208. 10.1016/j.pep.2010.05.001.View Article
- Panavas T, Sanders C, Butt TR: SUMO fusion technology for enhanced protein production in prokaryotic and eukaryotic expression systems. Methods Mol Biol. 2009, 497: 303-317. 10.1007/978-1-59745-566-4_20.View Article
- Sambrook J, Russell D: Molecular Cloning: A Laboratory Manual. 2001, New York: Cold Spring Harbor Laboratory Press, 3
- Laemmli UK: Cleavage of structural proteins during the assembly of the head of bacteriophage T4. Nature. 1970, 227: 680-685. 10.1038/227680a0.View Article
- Steiner T, Hess P, Bae JH, Wiltschi B, Moroder L, Budisa N: Synthetic biology of proteins: tuning GFPs folding and stability with fluoroproline. PLoS One. 2008, 3: e1680-10.1371/journal.pone.0001680.View Article
This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.