Skip to main content
  • Methodology article
  • Open access
  • Published:

A new method to customize protein expression vectors for fast, efficient and background free parallel cloning

Abstract

Background

Expression and purification of correctly folded proteins typically require screening of different parameters such as protein variants, solubility enhancing tags or expression hosts. Parallel vector series that cover all variations are available, but not without compromise. We have established a fast, efficient and absolutely background free cloning approach that can be applied to any selected vector.

Results

Here we describe a method to tailor selected expression vectors for parallel Sequence and Ligation Independent Cloning. SLIC cloning enables precise and sequence independent engineering and is based on joining vector and insert with 15–25 bp homologies on both DNA ends by homologous recombination. We modified expression vectors based on pET, pFastBac and pTT backbones for parallel PCR-based cloning and screening in E.coli, insect cells and HEK293E cells, respectively. We introduced the toxic ccdB gene under control of a strong constitutive promoter for counterselection of insert less vector. In contrast to DpnI treatment commonly used to reduce vector background, ccdB used in our vector series is 100% efficient in killing parental vector carrying cells and reduces vector background to zero. In addition, the 3’ end of ccdB functions as a primer binding site common to all vectors. The second shared primer binding site is provided by a HRV 3C protease cleavage site located downstream of purification and solubility enhancing tags for tag removal. We have so far generated more than 30 different parallel expression vectors, and successfully cloned and expressed more than 250 genes with this vector series. There is no size restriction for gene insertion, clone efficiency is > 95% with clone numbers up to 200. The procedure is simple, fast, efficient and cost-effective. All expression vectors showed efficient expression of eGFP and different target proteins requested to be produced and purified at our Core Facility services.

Conclusion

This new expression vector series allows efficient and cost-effective parallel cloning and thus screening of different protein constructs, tags and expression hosts.

Background

As central Core Facility Labs at the Max-Planck Institute of Biochemistry and the EMBL we provide in-house services for recombinant protein production. The proteins we are asked to produce are from various sources and protein families and are used for crystallization, immunization, biochemical, biophysical or biological studies. We perfom in-depth protein analysis to ensure that delivered proteins are properly folded. However, in many cases, this is a fairly challenging task to achieve, despite the many options that can improve proper protein folding such as construct design, solubility enhancing fusion tags, expression conditions, expression hosts and improved protein purification protocols. The constant challenge is to identify a successful combination of parameters with a minimum of resources. Apart from unbiased HTP approaches [1] or targeted selection [2, 3], it still remains a time consuming trial and error process in most non-automated protein labs. We had initially focused our screening efforts on constructs and solubility tags in E. coli as a first choice. Eukaryotic hosts were used if suggested by the literature or previous experience, or in case E. coli screening had failed. As this happened frequently we decided to implement parallel testing of constructs, solubility tags and expression hosts altogether. In order to handle all these different expression constructs, an efficient parallel cloning method was required. In the past few years, a number of powerful combinatorial cloning methods have been introduced. Gateway technology (Life Technologies) opened the area of combinatorial cloning more than a decade ago. More recent additions to the list are type II restriction enzymes (Stargate, IBA, Germany), Golden Gate Shuffling [4], RF cloning [5] and Sequence and Ligation Independent Cloning SLIC [6]. For recombinant protein expression, several commercial (Novagen, IBA, Life Technologies) and non-commercial [7] parallel vector series are available. However, we had already established different expression systems with their respective vectors: The E.coli pETM vector series have different tags followed by a protease cleavage site and are based on the same pET backbone [8]. Differences in expression levels are exclusively based on different tags, not on backbones differences such as copy number, spacer sequences or others. Expression uses the powerful T7 promoter system at low vector copy number and plasmids confer Kanamycin resistence instead of Ampicillin, which is crucial for plasmid stability [9], particularly in high cell density fermentations. Transient transfection of mammalian cells is another efficient and fast method for protein production. The HEK293 (human embryonic kidney) cell line is widely used due to high transfection efficiency and suspension growth in serum-free media. A variety of HEK293 cells are currently available with significant differences in productivity [10]. The Epstein-Barr virus nuclear antigen 1 (EBNA1) in HEK293E cell line interacts with oriP on pTT vectors which increases plasmid persistence and protein expression levels [11]. Therefore we preferred to customize pETM, pFastBac and pTT vectors for parallel cloning rather than adapt to a new system. The method must be sequence and restriction-site independent to allow for incorporation of any DNA fragment into any of the vectors. It must be directional and precise and most important, it must be fast, simple, efficient and cost-effective. SLIC cloning enables sequence independent, precise cloning with minimal or no changes in the amino acid sequence of the target protein and is based on homologous recombination of vector and insert with 15–25 bp homologies on both DNA ends. The reaction is enhanced using either T4 DNA polymerase, recA protein or incomplete PCR products [6]. In order to adapt vectors for SLIC, they need to share a common stretch of nucleotides at both ends of the linearized plasmid. We have chosen the HRV 3C protease [12] cleavage site and the 3’ end of the toxic ccdB gene to serve as primer binding sites common to all of the new vectors (Figure 1). The HRV 3C recognition site is located downstream of the N-terminal purification or solubility enhancing tag and can be used for tag removal. The CcdB protein inhibits bacterial growth by selective inhibition of E. coli DNA gyrase and can be neutralized by the antitoxin ccdA. The ccdB technology was developed by Delphi Genetics [13] and is licenced for use in Gateway vectors as well (Life Technologies). In the SLIC strategy presented here, ccdB introduced into the vector series was designed for strong constitutive expression in order to suppress growth in non-resistant cells at 100% efficiency. The vector is used as a PCR template for the amplification of the linear vector fragment, where ccdB is deleted. The ccdB gene on the template thus prevents the carry-over of the original vector during purification and SLIC reaction by preventing growth of colonies not containing the gene of interest. We developed pET, pFastBac and pTT parallel cloning vectors, named pCoofy1-x and present protein expression data in each of the respective host organisms.

Figure 1
figure 1

Principle of parallel SLIC cloning with negative ccdB selection. The vector is PCR linearized with LP1 forward and LP2 reverse primer. The LP1 primer corresponds to PreScission protease site (3C) for tag removal. The LP2 primer is either located at the C-terminus of ccdB or corresponds to a C-terminal tag. In both cases the ccdB gene is deleted upon PCR amplification thereby allowing counterselection of parental empty vector in ccdB sensitive cells. The Gene of Interest (GOI) is PCR amplified with primers composed of 5’ and 3’ gene specific sequences plus 15 bp – 25 bp extensions complementary to LP1 and LP2 vector primers, respectively.

Results and discussion

Vector design and cloning strategy

In order to drive strong constitutive ccdB expression from pCoofy vectors, we used the promoter of the major outer membrane lipoprotein OmpA, which is one of the strongest promoters in E. coli[14]. We inserted the respective llp5 promoter variant and a Shine-Dalgarno sequence upstream of the ccdB coding sequence (Figure 2A) to ensure translation. pPCRScript-LPP5-ccdB (Figure 2B) and all pCoofy ccdB derivatives show 100% killing efficiency when transformed into non-resistant cells. Occasionally, we observed the occurrence of ccdB inactivation during plasmid propagation in ccdB Survival™ cells (Life Technologies) under high selective pressure such as plasmids with high copy number. Therefore killing activity has to be verified for every single batch of vector DNA. However, since we use the vectors only as templates for PCR linearization the amount of DNA used for a single PCR is very low and usually sufficient for multiple SLIC reactions. Typically, 1 ng plasmid DNA is needed for a single SLIC reaction. We first cloned Llp5-ccdB into pETM14, pETM22, pETM33 and pETM44 to generate pCoofy1, 2, 3 and 4 (Figure 3A and Table 1). The parallel SLIC cloning procedure was established using eGFP as gene of interest according to the strategy illustrated in Figure 1A. pCoofy vectors were PCR linearized with 3C - LP1 forward and ccdB - LP2 reverse primer (Table 2). The LP1 primer corresponds to the HRV 3C protease site , the LP2 primer is located at the C-terminus of ccdB in order to delete the gene upon PCR amplification. eGFP was PCR amplified with primers composed of gene specific sequences plus 20 bp and 25 bp extensions complementary to LP1 and LP2 vector primers, respectively (Figure 3B).

Figure 2
figure 2

ccdB cassette used for pCoofy vectors. (A) The Llp5 variant of the strong E.coli major outer membrane lipoprotein promoter was introduced upstream of the coding sequence to drive constitutive ccdB transcription. -35 and -10 regulatory sequences are highlighted. A Shine-Dalgarno sequence was also added to ensure ccdB translation. The original start codon was converted to ATT (Ile, indicated in bold) for technical reasons. (B) The ccdB cassette was delivered by Sloning in pPCRSript (sequence available on request) and tested for toxicity by transformation into sensitive E.coli cells.

Figure 3
figure 3

Representative maps of parallel ccdB vectors for protein expression in E. coli , Baculovirus and HEK293E. For each of these hosts, one example of the pET (E. coli) (A), pFastBac (Baculovirus) (E) and pTT (HEK293E) (D) ccdB vector series is shown. All three backbones share common LP1 (3C) and LP2 (ccdB) primer binding sites for parallel cloning. (B) Primer design is illustrated for the universal PreScission 3C-ccdB primer pair of the pCoofy vector series. Gene specific primer sequences are fused to sequence overhangs of 20–24 nucleotides which are complementary to the corresponding vector amplification primer (see Table 2). (C) 2ndgeneration ccdB cassette including C-tags The ccdB cassette as shown in Figure 2A starting at TTGACA (−35 region) was fused to a row of C-terminal tags each separated by a stop codon. LP1 (3C) sequence was added upstream of the ccdB coding sequence. Restriction sites at both ends were also added for subsequent cloning into different vector backbones. The complete cassette was synthesized by GeneArt (now part of Life Technologies) with concomitant codon optimization of the tags for eukaryotic expression.

Table 1 Current list of pCoofys
Table 2 Primer for vector amplification and complementary primer extensions for gene of interest

When the SLIC reaction was carried out with insert and vector at a molar ratio of 1:3 without any treatment prior to transformation into chemocompetent OmniMAX™ 2 T1R cells, cloning efficiency was below 70%. Addition of recA raised overall cloning efficiency to > 95%. T4 DNA Polymerase treatment of vector and insert [6] was equally efficient, but due to simplicity we continued with the recA protocol. We tested several other variations to the basic protocol, however none of these further improved cloning efficiency: number of PCR cycles, PCR without extension step, extended LP1 and LP2 primer length for vector and insert PCR amplification, amount of vector and insert, molar ratio of vector and insert, 5 min 95°C denaturation of vector and insert mix followed by slow renaturation at 22° (data not shown). E.coli cells used for transformation may have an impact on quantity and quality of recombination events and should be tested first. At the Max-Planck Institute we use chemocompetent OmniMAX™ 2 T1R cells with a typical transformation efficiency of 107/μg pUC plasmid DNA.

Vector list and cloning statistics

The list of E.coli pCoofy vectors was extended by modifying additional pETM vectors or by introducing His10, OneStrep, S or Halo tags from templates listed in Table 1. All N- terminal tags are followed by the HRV 3C recognition site Leu-Phe-Gln/Gly-Pro. Specific cleavage occurs between Gln and Gly, with Gly-Pro remaining at the N terminus of the target protein. In order to express proteins that have to retain their native N-terminus after tag removal we generated ccdB versions of pET28M-Sumo1 and pET28M-Sumo3 vectors. The SUMO (Small Ubiquitin-like Modifier) tag is recognized and removed by SUMO protease in a structure specific manner to yield the target protein with its native N terminus [15]. Cloning a target gene into pCoofy5 and pCoofy6 requires the corresponding Sumo-LP1 primer for vector and insert PCR amplification (Table 2). We also generated E.coli vectors with either a C-terminal His10 or OneStrep tag (Table 1), which require the corresponding LP2 vector and insert primer for SLIC cloning (Figure 1, Table 2). Moreover, constructs without any N-terminal tag can be generated using LP1 tagless primers for the appropriate backbone. N-tagless primer were validated and used for pET and pFastBac backbones so far (Table 2). In order to further increase C-tag variations but at the same time reduce the number of vectors to be generated we designed a 2nd generation ccdB cassette (Figure 3C). Llp5-ccdB is followed by a row of C-terminal tags all separated by a stop codon. Depending on the LP2 primer used for vector and insert PCR linearization, either no tag, the His10, S, OneStrep, CBP, HPC4 or Æ54CPD54 self cleaving tag is fused to the C-terminus of the protein. Except for the Æ54CPD54 self cleaving tag, C-terminal tags lack a protease cleavage site and cannot be removed. At this time we have cloned this ccdB - C- tag cassette into pTT5 (Figure 3D) and validated eGFP and target gene expression in mammalian cells. pCoofy derivatives of pFastBac1 are currently available with His6, His6GST and His6MBP N-terminal tags (Figure 3E, Table 1).

We have effectively cloned more than 250 inserts into all different vectors of the pCoofy series. For all constructs, the DNA sequence of the translated gene fusion was controlled. We did not sequence the vector backbone of recombinant constructs, as we have never observed any compromised vector function. Insert sizes ranged between 150 and 3939 bp, with a majority in the range of 500–1000 bp. The number of clones per SLIC reaction varied with an average of about 20 throughout the entire distribution of insert sizes. As we never observed any background, we were not concerned if clone numbers were low, as the clone was correct in almost all cases (Figure 4).

Figure 4
figure 4

Cloning statistic. Typically, three colonies per transformation were selected for plasmid miniprep and sequencing reaction. All of the recombinant constructs listed here were approved to be correct by sequence alignments.

Expression in E.coli

Prior to use for requested target proteins, every new vector was validated with eGFP for cloning and small scale expression in the respective host. In case the expression level was unexpectedly low, we removed the vector from the list. For example a His10Trx-eGFP construct was expressed at about 20% of total protein in E.coli total cell lysate. When the double tag was switched to TrxHis10-eGFP the expression level increased to more than 70% (data not shown). Figure 5A shows a comparison of expression level and solubility of eGFP fused to several purification and solubility enhancing tags in E.coli. Trx, MBP and NusA protein fusions show the highest solubility as reported previously [8] and also the highest expression level at up to 80% of total cellular protein. Most interesting, His6GST-eGFP expressed at a high level but at low solubility. This result corroborates our previous observation, that His6GST expressed from the original pETM33 vector was insoluble (data not shown). Most of the E.coli expression data for requested target proteins at our Protein Production Service were collected for the first pCoofy vectors 1–4 corresponding to N-terminal His6, His6Trx, His6GST and His6MBP tags. In agreement with the eGFP expression data, MBP had a major impact on protein solubility (Figure 5B). This is also exemplified by E.coli expression of two Pil protein mutants in pCoofy1, 2, 3, and 4 (Figure 5C): expression levels range between 50% - and 80%, of target protein in E.coli total cell lysate with lower or no expression of the His6GST fusion protein. In the course of the project, only the His6MBP-Pil fusion proteins were soluble also after tag removal with HRV 3C protease (data not shown).

Figure 5
figure 5

Expression in E.coli. (A) eGFP was SLIC cloned into different E.coli pCoofy vectors using standard LP1 (3C) and LP2 (ccdB) primers and expressed in BL21 Rosetta (DE3) at 24°C. 2ml cultures were harvested and lyzed in PBS + protease inhibitors at consistent cell density / buffer ratio. eGFP expression level in total lysate was monitored on Agilent BioAnalyzer P80 Chips; eGFP fluorescence in cleared lysate was recorded on a LightCyclerII instrument. (B) Expression results for different target proteins cloned in pCoofy1 (His6), pCoofy2 (His6Trx), pCoofy3 (His6GST) and pCoofy4 (His6MBP) plus pCoofy35 (MBP only). Proteins are classified as soluble, insoluble or not expressed. “Soluble” proteins could be enriched by IMAC and were further purified with or without tag. Proteins classified as “insoluble” were not soluble in standard IMAC buffers (buffer A). These constructs were either subject to refolding, buffer screens or discontinued. Proteins classified as “not expressed” cover constructs, that were expressed at low or even undetectable level in Western Blot analysis and were thus discontinued. (C) Pil1 (UniProt: P53252). SAQ and SFK mutants were SLIC cloned into pCoofy1 (His6), pCoofy2 (His6Trx), pCoofy3 (His6GST) and pCoofy4 (His6MBP) using standard LP1 (3C) and LP2 (ccdB) primers and expressed in BL21 Rosetta (DE3) at 30°C. oD3 samples (0,1 ml culture of oD600 of 3 is lyzed in 50μl sample buffer) were loaded on Agilent BioAnalyzer P80 Chips. Expression levels indicated are based on relative peak quantification.

Expression in insect cells

pCoofy vectors allow for parallel screening in E.coli and insect and/or mammalian cells which has increased protein production throughput in our facilities substantially. Here we show two examples, Vasp and ODC, where parallel cloning allowed us to switch from the E.coli expression host to the Baculovirus expression system easily without much delay. In the case of Topoisomerase 1, Baculovirus expression of two parallel constructs improved project progress.

Expression of GST-Vasp in E.coli was described previously [16]. When we expressed GST-Vasp in E.coli we observed partial proteolytical degradation and also coaggregation of degradation products with full-length protein. Instead of investing time in optimizing bacterial expression, we cloned Vasp into pFastBac derivatives pCoofy28 (His6GST) and pCoofy29 (His6MBP) and expressed both constructs in SF9 cells without any signs of degradation (Figure 6A). Purified full-length His6GST-Vasp was shown to be biologically active (data not shown). Ornithin Decarboxylase (ODC) was properly folded when expressed in E.coli , but for co-expression purposes it had to be purified from insect cells. We therefore cloned the ODC gene into pCoofy27 (His6) and expressed the ODC protein in High Five cells (Figure 6B). Again, SLIC cloning enabled rapid change of expression host. Expression of GST-Top1 in insect cells was described previously [17]. We cloned Top1 in pCoofy28 (His6GST) and pCoofy29 (His6MBP) in parallel and tested expression in High Five cells. His6MBP-Top1 showed much higher expression level than His6GST-Top1 and was purified to homogeneity in enzymatically active form. (data not shown) (Figure 6C).

Figure 6
figure 6

Baculovirus expression in insect cells. pFastBac derivatives pCoofy27 (His6), pCoofy28 (His6GST) or pCoofy29 (His6MBP) were transformed into DH10Bac E.coli cells to generate recombinant Baculovirus (Bac-to-Bac®, Life Technologies). HighFive or SF9 cells were either infected with P2 virus stock or Baculo Infected Insect Cells BIIC at dilutions indicated. Cells were harvested after 72h and lyzed in SDS Lämmli buffer. Total lysate was loaded on 4–12% NuPAGE gels (Life Technologies). ø cells infected with non-recombinant Baculovirus. (A) Expression of His6GST-Vasp (lanes 1 to 5) and His6MBP-Vasp (lanes 6 to 10) (UniProt: P50552) in HighFive cells after infection with P2 virus stock; 48 h expression , 2000 fold virus dilution (lanes 1 and 6); 72 h expression at 20.000 (lanes 2 and 7), 2000 (lanes 3 and 8), 200 (lanes 4 and 9) and 40-fold (lanes 5 and 10) virus dilution, respectively; coomassie stained (B) Expression of His6-ODC (NCBI RefSeq NP_002530) in HighFive cells after infection with BIIC; lanes 1–4 BIIC at 1:500, 1:1000, 1:2000, 1:4000 dilution respectively; coomassie stained (C) Expression of Topoisomerase I (UniProt P04786) fused to His6GST-topisomerase I (lanes 1 to 4) and His6MBP (lanes 5 to 7) in High Five cells. Cells were infected with BIIC at 1:500, 1:1000, 1:2000 (lanes 1 to 3 and 5 to 7), and 1:4000 (lane 4) respectively. ø1 cells infected with insertless virus; ø2 uninfected H5 cells, ø3 uninfected SF9 cells, detected by Western Blot analysis using His-Probe™ reagent (ThermoScientific, Braunschweig, Germany).

Expression in HEK293E cells

Protein purification from HEK293E cells in our hands is not very efficient using standard immobilized metal affinity purification. We therfore have introduced alternative C-terminal purification tags into pTT5 (Figure 3C). The ccdB - C - tag cassette into pTT5 increases plasmid size by 1350 bp. In order to analyze if this has an impact on transient gene expression in HEK293E cells, we compared expression levels of both intracellular eGFP and secreted CD40 ligand protein [18]. Both proteins were expressed from the original pTT vectors [2] and their respective pCoofy derivatives. Transient transfection of both genes show comparable levels of eGFP in the total cell lysate and CD40 ligand in the culture supernatant when either expressed from pTT or pCoofy (Figure 7A).

Figure 7
figure 7

Expression in HEK293E cells. (A) pTT5-eGFP and pTT22-CD40L (GenBank NT_039706) constructs were generated by restriction enzyme cloning. pCoofy40L-eGFP was generated using standard LP1 (3C) and LP2 (ccdB) primers. Coofy40-CD40L was generated using pTT22-CD40L as template for PCR amplification of the insert. Gene specific LP1 primer sequence included vector VEGF signal sequence from pTT22. Recombinant plasmids were transfected into HEK293E cells and expressed for 72h. Total lysates of untransfected cells, pTT5-eGFP (lane 1) pCoofy40-eGFP (lane 2), cell culture supernatants of untransfected cells, pTT5- CD40L (lane 3) and pCoofy40-CD40L (lane 4) detected by Coomassie, silver staining and Western Blot HisProbe™-HRP detection. (B) eGFP was SLIC cloned into pTT derivative Coofy40 using standard LP1 (3C) primer in combination with either His10, OneStrep, S, CBP, HPC4 LP2 primer to introduce a C-terminal tag. Recombinant plasmids were transfected into HEK293E cells end expressed for 72h. Total lysates of untransfected cells, pCoofy40-eGFP fused to C-terminal S-tag (lane 1), CBP (lane 2), HPC (lane 3), OneStrep (lane 4) and His10 (lane 5). Eluates from S-Protein Agarose (lane 6), Calmodulin Affinity resin (lane 7), His-Select washed with 50 column volumes (CV) 20mM imidazole buffer (lane 8), His-Select washed with 50 CV 20mM imidazole buffer, 20 CV 50mM imidazole buffer and 20 CV 80mM imidazole buffer (lane 9), Calmodulin Affinity resin (lane 10, same sample as lane 7), Anti-Protein C Affinity matrix (lane 11) and Strep-Trap™ (lane 12). Yields estimated for His10 and OneStrep based on Bradford staining was 8 mg / L HEK culture and 3 mg /L HEK culture respectively.

In order to test these alternative purification tags we fused them to eGFP, transiently expressed in HEK293E cells and purified with the respective affinity resin, except Æ54CPD54, which is specifically activated by inositol hexakisphosphate (InsP6) present in eukaryotic cells [19]. Comparison of C-terminal S-tag, His10, HPC, CBP and OneStrep showed best expression levels for both S and His10 tag. Protein yield was lowest for S-tag, HPC and CBP. Best yield and purity were obtained for eGFP-One Strep and eGFP-His10 when washed stringently with 50mM and 80mM imidazole (Figure 7B). In summary, we have shown effective protein expression and purification from pCoofy40 vectors that can now be included in our parallel cloning strategy.

Conclusions

We have developed a method that allow one to tailor any given expression vector for efficient, fast, robust and cost-effective parallel cloning. High cloning efficiency is guaranteed via strong constitutive ccdB expression that, in contrast to DpnI digestion, is 100% efficient in counterselection of parental insert-less vector. The procedure is very robust and has been easily implemented in research groups in-house or externally. We have generated more than 30 parallel vectors for expression in bacteria, insect and HEK293 cells with different purification and solubility enhancing tags that we consider to be helpful in our workflow. This list of pCoofys has fundamentally increased our throughput and success rate in protein purification. We are constantly expanding the list of vectors and have also integrated the ccdB-C-tag cassette into Baculovirus, Pichia pastoris and Hansenula polymorpha vectors which still need to be validated.

With the strategy presented here it is straightforward to assemble any tag combination of interest for any selected application. Moreover, with the use of SLIC, as many as five inserts can be assembled in one reaction simultaneously with great efficiency [20]. Thus a modular combination of any vector element such as purification tag, signal sequences, antibiotic resistance etc. would be possible. The Llp-ccdB counterselection gene presented in this work could also enhance cloning efficiency of other cloning methods as RF or others.

Methods

Vector construction

Molecular biology methods were based on standard protocols. E.coli chemocompetent ccdB survival cells and OmniMAX™ 2 T1R cells (Life Technologies, Darmstadt, Germany) were used for propagating ccdB plasmids and for transforming cloning reactions, respectively. PCR primers were ordered at Metabion (Martinsried, Germany). PCR was performed in 50 μl reaction mixes using high fidelity Phusion polymerase (NEB, Frankfurt, Germany). PCR products were analyzed on agarose gels and purified with High Pure PCR cleanup kit (Roche, Mannheim, Germany). Plasmid DNA was prepared using NucleoBond® or NucleoSpin® Plasmid Kits (Macherey Nagel, Düren, Germany). pETM vectors were provided by the EMBL [8], pTT by Yves Durocher [11] and pFastBac was purchased at Life Technologies. Synthetic ccdB DNA containing the promoter, Shine-Dalgarno and the coding sequence of ccdB was synthesized by Sloning BioTechnology (now part of Morphosys). The 2nd generation cassette was synthesized by GeneArt® (Regensburg, Germany, now Life Technologies).

pCoofy E.coli expression vectors were generated by ligating the Llp5-ccdB HindIII restriction enzyme fragment from pPCRSript-ccdB into HindIII linearized pETM vectors. Additional tags, that were not present in the EMBL vector series, were added to the list: His6 was extended to His10 with the use of primer extensions; the OneStrep tag was PCR amplified from pPSG-IBA103-eGFP (IBA, Göttingen, Germany), the S tag was amplified from pET29 (Novagen, Darmstadt, Germany) and the Halo tag was amplified from pFN18a (Promega, Mannheim, Germany) template DNA. The pCoofy transient insect expression vector was derived from pIEX1. Vector was digested with XcmI and NotI and ligated to llp5-ccdB, that was PCR linearized with XcmI / NotI primer extensions. The pCoofy Baculovirus expression vectors were derived from pFastBac1. The sequence spanning N tag - 3C - Llp5-ccdB was PCR amplified using pCoofy1, pCoofy3 and pCoofy4 as template DNA to generate pCoofy27, pCoofy28, pCoofy29, respectively. PCR fragments were extended by RsrII /XhoI restriction sites and ligated into pFastBac1 linearized with RsrII /XhoI. The pCoofy HEK expression vector was derived from pTT5. The vector was linearized with EcoRI / NotI and ligated to the ccdB - C - tag cassette, which was PCR amplified with EcoRI / NotI primer extensions. Prior to use, the integrity of all vectors was verified by DNA sequencing. ccdB toxicity is a prerequisite for efficient counterselection and was verified for each new vector and for each individual vector preparation by transformation into ccdB non-resistant OmniMAX™ 2 T1R cells. Functionality in cloning and expression was controlled for each new vector by eGFP SLIC cloning and small scale test expression in the appropriate host.

SLIC cloning

All vectors were PCR linearized with their corresponding LP1 and LP2 primer combination, purified and stored at −20°C ready-to-use. Briefly, a 50 μl reaction mix containing 25 ng vector DNA, 50 pmol of each primer 0,4mM dNTP Mix, 1 Unit Phusion Polymerase and 1x Phusion Polymerase buffer was used in a 3-step PCR reaction: 1) 98°C for 3 min, 1 cycle; 2) 98°C for 30 sec followed by 72°C for 90 sec; 30 cycles; 3) 72°C for 10 min, 1 cycle. The PCR product is purified using High Pure PCR Cleanup Kit (Roche, Germany). The insert is PCR linearized with primers composed of gene specific sequences adjusted to a Tm of 56°C - 60°C plus 15 to 25 bp extensions complementary to LP1 and LP2 vector primer. PCR conditions need to be optimized for the respective gene. Typically 50 ng template DNA, 50 pmol of each primer 0,4mM dNTP Mix, 1 Unit Phusion Polymerase and 1x Phusion Polymerase buffer was used in a 3-step PCR reaction: : 1) 96°C for 5 min, 1 cycle; 2) 96°C for 30 sec followed by 50°C for 30 sec followed by 72°C for about 1min/kb; 30 cycles; 3) 72°C for 10 min, 1 cycle. The combination of LP1 and LP2 extensions has to be appropriate for the desired vector. The main vector series uses 3C-LP1 and ccdB-LP2. Alternative LP1 primers are used for N-terminal Sumo1, Sumo3 or no N tag at all. ccdB - LP2 is used in case no C-tag is introduced. It is important to include a stop codon in the insert LP2 primer, otherwise translation will proceed two codons (Arginine – Histidine) further downstream until the first stop codon in the LP2 primer extension is encountered. Diverse LP2 primers enable the introduction of certain C-terminal tags (Table 2). For expression in HEK cells a Kozak CCACC sequence should be added upstream the first ATG start codon in the LP1 gene specific primer sequence. A standard PCR program over 30 cycles including the final extension step was used. In the SLIC reaction, 100 ng of vector is mixed with insert at a molar ratio of 1:3. 1 μl recA protein (2 μg/ml stock) plus 1 μl recA buffer (10x) (NEB, Germany) was added to a final reaction volume of 10 μl, incubated for 30 min at 37°C and transformed into chemocompetent OmniMAX™ 2 T1R cells with slightly extended incubation times: 30 min on ice, 1 min heat shock at 42°C, 5 Typically, three clones per construct were selected and sequenced prior to use in expression.

Protein expression in E.coli

Expression plasmids were transformed into E.coli BL21 (DE3) Rosetta expression strains. Precultures were grown overnight in yeast extract supplemented with 3,5% glycerol (YG) at 30°C, 130 rpm. Expression cultures were either inoculated with colonies from agar plate or with preculture to a final oD600 of 0,004 and grown at 2 ml volume in 24 deepwell plates (Whatman, GE Healthcare, Germany) covered with airpore tape sheets (Qiagen, Hilden, Germany). Cultures were grown in autoinduction medium [21] at 24 or 30°C for 24h or at 37°C overnight. 18°C expression cultures in YG medium were first grown at 37°C, induced with 0,5 – 1mM IPTG and shifted to 18°C for over night expression. Cultures were pelleted, resuspended in 200 μl buffer A (50 mM Na-Phosphate pH 8,0, 500 mM NaCl, 10% glycerol, 10 mM imidazole) including protease inhibitors AEBSF-HCl (1 mM), Aprotinin (2 μg/ml), Leupeptin (1 μg/ml) Pepstatin (1 μg/ml) and lyzed mechanically in a cooled Tissue Lyzer (Retsch, Hann, Germany) with glass beads (Sigma, Munich, Germany) at 30 Hz for 5 min. Cell extracts were centrifuged for 30 min at 14.000 rpm at 4°C.

Protein expression in insect cells

HighFive and SF9 suspension cultures were grown in ExCell405 and ExCell420 medium, respectively (Sigma, Munich, Germany) at 27°C in 50 mm Unitron shakers (Infors, Bottmingen, Switzerland). 2 ml test expression cultures were shaken in 25 ml polystyrene screw cap tubes (Sarstedt, Nümbrecht, Germany) at 120 rpm. Cell viability and cell size were monitored on a Vi-CELL® instrument (Beckman Coulter, Krefeld, Germany).

Baculovirus expression was performed according to the Bac-to-Bac® protocol (Life Technologies, Darmstadt, Germany). Bacmid transfected SF9 cells were typically harvested after 4–5 days, at maximum cell size and onset of cell lysis. The titer of this first virus stock was determined with the SF9 easy titer cell line [22]. Virus was either amplified in 2 subsequent steps to generate P1 and P2 virus stock or used to generate Baculo Infected Insect Cells (BIIC) as described previously [23]. For test expression, High Five and SF9 cells were infected at 1 × 106 cells / ml with virus stock or BIIC at different dilutions, typically in the range of 1:1000 – 1:10.000 and harvested after different time points. Viability and cell size was recorded for every expression culture.

Protein expression in HEK293E cells

HEK293E suspension cells were grown in serum free Freestyle medium (Life Technologies, Darmstadt, Germany) supplemented with G418 to 1 μg/ml and Pluronic to 0,1% at 37°C in 50 mm Unitron CO2 (5%) shakers (Infors, Bottmingen, Switzerland) Transient transfection was performed as described [24]. Briefly, 10 ml suspension cultures in 125 ml Erlenmeyer flasks were adjusted to a cell density of 1x106 cells/ml. 1 μg plasmid DNA per ml culture was diluted in 100 μl PBS and mixed with 2 μl PEI (25 kDa linear, 1mg/ml, Polysciences). After 15 min incubation at room temperature the mix was added dropwise to the cells. For secreted proteins, A25 trypton (OrganoTechnie, La Courneuve, France) was added to a final concentration of 0,5% 24 h after transfection. Cells were harvested after 72, lyzed in respective binding buffers including protease inhibitors AEBSF-HCl (1 mM), Aprotinin (2 μg/ml), Leupeptin (1 μg/ml) Pepstatin (1 μg/ml) and lyzed mechanically in a Dounce homogenizer. Cell extracts were centrifuged for 30 min at 14.000 rpm at 4°C. Depending on the C-terminal tag fused to the protein, lysate supernatants were loaded on the following affinity resins: His-Select (Sigma, Munich, Germany); Calmodulin (Stratagene, Agilent Technologies, Germany), S-Protein (Novagen, Darmstadt, Germany), Anti-Protein C Affinity Matrix (Roche, Mannheim, Germany) and StrepTrap™ HP (GE Healthcare, Germany). Protein binding, washing and elution was performed according to the manufacturers instructions.

Protein detection

All proteins samples were analyzed on 4–12% SDS-PAGE gradient gels (NuPAGE in MES buffer system, Life Technologies, Darmstadt, Germany), visualized by Coomassie staining, Silver staining or on P80 and P230 BioAnalyzer2100 Chips (Agilent, Böblingen, Germany). For western blot analysis, proteins were blotted on Immobilon-P PVDF (Millipore, Schwalbach, Germany) and visualized with HisProbe™-HRP (Thermo Scientific, Munich, Germany).

Abbreviations

PCR:

Polymerase chain reaction

SLIC:

Sequence and Ligation Independent Cloning

Bp:

Basepairs

HRV:

Human rhinovirus

GFP:

Green fluorescent protein

HTTP:

High throughput

RF cloning:

Restriction free cloning

OmpA:

Outer membrane lipoprotein A

SUMO:

Small Ubiquitin-like Modifier

GOI:

Gene of interest

References

  1. Vincentelli R, Cimino A, Geerlof A, Kubo A, Satou Y, Cambillau C: High-throughput protein expression screening and purification in Escherichia coli. Methods. 2011, 55 (1): 65-72. 10.1016/j.ymeth.2011.08.010.

    Article  CAS  Google Scholar 

  2. An Y, Yumerefendi H, Mas PJ, Chesneau A, Hart DJ: ORF-selector ESPRIT: a second generation library screen for soluble protein expression employing precise open reading frame selection. J Struct Biol. 2011, 175 (2): 189-197. 10.1016/j.jsb.2011.04.004.

    Article  CAS  Google Scholar 

  3. Cabantous S, Waldo GS: In vivo and in vitro protein solubility assays using split GFP. Nat Methods. 2006, 3 (10): 845-854. 10.1038/nmeth932.

    Article  CAS  Google Scholar 

  4. Engler C, Gruetzner R, Kandzia R, Marillonnet S: Golden Gate Shuffling: A One-Pot DNA Shuffling Method Based on Type IIs Restriction Enzymes. PlosOne. 2009, 4 (5): e5553-

    Article  Google Scholar 

  5. Unger T, Jacobovitch Y, Dantes A, Bernheim R, Peleg Y: Applications of the Restriction Free (RF) cloning procedure for molecular manipulations and protein expression. J Struct Biol. 2010, 172 (1): 34-44. 10.1016/j.jsb.2010.06.016.

    Article  CAS  Google Scholar 

  6. Li MZ, Elledge SJ: Harnessing homologous recombination in vitro to generate recombinant DNA via SLIC. Nat Methods. 2007, 4 (3): 251-256. 10.1038/nmeth1010.

    Article  CAS  Google Scholar 

  7. Berrow NS, Alderton D, Sainsbury S, Nettleship J, Assenberg R, Rahman N, Stuart DI, Owens RJ: A versatile ligation-independent cloning method suitable for high-throughput expression screening applications. Nucleic Acids Res. 2007, 35 (6): e45-10.1093/nar/gkm047.

    Article  Google Scholar 

  8. Dümmler A, Lawrence AM, de Marco A: Simplified screening for the detection of soluble fusion constructs expressed in E. coli using a modular set of vectors. Microb Cell Fact. 2005, 4: 34-10.1186/1475-2859-4-34.

    Article  Google Scholar 

  9. Lancini G, Parenti F: Antibiotics. 1982, New York: Springer

    Book  Google Scholar 

  10. Geisse S: Reflections on more than 10 years of TGE approaches. Protein Expr Purif. 2009, 64: 99-107. 10.1016/j.pep.2008.10.017.

    Article  CAS  Google Scholar 

  11. Durocher Y, Perret S, Kamen A: High-level and high-throughput recombinant protein production by transient transfection of suspension-growing human 293-EBNA1 cells. Nucleic Acids Res. 2002, 30 (2): E9-10.1093/nar/30.2.e9.

    Article  Google Scholar 

  12. Carr S, Miller J, Leary SE, Bennett AM, Ho A, Williamson ED: Expression of a recombinant form of the V antigen of Yersinia pestis, using three different expression systems. Vaccine. 1999, 18: 153-159. 10.1016/S0264-410X(99)00214-5.

    Article  CAS  Google Scholar 

  13. Bernard P, Gabant P, Bahassi EM, Couturier M: Positive-selection vectors using the F plasmid ccdB killer gene. Gene. 1994, 148 (1): 71-74. 10.1016/0378-1119(94)90235-6.

    Article  CAS  Google Scholar 

  14. Inouye S, Inouye M: Up-promoter mutations in the lpp gene of Escherichia coli. Nucl. Acids Res. 1985, 13 (9): 3101-3110. 10.1093/nar/13.9.3101.

    Article  CAS  Google Scholar 

  15. Reverter D, Lima CD: A basis for SUMO protease specificity provided by analysis of human Senp2 and a Senp2-SUMO complex. Structure. 2004, 12 (8): 1519-1531. 10.1016/j.str.2004.05.023.

    Article  CAS  Google Scholar 

  16. Breitsprecher D, Kiesewetter AK, Linkner J, Faix J: Analysis of actin assembly by in vitro TIRF microscopy. Methods Mol Biol. 2009, 571: 401-415. 10.1007/978-1-60761-198-1_27.

    Article  CAS  Google Scholar 

  17. Chen Z, Yang H, Pavletich NP: Mechanism of homologous recombination from the RecA-ssDNA/dsDNA structures. Nature. 2008, 453 (7194): 489-4. 10.1038/nature06971.

    Article  CAS  Google Scholar 

  18. Kilunc MO, Mukundan L, Yolcu ES, Singh NP, Suttles J, Shirwan H: Generation of a multimeric form of CD40L with potent immunostimulatory activity using streptavidin as a chaperon. Exp Mol Pathol. 2006, 80: 252-261. 10.1016/j.yexmp.2005.12.004.

    Article  Google Scholar 

  19. Shen A, Lupardu PJ, Morell M, Ponder EL, Sadaghiani AM, Garcia KC Bogyo M: Simplified, Enhanced Protein Purification Using an Inducible, Autoprocessing Enzyme Tag. PlosOne. 2009, 4 (12): e8119-

    Article  Google Scholar 

  20. Li MZ, Elledge SJ: SLIC a method for sequence- and ligation-independent cloning. Methods Mol Biol. 2012, 852: 51-59. 10.1007/978-1-61779-564-0_5.

    Article  CAS  Google Scholar 

  21. Studier FW: Protein production by auto-induction in high density shaking cultures. Protein Expr Purif. 2005, 41 (1): 207-234. 10.1016/j.pep.2005.01.016.

    Article  CAS  Google Scholar 

  22. Hopkins R, Esposito D: A rapid method for titrating baculovirus stocks using the Sf-9 Easy Titer cell line. Biotechniques. 2009, 47 (3): 785-788. 10.2144/000113238.

    Article  CAS  Google Scholar 

  23. Wasilko DJ, Lee SE: TIPS: titerless infected-cells preservation and scale-up. Bioprocess J. 2006, 5: 29-32.

    Google Scholar 

  24. Tom R, Bisson L, Durocher Y: Transient expression in HEK293-EBNA1 cells. Expression systems. Edited by: Dyson MR, Durocher Y. 2007, UK: Scion Publishing Ltd, 212-218.

    Google Scholar 

Download references

Acknowledgements

We thankfully acknowledge Kirill Alexandrow and Christian Ostermeier for discussion of strategic concepts, Peggy Stolt-Bergner for critically reading the manuscript, Natasza Ziolkowska, Julian Stingele, Andreas Schweitzer, Marc Schmidt-Supprian and Alvaro Crevenna for providing PIL, Top1, ODC, CD40L and Vasp constructs, respectively.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Sabine Suppmann.

Additional information

Competing interests

The authors declare that they have no financial and no non-financial competing interests.

Authors’ contributions

JS constructed all pCoofys except pCoofy27,28 and 29, SLIC cloned inserts and performed insect cell expressions. HB constructed pCoofy27, 28, 29 and pETM28-Sumo1 and Sumo3. CS SLIC cloned inserts and performed bacterial expressions. SS designed the cloning strategy and vectors, performed bacterial, insect and mammalian expression experiments and wrote the manuscript. All authors read and approved the final manuscript.

Authors’ original submitted files for images

Rights and permissions

Open Access This article is published under license to BioMed Central Ltd. This is an Open Access article is distributed under the terms of the Creative Commons Attribution License ( https://creativecommons.org/licenses/by/2.0 ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Reprints and permissions

About this article

Cite this article

Scholz, J., Besir, H., Strasser, C. et al. A new method to customize protein expression vectors for fast, efficient and background free parallel cloning. BMC Biotechnol 13, 12 (2013). https://doi.org/10.1186/1472-6750-13-12

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: https://doi.org/10.1186/1472-6750-13-12

Keywords