Skip to main content
  • Methodology article
  • Open access
  • Published:

Directed PCR-free engineering of highly repetitive DNA sequences



Highly repetitive nucleotide sequences are commonly found in nature e.g. in telomeres, microsatellite DNA, polyadenine (poly(A)) tails of eukaryotic messenger RNA as well as in several inherited human disorders linked to trinucleotide repeat expansions in the genome. Therefore, studying repetitive sequences is of biological, biotechnological and medical relevance. However, cloning of such repetitive DNA sequences is challenging because specific PCR-based amplification is hampered by the lack of unique primer binding sites resulting in unspecific products.


For the PCR-free generation of repetitive DNA sequences we used antiparallel oligonucleotides flanked by restriction sites of Type IIS endonucleases. The arrangement of recognition sites allowed for stepwise and seamless elongation of repetitive sequences. This facilitated the assembly of repetitive DNA segments and open reading frames encoding polypeptides with periodic amino acid sequences of any desired length. By this strategy we cloned a series of polyglutamine encoding sequences as well as highly repetitive polyadenine tracts. Such repetitive sequences can be used for diverse biotechnological applications. As an example, the polyglutamine sequences were expressed as His6-SUMO fusion proteins in Escherichia coli cells to study their aggregation behavior in vitro. The His6-SUMO moiety enabled affinity purification of the polyglutamine proteins, increased their solubility, and allowed controlled induction of the aggregation process. We successfully purified the fusions proteins and provide an example for their applicability in filter retardation assays.


Our seamless cloning strategy is PCR-free and allows the directed and efficient generation of highly repetitive DNA sequences of defined lengths by simple standard cloning procedures.


Expansions of DNA repeat sequences are associated with many inherited neurodegenerative diseases [13]. One of the best-studied examples for a trinucleotide expansion disease is the neurological disorder Huntington's chorea, where the accumulation of CAG triplets within the first exon of the gene encoding the Huntingtin (Htt) protein leads to an elongated polyglutamine (Poly-Q) stretch in the polypeptide. It has been shown that more than 36 consecutive glutamine residues are pathogenic as they promote Htt aggregation into amyloid-like fibrils [4]. The translation product of the first exon of the htt gene was previously used to study the aggregation behavior of Poly-Q proteins in vitro [5]. In order to investigate the influence of the length of the Poly-Q stretch on aggregation kinetics, we wanted to clone reporter constructs containing defined numbers of glutamine residues. We developed a PCR-free cloning strategy allowing us to create repetitive DNA sequences encoding glutamine stretches of defined length. These sequences were used to generate improved constructs for filter retardation assays to study Huntingtin aggregation in vitro.

Several methods have been described for cloning long DNA repeat tracts. However, most of them include PCR-based amplification steps, generate imperfect repeats, or result in a pool of clones that differ in the number of repeats [613]. As a consequence, additional effort is required to identify and isolate clones with the desired length of nucleotide repeats. Therefore, we developed a simple multi-cycle cloning strategy using synthetic oligonucleotides and Type IIS restriction endonucleases to engineer highly repetitive DNA fragments. Our approach is PCR-free and generates exclusively clones carrying the desired length of repeat sequences in each step. Moreover, it is not only suitable to multiply the length of existing repetitive sequences but also to vary the number of inserted repeats in every elongation cycle. Finally, we can easily recombine constructs of the same or different repeat lengths to accelerate the construction of the desired number of repeats.

Results and discussion

Type IIS restriction endonucleases cut DNA at defined distances outside their recognition site [14]. Thus, these enzymes allow the generation of single-stranded DNA overhangs of any desired sequence. The arrangement of inward facing Type IIS recognition sites at the ends of DNA fragments facilitates their fusion without incorporation of extraneous nucleotides. Therefore, such enzymes are widely used in DNA shuffling or seamless cloning strategies to construct synthetic genes and artificial protein polymers [1519]. Based on this approach, we designed double-stranded oligonucleotides containing a defined number of central glutamine-encoding CAG and CAA triplets (Q-block) arranged in a non-regular fashion (Figure 1A). We decided on this approach because it has been shown previously that interruption of perfect CAG repeats by CAA triplets improves the stability of Poly-Q encoding sequences in E. coli [12]. The central triplet repeats were flanked by non-repetitive sequences that have two important functions: First, they ensure the specific annealing of the oligonucleotides, and secondly, they comprise two inward directed Type IIS restriction sites (BsaI and BsmBI) and a Type IIP restriction site (SacI). The Type IIS sites were designed to result in cuts within the repetitive sequence and to produce compatible DNA overhangs allowing for seamless elongation of the Poly-Q encoding sequence (Figure 1A and 1B). In our strategy, the BsaI and SacI restriction sites of the Q-block oligonucleotides are used to elongate a pre-existing Q-block on a plasmid digested with BsmBI and SacI (Figure 1C, lower panel). Importantly, the inward facing BsmBI site on the last introduced oligonucleotide remains intact upon ligation and is available for further insertions. Therefore, seamless elongation is based on the re-introduction of a unique Type IIS site in each elongation step (Figure 1B and 1C).

Figure 1
figure 1

Cloning strategy of Poly-Q encoding sequences. (A) Annealed oligonucleotides used for cloning of the Poly-Q encoding regions. The recognition sequences and the corresponding cleavage sites of the restriction endonucleases are indicated by different colors. The orientation of the two inward facing Type IIS restriction sites (BsaI and BsmBI) results in cleavage within the Poly-Q encoding sequence and allows seamless elongation of the Poly-Q stretch. (B) Detailed representation of the elongation cycles. Digestion of the double-stranded oligonucleotides (BsaI/SacI) and the vector (BsmBI/SacI) resulted in compatible overhangs allowing for seamless elongation. Used restriction sites are underlined. (C) The annealed oligonucleotides were initially subcloned using BsaI and SacI and subsequently inserted into pMK1 via NdeI and SacI. For further elongation BsaI/SacI digested oligonucleotides were ligated into pMK1-Qn digested with BsmBI and SacI. Thereby the number of Q-encoding nucleotides was increased by nine and a new BsmBI site was introduced allowing subsequent rounds of elongation.

The antiparallel Q-block oligonucleotides described above were commercially synthesized, annealed, and subcloned via the BsaI and SacI restriction sites into a vector containing a unique BsaI site (Figure 1B and 1C). The following elongation steps of the Poly-Q encoding sequence only require the double-stranded oligonucleotides treated with the appropriate enzymes as described above and a cloning vector without BsmBI sites. Hence, we performed the elongation cycles in the plasmid pMK1 from which all BsmBI sites were removed by site-directed mutagenesis. The initial introduction of the subcloned Q-block fragment into pMK1 resulted in the plasmid pMK1-Q11 (Figure 1C, upper panel). As a consequence, this vector now contains a single BsmBI site downstream of the glutamine-encoding triplets, which could be used for the further elongation of the Q-block (Figure 1C). Next, pMK1-Q11 was digested with the enzymes BsmBI and SacI. The resulting cohesive ends were compatible with the overhangs of the double-stranded Q-block oligonucleotides cut by BsaI and SacI (Figure 1C). Ligation of the insert with pMK1-Q11 gave rise to pMK1-Q20. Thereby a unique BsmBI site was re-introduced downstream of the elongated Poly-Q encoding region (Figure 1C). As the cohesive ends of the annealed and digested oligonucleotides are only compatible with the doubly-cut plasmid, multiple insertions were efficiently prevented. Using this strategy the directed elongation of the Q-block was achieved by repeated cycles of digestion and ligation.

Our approach has several additional advantages beyond providing a strategy for the directed elongation of repetitive DNA sequences. An important aspect is that the number of introduced nucleotides can be adjusted precisely in each round by defining the length of the Q-block of the synthetic oligonucleotides (Figure 1A). Furthermore, the generation of long repetitive sequences can be accelerated by simple modifications (Figure 2). For example, pMK1-Q20 was digested with BsaI and SacI and the resulting Q-block fragment was ligated with the pMK1-Q20 backbone cleaved with BsmBI and SacI. This slight adaptation of the protocol enabled the formation of pMK1-Q38 and demonstrates that the number of Q-encoding triplets could be rapidly increased in a single step. This allowed the fast generation of a pool of constructs with different lengths of the repetitive sequences that can be easily recombined (Figure 3A). Figure 3B shows the restriction digestion analysis of the constructed pMK1-Qn plasmids carrying inserts encoding the indicated Poly-Q stretches.

Figure 2
figure 2

Generation of long repetitive sequences can be accelerated. The elongation of the Q-encoding region was accelerated by a simple modification of the cloning strategy. For example, the Poly-Q encoding region of the plasmid pMK1-Q20 can be excised with BsaI and SacI and ligated into the same vector digested with BsmBI/SacI. Thereby, the number of Q-encoding repeats can be almost doubled in a single cloning step.

Figure 3
figure 3

Overview of the seamless cloning of repetitive sequences for the production of Poly-Q proteins. (A) Genealogy of the pMK1-vectors. A pool of different pMK1 vectors was generated by elongation cycles using double-stranded oligonucleotides or Poly-Q encoding DNA fragments derived from pre-existing pMK1-Qn vectors. (B) Repetitive sequences of different lengths cloned by the described strategy in (A). pMK1-Qn plasmids encoding indicated Poly-Q stretches were digested with EcoRI (cuts 77 bp upstream of BsaI) and SacI. The inserts were separated on a 2% agarose gel and visualized by ethidium bromide (EtBr) staining.

Our approach is also suitable to increase the length of repetitive sequences by multiple insertions as described previously [13]. Thereby, simultaneous digestion of the annealed oligonucleotides or pMK1-Qn plasmids with both Type IIS restriction enzymes would result in Q-block fragments with compatible overhangs (Figure 1A). This would allow for ligation of multiple Q-blocks in a single step.

Importantly, our technique is not limited to generate Poly-Q encoding sequences but can also be used for other nucleotide repeats. One example of highly repetitive nucleotide sequences are the poly(A) tails attached to the 3' ends of eukaryotic mRNAs by poly(A)-polymerases. These poly(A) extensions are up to 250 nucleotides in length and they are important for mRNA stability and efficient translation initiation [20]. As polyadenylation is lacking in conventional in vitro transcription reactions, poly(A) fragments are usually fused to the 3' untranslated regions of the DNA templates. Nevertheless, controlling the length of the poly(A) sequence is difficult to achieve by conventional cloning strategies. We have successfully applied our novel approach for the directed elongation of repetitive DNA sequences to create poly(A) stretches of defined length in DNA templates for the in vitro production of polyadenylated mRNA (Figure 4). Therefore, we used A-block oligonucleotides instead of Q-blocks and applied the same basic strategy as described above. In this case we used two oligonucleotide pairs (A31 and A32) to generate a stretch of 57 adenines and an inward facing BsmBI site in a single initiation step (Figure 4A and 4B). The newly introduced BsmBI site was used to elongate the poly(A) region with the oligonucleotide pair A32 (Figure 4A, lower panel). A restriction digestion analysis of the initiation and elongation steps of poly(A) cloning is shown in Figure 4C.

Figure 4
figure 4

Cloning of poly(A) constructs. (A) Our basic strategy was also successfully applied to the cloning of adenine repeat sequences. Here two antiparallel oligonucleotide pairs that differ in their restriction sites and length of the poly(A) region, A31 and A32, were used for the initiation step. A31, A32, and pMK0 were digested with HindIII/BsaI, BsaI/XhoI, and HindIII/XhoI, respectively, and ligated in a single reaction to obtain pMK0-A57. For elongation the BsaI/XhoI digested oligonucleotide A32 was inserted into pMK0-A57 cut with BsmBI and XhoI. The elongation principle is the same as described above for the Poly-Q constructs. Used restriction sites are underlined. (B) Antiparallel oligonucleotide pairs used for cloning of the poly(A) sequences. The recognition sequences and the corresponding cleavage sites of the restriction endonucleases are indicated by different colors. The inward facing BsaI sites on A31 and A32 were designed in a way to allow the seamless fusion of the oligonucleotides via their A-blocks. (C) Initiation and elongation step of poly(A) cloning. pMK0-An plasmids containing the indicated number of adenines or the empty pMK0 vector were digested with SacI (cuts 199 bp upstream of poly(A)) and XhoI. The inserts were separated on a 10% polyacrylamide (PAA) gel and visualized by ethidium bromide (EtBr) staining.

Theoretically, the basic strategy can also be expanded to establish a modular system of double-stranded DNA oligonucleotides or PCR-amplified fragments carrying different sequences that can be fused in a directed fashion and in any desired order. This is possible as long as the recognition sites of the Type IIS enzymes are designed to produce compatible ends (Figure 5). Moreover, other combinations of Type IIS and Type IIP restriction endonucleases can be used instead of the ones chosen in this study.

Figure 5
figure 5

Strategy to combine different repetitive sequences in a modular fashion. Antiparallel oligonucleotide pairs with different repetitive sequences (orange and blue) can be fused on one plasmid if their Type IIS recognition sites (IIS) are designed in a way that they create compatible overhangs. Moreover, our strategy is not restricted to specific endonucleases. The use of any combination of one Type IIP (IIP) and two different compatible Type IIS enzymes is possible. Upon insertion of the first annealed and digested oligonucleotide pair (orange) via IIS and IIP, the resulting vector can be digested with IIS' and IIP to allow the incorporation of a second double-stranded oligonucleotide pair (blue).

Our initial intention was to generate Poly-Q constructs for in vitro experiments. So far, we cloned repetitive DNA sequences of different lengths encoding Poly-Q stretches between 11 and 218 residues into the pMK1 vector (Figure 3A). We next adopted these constructs to meet our specific purposes. In our final constructs the 17 N-terminal amino acids of Htt (N17) flank the different Poly-Q regions and a FLAG-tag for immunodetection is located at the C-terminal end. In order to insert the various Q-blocks between the N17 and FLAG-tag encoding sequences on the expression vector (pLANA), the plasmid was linearized with the BsaI enzyme. The glutamine encoding sequences were excised from the pMK1 plasmids with BsaI and BsmBI resulting in compatible ends that allowed the seamless insertion of Q-blocks of different lengths behind the N17 motif (Figure 6A).

Figure 6
figure 6

Cloning and analysis of the final Poly-Q constructs. (A) The final constructs encode an N-terminal His6-tag (pink) and a SUMO-domain (blue) followed by the 17 N-terminal amino acids of Huntingtin (N17, orange). The different Poly-Q encoding sequences (red) and a FLAG-tag (yellow) are located at the 3' end. Expression is controlled by a T7 promoter (brown). For cloning of the final constructs the Poly-Q encoding repeats of different lengths were excised from of the pMK1-Qn vectors using BsaI and BsmBI. The resulting fragments were inserted into pLANA, which was linearized with BsaI. The restriction sites were arranged in a way that allowed the seamless ligation of the Poly-Q encoding sequences behind N17 into pLANA. Our final constructs still contain a BsaI site behind the Poly-Q encoding region allowing further nucleotide insertions. Used restriction sites are underlined. (B) Three different constructs were successfully produced in E. coli and purified. The proteins consisting of His6-SUMO-N17-Qn-FLAG were applied to SDS-PAGE and visualized by immunoblotting against the FLAG-tag. (C) Purified His6-SUMO-N17-Q47-FLAG fusion proteins were treated with Ulp1 for 1 min and analyzed by SDS-PAGE and α-FLAG immunoblotting. (D) Filter retardation assay using His6-SUMO-N17-Q47-FLAG fusion proteins. Membrane bound Poly-Q fibrils were visualized by α-FLAG immunodetection.

To study the aggregation kinetics of Poly-Q proteins in vitro, we applied a widely used filter retardation assay, which allows the detection and quantification of small amounts of Poly-Q containing aggregates [5]. This assay is based on the characteristic of Poly-Q fibrils to be insoluble in solutions containing 2% sodium dodecyl sulfate (SDS). Such fibrils are specifically retained on a cellulose-acetate filter, whereas the soluble monomeric species are denatured by SDS and filtered through the membrane. The captured aggregates can then be visualized by immunodetection. For such experiments fusion constructs were commonly used carrying a N-terminal glutathione S-transferase (GST) domain fused to the first exon of htt via a factor Xa cleavage site [5]. In these constructs the globular GST domain prevents aggregation of the monomers. However, aggregation can be induced by release of the GST moiety upon proteolytic factor Xa cleavage. Based on this principle we designed a new construct in which the first exon of the htt gene was fused to a SUMO domain (Saccharomyces cerevisiae Smt3) carrying an N-terminal His6-tag for affinity purification (Figure 6A). Importantly, the His6-SUMO domain keeps the purified Poly-Q containing fusion proteins soluble and allows for the controlled induction of aggregation by treatment with the Ulp1 protease (SUMO protease from S. cerevisiae), which cleaves behind the two conserved C-terminal glycine residues of folded SUMO [21].

Using the His6-SUMO fusion strategy the Poly-Q proteins can be easily affinity purified under denaturing conditions using silica-based Ni2+-matrices (Figure 6B). As Poly-Q proteins often tend to form insoluble inclusions when expressed in E. coli, we purified our constructs under denaturing conditions using 6 M guanidine hydrochloride and refolded them on the affinity matrix by slowly decreasing the concentration of the denaturant. The efficiency of the refolding process was monitored by digestion of the purified and soluble Poly-Q fusion proteins with Ulp1 (Figure 6C). Importantly, this protease does not recognize a specific amino acid sequence but rather the native structure of the SUMO-domain. Under our experimental conditions the purified fusion proteins were almost completely digested by Ulp1 within one minute, indicating that refolding of the constructs was successful. Therefore, we performed the filter retardation assay with the Poly-Q fusion proteins and induced aggregation by Ulp1 mediated cleavage. Figure 6D exemplifies the assay with our construct containing 47 glutamine residues. In the control reaction, where Ulp1 was omitted, no aggregation was observed over six hours beyond marginal background signals. By contrast, significant amounts of SDS-resistant Poly-Q fibrils were formed after one hour upon induction of SUMO cleavage. Accordingly, release of His6-SUMO from the Poly-Q constructs occurs much faster than fibril formation (Figure 6C and 6D). Thus, our His6-SUMO fusion strategy provides a useful tool to study the kinetics of Poly-Q aggregation in vitro.


As mentioned above, several other seamless cloning strategies were described previously, indicating the demand for such methods. However, our approach has several advantages compared to existing methods. Here we used synthetic oligonucleotides to overcome the limitations of PCR-based amplification of repetitive DNA sequences and combined this strategy with seamless cloning using Type IIS restriction enzymes. The usefulness of this method was demonstrated by means of two frequently occurring problems: the cloning of defined stretches of Poly-Q encoding sequences and the generation of polyadenine sequences for in vitro transcription of polyadenylated mRNA. We chose the latter application because we wanted to demonstrate that the technique is suitable for generating fully homogenous nucleotide sequences of defined length. Another important advantage is that our method is directed. This means that in each elongation cycle, the number of nucleotides added to the repetitive sequence can be determined precisely by simply designing the oligonucleotides accordingly. Importantly, a single product is obtained in each step. Other methods often generate a mixture of products due to multiple insertions. Consequently, products of the right length or sequence have to be selected or purified via agarose gels, which makes the procedure more complicated, time-consuming, and error-prone. As we did not observe multiple insertions and only rarely religations of the vector, this is not necessary using our technique. In addition, the protocol presented here is straightforward. One round of elongation can be done within one day, including restriction digestion of the vector, ligation, and transformation. Bacterial growth and sequencing requires approximately two further days. We obtained approximately 100-300 clones per transformation and in most cases it was sufficient to sequence a single clone. Moreover, combining pre-existing constructs can rapidly increase the length of the repetitive sequence (Figure 2 and 3). For example, once the method was established, the pMK1-constructs could be generated within three weeks including sequencing (Figure 3A). We did not encounter a limitation of the number of repeats for our purposes.

In general, our method is cheap because only the synthetic oligonucleotides have to be purchased for each application. We usually ordered 0.02 μmol of commercially synthesized, HPLC-purified oligonucleotides, respectively. The costs depend therefore on the list price per nucleotide, purity, scale, and length of the oligonucleotides. Once annealed and digested, the oligonucleotides can be used for all subsequent elongation rounds and do not have to be re-ordered. All other materials (e.g. restriction enzymes, agarose gels, competent cells, DNA ligase) are standard equipment for molecular cloning. It should be mentioned that in principle the strategy could be used also to assemble non-repetitive sequences. In this case, the DNA fragments could be amplified even by PCR with primers carrying the Type IIS and Type IIP restriction sites in the 5' overhang regions. Finally, it should be also possible to fuse different repetitive sequences in a directed and seamless manner (Figure 5).

In summary, our seamless cloning strategy is PCR-free and allows the directed and efficient generation of highly repetitive DNA sequences of defined lengths by simple standard cloning techniques. Besides their applications in basic research, as exemplified here, repetitive nucleotide sequences become increasingly important in biopolymer technology. As artificial proteins with unique physical properties such as elastomeric polypeptides and synthetic silk fibers often contain multiple amino acid repeats, new strategies are required that improve the cloning of synthetic genes for the production of protein-based polymeric materials in biological systems [22, 23]. The method presented here is both, cheap and fast, and can be easily adapted to produce any desired DNA sequences for a wide range of applications.


Cloning Methods

All cloning steps were performed according to standard protocols [24]. Restriction enzymes were named according to [25]. HPLC-purified synthetic Q-block (5'-ccagtgGGTCTCaCAGCAACAGCAACAGCAGCAGCAACAGCAGcagagacgGAG CTCgatc-3' and 5'-gatcGAGCTCcgtctctgCTGCTGTTGCTGCTGCTGTTGCTGTT GCTGtGAGACCcactgg-3') and poly(A) oligonucleotides A31 (5'-gccgAAGCTT aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaGAGACCcactgg-3' and 5'-ccagtgGGTCTCtttttttttttttttttttttttttt tttttAAGCTTcggc-3') and A32 (5'-ccagtgGGTCTCAaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaAGAGACG acagCTCGAGgccg-3' and 5'-cggcCTCGAGctgtCGTCTCTttttttttttttttttttttttttttttttTGAGACCcactgg-3') were obtained from Thermo Fisher Scientific. For annealing 10 pmol of the antiparallel oligonucleotides were mixed in a final volume of 100 μl. The reaction mixture was heated up to 95°C for 5 min, incubated for 10 min at 55°C, and finally cooled down to room temperature. The annealed oligonucleotides were digested with the respective restriction endonucleases (NEB) according to the manufacturers protocol. Next, the DNA was precipitated by addition of 1/10 volume 3 M sodium acetate pH 5.2, 2.5 volumes of 100% (v/v) ethanol (p.a.) and incubation at -20°C for at least 12 h. The DNA was recovered by centrifugation at 16000 × g for 30 min at 4°C and the DNA pellets were washed with 70% (v/v) ethanol. The DNA pellets were dried at room temperature and subsequently resuspended in 100 μl sterile water. Aliquots of the double-stranded digested oligonucleotides were stored at -20°C and used for the subsequent cloning cycles to elongate the repetitive sequences. Between 0.1 μl and 5 μl of the digested oligonucleotides were ligated with 30-70 ng linearized and dephosphorylated plasmid DNA and T4 DNA ligase (Fermentas) in a 20 μl reaction mixture according the manufacturers instructions. The ligations were carried out at 23°C for 2 h and transformed into 100 μl RbCl2-competent E. coli cells (strain: DH5αZ1). Plasmid preparation, gel extraction, and DNA purification kits were obtained from Qiagen. All DNA fragments originating from plasmid DNA were separated by agarose gel electrophoresis and extracted using the QIAquick Gel Extraction Kit (Qiagen). All plasmids generated during this study were verified by DNA sequencing (GATC, Germany).

To generate the poly(A) constructs the protocol was adapted due to the lower melting temperature of the poly(A) oligonucleotides. The antiparallel oligonucleotides were mixed as described above, heated to 95°C, and annealed by a continuous temperature gradient of 0.1°C per second to 4°C. To avoid any denaturation of poly(A) stretches the annealed oligonucleotides as well as vectors containing poly(A) regions were digested with the Type IIS restriction enzymes overnight at 37°C. Restriction enzymes were removed by phenol/chloroform extraction according to standard protocols and the DNA was recovered by ethanol precipitation as described above. The resulting poly(A) constructs were analyzed by restriction digestions and the lengths of the poly(A) stretches up to 100 base pairs could be determined precisely by sequencing (GATC, Germany).

Expression and purification of Poly-Q proteins

The pLANA-Qn constructs (Figure 6A) were transformed into the E. coli strain BL21(DE3). A two liter LB-culture was inoculated with 20 ml of a stationary overnight culture and grown to an OD600 nm of 0.6-0.8 at 30°C. Expression was induced by the addition of 1 mM isopropylthio β-D-1-galactopyranoside (IPTG). After 5 h cells were harvested by centrifugation and flash-frozen in liquid nitrogen. The cell pellets were resuspended in lysis buffer (30 mM HEPES-KOH pH7.4, 500 mM potassium acetate, 5 mM magnesium chloride 5% (v/v) glycerol, 1 mM ß-mercaptoethanol, 1 mM phenylmethylsulphonyl fluoride (PMSF), 1 × protease inhibitor cocktail (Roche)) and lysed by French press. After centrifugation at 30000 × g for 30 min at 4°C the pellet was resuspended in denaturation buffer 1 (6 M guanidine hydrochloride (GdnHCl), 30 mM HEPES-KOH pH 7.4) and incubated with 2.5 g Ni-IDA silica matrix (Protino; Macherey-Nagel) for 30 min at 4°C while rotating. The matrix was washed twice with 40 ml denaturation buffer 1, twice with 40 ml denaturation buffer 2 (4 M GdnHCl, 30 mM HEPES-KOH pH 7.4), twice with 40 ml denaturation buffer 3 (2 M GdnHCl, 30 mM HEPES-KOH pH 7.4), twice with 40 ml high salt buffer (30 mM HEPES-KOH pH 7.4, 1 M potassium acetate, 5 mM magnesium chloride, 5% (v/v) glycerol, 1 mM ß-mercaptoethanol), and finally four times with with 40 ml low salt buffer (30 mM HEPES-KOH pH 7.4, 50 mM potassium acetate, 5 mM magnesium chloride, 5% (v/v) glycerol, 1 mM ß-mercaptoethanol). Proteins were eluted with four times 10 ml elution buffer (30 mM HEPES-KOH pH 7,4, 50 mM potassium acetate, 5 mM magnesium chloride, 5% (v/v) glycerol, 1 mM ß-mercaptoethanol, 250 mM imidazole-HCl pH 8.0). The purest fractions were dialyzed against 5 l low salt buffer for 3 h. To avoid aggregation the purified proteins were diluted, frozen in liquid nitrogen, and stored at -80°C. The purified proteins were analyzed by SDS-PAGE and Coomassie staining as well as immunoblotting using FLAG antibodies (ANTI-FLAG, Sigma).

Filter retardation assay

The Q-containing fusion proteins were thawed on ice and diluted with low salt buffer (30 mM HEPES-KOH pH 7.4, 50 mM potassium acetate, 5 mM magnesium chloride, 1 mM ß- mercaptoethanol, 10% (v/v) glycerol) to a final concentration of 0.02 mg/ml. Insoluble material was removed by centrifugation at 16000 × g for 5 min at 4°C. The supernatant was divided into two samples, which were directly used for the filter retardation assay. The SUMO protease was added to one of the samples to a final concentration of 10 μg per mg substrate protein. The control reaction contained BSA instead of protease. The reactions were incubated at 22°C in a final volume of 350 μl. At the indicated time points 50 μl samples (1 μg fusion protein) were taken and mixed 1:1 with stop solution (4% (w/v) SDS, 100 mM dithiothreitol) followed by incubation at 95°C for 5 min. Under these conditions monomeric Poly-Q proteins are denatured whereas Poly-Q aggregates remain intact. Subsequently, the samples were applied to a Dot-blot filtration unit and filtered through a cellulose-acetate membrane (0.2 μm pore size) equilibrated with 2% (w/v) SDS in TBS (100 mM Tris-HCl pH 8.0, 150 mM sodium chloride). After washing with 0.1% (w/v) SDS in TBS the membrane was blocked in TBS-T (100 mM Tris-HCl pH 8.0, 150 mM sodium chloride, 0.05% (v/v) Tween-20) containing 5% (w/v) non-fat dried milk. The aggregates were detected by immunoblotting using primary FLAG-antibodies and alkaline-phosphatase coupled to anti-rabbit IgG (1:10000, Sigma).


  1. Scherzinger E, Lurz R, Turmaine M, Mangiarini L, Hollenbach B, Hasenbank R, Bates GP, Davies SW, Lehrach H, Wanker EE: Huntingtin-encoded polyglutamine expansions form amyloid-like protein aggregates in vitro and in vivo. Cell. 1997, 90 (3): 549-558. 10.1016/S0092-8674(00)80514-0.

    Article  CAS  Google Scholar 

  2. Lee JE, Cooper TA: Pathogenic mechanisms of myotonic dystrophy. Biochem Soc Trans. 2009, 37 (Pt 6): 1281-1286.

    Article  CAS  Google Scholar 

  3. Muragaki Y, Mundlos S, Upton J, Olsen BR: Altered growth and branching patterns in synpolydactyly caused by mutations in HOXD13. Science. 1996, 272 (5261): 548-551. 10.1126/science.272.5261.548.

    Article  CAS  Google Scholar 

  4. Borrell-Pages M, Zala D, Humbert S, Saudou F: Huntington's disease: from huntingtin function and dysfunction to therapeutic strategies. Cell Mol Life Sci. 2006, 63 (22): 2642-2660. 10.1007/s00018-006-6242-0.

    Article  CAS  Google Scholar 

  5. Wanker EE, Scherzinger E, Heiser V, Sittler A, Eickhoff H, Lehrach H: Membrane filter assay for detection of amyloid-like polyglutamine-containing protein aggregates. Methods Enzymol. 1999, 309: 375-386.

    Article  CAS  Google Scholar 

  6. Takahashi N, Sasagawa N, Suzuki K, Ishiura S: Synthesis of long trinucleotide repeats in vitro. Neurosci Lett. 1999, 262 (1): 45-48. 10.1016/S0304-3940(99)00031-2.

    Article  CAS  Google Scholar 

  7. Ordway JM, Detloff PJ: In vitro synthesis and cloning of long CAG repeats. Biotechniques. 1996, 21 (4): 609-610, 612.

    CAS  Google Scholar 

  8. Matsuura T, Ashizawa T: Polymerase chain reaction amplification of expanded ATTCT repeat in spinocerebellar ataxia type 10. Ann Neurol. 2002, 51 (2): 271-272. 10.1002/ana.10049.

    Article  Google Scholar 

  9. Laccone F, Maiwald R, Bingemann S: A fast polymerase chain reaction-mediated strategy for introducing repeat expansions into CAG-repeat containing genes. Hum Mutat. 1999, 13 (6): 497-502. 10.1002/(SICI)1098-1004(1999)13:6<497::AID-HUMU10>3.0.CO;2-6.

    Article  CAS  Google Scholar 

  10. Peters MF, Ross CA: Preparation of human cDNas encoding expanded polyglutamine repeats. Neurosci Lett. 1999, 275 (2): 129-132. 10.1016/S0304-3940(99)00758-2.

    Article  CAS  Google Scholar 

  11. Michalik A, Kazantsev A, Van Broeckhoven C: Method to introduce stable, expanded, polyglutamine-encoding CAG/CAA trinucleotide repeats into CAG repeat-containing genes. Biotechniques. 2001, 31 (2): 250-252, 254.

    CAS  Google Scholar 

  12. Dorsman JC, Bremmer-Bout M, Pepers B, van Ommen GJ, Den Dunnen JT: Interruption of perfect CAG repeats by CAA triplets improves the stability of glutamine-encoding repeat sequences. Biotechniques. 2002, 33 (5): 976-978.

    CAS  Google Scholar 

  13. Kim SH, Cai L, Pytlos MJ, Edwards SF, Sinden RR: Generation of long tracts of disease-associated DNA repeats. Biotechniques. 2005, 38 (2): 247-253. 10.2144/05382RR01.

    Article  Google Scholar 

  14. Szybalski W, Kim SC, Hasan N, Podhajska AJ: Class-IIS restriction enzymes--a review. Gene. 1991, 100: 13-26.

    Article  CAS  Google Scholar 

  15. Engler C, Gruetzner R, Kandzia R, Marillonnet S: Golden gate shuffling: a one-pot DNA shuffling method based on type IIs restriction enzymes. PLoS One. 2009, 4 (5): e5553-10.1371/journal.pone.0005553.

    Article  Google Scholar 

  16. Lewis RV, Hinman M, Kothakota S, Fournier MJ: Expression and purification of a spider silk protein: a new strategy for producing repetitive proteins. Protein Expr Purif. 1996, 7 (4): 400-406. 10.1006/prep.1996.0060.

    Article  CAS  Google Scholar 

  17. Petka WA, Harden JL, McGrath KP, Wirtz D, Tirrell DA: Reversible hydrogels from self-assembling artificial proteins. Science. 1998, 281 (5375): 389-392.

    Article  CAS  Google Scholar 

  18. Goeden-Wood NL, Conticello VP, Muller SJ, Keasling JD: Improved assembly of multimeric genes for the biosynthetic production of protein polymers. Biomacromolecules. 2002, 3 (4): 874-879. 10.1021/bm0255342.

    Article  CAS  Google Scholar 

  19. Lu Q: Seamless cloning and gene fusion. Trends Biotechnol. 2005, 23 (4): 199-207. 10.1016/j.tibtech.2005.02.008.

    Article  CAS  Google Scholar 

  20. Scorilas A: Polyadenylate polymerase (PAP) and 3' end pre-mRNA processing: function, assays, and association with disease. Crit Rev Clin Lab Sci. 2002, 39 (3): 193-224. 10.1080/10408360290795510.

    Article  CAS  Google Scholar 

  21. Panavas T, Sanders C, Butt TR: SUMO fusion technology for enhanced protein production in prokaryotic and eukaryotic expression systems. Methods Mol Biol. 2009, 497: 303-317. 10.1007/978-1-59745-566-4_20.

    Article  CAS  Google Scholar 

  22. Prince JT, McGrath KP, DiGirolamo CM, Kaplan DL: Construction, cloning, and expression of synthetic genes encoding spider dragline silk. Biochemistry. 1995, 34 (34): 10879-10885. 10.1021/bi00034a022.

    Article  CAS  Google Scholar 

  23. Betre H, Setton LA, Meyer DE, Chilkoti A: Characterization of a genetically engineered elastin-like polypeptide for cartilaginous tissue repair. Biomacromolecules. 2002, 3 (5): 910-916. 10.1021/bm0255037.

    Article  CAS  Google Scholar 

  24. Sambrook J, Russell DW: Molecular cloning: a laboratory manual. 2001, Cold Spring Harbor, N.Y.: Cold Spring Harbor Laboratory Press, 3

    Google Scholar 

  25. Roberts RJ, Belfort M, Bestor T, Bhagwat AS, Bickle TA, Bitinaite J, Blumenthal RM, Degtyarev S, Dryden DT, Dybvig K, et al: A nomenclature for restriction enzymes, DNA methyltransferases, homing endonucleases and their genes. Nucleic Acids Res. 2003, 31 (7): 1805-1812. 10.1093/nar/gkg274.

    Article  CAS  Google Scholar 

Download references


We thank Janine Kirstein-Miles, Rainer Nikolay, and Christina Schlatterer for proofreading the manuscript. This work was supported by fellowships of the Konstanz Research School Chemical Biology to M. K., A. S. and S. P. and a grant of the DFG (DE-783) to E. D.

Author information

Authors and Affiliations


Corresponding author

Correspondence to Elke Deuerling.

Additional information

Authors' contributions

AS cloned the Poly-Q and poly(A) constructs, performed in vitro aggregation assays, and wrote the paper; SP developed the cloning strategy, designed the oligonucleotides, contributed in cloning of the poly(A) constructs, and wrote the paper; MK performed site-directed mutagenesis and contributed in cloning of the poly(A) constructs; ED coordinated the project and assisted in manuscript writing. All authors read and approved the final manuscript.

Annika Scior, Steffen Preissler contributed equally to this work.

Authors’ original submitted files for images

Rights and permissions

Open Access This article is published under license to BioMed Central Ltd. This is an Open Access article is distributed under the terms of the Creative Commons Attribution License ( ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Reprints and permissions

About this article

Cite this article

Scior, A., Preissler, S., Koch, M. et al. Directed PCR-free engineering of highly repetitive DNA sequences. BMC Biotechnol 11, 87 (2011).

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: