Skip to main content
  • Research article
  • Open access
  • Published:

A modeling study by response surface methodology and artificial neural network on culture parameters optimization for thermostable lipase production from a newly isolated thermophilic Geobacillus sp. strain ARM



Thermostable bacterial lipases occupy a place of prominence among biocatalysts owing to their novel, multifold applications and resistance to high temperature and other operational conditions. The capability of lipases to catalyze a variety of novel reactions in both aqueous and nonaqueous media presents a fascinating field for research, creating interest to isolate novel lipase producers and optimize lipase production. The most important stages in a biological process are modeling and optimization to improve a system and increase the efficiency of the process without increasing the cost.


Different production media were tested for lipase production by a newly isolated thermophilic Geobacillus sp. strain ARM (DSM 21496 = NCIMB 41583). The maximum production was obtained in the presence of peptone and yeast extract as organic nitrogen sources, olive oil as carbon source and lipase production inducer, sodium and calcium as metal ions, and gum arabic as emulsifier and lipase production inducer. The best models for optimization of culture parameters were achieved by multilayer full feedforward incremental back propagation network and modified response surface model using backward elimination, where the optimum condition was: growth temperature (52.3°C), medium volume (50 ml), inoculum size (1%), agitation rate (static condition), incubation period (24 h) and initial pH (5.8). The experimental lipase activity was 0.47 Uml-1 at optimum condition (4.7-fold increase), which compared well to the maximum predicted values by ANN (0.47 Uml-1) and RSM (0.476 Uml-1), whereas R2 and AAD were determined as 0.989 and 0.059% for ANN, and 0.95 and 0.078% for RSM respectively.


Lipase production is the result of a synergistic combination of effective parameters interactions. These parameters are in equilibrium and the change of one parameter can be compensated by changes of other parameters to give the same results. Though both RSM and ANN models provided good quality predictions in this study, yet the ANN showed a clear superiority over RSM for both data fitting and estimation capabilities. On the other hand, ANN has the disadvantage of requiring large amounts of training data in comparison with RSM. This problem was solved by using statistical experimental design, to reduce the number of experiments.


Today, lipases (EC, triacylglycerol acylhydrolases) stand amongst the most important biocatalysts. They carry out novel reactions in both aqueous and nonaqueous media. Lipases are used to hydrolyze ester bonds of a variety of nonpolar substrates at high activity, chemo-, region- and stereo-selectivity. Moreover, they are used to catalyze the reverse reactions (such as esterification [1] and transesterification [2]) in nonpolar solvents [3] and [4].

Among lipases of different sources, microbial thermostable lipases are highly advantageous for biotechnological applications, since they can be produced at low cost and exhibit improved stability [3]. Thus, various thermostable lipase-producing microorganisms have been isolated from diverse habitats [57].

Bacterial lipases are mostly extracellular and their production greatly influenced by nutritional and physico-chemical factors, such as nitrogen and carbon sources, metal ions, initial pH, temperature, medium volume, agitation rate, incubation period, inoculum size and aeration [8] and [9].

The most important stages in a biological process are modeling and optimization to improve a system and increase the efficiency of the process without increasing the cost [10]. The classical optimization method (single variable optimization) is not only time-consuming and tedious but also does not depict the complete effects of the parameters in the process and ignores the combined interactions between physicochemical parameters. This method can also lead to misinterpretation of results [10] and [11]. In contrast, response surface methodology (RSM) is an empirical modeling system for developing, improving, and optimizing of complex processes [12] and [5]. RSM assesses the relationships between the response(s) and the independent variables [13], and defines the effect of the independent variables, alone or in combination, in the processes.

Although RSM has so many advantages, and has successfully been applied to study and optimize the enzymatic processes [14] and [15], and enzyme production from microorganisms [16] and [17], it is hard to say that it is applicable to all optimization and modeling studies [1820]. The past decade has seen a host of data analysis tools based on biological phenomena develop into well-established modeling techniques, such as artificial intelligence and evolutionary computing. Artificial neural networks (ANNs) are now the most popular artificial learning tool in biotechnology, with a wide applications range included optimization of bioprocesses [21] and enzyme production from microorganisms [22].

Indeed an ANN is a massively interconnected network structure consisting of many simple processing elements capable of performing parallel computation for data processing. The fundamental processing element of ANNs (the artificial neuron) simulates the basic functions of biological neurons [18] and [23].

In this work, after finding the best composition of production medium among the best previously published and modified media, the optimization of physical factors for extracellular thermostable lipase production from a newly isolated Geobacillus sp. strain ARM (DSM 21496 = NCIMB 41583) was carried out using RSM and ANN.

Results and discussion

Effect of various production media on lipase production

The production of lipases is mostly inducer-dependent [24] and different media have different stimulation effects on lipase production [9] based on the physiological and biochemical pathways of the bacterium. In order to select the best lipase production medium, the ability of bacterium to produce lipase was tested in eight different liquid media (Figure 1). Lipase activity in medium A1 was significantly higher than other production media, which is composed of peptone and yeast extract as organic nitrogen sources, olive oil as carbon source and lipase production inducer, sodium and calcium as metal ions, and gum arabic as emulsifier and lipase production inducer.

Figure 1
figure 1

Lipase activity in different compositions of production media.

Generally, microorganisms provide high yields of lipase when organic nitrogen sources are used, such as peptone and yeast extract, which have been used for lipase production by various thermophilic Bacillus sp. [25, 26] and [27]. Yeast extract is one of the most important nitrogen sources for high level lipase production by different microorganisms [28]. Besides this role, yeast extract supplies vitamins and trace elements for the growth of bacteria and increases their lipase production [29].

High levels of lipase production were reported from various thermophilic Bacillus sp. in the presence of olive oil as carbon source in the culture medium [6, 27, 30] and [28]. Most published experimental data have shown that lipid carbon sources (especially natural oils) stimulate lipase production [9, 31] and [32], whereas carbon sources that are easily broken down and used by bacteria play an inhibitory role [30, 33] and [34]. Different microorganisms have different requirement for metal ions. Calcium ions play essential roles for many microbial species. They are important in maintaining cell wall rigidity, stabilizing oligomeric proteins and covalently bounding protein peptidoglycan complexes in the outer membrane [35]. Lipase production by various Bacillus sp. was stimulated in the presence of Ca2+ alone [26] and [36] or in combination with other ions such as Mg2+, and Fe2+ [37].

On the other hand, highly branched, helically configurated, non-metabolizable polysaccharides such as gum arabic are able to enhance the lipase production. This might probably be due to the emulsification of culture media containing oil to increase the lipid surface (interfacial area between oil and water) for lipase action, detachment of lipase from the oil surface, and from binding sites at the outer membrane of Gram-negative bacteria [30, 38] and [39].

As a result, A1 production medium was chosen as the medium to be used in the further optimization of lipase production.

Analyzing and modeling

The central composite rotary design (CCRD) along with the observed responses is shown in Table 1.

Table 1 Experimental design used in RSM and ANN studies by using six independent variables showing observed values of lipase activity

Response surface methodology

Fitting the data to various models (linear, two factorial, quadratic and cubic) and their subsequent ANOVA showed that all models were unable to explain the effects of physical factors on the lipase production. To overcome of this problem, we used backward elimination strategy followed by hierarchical terms addition to find the best model. Backward elimination started with all of the predictors in the model. The variable that was least significant (with the largest P-value) was removed and the model was refitted. Each subsequent step removed the least significant variable in the model until all remaining variables had individual P-values smaller than 0.05 [40]. Finally, modified cubic equation (equation 1) and its subsequent ANOVA (Table 2) showed a quite suitable model to optimize the lipase production. Indeed, the modified model was a quadratic model with one eliminated (V.Ag) and one additional (T.Ag.t) terms.

Table 2 ANOVA for joint test

Lipase activity (U ml-1) = 4.41 - 0.06 T - 0.01 V - 0.32 IS - 0.02 Ag - 0.07 t + 0.11 pH - 1.5E-4 T2 + 6.9E-7 V2 - 1.1E-3 IS2 - 1.3E-6 Ag2 + 1.7E-5 t2 - 0.01 pH2 + 9.5E-5 T.V + 3E-3 T.IS + 3.8E-4 T.Ag + 1.3E-3 T.t + 7.2E-4 T.pH + 8.8E-4 V.IS + 3.7E-5 V.t + 5.6E-4 V.pH - 2.3E-5 IS.Ag + 1.1E-3 IS.t + 3.3E-3 IS.pH + 5.5E-4 Ag.t + 1.8E-4 Ag.pH - 1.7E-3 t.pH - 9.7E-6 T.Ag.t

where T is temperature, V is medium volume, IS is inoculum size, Ag is agitation rate, t is incubation period and pH is initial medium pH.

The computed model F-value of 1176.88 implies the model is significant and there is only a 0.01% chance that a "model F-value" this large could occur due to noise. The 'lack of fit F-value" of 0.18 implies the lack of fit is not significant relative to the pure error. There is a 69.32% chance that a "lack of fit F-value" this large could occur due to noise. Non-significant lack of fit shows the model is significant. On the other hand, the pure error is very low, indicating good reproducibility of the data obtained. With very small "model P-value" (< 0.0001) and large "lack of fit P-value" (0.6932) from the analysis of ANOVA and a suitable coefficient of determination (R2 = 0.9998) and adjusted coefficient of determination (R2adjusted = 0.999), the modified cubic polynomial model was highly significant and sufficient to represent the actual relationship between the response and the significant variables (Table 2).

Artificial neural network

Effect of architecture and topology on neural network performance

The selection of an optimal neural-network architecture and topology is of critical importance for a successful application. Several neural-network architectures and topologies were tested for the estimation and prediction of lipase production. Table 3 summarizes the top five ANN models.

Table 3 The effect of different neural network architecture and topologies on coefficient of determination, R2, and absolute average deviation, AAD, in the estimation of lipase production obtained in the training and testing of neural networks

Effect of learning algorithm and transfer function

Training a neural network model essentially means selecting one model from the set of allowed models that minimizes the cost criterion. We have tested different learning algorithms for training neural network models. All accepted models (RMSE < 0.0001, R = 1 and DC = 1) have shown that incremental back propagation (IBP) was the most suitable learning algorithm for prediction of lipase production (Table 3).

The type of transfer function employed affects the neural network's learning rate and is instrumental in its performance. In the present work, among all employed transfer functions for hidden and output layers, accepted models were produced by linear function for output layer and Gaussian function or hyperbolic tangent (Tanh) for hidden layer that between them, the best models have been obtained by Gaussian function.

Optimal number of hidden neurons

Although it is important to select the optimal number of hidden neurons carefully, depending on the type and complexity of the task, this usually has to be done by trial and error. An increase in the number of hidden neurons up to a point usually results in a better learning performance. Too few hidden neurons limit the ability of the neural network to model the process, and too many may allow too much freedom for the weights to adjust and, thus, to result in learning the noise present in the database used in training [41]. We tested the effect of number of hidden neurons on the goodness of fit. The results of testing with the two sample experiments, evaluated statistically on the basis of the coefficient of determination (R2), are shown in Figure 2. In both examined cases, the optimum number of hidden neurons was 16, with an obvious overfitting when too many hidden neurons were used. Then the 6-16-1 topology was chosen as the best topology for estimation of lipase production.

Figure 2
figure 2

Optimal number of hidden neurons. Estimation of lipase production with neural networks of varying number of hidden neurons, tested with two example cases: incremental back propagation multilayer full feedforward (blue diamond) and multilayer normal feedforward incremental back propagation (pink square) with Gaussian transfer functions.

Artificial neural network analysis of lipase production

The best ANN chosen in the present work was a multilayer full feedforward incremental back propagation network with Gaussian transfer function (Table 3, C21) that consisted of a 6-16-1 topology. The optimized values of network for learning rate and momentum were 0.15 and 0.8, respectively. The learning was completed in RMSE = 9.99E-5, R = 1 and DC = 1. In the case of training data set, the coefficient of determination (R2) and absolute average deviation (AAD) were 1 and 0.1%, respectively, whereas for the testing data set, R2 was 1 and AAD was 0.231% (Table 4) and for validating data sets R2 and AAD were 0.989 and 0.059%, respectively (Table 5). Comparison of predicted and experimental values in training, testing and validating data sets, not only revealed capability of ANN in prediction of known data responses (the data that have been used for training) but also showed the ability of generalization for unknown data (the data that have not been used for training) and implying that empirical models derived from ANN can be used to adequately describe the relationship between the input factors and lipase production.

Table 4 Actual and predicted lipase activity by ANN and RSM models along with absolute deviation, R2 and AAD
Table 5 Solution of optimum condition

Comparison of RSM and ANN predicted values

The predicted output values of RSM and ANN are shown in Table 4. Though both models preformed well and offered stable responses, yet the ANN based approach was better in both data fitting and estimation capabilities in comparison to the RSM.

Main effects and interaction between parameters

The optimum level of each variable and the effect of their interactions on lipase production as a function of two variables were studied by plotting three dimensional response surface curves (while keeping the other variables at optimum point).

ANOVA analysis (Table 2) and three dimensional plots (Figure 3) reveal that growth temperature, medium volume, inoculum size and incubation period had significant effects on lipase production. ANOVA analysis shows that although pH was not a significant parameter (P value > 0.05), it had important and significant interactions with other parameters, hence it has been used to develop the model. On the other hand, among the different interactions, interaction between agitation rate and growth volume, did not show significant effect on lipase production (P value > 0.05). Figure 3B depicts that lipase activity effectively increased with a decrease in growth volume but agitation rate did not show significant effect on lipase production. On the other hand, ANOVA analysis and Figure 3D reveal, that agitation is one of the most important parameters for lipase production. As a conclusion, though both agitation rate and growth volume parameters are significant, yet their interaction is not a significant parameter for lipase production. Hence modification of model via the removal of this interaction using backward elimination strategy improved the model (Equation 1).

Figure 3
figure 3

Three dimensional plot showing the effect of: (A) growth temperature, inoculum size; (B) agitation rate, medium volume; (C) initial pH, agitation rate; and (D) initial pH, incubation period, and their mutual effect on the lipase production. Other variables are constant: growth temperature (52.3°C), medium volume (50 ml), inoculum size (1%), agitation rate (static condition), incubation period (24 h) and initial pH (5.8).

Figure 3A represents the three dimensional plot as function of temperature and inoculum size on lipase activity. Maximum lipase activity of 0.47 Uml-1 was obtained at the 52.3°C and 1.0% inoculum size. Further increase or decrease in the temperature, and increase in the inoculum size led to the decrease in the enzyme production. Generally, the optimum temperature for lipase production corresponds with the growth temperature of the respective microorganism [26]. It has been also proven that temperature regulate enzyme synthesis at mRNA transcription and probably translation levels. For extracellular enzymes, temperature influences their secretion, possibly by changing the physical properties of the cell membrane [42]. On the other hand, though higher temperature causes higher reaction rates and higher solubility of substrate and products, yet oxygen solubility is usually decreased.

At a suitable inoculum size, the nutrient and oxygen levels are enough for sufficient growth of bacteria and therefore, enhance the lipase production. If the inoculum size is too small, insufficient number of bacteria will lead to reduced amount of secreted lipase. High inoculum size can result in the lack of oxygen and nutrient depletion in the culture media [42] and [43].

Figures 3B and 3C depict the medium volume-agitation rate and initial pH-agitation rate interactions respectively. These plots reveal that the lipase activity increased with a decrease in culture volume and agitation rate. The maximum lipase activity was obtained at 50 ml culture volume and moderately acidic pH (5.8), under static condition. Similarly, static condition had resulted in comparatively high lipase production for Syncephalastrum racemosum [44]Pseudomonas sp. strain S5 [45] and Pseudomonas aeruginosa [46]. Generally, suitable agitation lead to sufficient supply of dissolved oxygen in the media [47]. Nutrient uptake by bacteria also will be increased [48], but the degree of aeration appears to be critical in some cases since shallow layer (static) cultures (moderate aeration) produced much more lipase than shake cultures (high aeration) [45].

The medium volume may have a great effect on the enzyme production. Although a larger medium volume initially contains more oxygen, nutrients and space for growth of bacteria, the void in the container and subsequently oxygenation of the medium will be decreased. On the other hand, it seems that ratio of surface area to volume (A/V) is important for lipase production where higher ratio cause higher oxygenation and lipase production [49].

The combined effect of initial pH and incubation time on lipase production is shown in Figure 3D. According to the plot, a moderately acidic initial pH (5.8) caused maximum lipase production after 24 h of cultivation. The activity was decreased remarkably as the incubation period changed. pH plays an important role in all the biological processes. The initial pH of the growth medium is important for lipase production [8]. Most bacteria prefer neutral initial pH for the best growth and lipase production, such as thermophilic Bacillus sp. strains L2 and 398 [50] and [51]. Maximum lipase activity at higher initial pH by various thermophilic Bacillus sp. has also been reported [25] and [32]. In contrast, Ertugrul et al. [52] have reported a moderately acidic pH (6.0) as the optimum initial pH for lipase production by Bacillus sp. The molecular electric charges and consequently molecular interactions and functions are directly related to media pH, thus any changes in medium pH affects many biological functions such as enzymatic processes, signaling pathways and transportations of various components across the cytoplasmic membrane and cell wall [53]. Therefore, medium pH is very important in nutrients absorption and growth of bacteria, stimulation of enzyme production via signaling pathways and release of extracellular enzymes (based on proteolytic mechanism of signal peptidases that has been explained by Paetzel et al. [54]).

Lipases are produced throughout bacterial growth, with peak production being obtained by late exponential growth phase [55]. Therefore, the optimum incubation time is based on duration of log phase that is influenced by environmental conditions as well as by characteristics of the organism itself.

Different optimum conditions for maximum lipase production by various thermophilic Bacillus sp. were reported [25, 32, 50] and [51]. Strain differences and synergistic effects with other factors present in the medium might be responsible for differences in the obtained results. Although no conclusive picture has been emerged so far from the large amount of experimental data concerning the physiology of lipase biosynthesis and its regulation, most of published experimental data seem to support the following inference. At the end of log phase, when one of the essential nutrients of the culture medium is used up or some waste product of organism builds up in the medium to an inhibitory level, microorganisms try to solve the problem and continue the growth. One response to this problem is the production of extracellular hydrolytic enzymes such as lipases, proteases and amylases. In other words, limitation of growth can be an inducer for the production of some enzymes. On the other hand, Table 6 shows that lipase production is the result of a synergistic combination of effective parameters interactions. These parameters are in equilibrium. It means that a change of one parameter can be compensated by changes of other parameters to give same results.

Table 6 Effect of different combinations of parameters on lipase production

Finally, Figure 4 shows the importance percentage of effective parameters on the lipase production. Inoculum size of 18.15% is the most important factor on the lipase production, incubation period of 17.01%, agitation rate of 16.78%, growth temperature and medium volume of 16.46% and 16.44% respectively, and pH of 15.19% are subsequent degrees of importance.

Figure 4
figure 4

Importance of effective parameters on lipase production.

Optimization of reaction

The optimal conditions for lipase production were predicted as presented in Table 5 along with their predicted and actual values. Among the various optimum conditions, the highest lipase activity (0.47 Uml-1; 4.7-fold increase) was obtained at following conditions, growth temperature (52.3°C), medium volume (50 ml), inoculum size (1%), agitation rate (static condition), incubation period (24 h) and initial pH (5.8). Attention to R2 and AAD values between actual and estimated responses demonstrated a higher prediction accuracy of ANN compared to RSM.


In this work, different production media were tested for lipase production by a newly isolated thermophilic Geobacillus sp. strain ARM (DSM 21496 = NCIMB 41583). The maximum production was obtained in presence of peptone and yeast extract as organic nitrogen sources, olive oil as carbon source and lipase production inducer, sodium and calcium as metal ions, and gum arabic as emulsifier and lipase production inducer. On the other hand, culture parameters optimization and estimation of lipase production using RSM and ANN methods were successfully carried out. The best models were achieved by multilayer full feedforward incremental back propagation network and modified response surface model using backward elimination, where the optimum condition was: growth temperature (52.3°C), medium volume (50 ml), inoculum size (1%), agitation rate (static condition), incubation period (24 h) and initial pH (5.8). The experimental lipase activity was 0.47 Uml-1 at optimum condition (4.7-fold increase), which compared well to maximum predicted values by ANN (0.47 Uml-1) and RSM (0.476 Uml-1), whereas R2 and AAD were determined as 0.989 and 0.059% for ANN, and 0.95 and 0.078% for RSM respectively. Though the modified response surface model was comparable to ANN to provide good quality predictions for the six independent variables in terms of the lipase production, yet the ANN showed a clear superiority over RSM as a modeling technique for data sets showing nonlinear relationships.

On the other hand, ANN has the disadvantage of requiring large amounts of training data in comparison with RSM [56]. This problem was solved by using statistical experimental design, to reduce the number of experiments. Some of other researchers also have employed this strategy. Manohar and Divakar [21] employed a five variable parametric study for ANN analysis. They used 13 different combinations for training of network. Central composite design (CCD) was used for extracellular protease production (14 different combinations) [22] and modeling the growth of a bacterium (25 different combinations) [60]. Bas and Boyaci [18] employed face-centered design (FCD) and modified face-centered design (MFCD) for ANN study (13 different combinations for training).

As a conclusion lipase production is the result of a synergistic combination of effective parameters interactions. These parameters are in equilibrium and the change of one parameter can be compensated by changes of other parameters to give the same results. In addition, ANN can be a very powerful and flexible tool for modeling of the optimization process.


Bacterial strain

The bacterial strain used in this study was isolated from contaminated soil with oil from Selangor, Malaysia and identified as Geobacillus sp. strain ARM via 16S rDNA analysis [GenBank:EF025325] and deposited in DSMZ, Germany (DSM 21496) and NCIMB, UK (NCIMB 41583). This strain was preserved in sterile 16% (v/v) glycerol in Tryptic Soy Broth (TSB) at -80°C.

Composition of lipase production medium

In order to select the best lipase production medium, eight different media were tested. The composition of the media was (% w/v): M1: peptone (3), yeast extract (1), NaCl (0.5), olive oil (1% v/v) [57]; A1 (modified M1): M1+ CaCl2.2H2O (0.05) + gum arabic (1); A2 (modified M1): A1+ MgSO4.7H2O (0.01), FeCl3.6H2O (0.004); GYP: glucose (2), yeast extract (1), peptone (1), CH3COONa· 3H2O (1), MgSO4·7H2O (0.03), MnSO4 (0.01), KCl (0.05), olive oil (2% v/v) [57]; M3: nutrient broth (0.325), gum arabic (1), CaCl2·2H2O (0.05), Tween 80 (1% v/v), olive oil (1% v/v) [57]; M5: nutrient broth (0.8), triolein (1% v/v) [23]; TYEM: tryptone (0.6), yeast extract (0.2), CaCl2.2H2O (0.02), MgSO4.7H2O (0.01), FeCl3.6H2O (0.04), olive oil (1.5% v/v) [30]; MTYEM (modified TYEM): tryptone (0.6), yeast extract (0.2), CaCl2.2H2O (0.02), MgSO4.7H2O (0.01), FeCl3.6H2O (0.04), gum arabic (1), olive oil (1.5% v/v).

The media were sterilized for 15 min at 121°C after pH adjustment to 7.0. Bacterial inoculum (2% v/v; Ab600 = 0.5 of overnight culture in TSB) was then inoculated into 50 ml production medium and incubated by agitation under 150 rpm, for 48 h at 60°C. The cell free supernatant was obtained by centrifugation at 12,000 g, 4°C for 15 min prior to lipase assay.

Lipase activity assay

Determination of liberated free fatty acid was measured by colorimetric assay [58] using olive oil as substrate. The enzymatic reaction was performed in a water bath shaker for 30 min at 50°C under 200 rpm agitation. One unit of lipase activity was defined as 1.0 μmol of free fatty acid liberated min-1 and reported as Uml-1.

Experimental design

A five-level-six-factor central composite rotary design (CCRD) was employed in this study, requiring 33 experiments [59]. The variables and their levels selected for the lipase production optimization were: growth temperature (45 – 65°C); medium volume (50 – 200 ml); inoculum size (1 – 5%); agitation rate (0 – 200 rpm); incubation period (24 – 72 h) and initial pH (5 – 9). The experimental data [40 points include CCRD design (Table 1) and optimization data (Table 5)] was divided into three sets: training set, testing set and validating set. All tests were performed in triplicate.

Response surface methodology analysis

The CCRD design experimental data was used for model fitting in RSM to find the best polynomial equation. This data was analyzed using Design Expert version 6.06 (Stat Ease Inc. Minneapolis, USA) and then interpreted. Three main analytical steps: analysis of variance (ANOVA), a regression analysis and the plotting of response surface were performed to establish an optimum condition for the lipase production. Then, the predicted values obtained from RSM model, were compared with actual values for testing the model. Finally, the experimental values of predicted optimal conditions (Table 5) were used as validating set and were compared with predicted values.

Artificial neural network analysis

A commercial ANN software, NeuralPower version 2.5 (CPC-X Software) was used throughout the study. Multilayer normal feedforward and multilayer full feedforward neural networks were used to predict the lipase activity. Networks were trained by different learning algorithms (incremental back propagation, IBP; batch back propagation, BBP; quickprob, QP; genetic algorithm, GA; and Levenberg-Marquardt algorithm, LM). The ANN architecture consisted of an input layer with six neurons, an output layer with one neuron, and a hidden layer. To determine the optimal network topology, only one hidden layer was used and the number of neurons in this layer and the transfer functions of hidden and output layers (sigmoid, hyperbolic tangent function, Gaussian, linear, threshold linear and bipolar linear) were iteratively determined by developing several networks. Each ANN was trained until the network root of mean square error (RMSE) was lower than 0.0001, average correlation coefficient (R) and average determination coefficient (DC) were equal to 1. Other ANN parameters were chosen as the default values of the software. In the beginning, weights were initialized with random values and adjusted through a training process in order to minimize network error.

The CCRD design experimental data was divided into training and testing sets. For training, 25 points were used (Tables 1 and 4). One strategy for finding the best model is to summarize the data, it is well established that in ANN modeling, the replicates at center point do not improve the prediction capability of the network because of the similar inputs [10]. Hence, we improved our model by using mean of center points instead of 5 center points (Tables 1 and 4, italic numbers). To test the network, 4 remaining points were used (Tables 1 and 4, bold numbers). On the other hand, experimental values of predicted optimal conditions (Table 5) were used as validating set.

Verification of estimated data

To test the estimation capabilities of the techniques, the estimated responses obtained from RSM and ANNs were compared with the observed responses. The coefficient of determination (R2) and absolute average deviation (AAD) were determined and these values were used together to compare ANNs to each other for finding the best ANN model, and the best ANN model with RSM. The AAD and R2 are calculated by equations 2 and 3, respectively.

AAD = { [ i = 1 p ( | y i , exp y i , cal | / y i , exp ) ] / p } × 100 MathType@MTEF@5@5@+=feaagaart1ev2aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xI8qiVKYPFjYdHaVhbbf9v8qqaqFr0xc9vqFj0dXdbba91qpepeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGaciGaaiaabeqaaeqabiWaaaGcbaGaeeyqaeKaeeyqaeKaeeiraqKaeyypa0ZaaiWaaeaadaWadaqaamaaqahabaWaaeWaaeaadaabdaqaaiabbMha5naaBaaaleaacqqGPbqAcqGGSaalcyGGLbqzcqGG4baEcqGGWbaCaeqaaOGaeyOeI0IaeeyEaK3aaSbaaSqaaiabbMgaPjabcYcaSiabbogaJjabbggaHjabbYgaSbqabaaakiaawEa7caGLiWoacqGGVaWlcqqG5bqEdaWgaaWcbaGaeeyAaKMaeiilaWIagiyzauMaeiiEaGNaeiiCaahabeaaaOGaayjkaiaawMcaaaWcbaGaeeyAaKMaeyypa0JaeGymaedabaGaeeiCaahaniabggHiLdaakiaawUfacaGLDbaacqGGVaWlcqqGWbaCaiaawUhacaGL9baacqGHxdaTcqaIXaqmcqaIWaamcqaIWaamaaa@6132@

where yi,exp and yi,cal are the experimental and calculated responses, respectively, and p is the number of the experimental run.

R 2 = 1 Σ i = 1 -n ( model prediction i experimental value i ) 2 Σ i = 1 -n ( average experimental value experimental value i ) 2 MathType@MTEF@5@5@+=feaagaart1ev2aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xI8qiVKYPFjYdHaVhbbf9v8qqaqFr0xc9vqFj0dXdbba91qpepeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGaciGaaiaabeqaaeqabiWaaaGcbaGaeeOuai1aaWbaaSqabeaacqaIYaGmaaGccqGH9aqpcqaIXaqmcqGHsisljuaGdaWcaaqaamaalaaabaGaeu4OdmfabaGaeeyAaKMaeyypa0JaeGymaeJaeeyla0IaeeOBa4gaaiabcIcaOiabb2gaTjabb+gaVjabbsgaKjabbwgaLjabbYgaSjabbccaGiabbchaWjabbkhaYjabbwgaLjabbsgaKjabbMgaPjabbogaJjabbsha0jabbMgaPjabb+gaVjabb6gaUnaaBaaabaGaeeyAaKgabeaacqGHsislcqqGLbqzcqqG4baEcqqGWbaCcqqGLbqzcqqGYbGCcqqGPbqAcqqGTbqBcqqGLbqzcqqGUbGBcqqG0baDcqqGHbqycqqGSbaBcqqGGaaicqqG2bGDcqqGHbqycqqGSbaBcqqG1bqDcqqGLbqzdaWgaaqaaiabbMgaPbqabaGaeiykaKYaaWbaaeqabaGaeGOmaidaaaqaamaalaaabaGaeu4OdmfabaGaeeyAaKMaeyypa0JaeGymaeJaeeyla0IaeeOBa4gaaiabcIcaOiabbggaHjabbAha2jabbwgaLjabbkhaYjabbggaHjabbEgaNjabbwgaLjabbccaGiabbwgaLjabbIha4jabbchaWjabbwgaLjabbkhaYjabbMgaPjabb2gaTjabbwgaLjabb6gaUjabbsha0jabbggaHjabbYgaSjabbccaGiabbAha2jabbggaHjabbYgaSjabbwha1jabbwgaLjabgkHiTiabbwgaLjabbIha4jabbchaWjabbwgaLjabbkhaYjabbMgaPjabb2gaTjabbwgaLjabb6gaUjabbsha0jabbggaHjabbYgaSjabbccaGiabbAha2jabbggaHjabbYgaSjabbwha1jabbwgaLnaaBaaabaGaeeyAaKgabeaacqGGPaqkdaahaaqabeaacqaIYaGmaaaaaaaa@B3B8@

where n is the number of experimental data.

R2 is a measure of the amount of the reduction in the variability of response obtained by using the repressor variables in the model. Because R2 alone is not a measure of the model's accuracy, it is necessary to use absolute average deviation (AAD) analysis, which is a direct method for describing the deviations. Evaluation of R2 and AAD values together would be better to check the accuracy of the model. R2 must be close to 1.0 and the AAD between the predicted and observed data must be as small as possible. The acceptable values of R2 and AAD values mean that the model equation defines the true behavior of the system and it can be used for interpolation in the experimental domain [10].


  1. Gunawan ER, Basri M, Rahman MBA, Salleh AB, Rahman RNZA: Study on response surface methodology (RSM) of lipase-catalyzed synthesis of palm-based wax ester. Enzyme Microb Technol. 2005, 37: 739-744. 10.1016/j.enzmictec.2005.04.010.

    Article  CAS  Google Scholar 

  2. Osório NM, Ferreira-Dias S, Gusmão JH, da Fonseca MMR: Response surface modelling of the production of ω-3 polyunsaturated fatty acids-enriched fats by a commercial immobilized lipase. J Mol Catal B: Enzym. 2001, 11: 677-686. 10.1016/S1381-1177(00)00156-9.

    Article  Google Scholar 

  3. Hasan F, Shah AA, Hameed A: Industrial applications of microbial lipases. Enzyme and Microb Technol. 2006, 39: 235-251. 10.1016/j.enzmictec.2005.10.016.

    Article  CAS  Google Scholar 

  4. Svendsen A: Enzyme Functionality, Design, Engineering, and Screening. 2004, New York: Marcel Dekker, Inc

    Google Scholar 

  5. Sztajer H, Maliszewska I, Wieczorek J: Production of exogenous lipases by bacteria, fungi, and actinomycetes. Enzyme Microb Technol. 1988, 10: 492-497. 10.1016/0141-0229(88)90027-0.

    Article  CAS  Google Scholar 

  6. Eltaweel MA, Rahman RNZRA, Salleh AB, Basri M: An organic solvent-stable lipase from Bacillus sp. strain 42. Ann Microbiol. 2005, 55: 187-192.

    CAS  Google Scholar 

  7. DeFlaun MF, Fredrickson JK, Dong H, Pfiffner SM, Onstott TC, Balkwill DL, Streger SH, Stackebrandt E, Knoessen S, van Heerden E: Isolation and characterization of a Geobacillus thermoleovorans strain from an ultra-deep South African gold mine. Syst Appl Microbiol. 2007, 30: 152-164. 10.1016/j.syapm.2006.04.003.

    Article  CAS  Google Scholar 

  8. Rathi P, Sapna B, Sexena R, Gupta R: A hyperthermostable, alkaline lipase from Pseudomonas sp. with the property of thermal activation. Biotechnol Lett. 2000, 22: 495-498. 10.1023/A:1005604617440.

    Article  CAS  Google Scholar 

  9. He YQ, Tan TW: Use of response surface methodology to optimize culture medium for production of lipase with Candida sp. J Mol Catal B: Enzym. 2006, 43: 99-125. 10.1016/j.molcatb.2006.02.018.

    Article  Google Scholar 

  10. Baş D, Boyaci IH: Modeling and Optimization I: Usability of response surface methodology. J Food Eng. 2007, 78: 836-845. 10.1016/j.jfoodeng.2005.11.024.

    Article  Google Scholar 

  11. Abdel-Fattah YR, Saeed HM, Gohar YM, El-Baz MA: Improved production of Pseudomonas aeruginosa uricase by optimization of process parameters through statistical experimental designs. Process Biochem. 2005, 40: 1707-1714. 10.1016/j.procbio.2004.06.048.

    Article  CAS  Google Scholar 

  12. Manohar B, Divakar S: Applications of surface plots and statistical design to selected lipase-catalyzed esterification reactions. Process Biochem. 2004, 39: 847-853. 10.1016/S0032-9592(03)00192-4.

    Article  CAS  Google Scholar 

  13. Chen CS, Liu KJ, Lou YH, Shieh CJ: Optimization of kojic acid monolaurate synthesis PS From Pseudomonas cepacia. J Sci Food Agric. 2002, 82: 601-605. 10.1002/jsfa.1083.

    Article  CAS  Google Scholar 

  14. Soo EL, Salleh AB, Basri M, Rahman RNZA, Kamaruddin K: Response surface methodological study on lipase-catalyzed synthesis of amino acid surfactants. Process Biochem. 2004, 39: 1511-1518. 10.1016/S0032-9592(03)00279-6.

    Article  CAS  Google Scholar 

  15. Basri M, Rahman RNZRA, Ebrahimpour A, Salleh AB, Gunawan ER, Rahman MBA: Comparison of estimation capabilities of response surface methodology (RSM) with artificial neural network (ANN) in lipase- catalyzed synthesis of palm-based wax ester. BMC Biotechnol. 2007, 7: 53-10.1186/1472-6750-7-53.

    Article  Google Scholar 

  16. Gaur R, Gupta A, Khare SK: Lipase from solvent tolerant Pseudomonas aeruginosa strain: Production optimization by response surface methodology and application. Bioresource Technol. 2008, 99: 4796-4802. 10.1016/j.biortech.2007.09.053.

    Article  Google Scholar 

  17. Teng Y, Xu Y: Culture condition improvement for whole-cell lipase production in submerged fermentation by Rhizopus chinensis using statistical method. Bioresource Technol. 2008, 99: 3900-3907. 10.1016/j.biortech.2007.07.057.

    Article  CAS  Google Scholar 

  18. Baş D, Boyacı İ H: Modeling and optimization II: Comparison of estimation capabilities of response surface methodology with artificial neural networks in a biochemical reaction. J Food Eng. 2007, 78: 846-854. 10.1016/j.jfoodeng.2005.11.025.

    Article  Google Scholar 

  19. Beg QK, Saxena RK, Gupta R: Kinetic constants determination for an alkaline protease from Bacillus mojavensis using response surface methodology. Biotechnol Bioeng. 2002, 78: 289-295. 10.1002/bit.10203.

    Article  CAS  Google Scholar 

  20. Senanayake SPJN, Shahidi F: Lipase-catalyzed incorporation of docosahexaenoic acid (DHA) into borage oil: optimization using response surface methodology. Food Chem. 2002, 77: 115-123. 10.1016/S0308-8146(01)00311-9.

    Article  Google Scholar 

  21. Manohar B, Divakar S: An artificial neural network analysis of porcine pancreas lipase catalysed esterification of anthranilic acid with methanol. Process Biochem. 2005, 40: 3372-3376. 10.1016/j.procbio.2005.03.045.

    Article  CAS  Google Scholar 

  22. Dutta JR, Dutta PK, Banerjee R: Optimization of culture parameters for extracellular protease production from a newly isolated Pseudomonas sp. using response surface and artificial neural network models. Process Biochem. 2004, 39: 2193-2198. 10.1016/j.procbio.2003.11.009.

    Article  CAS  Google Scholar 

  23. Neural networks. []

  24. Lotti M, Monticelli S, Montesinos JL, Brocca S, Valero F, Lafuente J: Physiological control on the expression and secretion of Candida rugosa lipase. Chem Phys Lipids. 1998, 93: 143-148. 10.1016/S0009-3084(98)00038-3.

    Article  CAS  Google Scholar 

  25. Ghanem EH, Al-Sayeed HA, Saleh KM: An alkalophilic thermostable lipase produced by a new isolate of Bacillus alcalophilus. World J Microb Biot. 2000, 16: 459-464. 10.1023/A:1008947620734.

    Article  CAS  Google Scholar 

  26. Sharma R, Soni SK, Vohra RM, Jolly RS, Gupta LK, Gupta JK: Production of extracellular alkaline lipase from a Bacillus sp. RSJ1 and its application in ester hydrolysis. Ind J Microbiol. 2002, 42: 49-54.

    Google Scholar 

  27. Sugihara A, Tani T, Tominaga Y: Purification and characterization of a novel thermostable lipase from Bacillus sp. J Biochem. 1991, 109: 211-215.

    CAS  Google Scholar 

  28. Bora L, Kalita MC: Production and Optimization of Thermostable lipase from a Thermophilic Bacillus sp LBN 4. The Internet J Microbiol. 2007, 4 (1):

  29. Gupta N, Sahai V, Gupta R: Alkaline lipase from a novel strain Burkholderia multivorans : Statistical medium optimization and production in a bioreactor. Process Biochem. 2007, 42: 518-526. 10.1016/j.procbio.2006.10.006.

    Article  CAS  Google Scholar 

  30. Lee DW, Koh YS, Kim KJ, Kim BC, Choi HJ, Kim DS, Suhartono MT, Pyun YR: Isolation and characterization of a thermophilic lipase from Bacillus thermoleovorans ID-1. FEMS Microbiol Lett. 1999, 179: 393-400. 10.1111/j.1574-6968.1999.tb08754.x.

    Article  CAS  Google Scholar 

  31. Kaushik R, Saran S, Isar J, Saxena RK: Statistical optimization of medium components and growth conditions by response surface methodology to enhance lipase production by Aspergillus carneus. J Mol Catal B: Enzym. 2006, 40: 121-126. 10.1016/j.molcatb.2006.02.019.

    Article  CAS  Google Scholar 

  32. Abdel-Fattah YR: Optimization of thermostable lipase production from a thermophilic Geobacillus sp. using Box-Behnken experimental design. Biotechnol Lett. 2002, 24: 1217-1222. 10.1023/A:1016167416712.

    Article  CAS  Google Scholar 

  33. Mates A, Sudakevitz D: Production oflipase by Staphylococcus aureus under various growth conditions. J Appl Bacteriol. 1973, 36: 219-226.

    Article  CAS  Google Scholar 

  34. Gowland P, Kernick M, Sundaram TK: Thermophilic bacterial isolates producing lipase. FEMS Microbiol Lett. 1987, 48: 339-43. 10.1111/j.1574-6968.1987.tb02621.x.

    Article  CAS  Google Scholar 

  35. Macció D, Fabra A, Castro S: Acidity and calcium interaction affect the growth of Bradyrhizobium sp. and attachment to peanut roots. Soil Biol Biochem. 2002, 34: 201-208. 10.1016/S0038-0717(01)00174-2.

    Article  Google Scholar 

  36. Alkan H, Baysal Z, Uyar F, Dogru M: Production of lipase by a newly isolated Bacillus coagulans under solid-state fermentation using melon wastes. Appl Biochem Biotechnol. 2007, 136: 183-92. 10.1007/BF02686016.

    Article  CAS  Google Scholar 

  37. Janssen PH, Monk CR, Morgan HW: A thermophilic, lipolytic Bacillus sp., and continuous assay of its p-nitrophenyl-palmitate esterase activity. FEMS Microbiol Lett. 1994, 120: 195-200. 10.1111/j.1574-6968.1994.tb07030.x.

    Article  CAS  Google Scholar 

  38. Winkler UK, Stuckman M: Glycogen, hyaluronate, and some other polysaccharides greatly enhance the formation of exolipase by Serratia marcescens. J Bacteriol. 1979, 138: 663-679.

    CAS  Google Scholar 

  39. Nishio T, Chikano T, Kamimura M: Purification and some properties of lipase produced by Pseudomonas fragi 22.39B. Agric Biol Chem. 1987, 51: 181-187.

    Article  CAS  Google Scholar 

  40. Simplifying a Multiple Regression Equation. []

  41. Linko S, Zhu YH, Linko P: Applying neural networks as software sensors for enzyme engineering. Trends Biotechnol. 1999, 17: 155-162. 10.1016/S0167-7799(98)01299-2.

    Article  CAS  Google Scholar 

  42. Rahman RNZA, Lee PG, Basri M, Salleh AB: Physical factors affecting the production of organic solvent-tolerant protease by Pseudomonas aeruginosa strain K. Bioresource Technol. 2005, 96: 429-436. 10.1016/j.biortech.2004.06.012.

    Article  CAS  Google Scholar 

  43. Shafee N, Aris SN, Rahman RNZA, Basri M, Salleh AB: Optimization of Environmental and Nutritional Conditions for the Production of Alkaline Protease by a Newly Isolated Bacterium Bacillus cereus Strain 146. J Appl Sci Res. 2005, 1: 1-8.

    Google Scholar 

  44. Chopra AK, Chander H: Factors affecting lipase production in Syncephalastrum racemosum. J Appl Bacteriol. 1983, 54: 163-169.

    Article  CAS  Google Scholar 

  45. Rahman RNZRA, Baharum SN, Salleh AB, Basri M: S5 Lipase: An Organic Solvent Tolerant Enzyme. J Microbiol. 2006, 44: 583-590.

    Google Scholar 

  46. Nadkarni SR: Studies on bacterial lipase. II. Study of the characteristics of partially purified lipase from Pseudomonas aeruginosa. Enzymologia. 1971, 40: 286-301.

    CAS  Google Scholar 

  47. Kumar CG, Takagi H: microbial alkaline proteases: from a bioindustrial viewpoint. Biotechnol Adv. 1999, 17: 561-94. 10.1016/S0734-9750(99)00027-0.

    Article  CAS  Google Scholar 

  48. Beg QK, Sahai V, Gupta R: Statistical media optimization and alkaline protease production from Bacillus mojavensis in bioreactor. Process Biochem. 2003, 39: 203-209. 10.1016/S0032-9592(03)00064-5.

    Article  CAS  Google Scholar 

  49. Woolley P, Petersen SB: Lipases: their Structure, Biochemistry and Application. 1994, UK: Cambridge University Press, 77-94.

    Google Scholar 

  50. Shariff FM, Leow TC, Mukred AD, Salleh AB, Basri M, Rahman RNZRA: Production of L2 lipase by Bacillus sp. strain L2: nutritional and physical factors. J Basic Micro. 2007, 47: 406-412. 10.1002/jobm.200610275.

    Article  CAS  Google Scholar 

  51. Kim EK, Sung MH, Kim HM, Oh TK: Occurrence of thermostable lipase in thermophilic Bacillus sp. strain 398. Biosci Biotechnol Biochem. 1994, 58: 961-962.

    Article  CAS  Google Scholar 

  52. Etruğul S, Dönmez G, Takaç SS: Isolation of lipase producing Bacillus sp. from olive mill wastewater and improving its enzyme activity. J Hazard Mater. 2007, 149 (3): 720-724. 10.1016/j.jhazmat.2007.04.034.

    Article  Google Scholar 

  53. Moon SH, Parulekar SJ: A parametric study of protease production on batch and fed-batch cultures of Bacillus firmus. Biotechnol Bioeng. 1991, 37: 467-483. 10.1002/bit.260370509.

    Article  CAS  Google Scholar 

  54. Paetzel M, Karla A, Strynadka NCJ, Dalbey RE: Signal Peptidases. Chem Rev. 2002, 102: 4549-4579. 10.1021/cr010166y.

    Article  CAS  Google Scholar 

  55. Gupta R, Gupta N, Rathi P: Bacterial lipases: an overview of production, purification and biochemical properties. Appl Microbiol Biochem. 2004, 64: 763-781. 10.1007/s00253-004-1568-8.

    Article  CAS  Google Scholar 

  56. Myers RH, Montgomery DC: Response surface methodology: Process and product optimization using designed experiments. 1995, New York: John Wiley & Sons, Inc

    Google Scholar 

  57. Chin JH, Rahman RNZA, Salleh AB, Basri M: A newly isolated organic solvent tolerant Bacillus sphaericus 205y producing organic solvent-stable lipase. Biochem Eng J. 2003, 15: 147-151. 10.1016/S1369-703X(02)00185-7.

    Article  Google Scholar 

  58. Kwon DY, Rhee JS: A Simple and Rapid Colorimetric Method for Determination of Free Fatty Acids for Lipase Assay. J Am Oil Chem Soc. 1986, 63: 89-92. 10.1007/BF02676129.

    Article  CAS  Google Scholar 

  59. Cohran WG, Cox GM: Experimental Design. 2002, New York: Wiley

    Google Scholar 

  60. García-Gimeno RM, Hervás-Martinez C, Rodriguez-Perez R, Zurera-Cosano G: Modelling the growth of Leuconostoc mesenteroides by artificial neural networks. Int J Food Microbiol. 2005, 105: 317-332. 10.1016/j.ijfoodmicro.2005.04.013.

    Article  Google Scholar 

Download references


The financial support by Universiti Putra Malaysia is gratefully acknowledged.

Author information

Authors and Affiliations


Corresponding author

Correspondence to Abu Bakar Salleh.

Additional information

Authors' contributions

ABS, RNZRAR and MB conceived the idea of the study and experimental design. AE and DCHE performed the experiments described in this paper. AE conceived the RSM and ANN design and analysis, compared the estimation capabilities of the RSM with ANN and drafted the manuscript. All authors read and approved the final manuscript.

Authors’ original submitted files for images

Rights and permissions

Open Access This article is published under license to BioMed Central Ltd. This is an Open Access article is distributed under the terms of the Creative Commons Attribution License ( ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Reprints and permissions

About this article

Cite this article

Ebrahimpour, A., Rahman, R.N.Z.R.A., Ean Ch'ng, D.H. et al. A modeling study by response surface methodology and artificial neural network on culture parameters optimization for thermostable lipase production from a newly isolated thermophilic Geobacillus sp. strain ARM. BMC Biotechnol 8, 96 (2008).

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: