 Methodology article
 Open Access
 Published:
Soft  sensing modeling based on ABC  MLSSVM inversion for marine low  temperature alkaline protease MP fermentation process
BMC Biotechnology volume 20, Article number: 9 (2020)
Abstract
Background
Aiming at the characteristics of nonlinear, multiparameter, strong coupling and difficulty in direct online measurement of key biological parameters of marine lowtemperature protease fermentation process, a softsensing modeling method based on artificial bee colony (ABC) and multiple least squares support vector machine (MLSSVM) inversion for marine protease fermentation process is proposed.
Methods
Firstly, based on the material balance and the characteristics of the fermentation process, the dynamic “grey box” model of the fedbatch fermentation process of marine protease is established. The inverse model is constructed by analyzing the inverse system existence and introducing the characteristic information of the fermentation process. Then, the inverse model is identified offline using MLSSVM. Meanwhile, in order to reduce the model error, the ABC algorithm is used to correct the inverse model. Finally, the corrected inverse model is connected in series to the marine alkaline protease MP fermentation process to form a composite pseudolinear system, thus, realtime online prediction of key biological parameters in fermentation process can be realized.
Results
Taking the alkaline protease MP fermentation process as an example, the simulation results demonstrate that the softsensing modeling method can solve the realtime prediction problem of key biological parameters in the fermentation process online, and has higher accuracy and generalization ability than the traditional softsensing method of support vector machine.
Conclusions
The research provides a new method for softsensing modeling of key biological parameters in fermentation process, which can be extended to softsensing modeling of general nonlinear systems.
Background
Marine alkaline protease MP is a fermentation enzyme that adapts well to the lowtemperature environment [1]. In addition to a wide range of pH, the MP enjoys a high activity at room temperature, and becomes less active with the decrease of temperature [2, 3]. This emerging industrial enzyme brings new vitality and opportunities to such fields as medicine, food, enzyme industry, national defense and so on, and greatly broadens application regions of protease [4]. The marine lowtemperature alkaline protease MP fermentation process is a dynamic process with high nonlinearity and strong coupling effect. Like general nonlinear systems, fermentation has timevarying, correlated and uncertain parameters [5]. Therefore, it is very difficult to directly measure the key biological parameters in the fermentation process [6]. Currently, these parameters can only be obtained through regular sampling, offline analysis and lab test. The current method has a poor realtime performance, and increases the bacteria exposure of the samples, hindering the advanced control of fermentation. This calls for a strategy that timely acquires the state of key biological parameters in the fermentation process. Without the state information, it is impossible to achieve dynamic and optimal control of the fermentation process, which boosts the biomass density and productivity of enzyme.
The inverse system method provides a good solution to the softsensing modelling of the fermentation process. This method boasts strict theoretical bases and clear physical meanings. Coupled with learning algorithms (e.g. neural network (NN) and support vector machine (SVM)), the inverse system method can complete softsensing of nonlinear systems, which are difficult to be modelled accurately [7, 8]. Suffice it to say that the inverse system method greatly facilitates the softsensing modelling of highly nonlinear systems in engineering practices. However, the inverse system method faces two problems in softsensing of the marine lowtemperature alkaline protease MP fermentation process. On the one hand, the mathematical model of the controlled object and the system parameters of the model must be known before using the inverse system method. It is no easy task to obtain either information from the highly nonlinear and strongly coupled fermentation process. On the other hand, the inverse system of the original system must be established before using the inverse system method. In other words, the inverse system should be expressed mathematically in advance (that is, to derive a mathematical expression that can be used to describe the inverse system) [8]. To solve the problems, the literature [9] proposes a neural network inverse system method, which integrates intelligent control with the inverse system method. The inverse system is approximated by the neural network in this literature. The method was successfully applied to the softsensing of erythromycin fermentation, creating a “graybox” model of the fermentation process. Nevertheless, the “graybox” model is a simplified model based on the Monod equation, which ignores many important nonlinear factors in the actual process of erythromycin fermentation. Besides, the neural network, inspired by the asymptotic theory, is based on the unrealistic assumption that the number of samples is infinite, but the number of samples in the actual problem is often limited, especially the strong coupling, large lag complex nonlinear system as the marine alkaline protease MP fermentation process, it is extremely difficult to obtain accurate sample data. Therefore, in the case of small samples, the research of inverse softsensing methods suitable for the marine alkaline protease MP fermentation process and easy to implement in engineering has become the key problem to be solved urgently in the marine lowtemperature alkaline protease MP fermentation process.
Considering the limited number of samples in actual fermentation, this paper attempts to design an easytouse inverse system method for softsensing modeling of the marine alkaline protease MP fermentation process. Firstly, a “graybox” dynamic model was established for the the marine lowtemperature alkaline protease MP fermentation process, according to material balance and features of that process. Secondly, the existing inverse system was analyzed, and the design of extended inverse model was introduced. Thirdly, the offline identification of MLSSVM and online optimization of ABC were combined to develop the extended inverse model based on ABCMLSVM, and the extended inverse model was connected in series after the primary fermentation process, serving as the softsensing model that predicts key biological parameters online in real time. Fourthly, the effectiveness of the softsensing modelling method was verified through a simulation of the MP fermentation in lab; the simulation results show that the method can effectively predict the key biological parameters of the marine lowtemperature alkaline protease MP fermentation process online, and outperform the traditional least square support vector machine (LSSVM) softsensing modeling method in prediction accuracy.
Methods
Dynamic model of fermentation process
In this paper, Taking the fermentation process as the object, this paper assumes that both cell concentration and the protease concentration are zero. The fermentation states (concentration of each substance) were taken as dependent variables of differential equation, while time t was taken as an independent variable or separate variable. Then, the “grey box” dynamic model could be described by the material balance equations of various substances (mycelia, restrictive substrate, protease, oxygen, H+, etc.) [10], as shown in Eq. (1):
The construction process of the dynamic model of the ash box is as follows:
Volume change equilibrium equation
During the fermentation process, culture medium should be added at a rational rate to supplement the nutrients and increase the protease yield. The culture medium mainly consists of carbon source, nitrogen source, inorganic salt, growth factor and enzymeproducing promoter. Through preliminary experiments, this paper selects maize flour hydrolysate as carbon source, soybean meal hydrolysate as nitrogen source, ammonia sulfate ((NH_{4})_{2}SO4) as inorganic salt, malt extract as growth factor and Polysorbate 80 (Tween80) as surfactant. The volume (V) of the fermentation broth changes with the addition of nutrients and enzymeproducing promoter. The equilibrium equation is as follows:
where: V is fermentation broth volume, f_{mh}, f_{s}, f_{a}, f_{m} and f_{tw} are respectively the flow rate of maize flour hydrolysate, soyabean cake meal hydrolysate, (NH_{4})_{2}SO_{4}), malt extract and enzymeproducing promoter (Tween80).
Cell growth kinetics equation
The previous studies have found that, the growth of the enzyme producing strain has a maximum concentration, i.e. a saturation point, which could be reached if the initial sugar concentration is on suitable levels. The time to reach the saturation point varies with the initial sugar concentration. The higher the initial sugar concentration, the slower the cells grow, that is, the substrate concentration inhibits the cell growth. Considering the deviation of Monod equationbased description, the logistic equation was employed to depict the growth law of the cells, in the light of the volume change in fedbatch fermentation, and the volume change during fedbatch fermentation is taken into account. The growth kinetics model of cell is as follows:
where: μ is the specific growth rate of somatic cells, X is cell concentration.
Substrate consumption equation
The substrate consumption of marine lowtemperature alkaline protease MP was modelled based on the material balance. The effect of additive carbon source (maize flour hydrolysate) was considered in the model, because the carbon source, as the only restrictive substrate, is consumed rapidly in large quantities. The model is expressed as follows:
where: S is the substrate concentration, ν is the specific consumption rate of substrate (h^{−1}), S_{mh} is the maize flour hydrolysate flow rate.
Protease synthesis kinetics
The model of fermentation enzyme production is partial growth coupled type (It belongs to extracellular enzyme, and its synthesis regulation is affected by many mechanisms), high concentration substrate can obviously inhibit the secretion of protease while maintaining low carbon source concentration is beneficial to the secretion of protease MP. At the same time, the hydrolysis of alkaline protease MP also has a certain effect on protease MP itself. On this basis, Tween80, growth factor and the hydrolysis rate which have influence on the fermentation process are introduced into the protease synthesis kinetics, the model is expressed as follows:
where: E is protease content (%), ρ is the specific growth rate of protease, K is hydrolysis constants of protease, and K_{m}, K_{tw} are inhibition constants.
Dissolved oxygen concentration (DO) variation model
The MP fermentation is aerobic, i.e. oxygen is involved in cell growth and protease synthesis. DO must be controlled in a suitable range. In fact, the oxygen demand constantly changes through the cell growth, because cell concentration and cell respiration intensity change from stage to stage. Based on the varying oxygen demands, the DO in the fermentation broth must be regulated in real time. According to the aerobic features of MP fermentation and the effect of bioreactor size on DO level in culture medium, the oxygen volumetric mass transfer coefficient of the bioreactor was introduced to the DO concentration equilibrium equation. The equilibrium equation is as follows:
where: C_{L} is the DO concentration (mol/L), K_{La} is oxygen volume mass transfer coefficient in bioreactor (s^{−1}), \( {C}_L^{\ast } \) is saturation concentration of oxygen dissolved in fermentation liquid phase (mol/L), and η is the specific consumption rate of oxygen.
PH dynamic change model
During the fermentation, the enzyme producing strain favors an alkaline environment, with the optimal pH range of 9.0~10.0. Any change of pH in fermentation broth will exert a huge impact on the fermentation of the MP. Excessively high or low pH values will slow down the strain growth and the formation of protease, weakening the enzyme activity. Therefore, the pH of fermentation broth was regulated within the optimal range by the flow of nutrients through the fermentation process, so that it can be maintained in the optimum range. The pH equilibrium equation of fermentation broth is expressed as:
where:[H]^{+} is the hydrogen ion concentration in fermentation broth (used to characterize the pH of fermentation broth), f_{mh}, f_{s}, f_{m} and f_{tw} are respectively flow rate of maize flour hydrolysate, soyabean cake meal hydrolysate, malt extract and Tween80. S_{mh}, S_{s}, S_{m}, S_{tw} are respectively the liquid concentrations of maize flour hydrolysate, soyabean cake meal hydrolysate, malt extract and Tween80, γ is the specific consumption of [H+].
Through the above analysis, the “graybox” dynamic model of the marine lowtemperature alkaline protease MP fermentation process can be expressed as:
where: x = [x_{1}, x_{2}, x_{3}, x_{4}, x_{5}, x_{6}]^{T} = [X, S, E, C_{L}, [H]^{+}, V]^{T} represent the status vector, u = [u_{1}, u_{2}, u_{3}, u_{4}, u_{5}]^{T} = [f_{mh}, f_{s}, f_{a}, f_{m}, f_{tw}]^{T} is the input vector, μ(x), ν(x), ρ(x), η(x), γ(x) are the analytical functions of the respective status variables x, S_{i} (i = 1, 2, ⋯9) are all constants other than zero and represent respectively the liquid feeding concentration of maize flour hydrolysate, hydrolysis constants of protease, inhibition constant, gas saturated oxygen concentration, \( {C}_L^{\ast}\ast {K}_{La} \), the liquid feeding concentration of soyabean cake meal hydrolysate, (NH_{4})_{2}SO_{4} and Tween80.
Reversibility analysis
The marine lowtemperature alkaline protease MP fermentation process is shown in Fig. 1. u_{1}~u_{5} are input parameters, x_{1}~x_{6} are the six process parameters of fermentation process. The process parameters x_{4}, x_{5}, x_{6} are directly measurable parameters and x_{1}, x_{2}, x_{3} are key parameters that are difficult to measure directly online (That is, the variable that needs to be predicted).
In order to predict the nondirect measurable key biological parameters x_{1}, x_{2}, x_{3}, a virtual subsystem was assumed to exist in the marine lowtemperature alkaline protease MP fermentation process, including three nondirectly measureable inputs x_{1}, x_{2}, x_{3}, three directly measurable outputs x_{4}, x_{5}, x_{6} and five variables u_{1}~u_{5}. This virtual subsystem is regarded as a “virtual sensor” [11] of the marine lowtemperature alkaline protease MP fermentation process. The softsensing of x_{1}, x_{2}, x_{3} can be realized through the following steps: solve the inverse model of the virtual subsystem; take the model as a dynamic compensator in series with the virtual subsystem, forming a composite system [12]; reproduce the inputs of the “virtual sensor” based on the outputs of the composite system, as shown in Fig. 2.
In order to achieve the softsensing of key biological parameters x_{1}, x_{2}, x_{3}, the reversibility analysis of the “virtual sensor” must be carried out, and the inverse system model should be solved.
LemmaNecessary and sufficient conditions for system Σ reversibility in some field of point (x_{0}, u_{0}): The system meets\( \mathit{\operatorname{rank}}\left(\partial {z}_m^{\mathrm{T}}/\partial {\hat{\mathbf{x}}}^{\mathrm{T}}\right)={r}_m=l \) , l is the dimension number of nondirect measurable variable.
The reversibility of “virtual sensor” is analyzed by Interactor algorithm: The direct measurable variable z = [z_{1}, z_{2}, z_{3}] = [x_{4}, x_{5}, x_{6}] are derived by using the modeling algorithm, and allorder derivatives \( {\dot{z}}_i,{\ddot{z}}_i,\cdots, {z}_i^{\left({k}_i\right)} \) (i = 1, 2, 3) can be obtained, and then the independent derivative information of the function is selected to form the vector Z_{m},as shown in Eq. 9:
where:
According to Eq. (9), \( \partial {\ddot{z}}_2/\partial {x}_i=\partial {g}_1(x)/\partial {x}_i,i=1,2,3, \).
m = 3is obtained, and the Jacobian Matrix \( J=\partial {\left({\ddot{z}}_1,{\dot{z}}_1,{\dot{z}}_2\right)}^T/\partial \left({x}_1,{x}_2,{x}_3\right) \) is further solved:
After the transformation of the elementary row of Eq. (11) and obtain the following:
where: \( {g}_3\left(\mathbf{x},\mathbf{u}\right)=\left[\left(\frac{\partial \gamma }{\partial {x}_1}{x}_1+\gamma \right)\frac{\partial \eta }{\partial {x}_3}\right]/\frac{\partial \gamma }{\partial {x}_3}\frac{\partial \eta }{\partial {x}_1}{x}_1\eta \);\( {g}_4\left(\mathbf{x},\mathbf{u}\right)=\left({x}_1\frac{\partial \gamma }{\partial {x}_2}\frac{\partial \eta }{\partial {x}_3}\right)/\frac{\partial \gamma }{\partial {x}_3}\frac{\partial \eta }{\partial {x}_2}{x}_2; \)\( {g}_5\left(\mathbf{x},\mathbf{u}\right)=\frac{\partial {g}_1\left(\mathbf{x},\mathbf{u}\right)}{\partial {x}_1} \)\( \left[\left(\frac{\partial v}{\partial {x}_1}{x}_1+\gamma \right)\frac{\partial {g}_1\left(\mathbf{x},\mathbf{u}\right)}{\partial {x}_3}\right]/\frac{\partial v}{\partial {x}_3} \); \( {g}_6\left(\mathbf{x},\mathbf{u}\right)=\frac{\partial {g}_1\left(\mathbf{x},\mathbf{u}\right)}{\partial {x}_2} \)\( \left(\frac{\partial \gamma }{\partial {x}_2}\frac{\partial {g}_1\left(\mathbf{x},\mathbf{u}\right)}{\partial {x}_3}\right)/\frac{\partial \gamma }{\partial {x}_3} \);
If \( \det \left(\overset{\sim }{J}\right)={g}_7\left(\boldsymbol{x},\boldsymbol{u}\right)\bullet {g}_4\left(\boldsymbol{x},\boldsymbol{u}\right)\bullet \frac{\partial \gamma }{\partial {x}_3}{x}_1\not\equiv 0 \) in the entire real vector space, it can be known that \( \mathbf{J}=\partial {\mathbf{Z}}_m^{\mathrm{T}}/\partial {\hat{\mathbf{x}}}^{\mathrm{T}}=\partial \left({\ddot{z}}_1,{\dot{z}}_1,{z}_2\right)/\partial \left({x}_1,{x}_2,{x}_3\right)=3 \) from Lemma, and it meet the system reversibility condition, that is, the system is globally reversible. However, for \( \det\ \left(\overset{\sim }{J}\right)\not\equiv 0 \), it is quite difficult to guarantee that it satisfies the nonzero conditions everywhere in the entire real vector space R.
Considering the above situation and the current operation state of the marine low temperature alkaline protease MP fermentation process, it is assumed that \( \det\ \left(\overset{\sim }{J}\right)\not\equiv 0 \), a small work area of the fermentation process within the real vector space R, satisfies the reversibility condition of the “virtual sensor”. Then, an inverse softsensing model is constructed based on ABCMLSVM. The assumption is verified against actual analysis results.
Suppose the system satisfies the reversibility condition for the work area. Then, the inverse system of the virtual sensor of he marine lowtemperature alkaline protease MP fermentation process is established based on the inverse function theorem, using Eqs. (8)~(9):
The “graybox” model is obtained through lab research based on the material balance of fedbatch fermentation. The model ignores the influence of many factors, and merely approximates the actual kinetics [13]. There are several constraints of this model: (1) The temperature, fermenter pressure and agitation speed are constant in the fermentation process; (2) The broth and substrate concentrations are not affected by fermentation heat. The inverse system model Eq. (13) of the “virtual sensor” is established based on the gray box model under the above constraints. Obviously, the established model fails to reflect the influence of several key factors in the fermentation process, including but not limited to fermentation temperature, intank pressure, agitation speed, and air flowrate. As a result, the softsensing prediction based on Eq. (13) will have a huge error, undermining the subsequent optimization control.
To overcome the defect, fermentation temperature (W_{t}), tank inside pressure(P_{t}), agitation speed (S_{a}), air flow rate (F_{a}) four process parameters are included to the softsensing model based on Eq. (13). The structure of the extended inverse model for softsensing can be described as:
The addition of the key parameters provides the extended inverse model for softsensing with more characteristic information of the fermentation process, which greatly promotes the adaptability and antijamming ability of the model.
Although the inverse softsensing model of the marine lowtemperature alkaline protease MP fermentation process is constructed in this paper, However, Eq. (14) shows that the extended inverse model for softsensing is difficult to solve, despite the possible existence of a solution. The LSSVM offers a solution to this problem, thanks to its strong approximation ability to nonlinear functions.
Improved MLSSVM
Traditional LSSVM is grounded on multiinput singleoutput (MISO) systems. It cannot be directly applied to identify multiinput multioutput (MIMO) systems [14], such as the extended inverse model of the marine alkaline protease MP fermentation process. Thus, the MLSSVM was proposed to build the extended inverse model of the marine alkaline protease MP fermentation process (MIMO).
LSSVM is proposed by Suykens in which author has changed the inequality constraints in SVM [15, 16] with equality and converted the convex quadratic programming problem to a convex linear system problem, which is often used for model decomposition problems and function prediction. Its modeling principle is as follows:
Given l training samples {(x_{i}, y_{i}) i = 1, 2, ..., l}, x_{i} ∈ R^{n} is an input and y_{i} ∈ R is output. The optimization problem for regression LSSVM is as follows:
where: w is the weight vector, b is the deviation, φ(x_{i}) is a mapping to a high dimensional space, ξ_{i} is relaxation factor (error), γ is regularization parameter.
In order to transform the singleoutput optimization problem into a multioutput optimization problem. In this paper, the quadratic loss function of error(\( {\boldsymbol{\upxi}}_i{\boldsymbol{\upxi}}_i^{\mathrm{T}} \), ξ ∈ R^{1 × n}) is introduced to replace the relaxation factor(\( {\xi}_i^2 \)) in optimization problem(15):
where: ξ ∈ R^{1 × n}, n is the number of output variables, φ_{i}(x) = [φ_{i}(x_{1}), ⋯, φ_{i}(x_{l})].
Lagrange function is used to solve the above optimization problems:
where: a_{i} ∈ R^{m × l} is a Lagrange multiplier, m is input vectors number.
According to the KKT condition, the transformation to linear equation is as follow:
From the above equations, \( {\mathbf{w}}_i={\mathbf{a}}_i{\varphi}_i^{\mathrm{T}}\left(\mathbf{x}\right) \) and ξ_{i} = a_{i} can be easily obtained, and then they can be substituted into the last term of Eq. (18):
So that for the above optimization problem(18), the estimation function is written as:
where: K(x_{i}, x) is the kernel function satisfying Mercer condition. In this paper, the kernel function is Gaussian radial basis function.
Considering that the matrix \( \left[\begin{array}{cc}0& {\boldsymbol{\upgamma}}^{\mathrm{T}}\\ {}\boldsymbol{\upgamma} & {K}_i\left({\mathbf{x}}_i,\mathbf{x}\right)+I\end{array}\right] \) is nonsingular, Eq. (20) can be converted to Eq. (21) by small transformation as follows:
Then, MLSSVM approximation is expressed as:
The MLSSVM identification of the extended inverse model for softsensing depends heavily on the selection of kernel function parameter σ and regularization parameter γ. If σ is too small, the training effect will be undermined by the localized kernel; Otherwise, there will be a high risk of undertraining; If γ is too small, the training error will increase and the learning machine will have stronger generalization ability; Otherwise, the training error will decrease and the learning machine will have a weaker generalization ability. Traditionally, these parameters are selected empirically through trialanderror. The selection process is inaccurate and timeconsuming. To ensure the prediction precision of our extended inverse model, this paper adopts the ABC algorithm to optimize and finetune the parameter combination (σ, γ).
ABC optimization algorithm
In recent years, the research on intelligent optimization algorithm and its application in model parameter optimization is very active and has achieved encouraging results [15, 17, 18]. Inspired by the foraging behavior of bees, the ABC is an intelligent optimization algorithm that has been successfully applied in optimization of model parameters [19, 20]. This algorithm does not care about the specific information of the problem, but the merits and demerits of the problem. By the ABC, three types of bees are set up to perform local optimization, and the optimal food source is updated iteratively to obtain the global optimal solution. Therefore, the ABC converges fast and stays immune to the local optimum trap, providing an effective way to solve multidimensional engineering problems. Many numerical examples have shown that the ABC has better optimization and convergence performance than differential evolution (DE), genetic algorithm (GA) and particle swarm optimization (PSO) [21, 22]. That is why this algorithm is adopted here to optimize two key parameters. Based on this, the paper selects the ABC algorithm to optimize the performance parameters of MLSSVM.
In the ABC, the colony consists of three groups of bees: leading bees, following bees and scout bees. The leading bees whose food source has been abandoned becomes a scout. The leading bees search for highquality food sources, the following bees watch the dances of leading bees and choose one source depending on the dances, and the scout bees search for new food sources randomly around the chosen source. The total number of leading bees and following bees equals the number of food sources. Let S_{N}. x_{ij}(i = 1, 2, ⋯, SN, j = 1, 2, ⋯, D) be the locations of food sources, with D be the number of optimization parameters. After initialization, the leading bees start to search for food sources iteratively. In each iteration, an leading bee remembers the new food source, if it has a higher nectar amount than the old one. The following bees will choose a food source, go to that source, choose a neighbor, and evaluate its nectar amount. Then, abandoned food sources are determined and are replaced with the new food sources discovered scout bees. Finally, the best food source found so far is registered.
The leading bees search for new solutions based on their current location, which can be described as follows:
where: k ∈ {1, 2, ⋯, S_{N}} and j ∈ {1, 2, ⋯, D} are randomly selected, and k ≠ i. ϕ_{ij} is a random number between [−1, 1].
The conversion probability of each individual is calculated as follows:
where: f(x_{i}) is the fitness value of each individual and S_{N} is the number of food sources.
If the solution x_{i} update fails, it means that the solution can not be optimized and needs to be replaced by a new solution generated by running the following formula:
When using the ABC algorithm to optimize the MLSSVM parameters, it is necessary to set the relevant parameters and fitness functions, including the initialization of the control parameters in the ABC algorithm. The detailed flow chart of ABCMLSSVM is shown in Fig. 3, the specific parameters are set as follows:
 1)
Initialize various parameters in the ABC algorithm, the number of food source S_{N} is 20, the maximum number of searches limit is 50, and the number of termination cycles MCN is 100.
 2)
Performance parameters (γ, σ) of MLSSVM represents the location of food source, D is set to 6, and the search range of LSSVM parameters is set to [0.01, 1000].
 3)
Set the usage function in the ABC algorithm. The purpose of optimizing MLSSVM is to reduce the prediction error, so the fitness function is applied as \( F\left({x}_i\right)=\frac{1}{MS{E}_i} \), where MSE_{i} represents the root mean square error of the MLSSVM of ith solution.
ABC optimization algorithm does not care about the specific information of the problem, but the merits and demerits of the problem. It can quickly converge and keep immune to the local optimal trap, which provides an effective way to solve multidimensional engineering problems. Based on this, this paper uses ABC algorithm to optimize the performance parameters (σ, γ) of MLSSVM, in order to get a more accurate inverse model.
Inverse model identification based on ABCMLSSVM
The order of each input and its derivative of ABCMLSSVM inverse softsensing model is determined according to Eq. (14), and φ_{4}, φ_{5}, φ_{6} in Eq. (14) are obtained by using MLSSVM offline identification and ABC algorithm online optimization. Then the inverse softsensing model developed in this way can realize the softsensing of x_{1}, x_{2}, x_{3}, and the kernel function of MLSSVM is Gauss radial basis function. The identification process of inverse softsensing model is as follows:
 1)
Fermentation data acquisition. On the premise of meeting the sampling theorem, the input variable u is collected with appropriate excitation signal during the working area of the marine lowtemperature alkaline protease MP fermentation process, direct measurable parameter {x_{4}, x_{5}, x_{6}}, and process parameter {W_{t}, P_{t}, S_{a}, F_{a}} to obtain the original data sample set {u_{1}, u_{2}, u_{3}, u_{4}, u_{5}, x_{4}, x_{5}, x_{6}, W_{t}, P_{t}, S_{a}, F_{a}}. Nondirect measurable variable {x_{1}, x_{2}, x_{3}} can be obtained by offline, assay analysis in the laboratory.
 2)
Data preprocessing. Through certain technical processing (such as digital filtering, improving measurement redundancy, etc.), the bad data caused by working conditions, manual operation or environmental impact can be deleted, and the reliability of sample data can be improved. At the same time, in order to accurately calculate the required derivatives, according to the structure of the extended inverse model determined by Eq. (14), the fivepoint derivation method is adopted to obtain the every derivative \( \left\{{\dot{x}}_4,{\ddot{x}}_4,{\dot{x}}_5,\dot{\mathbf{u}}\right\} \) of {x_{4}, x_{5}, u}, and the interpolation method is used to process the {x_{1}, x_{2}, x_{3}} (keep it synchronized with measurable data in real time to ensure consistency of data), and finally the data sample sets {x_{1}, x_{2}, x_{3}} and \( \left\{{x}_4,{x}_5,{x}_6,{\dot{x}}_4,{\ddot{x}}_4,{\dot{x}}_5,\mathbf{u},\dot{\mathbf{u}},{W}_t,{P}_t,{S}_a,{F}_a\right\} \) are obtained, the former is used as the output of the inverse softsensing model, that is, the key biological parameter, and the latter is the input of the inverse softsensing model.
 3)
Offline training and online correction. According to the input and output sample data, the MLSSVM is trained offline and the corresponding initial parameters are determined by crossvalidation, and the initial inverse expanded model is established. Then, according to the deviation between the offline, assay analysis value in the actual fermentation process and the output of the inversion softsensing model, the ABC algorithm is used to optimize and adjust the performance parameters of the MLSSVM, so that the initial inverse expanded model can be corrected online. Figure 4 shows the online correction figure of the inverse softsensing model of fermentation process.
Results
Take the lowtemperature alkaline protease MP fermentation process as the object for experimental verification. The flow chart of marine lowtemperature alkaline protease MP fermentation process is shown in Fig. 5. In order to make the experiment closer to the production process, the experiment scheme is designed as follows:
 1)
The highyield lowtemperature alkaline protease strain YS80 isolated from Huang Hai water samples of China is selected as the strain (with the characteristics of short fermentation period, high protease yield, good enzyme stability, safety and reliability, nontoxicity and so on). It is fermented in the fermenter of 1m^{3} and is fermented according to the technological requirements of the marine alkaline protease MP fermentation. After the fermentation tank is added into the culture medium and sterilized by steam at high temperature, after cooling, the fermentation strain is connected to the fermentation strain in a certain proportion, and the appropriate amount of the enzyme producing strain is carried out at the right time.
 2)
Set fermentation period T as 90 h and sampling period t as 5 min of each batch, the fermentation temperature is controlled at about 28 °C, the pH value is about 9.5, the tank pressure is controlled at 0.04 Mpa, the stirring speed is controlled at 250r/min, the dissolved oxygen is controlled between 45~75%, and ventilation volume is 1000 L/h. Nondirect measurable variable {x_{1}, x_{2}, x_{3}} is obtained by offline analysis and test after regular sampling (the appropriate fermentation broth is taken every 4 h through sampling mouth) in the laboratory. Among them, X is obtained based on the cell dry weight method, a certain amount of fermentation broth is centrifuged at 3000r/min for 5min in a centrifuge tube. Then, the supernatant is discarded, washing twice with distilled water, and drying it at 105 °C to a constant weight, weighing it. S is measured using a SBA − 40A glucose analyzer and P is determined by an automatic scanning spectrophotometer.
 3)
Only 10 batches of sample data are considered to test the identification ability of ABCMLSSVM inverse softsensing model to small samples in the experiment. In order to enhance the difference among different batches, the initial conditions of each batch fermentation and the feeding strategy of each nutrient solution are set to be different. And the first six batches of fermentation data are used as training samples to offline train the inverse expanded model of fermentation process. The seventh batch and the eighth batch of fermentation data are used to online correction the initial extended inverse model, and the ninth batch and the 10th batch fermentation data are used to verify the effectiveness and prediction accuracy of the inverse expanded model.
In order to test the performance of the ABCMLSSVM inverse softsensing modeling method, it is compared with the traditional LSSVM softsensing modeling method, and the relative errors of the prediction results of the two methods are calculated. The initial performance parameters of MLSSVM are taken as: σ^{2} = [1.0, 1.0, 1.0], γ = [10, 10, 10], and the performance parameters of MLSSVM after online optimization by ABC algorithm are γ = [10.1, 6.3, 8.2], σ^{2} = [0.532, 1, 613, 0.479].
Figure 6 is a comparison of softsensing results of key biological parameters of the ninth batch fermentation (protease content is characterized by relative enzyme activity in Fig. 6). Figure 7 is a relative error curve. Table 1 lists the average relative error MRE of the softsensing results of the two methods (protease content is characterized by relative enzyme activity in Fig. 7).
Discussion
As can be seen from Figs. 6, 7 and Table 1, compared with the traditional LSSVM softsensing method, the online estimation results of the ABCMLSSVM inverse softsensing method are closer to the offline assay values, especially in the prediction of cell concentration. It is fully proved that the reversibility assumption of the “virtual sensor” is reasonable. During the logarithmic growth period and stable growth period (20 h–60 h) of marine lowtemperature alkaline protease MP fermentation, the average RMSE (rootmeansquare error) of cell concentration, substrate concentration and relative enzyme activity are 0.146, 0.127 and 0.185 respectively when the MLSSVM method is used. While when the ABCMLSSVM inversion method is adopted, the softsensing RMSE of the there results are 0.0645, 0.0538 and 0.0712. This indicates that the ABCMLSSVM inverse system method is effective and credible, and can greatly improve the softsensing precision of key biological parameters in the lowtemperature alkaline protease MP fermentation process, which satisfactorily meets the expected accuracy requirements.
Conclusion
In order to solve the problem that the key biological parameters of marine lowtemperature alkaline protease MP cannot be measured directly online during fedbatch fermentation, a softsensing modeling method for marine lowtemperature alkaline protease MP fermentation process based on ABCMLSSVM inversion is proposed by combining the inverse system method with least square support vector machine. This paper firstly establishes a “graybox” model for the marine lowtemperature alkaline protease MP fermentation process based on the material balance. Then, the reversibility of the nonlinear model was analyzed based on the inverse method, and the extended inverse model was constructed, coupling MLSSVM system identification with ABC optimization. Finally, the extended inverse model was connected in series with the original fermentation system, forming a composite pseudolinear system. The composite system supports the online prediction of key biological parameters in fermentation process. The simulation results show the rationality of the system dynamic model and the validity of ABCMLSSVM inverse softsensing method for predicting the key biological parameters of marine lowtemperature alkaline protease MP fermentation process.
The proposed model offers a feasible theoretical method to solve the softsensing of key biological parameters of the marine alkaline protease MP fermentation process. The model achieves ideal identification effect based on a few input/output data, eliminating the need for an exact kinetics model of the fermentation process. The softsensing of key parameters can be achieved by connecting the inverse system with the original system into a composite system. With clear physical meanings, The ABCMLSSVM inversion softsensing method effectively overcomes the bottleneck of traditional inverse system method: The difficulty in implementing an accurate model, and enables the softsensing of general nonlinear reversible systems. The proposed model enjoys a wide scope of applications, laying the basis for nonlinear softsensing modelling of MIMO systems.
Availability of data and materials
The datasets used and/or analysed during the current study are available from the corresponding author on reasonable request.
Abbreviations
 ABC:

Artificial bee colony
 CL:

Dissolved oxygen concentration
 E:

Relative enzyme activity
 LSSVM:

Least square support vector machine
 MLSSVM:

Multiple least squares support vector machine
 S:

Substrate concentration
 V:

Fermentation broth volume
 X:

Cell concentration protease content
References
 1.
Bose A, Chawdhary V, Keharia H, Subramanian RB. Production and characterization of a solventtolerant protease from a novel marine isolate bacillus tequilensis p15. Ann Microbiol. 2014;64(1):343–54.
 2.
Wang YJ, Zhang MP, Su M, Wang YJ, Xie HG, Chen CG. The chemical characteristics of marine lowtemperature alkaline protease (ys80122). Transact Oceanol Limnol. 2004;1:8–15.
 3.
Cui HX, Yang MY, Wang LP, Xian CJ. Identification of a new marine bacterial strain sd8 and optimization of its culture conditions for producing alkaline protease. PLoS One. 2015;10(12):e0146067.
 4.
Zhu MH. Research progress on the medical field of marine proteinases. Shandong Chemical Industry. 2016;45(1):60–3.
 5.
Wan Q, Lu ZH, Gao H. Screening of low temperature alkaline proteaseproducing strain and its enzymeproducing condition. J Microbiol. 2002;22(5):16–8.
 6.
Ding SH, Wang YH, Sun LN. Soft sensor of biological parameters in the marine protease fermentation process. In: Proceedings of the 33rd Chinese control conference; 2014. p. 3620–4.
 7.
Wu RB, Li CW. Constructive inverse system method for general nonlinear systems. Control Theory Appl. 2003;20(3):345–50.
 8.
Wang WC, Dai XZ. An interactor algorithm for invertibility in general nonlinear system. Proc 5th IEEE World Congress Intell Control Automation. 2004;1:59–63.
 9.
Dai X, Wang W, Ding Y, Sun Z. “Assumed inherent sensor” inversion based ann dynamic softsensing method and its application in erythromycin fermentation process. Comput Chem Eng. 2006;30(8):1203–25.
 10.
Shi ZP, Pan F. Analysis,control and detection technology of fermentation process. Beijing: Chemical Industry Press; 2010. p. 41–2.
 11.
Yu S, Liu GH, Mei CL, Cheng JX. VIP optimal neural network inverse system soft sensing method in biofermentation process. J Nanjing Univ Sci Technol. 2015;39(4):447–51.
 12.
Sun YK, Zhang Y, Huang YH, Sun XT. Soft senor of penicillin fermentation based on inverse system. Control Eng China. 2011;18(1):46–50.
 13.
Wang FW, Duan SJ, Tong LJ, Sun XZ. Identification of producing lowtemperature alkaline protease strains from Undaria Pinnatifida and fermentation conditions and enzymatic properties. J Chin Institute Food Sci Technol. 2017;17(2):85–91.
 14.
Li C, Wang SL, Zhang XM. Dynamic soft sensor modeling based on multiple least squares support vector machines. In: Processings of the 7th world congress on intelligent control & automation; 2008. p. 4315–9.
 15.
Wang SE, Han ZZ, Liu FC, Tang YG. Nonlinear system identification using least squares support vector machine tuned by an adaptive particle swarm optimization. Int J Mach Learn Cybern. 2015;6(6):981–92.
 16.
Si L, Wang B, Liu XH, Tan C, Liu Z, Xu J. Identification of shearer cutting patterns using vibration signal based on a least squares support vector machine with an improved fruit fly optimization algorithm. Sensor. 2016;16(1):90.
 17.
Yuan JL, Xie J, Huang M, Fan HM, Feng EM, Xiu ZL. Robust optimal control problem with multiple characteristic time points in the objective for a batch nonlinear timevarying process using parallel global optimization. Optim Eng. 2020. https://doi.org/10.1007/s1108101909472z.
 18.
Wang L, Yuan JL, Wu CZ, Wang XY. Practical algorithm for stochastic optimal control problem about microbial fermentation in batch culture. Opt Lett. 2019;13(3):527–41.
 19.
Karaboga D, Gorkemli B, Ozturk C, Karaboga N. A comprehensive survey: artificial bee colony (ABC) algorithm and applications. Artif Intell Rev. 2014;42(1):21–57.
 20.
Karaboga D, Basturk B. A powerful and efficient algorithm for numerical function optimization: artificial bee colony (ABC) algorithm. J Glob Optim. 2007;39(3):459–71.
 21.
Karaboga D, Akay B. A modified artificial bee colony algorithm for constrained optimization problem. Appl Soft Comput. 2011;11(3):3021–31.
 22.
Suman S, Shankar C. Parametric optimization of some nontraditional machining processes using artificial bee colony algorithm. Eng Appl Artif Intel. 2011;24(6):946–57.
Acknowledgements
We thank Jiangsu university agricultural engineering Jiangsu province superior discipline and Yellow Sea Fisheries Research Institute of Chinese Academy of Fishery Sciences for their technical support of our experiment. We also showed our thanks to the anonymous reviewers for their helpful comments on the manuscript.
About the authors
Bo Wang obtained doctor degree from jiangsu University in 2010. Now he is a professor in Jiangsu University and his research interests include soft measurement and control of biochemical reaction process.
Meifang Yu is currently pursuing M.S. degree at Jiangsu University, majoring in Control Engineering and her main research interests include the modeling, control and optimization.
Xianglin Zhu obtained B.Sc. Degree in 1984 from Hu Nan Institute of Engineering. Now he is a professor and supervisor of postgraduate in Jiangsu University and his research direction is intelligent control of industrial process.
Li Zhu obtained M.S. degree in 1999 from JiangSu University. She is now a lecturer in Jiangsu university and her research direction is automatic control of fermentation.
Funding
This work is supported by the Natural science fund for colleges and universities in Jiangsu province Grant 18KJB510004 and the priority academic program development of Jiangsu higher education institutions (PAPD).
Author information
Affiliations
Contributions
All authors discussed and agreed upon the content, and contributed to the development and revision of the draft manuscripts. BW proposed the idea of the softsensing modeling based on ABCMLSSVM inversion for marine lowtemperature alkaline protease MP fedbatch fermentation process. MY and XZ worked on the data analysis and the simulation of this method in the paper. LZ are responsible for correction and inspection.
Corresponding author
Correspondence to Bo Wang.
Ethics declarations
Ethics approval and consent to participate
This study does not involve any human or animal testing.
Consent for publication
Not applicable.
Competing interests
The authors declare that they have no competing interests.
Additional information
Publisher’s Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
About this article
Cite this article
Wang, B., Yu, M., Zhu, X. et al. Soft  sensing modeling based on ABC  MLSSVM inversion for marine low  temperature alkaline protease MP fermentation process. BMC Biotechnol 20, 9 (2020). https://doi.org/10.1186/s128960200603x
Received:
Accepted:
Published:
Keywords
 Marine alkaline protease MP
 Material balance
 Inverse system
 Support vector machine
 Softsensing