CN105654203A

CN105654203A - Cucumber whole-course photosynthetic rate predicting model based on support vector machine, and establishing method

Info

Publication number: CN105654203A
Application number: CN201511027646.XA
Authority: CN
Inventors: 张海辉; 王智永; 胡瑾; 陶彦蓉; 辛萍萍; 张斯威; 张珍
Original assignee: Northwest A&F University
Current assignee: Northwest A&F University
Priority date: 2015-12-31
Filing date: 2015-12-31
Publication date: 2016-06-08

Abstract

The invention relates to a cucumber whole-course photosynthetic rate predicting model based on a support vector machine. Photosynthetic rate test data of cucumber seedlings is obtained by utilizing a multi-factor nested experiment; model training is carried out by adopting a LM training method; a cucumber whole-course photosynthetic rate model fused with multiple growing periods is established; the cucumber whole-course photosynthetic rate model is subjected to comparison verification with a single-growing-period photosynthetic rate model and a whole-growing-period cucumber photosynthetic rate model respectively by adopting an abnormal check manner; the result shows that the whole-course photosynthetic rate model, which is established by adding the growing period as the one-dimensional input quantity, can effectively cross a local flat region, have obvious superiority, and satisfy training requirements that the error is less than 0.0001; the determination coefficient of a predicted value and a practically measured value of the model is 0.993; the error is less than 6.253%; both the training effect and the model-fitting degree of the model are superior to the mixed-growing period model; the precision of the model is similar to that of the single-growing-period photosynthetic rate model; and thus, the model disclosed by the invention can provide theoretical basis and technical support for light environment adjustment and control of facility crops.

Description

A prediction model and establishment method of whole-range photosynthetic rate of cucumber based on support vector machine

技术领域 technical field

本发明属于智能农业技术领域，特别涉及一种基于支持向量机的黄瓜全程光合速率预测模型及建立方法。 The invention belongs to the technical field of intelligent agriculture, and in particular relates to a support vector machine-based prediction model and establishment method of the whole-process photosynthetic rate of cucumber.

背景技术 Background technique

黄瓜是我国栽培的主要蔬菜之一，黄瓜的品质和产量与其进行光合作用的能力密不可分。光合速率与叶绿素含量、温度、CO₂浓度、光照强度、相对湿度等多个因子有着显著关系。其中，叶绿体是绿色植物进行光合作用的基础细胞器，而叶绿素是叶绿体的基本组成物质，在植物光合作用中至关重要，其含量是植物光合作用能力、营养状况和生长态势的重要指示因子，温度影响作物体内Rubisco活化酶的活性、气孔导度，CO₂浓度直接影响作物进行暗反应速率和干物质的积累，光照强度是光合作用的直接动力与推动力量，相对湿度影响叶片气孔导度等，且各因子间存在相互影响。因此，综合考虑多个因子影响、建立多因子耦合的全程光合速率预测模型，对优化黄瓜光环境具有重要作用。 Cucumber is one of the main vegetables cultivated in my country. The quality and yield of cucumber are inseparable from its photosynthetic ability. Photosynthetic rate has a significant relationship with multiple factors such as chlorophyll content, temperature, _CO2 concentration, light intensity, and relative humidity. Among them, chloroplast is the basic organelle for photosynthesis of green plants, and chlorophyll is the basic component of chloroplast, which is very important in plant photosynthesis, and its content is an important indicator of plant photosynthesis ability, nutritional status and growth status. Temperature It affects the activity of Rubisco activating enzyme and stomatal conductance in crops. The concentration of CO ₂ directly affects the dark reaction rate of crops and the accumulation of dry matter. The intensity of light is the direct driving force and driving force of photosynthesis. Relative humidity affects the stomatal conductance of leaves, etc. And there are interactions among the factors. Therefore, comprehensive consideration of the influence of multiple factors and the establishment of a multi-factor coupling full-range photosynthetic rate prediction model play an important role in optimizing the cucumber light environment.

国外的很多相关学者和研究机构，已经通过对植物光合作用的深入研究，并以此为基础建立了大量的有关温室内部环境控制模型及植物生长的模型。70年代，Charles-Edwards提出植物光合作用生理模型是建立叶片光合模型的初始模型之一，其中生理模型包括包括光呼吸作用、暗呼吸作用及氧效应。在此相关研究基础上，相关学者建立了多种光合模型，其中包括直角双曲模型、非直角双曲模型和指数关系等模型，但其模型参数不易获取，给模型的应用带来一定困难。基于上述生理模型，ZipiaoYe等提出了基于电子输运的光合速率模型等提出了光合速率稳态模型，J.Z.XU等进行了不同氮素下光合作用模型的研究，Y.LANG等利用不同的叶片，进行了光合速率模型的相关研究和探索。所以，选取性能良好的植物光合模型以及确定较为准确的相关参数对于调控植物生长的环境和和作物培育显得更有必要，但是目前国内在日光温室作物净光合的模型还需不断改进。 Many foreign scholars and research institutions have conducted in-depth research on plant photosynthesis, and based on this, they have established a large number of models for controlling the internal environment of greenhouses and for plant growth. In the 1970s, Charles-Edwards proposed that the physiological model of plant photosynthesis is one of the initial models for establishing leaf photosynthetic models, and the physiological model includes photorespiration, dark respiration and oxygen effects. On the basis of this related research, relevant scholars have established a variety of photosynthetic models, including Cartesian hyperbolic models, non-Cartesian hyperbolic models, and exponential relationship models, but the model parameters are not easy to obtain, which brings certain difficulties to the application of the model. Based on the above physiological model, ZipiaoYe et al. proposed a photosynthetic rate model based on electron transport, etc. proposed a steady-state model of photosynthetic rate, J.Z.XU et al. conducted a study on the photosynthetic model under different nitrogen, Y.LANG et al. used different leaves, Carried out related research and exploration of photosynthetic rate model. Therefore, it is more necessary to select a plant photosynthesis model with good performance and determine more accurate related parameters to regulate the environment of plant growth and crop cultivation. However, the current model of net photosynthesis of crops in solar greenhouses in China still needs continuous improvement.

近年来，众多学者在建立光合速率模型方面已进行了相关研究，上述研究均考虑了不同环境因子之间的关联，但存在拟合度较低、拟合公式复杂、误差较大等不足。而神经网络具有非线性映射和自适应学习能力等优点，适宜拟合和预测非线性复杂系统模型，因此基于神经网络的光合速率建模已成为研究热点。近期出现了基于Hopfield网络光合速率模型、基于BP神经网络的温室番茄叶片气孔导度模型、基于WSN的番茄开花期单叶净光合作用速率预测模型，上述研究从不同角度将神经网络应用于光合速率建模，但均未考虑不同生长期对作物的影响，尚未建立起全程的黄瓜光合速率预测模型，且存在训练过程较慢，训练误差相差较大的不足。 In recent years, many scholars have carried out relevant research on the establishment of photosynthetic rate models. The above studies have considered the relationship between different environmental factors, but there are deficiencies such as low fitting degree, complex fitting formula, and large error. The neural network has the advantages of nonlinear mapping and adaptive learning capabilities, and is suitable for fitting and predicting nonlinear complex system models. Therefore, the modeling of photosynthetic rate based on neural networks has become a research hotspot. Recently, the photosynthetic rate model based on Hopfield network, the stomatal conductance model of greenhouse tomato leaves based on BP neural network, and the net photosynthetic rate prediction model of single leaf in tomato flowering period based on WSN have appeared. The above studies have applied neural networks to photosynthetic rate from different angles. Modeling, but the impact of different growth periods on crops has not been considered, and a full-range cucumber photosynthetic rate prediction model has not been established, and there are shortcomings in the slow training process and large differences in training errors.

支持向量机是基于统计学习理论框架下的一种新的通用机器学习方法。可以解决样本空间中的高度非线性分类和回归等问题,是一种处理非线性分类和非线性回归的有效方法。光合速率预测包含大量的非线性因素。传统的统计预测方法,在建立预测模型时要求因子与预测对象间存在显著的线性相关,且因子间要求线性相关达最小。而引起光合速率变化诸多因子的复杂性和非线性,决定了预测因子与预测对象间的非线性相关,因而传统的模型预测方法很难解决本质是非线性关系的预测问题，支持向量机为植物光合速率预测提供了一种可行的有效途径。 Support vector machine is a new general machine learning method based on the framework of statistical learning theory. It can solve the highly nonlinear classification and regression problems in the sample space, and is an effective method to deal with nonlinear classification and nonlinear regression. Photosynthetic rate prediction contains a large number of non-linear factors. Traditional statistical forecasting methods require significant linear correlations between factors and forecasted objects when establishing forecasting models, and the minimum linear correlation between factors is required. The complexity and nonlinearity of many factors that cause photosynthetic rate changes determine the nonlinear correlation between predictors and predictors. Therefore, traditional model prediction methods are difficult to solve the prediction problem that is essentially a nonlinear relationship. Rate prediction provides a feasible and efficient way.

发明内容 Contents of the invention

为了克服上述现有技术的缺点，本发明的目的在于提供一种基于支持向量机的黄瓜全程光合速率预测模型，设计多因子嵌套试验，将数据归一化处理后采用支持向量机建模，分别建立仅针对幼苗期、开花结果期、全生长期和加入生长期作为一维输入因子加以区分的黄瓜光合速率预测模型，通过对比验证建立起全程的黄瓜光合速率预测模型，为设施农业的光环境调控建立基础。 In order to overcome the above-mentioned shortcoming of the prior art, the object of the present invention is to provide a kind of cucumber full-range photosynthetic rate prediction model based on support vector machine, design multi-factor nested experiment, adopt support vector machine modeling after data normalization process, A cucumber photosynthetic rate prediction model that only distinguishes the seedling stage, flowering and fruiting stage, full growth stage, and added growth stage as one-dimensional input factors was established respectively, and a whole-process cucumber photosynthetic rate prediction model was established through comparison and verification. Establish the basis for environmental regulation.

为了实现上述目的，本发明采用的技术方案是： In order to achieve the above object, the technical scheme adopted in the present invention is:

一种基于支持向量机的黄瓜全程光合速率预测模型，模型公式为：其中，输出f(x)表示预测的光合速率，输入信号X'＝(X₁'X₂'…X₅')^T，X₁'、X₂'、X₃'、X₄'、X₅'分别为温度、CO₂浓度、光照强度、相对湿度和叶绿素含量，w为权值向量，b为偏置，Φ(x)为非线性映射函数，l为训练集样本对{(x_i,y_i),i＝1,2,3,…,l}中的训练样本个数，x_i是第i训练样本的输入列向量，y_i为对应的输出值，y_i∈R，是i×d维实数域，d是列向量维数，a_i和a_i ^*为下式的最优解： A prediction model of whole-process photosynthetic rate of cucumber based on support vector machine, the model formula is: Among them, the output f(x) represents the predicted photosynthetic rate, the input signal X'=(X ₁ 'X ₂ '…X ₅ ') ^T , X ₁ ', X ₂ ', X ₃ ', X ₄ ', X ₅ 'respectively temperature, CO ₂ concentration, light intensity, relative humidity and chlorophyll content, w is the weight vector, b is the bias, Φ(x) is the nonlinear mapping function, l is the training set sample pair {( _xi , y _i ), the number of training samples in i=1,2,3,...,l}, x _i is the input column vector of the i-th training sample, y _i is the corresponding output value, y _i ∈ R, is the i×d-dimensional real number field, d is the column vector dimension, a _i and a _i ^* are the optimal solutions of the following formula:

为核函数，σ为宽度参数，ε为中止训练误差，c为惩罚因子。 is the kernel function, σ is the width parameter, ε is the abort training error, and c is the penalty factor.

所述基于支持向量机的黄瓜全程光合速率预测模型的建立方法，包括如下步骤： The establishment method of the whole-range photosynthetic rate prediction model of cucumber based on support vector machine comprises the steps:

步骤1，获取实验数据，过程如下： Step 1, obtain experimental data, the process is as follows:

采用营养钵育苗，待黄瓜幼苗长成二叶一心，选择长势均匀、茎横径在0.6～0.8cm之间、株高10cm以内的黄瓜幼苗进行试验，选取茁壮的黄瓜幼苗150株作为试验样本，待黄瓜处于开花结果期，选取开花节位距龙头约50厘米的植株150株作为开花结果期的试验样本； Use a nutrient pot to grow seedlings. When the cucumber seedlings grow into two leaves and one heart, select cucumber seedlings with uniform growth, a stem diameter between 0.6 and 0.8 cm, and a plant height of less than 10 cm for the test. Select 150 strong cucumber seedlings as test samples. Treat that the cucumber is in the flowering and fruiting period, choose 150 plants whose flowering nodes are about 50 centimeters away from the faucet as the test sample of the flowering and fruiting period;

测定净光合速率，过程中利用控温模块设定16、20、24、28、32℃共5个温度梯度；利用CO₂注入模块设定二氧化碳体积比为300、600、900、1200、1500μL/L共5个梯度；利用LED光源模块获得0、20、50、100、200、300、500、700、1000、1200、1500μmol/(m²·s)共11个光子通量密度梯度，以嵌套方式共进行275组试验，每组试验在随机选取的3株植株上做重复测试，试验中记录叶室相对湿度，并记录被测叶片叶绿素含量，从而形成以叶绿素含量、温度、CO₂浓度、光照强度、相对湿度为输入，净光合速率为输出的1650组实验数据，即幼苗期825组，开花结果期825组； Measure the net photosynthetic rate. During the process, use the temperature control module to set ₅ temperature gradients of 16, 20, 24, 28, and 32°C; There are 5 gradients in L; a total of 11 photon flux density gradients of 0, 20, 50, 100, 200, 300, 500, 700, 1000, 1200, 1500μmol/(m ² s) were obtained by using the LED light source module, to embedding A total of 275 groups of experiments were carried out in this way, and each group of experiments was repeatedly tested on 3 randomly selected plants. During the experiment, the relative humidity of the leaf chamber was recorded, and the chlorophyll content of the measured leaves was recorded, so as to form a model based on chlorophyll content, temperature, and _CO2 concentration. , light intensity and relative humidity are input, and net photosynthetic rate is output 1650 sets of experimental data, namely 825 sets at the seedling stage and 825 sets at the flowering and fruiting stage;

步骤2，建立模型 Step 2, build a model

在步骤1得到的实验数据中随机选取训练样本及测试样本，然后选取支持向量机核函数类型为径向基函数，用网格法确定支持向量机惩罚因子c和径向基核函数参数g的寻优区间，利用遗传算法对参数c和g基于所述寻优区间进一步寻优直至达到最大迭代次数，输出参数c和g的寻优结果，构建基于支持向量机的黄瓜全程光合速率预测模型。 Randomly select training samples and test samples from the experimental data obtained in step 1, then select the support vector machine kernel function type as radial basis function, and use the grid method to determine the support vector machine penalty factor c and radial basis kernel function parameter g In the optimization interval, the genetic algorithm is used to further optimize the parameters c and g based on the optimization interval until the maximum number of iterations is reached, the optimization results of the parameters c and g are output, and the whole-process photosynthetic rate prediction model of cucumber based on the support vector machine is constructed.

所述步骤2中， In the step 2,

将实验数据归一化处理到[0.2,0.9]区间得到全组样本数据，按4:1比例随机选取样本编号构建训练样本集、测试样本集。 The experimental data was normalized to the [0.2,0.9] interval to obtain the whole set of sample data, and the sample number was randomly selected according to the ratio of 4:1 to construct the training sample set and test sample set.

所述步骤2中， In the step 2,

惩罚因子c和径向基核函数参数g的寻优区间表示为lgc∈[2,3]，lgg∈[0,1]。 The optimization interval of penalty factor c and radial basis kernel function parameter g is expressed as lgc∈[2,3], lgg∈[0,1].

所述步骤2中， In the step 2,

采用遗传算法对惩罚因子c和径向基核函数参数g的寻优区间进一步寻优，过程为：设定遗传算法参数，个体数目NIND＝100，最大遗传代数MAXGEN＝50，代沟GGAP＝0.95，交叉概率和变异概率为0.8、0.25，c和g作为变量，分别用10位二进制表示；定义目标函数为模型均方误差值MSE；遗传算子操作：选择、交叉、变异；计算子代目标值函数并重插子代到父代得到新种群；达到最大迭代次数输出c和g的寻优结果。 The genetic algorithm is used to further optimize the optimization interval of the penalty factor c and the radial basis kernel function parameter g. The process is as follows: set the genetic algorithm parameters, the number of individuals NIND = 100, the maximum genetic algebra MAXGEN = 50, the generation gap GGAP = 0.95, The crossover probability and mutation probability are 0.8, 0.25, c and g are used as variables, respectively represented by 10-bit binary; define the objective function as the model mean square error value MSE; genetic operator operations: selection, crossover, mutation; calculate the target value of offspring function and reinsert the child to the parent to obtain a new population; reach the maximum number of iterations and output the optimization results of c and g.

所述步骤2中， In the step 2,

构建的模型为：其中，输入信号X'＝(X₁'X₂'…X₅')^T，X₁'、X₂'、X₃'、X₄'、X₅'分别为温度、CO₂浓度、光照强度、相对湿度和叶绿素含量，w为权值向量，b为偏置，Φ(x)为非线性映射函数。 The built model is: Among them, the input signal X'=(X ₁ 'X ₂ '…X ₅ ') ^T , X ₁ ', X ₂ ', X ₃ ', X ₄ ', X ₅ ' are temperature, CO ₂ concentration, light intensity , relative humidity and chlorophyll content, w is a weight vector, b is a bias, and Φ(x) is a nonlinear mapping function.

在获得基于支持向量机的黄瓜全程光合速率预测模型后，输入样本数据进行光合效率预测，如果达到精度要求，则输出预测结果，否则返回到遗传算法步骤，对c和g基于所述寻优区间进一步寻优。 After obtaining the full-range photosynthetic rate prediction model of cucumber based on support vector machine, input sample data for photosynthetic efficiency prediction, if the accuracy requirement is met, then output the prediction result, otherwise return to the genetic algorithm step, based on the optimization interval for c and g further optimization.

与现有技术相比，本发明的有益效果是： Compared with prior art, the beneficial effect of the present invention is:

1)基于支持向量机算法构建模型。支持向量机拓扑结构由支持向量决定，并应用核函数将非线性变换映射到高维空间内的线性变换，既保证模型具有良好的泛化能力，又解决了“维数灾难”的问题。避免了传统神经网络需要试凑确定网络结构的问题。另外，模型验证结果表明，在针对小样本的回归拟合问题，SVM模型的预测效果要明显优于BP神经网络，且有效避免BP神经网络局部最小问题。 1) Build a model based on the support vector machine algorithm. The topology of the support vector machine is determined by the support vectors, and the kernel function is used to map the nonlinear transformation to the linear transformation in the high-dimensional space, which not only ensures that the model has good generalization ability, but also solves the problem of "curse of dimensionality". It avoids the problem that the traditional neural network needs to try and determine the network structure. In addition, the model verification results show that the prediction effect of the SVM model is significantly better than that of the BP neural network in the regression fitting problem of small samples, and the local minimum problem of the BP neural network can be effectively avoided.

2)网格法与遗传算法结合的模型参数寻优。传统支持向量机参数确定多采用经验法或K-fold交叉验证法，由于样本数据类型不同，参数值选取存在较大差异，因此往往会造成模型误差较大或寻优计算时间过长，本文通过网格法预训练确定模型参数寻优范围，进而运用遗传算法对上述范围进行寻优。经验证，随机选取相同样本数据下，通过本方法构建模型实测值与模拟值均方误差为0.000409，决定系数为0.9915，而采用K-fold交叉验证法构建模型实测值与模拟值均方误差为0.000860，决定系数为0.9769，预测精度得到有效提高。 2) Optimization of model parameters by combining grid method and genetic algorithm. The traditional support vector machine parameter determination mostly adopts the empirical method or K-fold cross-validation method. Due to the different types of sample data, there are large differences in the selection of parameter values, which often leads to large model errors or long optimization calculation time. The grid method pre-training determines the optimization range of the model parameters, and then uses the genetic algorithm to optimize the above range. After verification, under the same sample data randomly selected, the mean square error between the measured value and the simulated value of the model constructed by this method is 0.000409, and the coefficient of determination is 0.9915, while the mean square error between the measured value and the simulated value of the model constructed by the K-fold cross-validation method is 0.000860, the coefficient of determination is 0.9769, and the prediction accuracy is effectively improved.

3)模型融合时间序列，综合考虑不同生长阶段植物光合速率变化，通过加入一维生长期变量，有效区分了黄瓜幼苗期与开花结果期的光合速率值在不同条件下的差异，构建全程光合速率预测模型。预测效果显示，预测值与实测值决定系数为0.993，最大误差为3.571，相对误差小于6.253％，预测精度高于混合生长期的预测模型。 3) The model integrates time series, comprehensively considers the changes in plant photosynthetic rate in different growth stages, and effectively distinguishes the difference in photosynthetic rate values between the seedling stage and the flowering and fruiting stage of cucumber under different conditions by adding one-dimensional growth stage variables, and constructs the whole photosynthetic rate. predictive model. The prediction results show that the coefficient of determination between the predicted value and the measured value is 0.993, the maximum error is 3.571, and the relative error is less than 6.253%. The prediction accuracy is higher than that of the prediction model in the mixed growth period.

本发明提出的全程光合速率预测模型可为黄瓜光环境调控提供理论依据，可扩展应用于不同作物的光合优化调控模型建立，以提高温室作物的光合能力。 The full-range photosynthetic rate prediction model proposed by the invention can provide a theoretical basis for the regulation and control of the cucumber light environment, and can be expanded and applied to the establishment of photosynthetic optimization regulation models for different crops to improve the photosynthetic capacity of greenhouse crops.

附图说明 Description of drawings

图1是本发明基于支持向量机算法流程图。 Fig. 1 is a flow chart of the present invention based on a support vector machine algorithm.

图2是本发明不同核函数类型模型预测效果图，其中图2(a)表示线性核函数预测值，图2(b)表示多项式核函数预测值，图2(c)表示径向基核函数预测值，图2(d)表示sigmoid核函数预测值。 Fig. 2 is the prediction effect figure of different kernel function type models of the present invention, wherein Fig. 2 (a) represents linear kernel function prediction value, Fig. 2 (b) represents polynomial kernel function prediction value, Fig. 2 (c) represents radial basis kernel function Predicted value, Figure 2(d) shows the predicted value of sigmoid kernel function.

图3是本发明基于网格法确定模型参数寻优范围，其中图3(a)表示训练集样本均方误差随c、g的变化，其中图3(b)表示测试集样本均方误差随c、g的变化。 Fig. 3 is that the present invention determines model parameter optimization range based on the grid method, wherein Fig. 3 (a) represents the change of the mean square error of the training set sample with c, g, and wherein Fig. 3 (b) represents the variation of the mean square error of the test set sample with c, g changes.

图4是本发明基于遗传算法优化模型参数流程图。 Fig. 4 is a flowchart of optimizing model parameters based on genetic algorithm in the present invention.

图5是本发明基于遗传算法优化模型参数进化过程图。 Fig. 5 is a diagram of the evolution process of the parameters of the optimization model based on the genetic algorithm in the present invention.

图6是本发明模型验证中光合速率实测值与模拟值之间的相关性示意图。其中，图6(a)是植物幼苗期光合速率模型验证中光合速率实测值与模拟值之间的相关性示意图；图6(b)是植物开花结果期光合速率模型验证中光合速率实测值与模拟值之间的相关性示意图；图6(c)是混合生长全期植物光合速率模型验证中光合速率实测值与模拟值之间的相关性示意图；图6(d)是融合生长全期光合速率模型验证中光合速率实测值与模拟值之间的相关性示意图。 Fig. 6 is a schematic diagram of the correlation between the measured value and the simulated value of the photosynthetic rate in the model verification of the present invention. Among them, Figure 6 (a) is a schematic diagram of the correlation between the measured photosynthetic rate and the simulated value in the photosynthetic rate model verification of plant seedling stage; Figure 6 (b) is the photosynthetic rate measured value and Schematic diagram of the correlation between the simulated values; Figure 6(c) is a schematic diagram of the correlation between the measured photosynthetic rate and the simulated value in the model verification of the photosynthetic rate of plants in the mixed growth period; Figure 6(d) is the photosynthetic rate in the mixed growth period Schematic diagram of the correlation between the measured and simulated values of photosynthetic rate in rate model validation.

具体实施方式 detailed description

下面结合附图和实施例详细说明本发明的实施方式。 The implementation of the present invention will be described in detail below in conjunction with the drawings and examples.

本发明一种基于神经网络的黄瓜全程光合速率预测模型的建立过程如下： The establishment process of a kind of whole-range photosynthetic rate prediction model based on neural network of cucumber of the present invention is as follows:

1、试验材料与方法 1. Test materials and methods

本试验于2014年4月至7月在西北农林科技大学科研温室进行。供试黄瓜品种为“长春密刺”，在培养皿中将已经浸胀的种子进行催芽，待要萌发时进行低温处理，在50孔(540mm280mm50mm)穴盘内采用营养钵育苗。育苗基质为农业育苗专用基质。幼苗培育期间，保持水肥充足，待黄瓜幼苗长成二叶一心，选择长势均匀、茎横径在0.6～0.8cm之间、株高10cm以内的黄瓜幼苗进行试验。选取茁壮的黄瓜幼苗150株作为试验样本。试验期间，进行正常的田间栽培管理，不喷施任何农药和激素，待黄瓜处于开花结果期，选取开花节位距龙头约50厘米的植株150株作为开花结果期的试验样本。 This experiment was carried out in the scientific research greenhouse of Northwest A&F University from April to July 2014. The cucumber variety to be tested is "Changchun Mici". The swollen seeds were germinated in a petri dish, and treated at low temperature when they were about to germinate. Seedlings were raised in a 50-hole (540mm280mm50mm) plug tray with a nutrient pot. The seedling raising substrate is a special substrate for agricultural seedling raising. During seedling cultivation, keep sufficient water and fertilizer, wait for cucumber seedlings to grow into two leaves and one heart, choose cucumber seedlings with uniform growth, stem diameter between 0.6 ~ 0.8 cm, and plant height within 10 cm for the test. 150 strong cucumber seedlings were selected as test samples. During the test period, normal field cultivation management was carried out without spraying any pesticides and hormones. When the cucumbers were in the flowering and fruiting stage, 150 plants whose flowering nodes were about 50 cm away from the faucet were selected as test samples for the flowering and fruiting stage.

采用美国LI-COR公司生产的Li-6400XT型便携式光合仪测定净光合速率，在试验过程中采用光合仪选配的多个子模块按需控制叶片周围的温度、CO2浓度、光照强度等参数。其中，利用控温模块设定16、20、24、28、32℃共5个温度梯度；利用CO₂注入模块设定二氧化碳体积比为300、600、900、1200、1500μL/L共5个梯度；利用LED光源模块获得0、20、50、100、200、300、500、700、1000、1200、1500μmol/(m²·s)共11个光子通量密度(Photofluxdensity,PFD)梯度，以嵌套方式共进行275组试验，每组试验在随机选取的3株植株上做重复测试，试验中记录叶室相对湿度，并采用日本柯尼卡公司的SPAD-502Plus型叶绿素仪记录被测叶片叶绿素含量，从而形成以叶绿素含量、温度、CO₂浓度、光照强度、相对湿度为输入，净光合速率为输出的1650组实验数据，即幼苗期825组，开花结果期825组。 The Li-6400XT portable photosynthetic instrument produced by American LI-COR Company was used to measure the net photosynthetic rate. During the test, multiple sub-modules selected by the photosynthetic instrument were used to control the temperature, CO2 concentration, light intensity and other parameters around the leaves as needed. Among them, use the temperature control module to set a total of 5 temperature gradients of 16, 20, 24, 28, and 32°C; use the _CO injection module to set a total of 5 gradients for the volume ratio of carbon dioxide to 300, 600, 900, 1200, and 1500 μL/L ;A total of 11 photon flux density (Photofluxdensity, PFD) gradients of 0, 20, 50, 100, 200, 300, 500, 700, 1000, 1200, 1500 μmol/(m ² s) were obtained by using the LED light source module, and embedded A total of 275 sets of tests were carried out in total, and each set of tests was repeated on 3 randomly selected plants. During the test, the relative humidity of the leaf chamber was recorded, and the chlorophyll of the measured leaves was recorded by the SPAD-502Plus chlorophyll meter of Japan Konica Company. Content, thus forming 1650 sets of experimental data with chlorophyll content, temperature, _CO2 concentration, light intensity, and relative humidity as input, and net photosynthetic rate as output, that is, 825 sets at the seedling stage and 825 sets at the flowering and fruiting stage.

2、模型建立方法 2. Model building method

2.1支持向量机回归基本理论 2.1 Basic theory of support vector machine regression

SVM起初是用于解决线性可分情况下两类样本的分类问题(SVC)，其核心思想是找到一个最优分类超平面，使两类样本的分类间隔最大化。SVM应用于回归拟合分析时，其基本思想不再是寻找一个最优分类平面使得两类样本分开，而是寻找一个最优分类面使得所有训练样本离该最优分类平面的误差最小。 SVM was originally used to solve the classification problem of two types of samples in the case of linear separability (SVC). Its core idea is to find an optimal classification hyperplane to maximize the classification interval of two types of samples. When SVM is applied to regression fitting analysis, its basic idea is no longer to find an optimal classification plane to separate the two types of samples, but to find an optimal classification plane to minimize the error of all training samples from the optimal classification plane.

不失一般性，设含有l个训练样本的训练集样本对为{(x_i,y_i),i＝1,2,3,…,l}，其中，是第i个训练样本的输入列向量，d是列向量维数，是i×d维实数域，y_i∈R，为对应的输出值。 Without loss of generality, let the training set sample pair containing l training samples be {( _xi ,y _i ),i=1,2,3,…,l}, where, is the input column vector of the i-th training sample, d is the dimension of the column vector, is the i×d-dimensional real number field, y _i ∈ R is the corresponding output value.

设在高维特征空间中建立的线性回归函数为 Let the linear regression function established in the high-dimensional feature space be

f(x)＝wΦ(x)+b(2-1) f(x)=wΦ(x)+b(2-1)

其中x为输入向量，w为权值向量，b为偏置，Φ(x)为非线性映射函数。 Where x is the input vector, w is the weight vector, b is the bias, and Φ(x) is the nonlinear mapping function.

定义ε线性不敏感损失函数 Define the ε linear insensitive loss function

其中f(x)为回归函数返回的预测值；y为对应的真实值，即表示若预测值与真实值之间的差别小于等于ε，则损失等于0。 Among them, f(x) is the predicted value returned by the regression function; y is the corresponding real value, which means that if the difference between the predicted value and the real value is less than or equal to ε, the loss is equal to 0.

对于线性回归问题，问题变为寻求一个最优超平面，使得在给定精度(ε≥0)条件下可以无误差地拟合y，即所有样本点到最优超平面的距离都不大于ε；考虑到允许误差的情况，可引入松弛变量ξ_i，ξ_i ^*≥0其寻优问题转化相应的二次规划问题为： For linear regression problems, the problem becomes to find an optimal hyperplane so that y can be fitted without error under the condition of a given accuracy (ε≥0), that is, the distance from all sample points to the optimal hyperplane is not greater than ε ; Considering the allowable error, the slack variable ξ _i can be introduced, and the optimization problem of ξ _i ^* ≥ 0 can be transformed into the corresponding quadratic programming problem as:

其中c为惩罚因子，c越大表示对训练误差大于ε的样本惩罚越大，ε规定了回归函数的误差要求，ε越小表示回归函数的误差越小。 Among them, c is the penalty factor. The larger c means the greater the penalty for the sample whose training error is greater than ε. ε specifies the error requirement of the regression function. The smaller ε means the smaller the error of the regression function.

2-3式求解问题可转化为对偶问题： The solution problem of formula 2-3 can be transformed into a dual problem:

其中a_i，a_i ^*为(2-4)式最优解。 Among them, a _i and a _i ^* are the optimal solutions of formula (2-4).

求解上述问题求解可得最优回归函数为： Solving the above problems to solve the optimal regression function can be obtained as:

其中K(x_i,x_j)＝Φ(x_i)Φ(x_j)为核函数。 Where K( _xi ,x _j )=Φ( _xi )Φ(x _j ) is the kernel function.

2.2支持向量机核函数选取 2.2 Support vector machine kernel function selection

SVM的核函数是将非线性可分样本转换到线性可分的特征空间，不同的核函数选择会使SVM模型产生的分类超平面不同，产生较大的差异性，对SVM模型的性能有直接的影响。因此，核函数的选取是影响SVM预测模型的关键。常用的核函数有：线性函数、多项式函数、径向基函数、sigmoid函数等。本fa.m分别针对以上四种核函数进行模型预训练，并选用均方误差和决定系数两项作为评价指标。模型预测结果如表1，图2所示。 The kernel function of SVM is to convert nonlinearly separable samples into linearly separable feature space. Different selection of kernel function will make the classification hyperplane produced by SVM model different, resulting in large differences, which have a direct impact on the performance of SVM model. Impact. Therefore, the selection of kernel function is the key to affect the SVM prediction model. Commonly used kernel functions are: linear function, polynomial function, radial basis function, sigmoid function, etc. This fa.m performs model pre-training for the above four kernel functions, and uses the mean square error and the coefficient of determination as evaluation indicators. The prediction results of the model are shown in Table 1 and Figure 2.

表1核函数对模型性能的影响 Table 1 Effect of kernel function on model performance

Table1Theimpactontheperformanceofthemodelkernel Table 1 The impact on the performance of the model kernel

由表1可知，径向基函数和多项式核函数相比于线性函数和sigmoid函数具有较小的经验误差，但多项式函数泛化能力较差，且与径向基核函数相比，多项式函数需要确定的参数跟多，由图2可知，径向基核函数实测值与模拟值拟合效果最好。因此，综合考虑训练效果和复杂程度，选取径向基函数作为本文SVM光合速率预测模型的核函数。 It can be seen from Table 1 that the radial basis function and the polynomial kernel function have smaller empirical errors than the linear function and the sigmoid function, but the generalization ability of the polynomial function is poor, and compared with the radial basis function, the polynomial function requires There are many parameters to be determined, and it can be seen from Figure 2 that the radial basis kernel function has the best fitting effect between the measured value and the simulated value. Therefore, considering the training effect and complexity, the radial basis function is selected as the kernel function of the SVM photosynthetic rate prediction model in this paper.

2.3支持向量机核参数选取 2.3 Support vector machine kernel parameter selection

SVM作为一种基于统计学理论的预测模型，采用其进行预测的难点在于对模型参数的选择。预测人往往根据经验，通过反复试验来选择合适的参数。这并不能保证模型能收敛到全局最小，预测结果也自然没法保证最优。对于没有相关理论基础的实际操作人员来讲，选择最优参数更是难上加难，这也限制了SVM模型的应用。支持向量机参数包括惩罚因子c、径向基核函数参数g、阶数p，中止训练误差ε等。 SVM is a forecasting model based on statistical theory, the difficulty of using it for forecasting lies in the selection of model parameters. Forecasters often select appropriate parameters based on experience and trial and error. This does not guarantee that the model can converge to the global minimum, and the prediction results are naturally not guaranteed to be optimal. For actual operators who do not have relevant theoretical foundations, it is even more difficult to select the optimal parameters, which also limits the application of the SVM model. Support vector machine parameters include penalty factor c, radial basis kernel function parameter g, order p, abort training error ε, etc.

其中罚因子c是一个由用户去指定的系数，表示对分错的点加入多少的惩罚，当c很大的时候，分错的点就会更少，但是过拟合的情况可能会比较严重，当c很小的时候，分错的点可能会很多，由此得到的模型会不太正确。 Among them, the penalty factor c is a coefficient specified by the user, indicating how much penalty to add to the wrong points. When c is large, the wrong points will be less, but the overfitting situation may be more serious. , when c is small, there may be many misclassified points, and the resulting model will not be correct.

核参数g选取的不同，函数的形态会发生相应的变化，进而引起SVM模型的变化。 Depending on the selection of the kernel parameter g, the shape of the function will change accordingly, which in turn will cause changes in the SVM model.

本发明采用网格法对参数范围进行粗选取。选取径向基函数作为模型核函数，选用均方误差MSE作为评价指标，分别对惩罚因子c和核函数参数g进行寻优，寻优范围lgc∈[-5,5]，,lgg∈[-5,5]，寻优结果确定模型参数c,g的范围为lgc∈[2,3]，,lgg∈[0,1]，如图3所示。 The present invention adopts the grid method to roughly select the parameter range. The radial basis function is selected as the model kernel function, the mean square error MSE is selected as the evaluation index, and the penalty factor c and the kernel function parameter g are respectively optimized, and the optimization range is lgc∈[-5,5], lgg∈[- 5,5], the optimization results determine the range of model parameters c and g as lgc∈[2,3], lgg∈[0,1], as shown in Figure 3.

基于上述范围利用遗传算法进行寻优，确定c,g参数具体值，具体步骤如图4所示，遗传进化过程如图5所示。 Based on the above range, the genetic algorithm is used to optimize and determine the specific values of c and g parameters. The specific steps are shown in Figure 4, and the genetic evolution process is shown in Figure 5.

2.4基于支持向量机的光合速率模型构建 2.4 Construction of photosynthetic rate model based on support vector machine

针对黄瓜的生长期不同采用同样的建模方法共建立四种模型，分别为仅针对黄瓜幼苗期的预测模型、仅针对黄瓜开花结果期的预测模型、黄瓜全程的光合速率预测模型和将生长期的不同作为一维输入建立黄瓜全程的预测模型。输入信号为X'＝(X₁'X₂'…X₅')^T，X₁'、X₂'、X₃'、X₄'、X₅'分别为温度、CO₂浓度、光照强度、相对湿度和叶绿素含量，第四种模型加入生长期作为一维输入，输出信号均用T_o表示网络计算得到的光合速率，每组对应实测光合速率均为教师信号T_d。通过支持向量机训练法建立全程黄瓜幼苗光合速率模型T_d'(X')。 The same modeling method was used to establish four models according to the different growth stages of cucumbers, namely, the prediction model only for the cucumber seedling stage, the prediction model only for the flowering and fruiting stage of cucumber, the photosynthetic rate prediction model for the whole cucumber and the growth stage. The difference is used as a one-dimensional input to establish a forecast model for the whole process of cucumber. The input signal is X'=(X ₁ 'X ₂ '…X ₅ ') ^T , X ₁ ', X ₂ ', X ₃ ', X ₄ ', X ₅ ' are temperature, CO ₂ concentration, light intensity, For relative humidity and chlorophyll content, the fourth model adds the growth period as a one-dimensional input, and the output signal uses T _o to represent the photosynthetic rate calculated by the network, and the corresponding measured photosynthetic rate of each group is the teacher signal T _d . The photosynthetic rate model T _d '(X') of cucumber seedlings was established by the support vector machine training method.

3模型训练性能分析 3 Model Training Performance Analysis

基于上述试验样本集，采用支持向量机算法进行训练，得到四种模型，幼苗期建立的黄瓜预测模型，训练误差为0.000446，决定系数为0.9883； Based on the above test sample set, the support vector machine algorithm was used for training, and four models were obtained. The cucumber prediction model established at the seedling stage had a training error of 0.000446 and a coefficient of determination of 0.9883;

开花生长期建立的黄瓜预测模型，训练误差为0.000387，决定系数为0.9900；混合两种生长期的全程黄瓜预测模型，训练误差为0.0022，决定系数为0.9419；加入生长期作为一维自变量因子，建立融合两种生长期的黄瓜预测模型，训练误差为0.00021128，决定系数为0.9943。对比分析训练结果可以发现，分阶段训练模型均达到了期望的误差水平，且训练集实测值与模拟值具有良好的相关系；混合两种生长期的全程黄瓜预测模型，训练误差较大，训练集实测值与模拟值具有良好的相关系较差，而加入生长期作为一维自变量因子的融合预测模型，训练误差和决定系数均优于分阶段模型，模型性能达到最优化。 The cucumber prediction model established during the flowering growth period has a training error of 0.000387 and a coefficient of determination of 0.9900; the whole-process cucumber prediction model that mixes two growth periods has a training error of 0.0022 and a coefficient of determination of 0.9419; adding the growth period as a one-dimensional independent variable factor, A cucumber forecasting model that combines two growth periods was established, with a training error of 0.00021128 and a coefficient of determination of 0.9943. Comparing and analyzing the training results, it can be found that the staged training model has reached the expected error level, and the measured value of the training set has a good correlation with the simulated value; the whole cucumber prediction model mixed with two growth periods has a large training error, and the training The measured value and the simulated value have a good correlation, but the fusion prediction model that adds the growth period as a one-dimensional independent variable factor, the training error and the coefficient of determination are better than the staged model, and the model performance is optimized.

基于上述结果，加入生长期作为一维因子建立的模型效果显著，可以为光环境调控提供理论基础和技术支持，简化了光环境设备的操作。 Based on the above results, the model established by adding the growth period as a one-dimensional factor has a significant effect, which can provide a theoretical basis and technical support for light environment regulation, and simplify the operation of light environment equipment.

4模型验证结果分析 4 Analysis of model verification results

用多因子嵌套试验获得的试验样本集共1650个两小组，将样本分为训练集和测试集，其中700用于模型训练，剩余175组用于构成测试集，约占总样本的20％，采用异校验方法进行模型验证，得到光合速率实测值与预测值相关性分析如图所示。从图6中可以发现，图6a中SVM机模型实测值和预测值相关性分析的决定系数是0.988，直线斜率是0.984，截距是0.2066，最大预测误差1.4672，图6b中支SVM模型实测值和预测值相关性分析的决定系数是0.986，直线斜率是0.9864，截距是0.3202，最大预测误差2.7186，图6c中SVM模型实测值和预测值相关性分析的决定系数是0.884，直线斜率是0.9661，截距是0.7343，最大预测误差12.55，图6d中SVM模型实测值和预测值相关性分析的决定系数是0.993，直线斜率是0.9923，截距是0.05523，最大预测误差3.575，考虑生长期建立模型的线性度明显更高，拟合程度更好。 The test sample set obtained by multi-factor nested experiment consists of 1650 two groups, and the samples are divided into training set and test set, of which 700 are used for model training, and the remaining 175 groups are used to form the test set, accounting for about 20% of the total samples , using the difference calibration method to verify the model, and obtain the correlation analysis between the measured value and the predicted value of photosynthetic rate, as shown in the figure. It can be seen from Figure 6 that the coefficient of determination of the correlation analysis between the measured value and the predicted value of the SVM machine model in Figure 6a is 0.988, the slope of the line is 0.984, the intercept is 0.2066, and the maximum prediction error is 1.4672. The measured value of the SVM model in Figure 6b is The coefficient of determination of the correlation analysis with the predicted value is 0.986, the slope of the line is 0.9864, the intercept is 0.3202, and the maximum prediction error is 2.7186. The coefficient of determination of the correlation analysis between the measured value and the predicted value of the SVM model in Figure 6c is 0.884, and the slope of the line is 0.9661 , the intercept is 0.7343, the maximum prediction error is 12.55, the coefficient of determination of the correlation analysis between the measured value and the predicted value of the SVM model in Figure 6d is 0.993, the slope of the straight line is 0.9923, the intercept is 0.05523, and the maximum prediction error is 3.575, considering the growth period to establish the model The linearity is significantly higher and the fitting degree is better.

对试验结果进行误差分析可知，考虑生长期建立的全程光合速率预测模型的实测值与模拟值最大相对误差小于±6.253％，表明本文所建立的模型可进行全生长期的光合速率模型预测，有良好的精度。 The error analysis of the test results shows that the maximum relative error between the measured value and the simulated value of the whole photosynthetic rate prediction model established in consideration of the growth period is less than ±6.253%, indicating that the model established in this paper can predict the photosynthetic rate model in the whole growth period. good precision.

Claims

1. a whole-range photosynthetic rate forecasting model of cucumber based on support vector machine, is characterized in that, model formula is:

f (x) = w Φ (x) + b = Σ_{i = 1}^{l} (a_{i} - {a_{i}}^{*}) &Center Dot; \exp (- \frac{| | x - x_{i} | |}{2 σ^{2}}) + b,

Among them, the output f(x) represents the predicted photosynthetic rate, the input signal X'=(X ₁ 'X ₂ '…X ₅ ') ^T , X ₁ ', X ₂ ', X ₃ ', X ₄ ', X ₅ 'respectively temperature, CO ₂ concentration, light intensity, relative humidity and chlorophyll content, w is the weight vector, b is the bias, Φ(x) is the nonlinear mapping function, l is the training set sample pair {( _xi , y _i ), the number of training samples in i=1,2,3,...,l}, x _i is the input column vector of the i-th training sample, y _i is the corresponding output value, y _i ∈ R, is the i×d-dimensional real number field, d is the column vector dimension, a _i and a _i ^* are the optimal solutions of the following formula:

\{\begin{matrix} \underset{α α,, {α α}^{* *}}{max max} [[- - \frac{11}{22} {Σ Σ}_{i i = = 11}^{l l} {Σ Σ}_{j j = = 11}^{l l} (({α α}_{i i} - - {α α}_{i i}^{* *})) (({α α}_{j j} - - {α α}_{j j}^{* *})) K K (({x x}_{i i},, {x x}_{j j})) - - {Σ Σ}_{i i = = 11}^{l l} (({α α}_{i i} + + {α α}_{i i}^{* *})) ϵ ϵ + + {Σ Σ}_{i i = = 11}^{l l} (({α α}_{i i} - - {α α}_{i i}^{* *})) {y the y}_{i i}]] \\ s the s . . t t . . \{\begin{matrix} {Σ Σ}_{i i = = 11}^{l l} (({α α}_{i i} - - {α α}_{i i}^{* *})) = = 00 \\ 00 \leq \leq {α α}_{i i} \leq \leq c c \\ 00 \leq \leq {α α}_{i i}^{* *} \leq \leq c c \end{matrix} \end{matrix}

is the kernel function, σ is the width parameter, ε is the abort training error, and c is the penalty factor.

2. the establishment method of the cucumber full-range photosynthetic rate prediction model based on support vector machine described in claim 1, is characterized in that, comprises the steps:

Step 1, obtain experimental data, the process is as follows:

Use a nutrient pot to grow seedlings. When the cucumber seedlings grow into two leaves and one heart, select cucumber seedlings with uniform growth, a stem diameter between 0.6 and 0.8 cm, and a plant height of less than 10 cm for the test. Select 150 strong cucumber seedlings as test samples. Treat that the cucumber is in the flowering and fruiting period, choose 150 plants whose flowering nodes are about 50 centimeters away from the faucet as the test sample of the flowering and fruiting period;

Measure the net photosynthetic rate. During the process, use the temperature control module to set ₅ temperature gradients of 16, 20, 24, 28, and 32°C; There are 5 gradients in L; a total of 11 photon flux density gradients of 0, 20, 50, 100, 200, 300, 500, 700, 1000, 1200, 1500μmol/(m ² s) were obtained by using the LED light source module, to embedding A total of 275 groups of experiments were carried out in this way, and each group of experiments was repeatedly tested on 3 randomly selected plants. During the experiment, the relative humidity of the leaf chamber was recorded, and the chlorophyll content of the measured leaves was recorded, so as to form a model based on chlorophyll content, temperature, and _CO2 concentration. , light intensity and relative humidity are input, and net photosynthetic rate is output 1650 sets of experimental data, namely 825 sets at the seedling stage and 825 sets at the flowering and fruiting stage;

Step 2, build a model

Randomly select training samples and test samples from the experimental data obtained in step 1, then select the support vector machine kernel function type as radial basis function, and use the grid method to determine the support vector machine penalty factor c and radial basis kernel function parameter g In the optimization interval, the genetic algorithm is used to further optimize the parameters c and g based on the optimization interval until the maximum number of iterations is reached, the optimization results of the parameters c and g are output, and the whole-process photosynthetic rate prediction model of cucumber based on the support vector machine is constructed.

3. according to claim 2 based on the cucumber full-range photosynthetic rate prediction model based on support vector machine, the method for establishing, is characterized in that, in described step 2,

The experimental data was normalized to the [0.2,0.9] interval to obtain the whole set of sample data, and the sample number was randomly selected according to the ratio of 4:1 to construct the training sample set and test sample set.

4. according to the described establishment method of the whole-range photosynthetic rate prediction model of cucumber based on support vector machine of claim 2, it is characterized in that, in described step 2,

The optimization interval of penalty factor c and radial basis kernel function parameter g is expressed as lgc∈[2,3], lgg∈[0,1].

5. according to the described establishment method of the cucumber full-range photosynthetic rate prediction model based on support vector machine of claim 2, it is characterized in that, in described step 2,

The genetic algorithm is used to further optimize the optimization interval of the penalty factor c and the radial basis kernel function parameter g. The process is as follows: set the genetic algorithm parameters, the number of individuals NIND = 100, the maximum genetic algebra MAXGEN = 50, the generation gap GGAP = 0.95, The crossover probability and mutation probability are 0.8, 0.25, c and g are used as variables, respectively represented by 10-bit binary; define the objective function as the model mean square error value MSE; genetic operator operations: selection, crossover, mutation; calculate the target value of offspring Function and reinsert the child to the parent to obtain a new population; reach the maximum number of iterations and output the optimization results of c and g.

6. according to claim 2 based on the cucumber full-range photosynthetic rate prediction model based on support vector machine, the method for establishing, is characterized in that, in described step 2,

The built model is:

f (x) = w Φ (x) + b = Σ_{i = 1}^{l} (a_{i} - {a_{i}}^{*}) \cdot \exp (- \frac{| | x - x_{i} | |}{2 σ^{2}}) + b,

Among them, the input signal X'=(X ₁ 'X ₂ '…X ₅ ') ^T , X ₁ ', X ₂ ', X ₃ ', X ₄ ', X ₅ ' are temperature, CO ₂ concentration, light intensity , relative humidity and chlorophyll content, w is a weight vector, b is a bias, and Φ(x) is a nonlinear mapping function.

7. according to claim 2 based on the whole-range photosynthetic rate prediction model of cucumber based on support vector machine, the establishment method is characterized in that, after obtaining the whole-range photosynthetic rate prediction model based on support vector machine, input sample data carries out photosynthetic efficiency prediction , if the accuracy requirement is met, output the prediction result, otherwise return to the genetic algorithm step, and further optimize c and g based on the optimization interval.