[go: up one dir, main page]

CN111504944A - Statistical monitoring method of citric acid fermentation liquefied clear liquid based on near infrared spectrum - Google Patents

Statistical monitoring method of citric acid fermentation liquefied clear liquid based on near infrared spectrum Download PDF

Info

Publication number
CN111504944A
CN111504944A CN202010490148.3A CN202010490148A CN111504944A CN 111504944 A CN111504944 A CN 111504944A CN 202010490148 A CN202010490148 A CN 202010490148A CN 111504944 A CN111504944 A CN 111504944A
Authority
CN
China
Prior art keywords
data
citric acid
near infrared
acid fermentation
monitoring
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202010490148.3A
Other languages
Chinese (zh)
Inventor
赵忠盖
郝超
苗茂栋
金赛
孙福新
胡志杰
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Jiangsu Guoxin Xielian Energy Co ltd
Jiangnan University
Original Assignee
Jiangsu Guoxin Xielian Energy Co ltd
Jiangnan University
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Jiangsu Guoxin Xielian Energy Co ltd, Jiangnan University filed Critical Jiangsu Guoxin Xielian Energy Co ltd
Priority to CN202010490148.3A priority Critical patent/CN111504944A/en
Publication of CN111504944A publication Critical patent/CN111504944A/en
Pending legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G01MEASURING; TESTING
    • G01NINVESTIGATING OR ANALYSING MATERIALS BY DETERMINING THEIR CHEMICAL OR PHYSICAL PROPERTIES
    • G01N21/00Investigating or analysing materials by the use of optical means, i.e. using sub-millimetre waves, infrared, visible or ultraviolet light
    • G01N21/17Systems in which incident light is modified in accordance with the properties of the material investigated
    • G01N21/25Colour; Spectral properties, i.e. comparison of effect of material on the light at two or more different wavelengths or wavelength bands
    • G01N21/31Investigating relative effect of material at wavelengths characteristic of specific elements or molecules, e.g. atomic absorption spectrometry
    • G01N21/35Investigating relative effect of material at wavelengths characteristic of specific elements or molecules, e.g. atomic absorption spectrometry using infrared light
    • G01N21/359Investigating relative effect of material at wavelengths characteristic of specific elements or molecules, e.g. atomic absorption spectrometry using infrared light using near infrared light

Landscapes

  • Physics & Mathematics (AREA)
  • Spectroscopy & Molecular Physics (AREA)
  • Health & Medical Sciences (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Chemical & Material Sciences (AREA)
  • Analytical Chemistry (AREA)
  • Biochemistry (AREA)
  • General Health & Medical Sciences (AREA)
  • General Physics & Mathematics (AREA)
  • Immunology (AREA)
  • Pathology (AREA)
  • Investigating Or Analysing Materials By Optical Means (AREA)

Abstract

The invention relates to a statistical monitoring method of citric acid fermentation liquefied clear liquid based on near infrared spectrum, which comprises the steps of collecting and preprocessing input and output data, dividing a data set into two parts of marked data and unmarked data, establishing a semi-supervised PP L S model and calculating a monitoring index.

Description

基于近红外光谱的柠檬酸发酵液化清液的统计监控方法Statistical monitoring method of citric acid fermentation liquefaction liquid based on near-infrared spectroscopy

技术领域technical field

本发明涉及柠檬酸发酵领域,具体涉及一种基于近红外光谱的柠檬酸发酵液化清液的统计监控方法。The invention relates to the field of citric acid fermentation, in particular to a near-infrared spectrum-based statistical monitoring method for citric acid fermentation liquefaction liquid.

背景技术Background technique

柠檬酸发酵液化清液是以玉米为原材料的产物:首先将玉米粉碎得到玉米粉,然后加水稀释成玉米浆,将糖水和工艺水调制得到投料水,并用NaOH调节,将玉米浆与投料水在配料罐中混合,加入淀粉酶后得到料液,进入一次喷射器,然后料液分两路,一路进入种子混液罐,一路进入二次喷射器,料液加入淀粉酶得到混液,并用废酸调节pH,用于固液分离,混液经过冷却器冷却后通过板框进行分离,经板框压率后得到滤渣和清液。液化清液对于发酵培养基的制备非常重要,直接决定培养基的质量,进而影响柠檬酸发酵过程的得率。因此,对柠檬酸发酵液化清液的生产过程进行监控对整个柠檬酸发酵过程至关重要。总糖和总氮是液化清液的关键工艺参数,通常采用人工取样、离线分析的方法获得这两个参数的值,并通过上下限判断的方法确定液化清液的运行状态。The citric acid fermentation liquefaction liquid is the product of corn as the raw material: firstly, the corn is pulverized to obtain corn flour, and then diluted with water into corn steep liquor, the sugar water and process water are prepared to obtain feeding water, and NaOH is used to adjust, and the corn steep liquor and feeding water are mixed together. Mixing in the batching tank, adding amylase to get the material liquid, which enters the primary ejector, and then the material liquid is divided into two paths, one of which enters the seed mixing tank, and the other enters the secondary ejector. pH is used for solid-liquid separation. The mixed liquid is cooled by the cooler and then separated by the plate and frame, and the filter residue and clear liquid are obtained after the pressure ratio of the plate and frame. The liquefied supernatant is very important for the preparation of the fermentation medium, which directly determines the quality of the medium, which in turn affects the yield of the citric acid fermentation process. Therefore, monitoring the production process of citric acid fermentation liquefied serum is crucial to the whole citric acid fermentation process. Total sugar and total nitrogen are the key process parameters of liquefied clear liquid. Usually, manual sampling and offline analysis are used to obtain the values of these two parameters, and the operation state of liquefied clear liquid is determined by the method of upper and lower limit judgment.

近红外光谱从分子振动层面收集了大量过程信息,具有非破坏性、分析快、效率高等特点,受到了广泛的关注,并在农业、石油、医药和环境等领域逐渐普及,目前生物发酵过程中开始引入近红外光谱仪进行关键参数的检测。但是,在建立近红外光谱的校正模型时,通常需要过程变量与质量变量的采样率一致,而在柠檬酸液化清液过程中,近红外光谱分析快,而总糖和总氮的离线取样周期长。以往的方法多是采用降采样率的方式,按照总糖和总氮的采样时间,选择不同的光谱数据,使两者的采样率一致,建立总糖和总氮的模型,但是丢失了大量近红外光谱的信息,会降低建模的准确性。在确定出校正模型后,通常根据该模型估计出总糖和总氮值后,然后采用上下限的方法对液化清液进行监控,属于一种事后报警的监控方式,必然导致大量的浪费。Near-infrared spectroscopy collects a large amount of process information from the molecular vibration level. It has the characteristics of non-destructiveness, fast analysis and high efficiency. It has received extensive attention and has gradually become popular in the fields of agriculture, petroleum, medicine and the environment. Begin to introduce near-infrared spectrometer to detect key parameters. However, when building a calibration model for NIR spectroscopy, the sampling rate of process variables and quality variables is usually required to be consistent, while in the citric acid liquefaction serum process, NIR spectroscopy analysis is fast, and the offline sampling cycle of total sugar and total nitrogen is long. In the past, most of the methods used the method of downsampling rate. According to the sampling time of total sugar and total nitrogen, different spectral data were selected to make the sampling rate of the two consistent, and the model of total sugar and total nitrogen was established, but a large amount of near-term data was lost. Infrared spectral information will reduce the accuracy of the modeling. After the calibration model is determined, the total sugar and total nitrogen values are usually estimated according to the model, and then the upper and lower limit methods are used to monitor the liquefied clear liquid.

发明内容SUMMARY OF THE INVENTION

本发明要解决的技术问题是提供一种基于近红外光谱的柠檬酸发酵液化清液的统计监控方法,主要解决以下两个方面的问题:总糖和总氮的近红外光谱半监督PPLS建模方法。总糖和总氮的检测周期很大,滞后时间长,而近红外光谱能够实现实时在线测量。因此,为了充分利用近红外光谱的数据,则必须采用较多的近红外光谱样本建立模型,造成了部分近红外光谱样本没有对应的总糖和总氮样本,给建立模型带来了困难。不同于以往降采样率的建模方法,半监督PPLS方法保留了未标记数据的光谱信息,可以提高模型的准确性。基于NIR的柠檬酸发酵液化清液事前预警方法。在长期的运行中,人工经验的方法能够得到总糖和总氮的上下限,因此,常规方法将NIR作为总糖和总氮的检测设备,当通过PPLS模型得到总糖和总氮的值后,判断这些值是否在其上下限内,如果不在,则认为液化清液出现了故障。本发明引入统计监控的思想,取代上下限的监控方法,能够提供一种事前预警,从而解决事后报警的问题。通过解决以上两个方面的问题,本发明能够根据液化过程关键参数的统计分布规律判断过程运行状态,实现事前预警,对企业提高生产效益,降低生产成本,并获得理想的产品质量具有重要意义。The technical problem to be solved by the present invention is to provide a statistical monitoring method of citric acid fermentation liquefied clear liquid based on near-infrared spectroscopy, which mainly solves the following two problems: near-infrared spectroscopy semi-supervised PPLS modeling of total sugar and total nitrogen method. The detection period of total sugar and total nitrogen is large and the lag time is long, while near-infrared spectroscopy can realize real-time online measurement. Therefore, in order to make full use of the near-infrared spectral data, it is necessary to use more near-infrared spectral samples to build the model, resulting in some near-infrared spectral samples without corresponding total sugar and total nitrogen samples, which brings difficulties to the establishment of the model. Different from previous modeling methods with downsampling rate, the semi-supervised PPLS method preserves the spectral information of unlabeled data, which can improve the accuracy of the model. A pre-warning method for citric acid fermentation liquefied serum based on NIR. In the long-term operation, the upper and lower limits of total sugar and total nitrogen can be obtained by the method of artificial experience. Therefore, the conventional method uses NIR as the detection equipment of total sugar and total nitrogen. When the values of total sugar and total nitrogen are obtained through the PPLS model , to judge whether these values are within their upper and lower limits, if not, it is considered that the liquefied serum is faulty. The present invention introduces the idea of statistical monitoring, replaces the monitoring method of upper and lower limits, and can provide a kind of pre-warning, thereby solving the problem of post-alarming. By solving the problems in the above two aspects, the present invention can judge the process operation state according to the statistical distribution law of key parameters of the liquefaction process, realize advance warning, and is of great significance for enterprises to improve production efficiency, reduce production costs, and obtain ideal product quality.

为了解决上述技术问题,本发明提供了一种基于近红外光谱的柠檬酸发酵液化清液的统计监控方法,包括:输入输出数据采集与预处理、将数据集分为有标记数据和未标记数据两部分、建立半监督PPLS模型、计算监控指标。In order to solve the above-mentioned technical problems, the present invention provides a method for statistical monitoring of citric acid fermentation liquefied clear liquid based on near-infrared spectroscopy, including: input and output data collection and preprocessing, and dividing the data set into labeled data and unlabeled data Two parts, establishing the semi-supervised PPLS model and calculating the monitoring indicators.

在其中一个实施例中,具体步骤如下:In one embodiment, the specific steps are as follows:

步骤一:光谱数据采集及预处理Step 1: Spectral data collection and preprocessing

1)样本采集:采集正常操作工艺下及异常情况下柠檬酸发酵液化清液不同日期不同批次的样品,并对样品标号,获得样品集;1) Sample collection: collect samples of different days and different batches of citric acid fermentation liquefied clear liquid under normal operating process and under abnormal conditions, and label the samples to obtain a sample set;

2)柠檬酸发酵液化清液近红外光谱采集和含糖量测定:测得部分样品集的含糖量,用近红外光谱仪测得所有样品的近红外光谱;2) Near-infrared spectrum collection and sugar content measurement of citric acid fermentation liquefaction clear liquid: measure the sugar content of some sample sets, and measure the near-infrared spectrum of all samples with a near-infrared spectrometer;

3)清液光谱预处理及波段选择:用多元散射校正的方法对光谱进行预处理,对预处理之后的光谱进行波段选择,得到光谱的有效信息数据;3) Clear liquid spectrum preprocessing and band selection: Preprocess the spectrum with the method of multiple scattering correction, and select the wavelength band of the preprocessed spectrum to obtain the effective information data of the spectrum;

步骤二:数据分类Step 2: Data Classification

4)对建模数据集中的数据重新组合:将建模数据中的数据分为两组,一组为有标签数据,另一组为无标签数据,两组数据量相等;4) Recombine the data in the modeling data set: divide the data in the modeling data into two groups, one group is labeled data, the other group is unlabeled data, and the two groups of data are of equal amount;

5)建模集与预测集划分:将样品集分为两部分,一部分为建模数据集,包含一部分有标签数据和一部分无标签数据,用于获取监控指标所需参数,:另一部分为预测集,同样包含一部分有标签数据和一部分无标签数据,用于测试模型的监控效果;5) Division of modeling set and prediction set: Divide the sample set into two parts, one part is the modeling data set, which contains part of the labeled data and part of the unlabeled data, which are used to obtain the parameters required for monitoring indicators; the other part is the prediction set, which also contains a part of labeled data and a part of unlabeled data, which is used to test the monitoring effect of the model;

步骤三:建立半监督概率偏最小二乘模型Step 3: Establish a semi-supervised probabilistic partial least squares model

6)建立模型:将步骤5中重新排列过的数据进行半监督PPLS建模,得到相关参数值;6) Model establishment: semi-supervised PPLS modeling is performed on the rearranged data in step 5 to obtain relevant parameter values;

假设输入变量和输出变量分别为:

Figure BDA0002520758920000031
N<K为样本采样数,Dx,Dy分别为过程变量及质量变量的个数,半监督PPLS的模型可表示为:半监督PPLS的模型可表示为:Suppose the input and output variables are:
Figure BDA0002520758920000031
N<K is the number of samples, Dx and Dy are the number of process variables and quality variables, respectively. The model of semi-supervised PPLS can be expressed as: The model of semi-supervised PPLS can be expressed as:

xn=Ptnxn x n =Pt nxn

yn=Ctnyn y n =Ct nyn

把完整数据集S分为标记数据S1和未标记数据S2,处理后的S1和S2均为样本数一致的数据,可以把完整数据集S分为标记数据S1和未标记数据S2,处理后的S1和S2均为样本数一致的数据;Divide the complete data set S into labeled data S 1 and unlabeled data S 2 , the processed S 1 and S 2 are data with the same number of samples, and the complete data set S can be divided into labeled data S 1 and unlabeled data S 2 For data S 2 , the processed S 1 and S 2 are data with the same number of samples;

步骤四:建立监控指标Step 4: Establish monitoring indicators

7)建立监控指标:根据估计的总糖和总氮,确定正常运行下总糖和总氮的正态分布规律,,定义出总糖和总氮正常运行下的统计分布范围;7) Establish monitoring indicators: According to the estimated total sugar and total nitrogen, determine the normal distribution law of total sugar and total nitrogen under normal operation, and define the statistical distribution range of total sugar and total nitrogen under normal operation;

8)将在线检测的近红外光谱代入到步骤6建立的校正模型中,得到总糖和总氮的估计值,然后判断该估计之是否满足正常的正态分布,如果满足则过程运行正常,如果超出阈值,则判断过程运行出现故障。8) Substitute the near-infrared spectrum of the online detection into the calibration model established in step 6 to obtain the estimated values of total sugar and total nitrogen, and then judge whether the estimated value satisfies the normal normal distribution. If so, the process is running normally. If the threshold is exceeded, it is judged that the process operation is faulty.

在其中一个实施例中,用联合区间概率偏最小二乘方法对预处理之后的光谱进行波段选择。In one of the embodiments, a joint interval probability partial least squares method is used to perform band selection on the preprocessed spectrum.

在其中一个实施例中,用费林法测得部分样品集的含糖量。In one embodiment, the Fehling method is used to determine the sugar content of a portion of the sample set.

在其中一个实施例中,定义出总糖和总氮正常运行下的统计分布范围中采用“6σ”标准。In one embodiment, the "6σ" criterion is used to define the statistical distribution range of total sugar and total nitrogen under normal operation.

在其中一个实施例中,“把完整数据集S分为标记数据S1和未标记数据S2,处理后的S1和S2均为样本数一致的数据,可以把完整数据集S分为标记数据S1和未标记数据S2,处理后的S1和S2均为样本数一致的数据”In one of the embodiments, "the complete data set S is divided into labeled data S 1 and unlabeled data S 2 , and the processed S 1 and S 2 are both data with the same number of samples, and the complete data set S can be divided into For the labeled data S 1 and the unlabeled data S 2 , the processed S 1 and S 2 are data with the same number of samples”

具体地,可以表示为:Specifically, it can be expressed as:

S=S1∪S2={(xn,yn)|n=1,,,N}∪{xk|k=N+1,,,K}S=S 1 ∪ S 2 ={(x n ,y n )|n=1,,,N}∪{x k |k=N+1,,,K}

完整的似然函数可以表示为:The complete likelihood function can be expressed as:

Figure BDA0002520758920000041
Figure BDA0002520758920000041

待估计参数有:

Figure BDA0002520758920000042
The parameters to be estimated are:
Figure BDA0002520758920000042

在其中一个实施例中,用EM算法求取参数;In one of the embodiments, the EM algorithm is used to obtain the parameters;

参数初始化:

Figure BDA0002520758920000043
Parameter initialization:
Figure BDA0002520758920000043

计算E步:

Figure BDA0002520758920000044
Calculate E-step:
Figure BDA0002520758920000044

计算M步:Θk+1=arg maxQ(Θ,Θk)Compute M steps: Θ k+1 =arg maxQ(Θ,Θ k )

当|L(Θk+1)-L(Θk)|>10-5,k=k+1,跳转到计算E步。When |L(Θ k+1 )-L(Θ k )|>10 -5 , k=k+1, jump to the calculation step E.

基于同样的发明构思,本申请还提供一种计算机设备,包括存储器、处理器及存储在存储器上并可在处理器上运行的计算机程序,所述处理器执行所述程序时实现任一项所述方法的步骤。Based on the same inventive concept, the present application also provides a computer device, including a memory, a processor, and a computer program stored in the memory and running on the processor, the processor implements any one of the above when executing the program. steps of the method described.

基于同样的发明构思,本申请还提供一种计算机可读存储介质,其上存储有计算机程序,该程序被处理器执行时实现任一项所述方法的步骤。Based on the same inventive concept, the present application also provides a computer-readable storage medium on which a computer program is stored, and when the program is executed by a processor, implements the steps of any one of the methods.

基于同样的发明构思,本申请还提供一种处理器,所述处理器用于运行程序,其中,所述程序运行时执行任一项所述的方法。Based on the same inventive concept, the present application also provides a processor for running a program, wherein the program executes any one of the methods when the program runs.

本发明的有益效果:Beneficial effects of the present invention:

柠檬酸液化清液过程主要采用人工取样分析总糖和总氮的值,然后根据运行经验判断过程的运行状态,是一种事后报警,只有到产品质量发生变化后,才能给出报警,造成大量的浪费,并影响后续的柠檬酸发酵生产。本发明引入近红外光谱对柠檬酸液化清液过程进行检测,并由此实现了统计监控。The process of liquefying the clear liquid of citric acid mainly adopts manual sampling to analyze the values of total sugar and total nitrogen, and then judges the operation status of the process according to the operation experience. waste and affect subsequent citric acid fermentation production. The present invention introduces near-infrared spectroscopy to detect the citric acid liquefaction process, thereby realizing statistical monitoring.

相比于近红外中常规的PLS建模和PPLS建模,本发明采用的半监督PPLS建模方法。该方法考虑了建模过程中,近红外光谱以及总糖和总氮的采集周期不一样的问题,能够尽可能地利用近红外光谱的信息。Compared with the conventional PLS modeling and PPLS modeling in the near infrared, the semi-supervised PPLS modeling method adopted in the present invention is adopted. This method takes into account the different acquisition periods of near-infrared spectra and total sugar and total nitrogen in the modeling process, and can utilize the information of near-infrared spectra as much as possible.

另一方面,相比于根据人工经验的上下限监控方法,本发明在近红外光谱检测的基础上,引入了统计过程的监控思想,不再依赖人工经验,并且能够实现事前预警,取代上下限监控方法的事后报警,为生产提供有益指导。On the other hand, compared with the upper and lower limit monitoring method based on manual experience, the present invention introduces the monitoring idea of statistical process on the basis of near-infrared spectrum detection, no longer relies on manual experience, and can realize advance warning, replacing the upper and lower limits The post-event alarm of the monitoring method provides useful guidance for production.

附图说明Description of drawings

图1是本发明基于近红外光谱的柠檬酸发酵液化清液的统计监控方法中的基于半监督PPLS的单变量统计监控总糖示意图。Fig. 1 is the schematic diagram of the univariate statistical monitoring total sugar based on semi-supervised PPLS in the statistical monitoring method of citric acid fermentation liquefaction supernatant based on near-infrared spectroscopy of the present invention.

图2是本发明基于近红外光谱的柠檬酸发酵液化清液的统计监控方法中的基于半监督PPLS的单变量统计监控总氮示意图。2 is a schematic diagram of the univariate statistical monitoring of total nitrogen based on semi-supervised PPLS in the statistical monitoring method of citric acid fermentation liquefaction supernatant based on near-infrared spectroscopy of the present invention.

具体实施方式Detailed ways

下面结合附图和具体实施例对本发明作进一步说明,以使本领域的技术人员可以更好地理解本发明并能予以实施,但所举实施例不作为对本发明的限定。The present invention will be further described below with reference to the accompanying drawings and specific embodiments, so that those skilled in the art can better understand the present invention and implement it, but the embodiments are not intended to limit the present invention.

一种基于近红外光谱的柠檬酸发酵液化清液的半监督PPLS监控方法,分为四个部分,分别为:输入输出数据采集与预处理、将数据集分为有标记数据和未标记数据两部分、建立半监督PPLS模型、计算监控指标。具体步骤如下:A near-infrared spectroscopy-based semi-supervised PPLS monitoring method for citric acid fermentation liquefaction supernatant, which is divided into four parts: input and output data acquisition and preprocessing, and dividing the data set into labeled data and unlabeled data. Part, establish a semi-supervised PPLS model, and calculate the monitoring indicators. Specific steps are as follows:

步骤一:光谱数据采集及预处理Step 1: Spectral data collection and preprocessing

1)样本采集:采集正常操作工艺下及异常情况下柠檬酸发酵液化清液不同日期不同批次的样品,并对样品标号,获得样品集。1) Sample collection: Collect samples of citric acid fermentation liquefaction liquid from different batches on different dates and under normal operating processes and under abnormal conditions, and label the samples to obtain a sample set.

2)柠檬酸发酵液化清液近红外光谱采集和含糖量测定:用费林法测得部分样品集的含糖量,用近红外光谱仪测得所有样品的近红外光谱。2) Near-infrared spectrum collection and sugar content determination of citric acid fermentation liquefied clear liquid: the sugar content of some sample sets was measured by Fehling method, and the near-infrared spectrum of all samples was measured by near-infrared spectrometer.

3)清液光谱预处理及波段选择:用多元散射校正的方法对光谱进行预处理,用联合区间概率偏最小二乘方法对预处理之后的光谱进行波段选择,得到光谱的有效信息数据。3) Clear liquid spectrum preprocessing and band selection: Preprocess the spectrum by the method of multiple scattering correction, and select the wavelength band of the preprocessed spectrum by the combined interval probability partial least square method to obtain the effective information data of the spectrum.

步骤二:数据分类Step 2: Data Classification

4)对建模数据集中的数据重新组合:将建模数据中的数据分为两组,一组为有标签数据,另一组为无标签数据,两组数据量相等。4) Recombine the data in the modeling data set: Divide the data in the modeling data into two groups, one group is labeled data, the other group is unlabeled data, and the two groups of data are of equal amount.

5)建模集与预测集划分:将样品集分为两部分,一部分为建模数据集,包含一部分有标签数据和一部分无标签数据,用于获取监控指标所需参数,:另一部分为预测集,同样包含一部分有标签数据和一部分无标签数据,用于测试模型的监控效果。5) Division of modeling set and prediction set: Divide the sample set into two parts, one part is the modeling data set, which contains part of the labeled data and part of the unlabeled data, which are used to obtain the parameters required for monitoring indicators; the other part is the prediction The set also contains a part of labeled data and a part of unlabeled data, which is used to test the monitoring effect of the model.

步骤三:建立半监督概率偏最小二乘模型Step 3: Establish a semi-supervised probabilistic partial least squares model

6)建立模型:将步骤5中重新排列过的数据进行半监督PPLS建模,得到相关参数值。6) Model establishment: conduct semi-supervised PPLS modeling on the rearranged data in step 5 to obtain relevant parameter values.

假设输入变量和输出变量分别为:

Figure BDA0002520758920000071
N<K为样本采样数,Dx,Dy分别为过程变量及质量变量的个数,半监督PPLS的模型可表示为:半监督PPLS的模型可表示为:Suppose the input and output variables are:
Figure BDA0002520758920000071
N<K is the number of samples, Dx and Dy are the number of process variables and quality variables, respectively. The model of semi-supervised PPLS can be expressed as: The model of semi-supervised PPLS can be expressed as:

xn=Ptnxn x n =Pt nxn

yn=Ctnyn y n =Ct nyn

把完整数据集S分为标记数据S1和未标记数据S2,处理后的S1和S2均为样本数一致的数据,可以把完整数据集S分为标记数据S1和未标记数据S2,处理后的S1和S2均为样本数一致的数据,具体地,可以表示为:Divide the complete data set S into labeled data S 1 and unlabeled data S 2 , the processed S 1 and S 2 are data with the same number of samples, and the complete data set S can be divided into labeled data S 1 and unlabeled data S 2 For the data S 2 , the processed S 1 and S 2 are data with the same number of samples, specifically, it can be expressed as:

S=S1∪S2={(xn,yn)|n=1,,,N}∪{xk|k=N+1,,,K}S=S 1 ∪ S 2 ={(x n ,y n )|n=1,,,N}∪{x k |k=N+1,,,K}

完整的似然函数可以表示为:The complete likelihood function can be expressed as:

Figure BDA0002520758920000072
Figure BDA0002520758920000072

待估计参数有:

Figure BDA0002520758920000073
用EM算法求取参数。The parameters to be estimated are:
Figure BDA0002520758920000073
Use the EM algorithm to find the parameters.

参数初始化:

Figure BDA0002520758920000074
Parameter initialization:
Figure BDA0002520758920000074

计算E步:

Figure BDA0002520758920000075
Calculate E-step:
Figure BDA0002520758920000075

计算M步:Θk+1=arg maxQ(Θ,Θk)Compute M steps: Θ k+1 =arg maxQ(Θ,Θ k )

当|L(Θk+1)-L(Θk)|>10-5,k=k+1,跳转到计算E步。When |L(Θ k+1 )-L(Θ k )|>10 -5 , k=k+1, jump to the calculation step E.

步骤四:建立监控指标Step 4: Establish monitoring indicators

7)建立监控指标:根据估计的总糖和总氮,确定正常运行下总糖和总氮的正态分布规律,并按照“6σ”标准,定义出总糖和总氮正常运行下的统计分布范围。7) Establish monitoring indicators: According to the estimated total sugar and total nitrogen, determine the normal distribution law of total sugar and total nitrogen under normal operation, and define the statistical distribution of total sugar and total nitrogen under normal operation according to the "6σ" standard scope.

8)将在线检测的近红外光谱代入到步骤6建立的校正模型中,得到总糖和总氮的估计值,然后判断该估计之是否满足正常的正态分布,如果满足则过程运行正常,如果超出阈值,则判断过程运行出现故障。8) Substitute the near-infrared spectrum of the online detection into the calibration model established in step 6 to obtain the estimated values of total sugar and total nitrogen, and then judge whether the estimated value satisfies the normal normal distribution. If so, the process is running normally. If the threshold is exceeded, it is judged that the process operation is faulty.

本发明将近红外光谱技术引入到柠檬酸发酵液化过程的监控中,基于半监督PPLS方法,实现了从分子振动的层面对生产运行信息的收集,同时应用统计监控理念,对估计的生产信息进行处理,评估运行状态。The present invention introduces the near-infrared spectroscopy technology into the monitoring of the citric acid fermentation and liquefaction process, and based on the semi-supervised PPLS method, realizes the collection of production operation information from the level of molecular vibration, and simultaneously applies the statistical monitoring concept to process the estimated production information , to evaluate the operating status.

正常情况下,总糖和总氮符合正态分布。该实施例首先通过正常的总糖和总氮得到其分布规律。如果总糖和总氮满足的分布为N(μ,Λ),则统计过程监控中通过判断样本的总糖和总氮与该正态分布的马氏距离指标判断状态,该指标为Index=(x-μ)TΛ-1(x-μ)。在单变量的统计过程监控中,为了方便,对于新的测试样本,也可以计算“6σ”标准,即在置信度为99.99966%的情况下,正常的质量变量区间为(μ-6Λ1/2,μ+6Λ1/2)。Normally, total sugar and total nitrogen conform to a normal distribution. This example first obtains its distribution by normal total sugar and total nitrogen. If the satisfied distribution of total sugar and total nitrogen is N(μ, Λ), in the statistical process monitoring, the state is judged by judging the Mahalanobis distance index between the total sugar and total nitrogen of the sample and the normal distribution, and the index is Index=( x-μ) T Λ -1 (x-μ). In univariate statistical process monitoring, for convenience, for new test samples, the "6σ" criterion can also be calculated, that is, when the confidence level is 99.99966%, the normal quality variable interval is (μ-6Λ 1/2 , μ+6Λ 1/2 ).

下面介绍本发明的一个具体应用场景:A specific application scenario of the present invention is introduced below:

图中,前15个为正常样本,后15个为异常样本。总糖的基于半监督PPLS的单变量统计监控结果如图1所示,第19-30个样本为超过控制限的异常样本,即检测出12个异常样本,漏掉3个异常样本;总氮的基于半监督PPLS的单变量统计监控结果如图2所示,第17、18、22、23、29、30个样本为超过控制限的异常样本,即检测出6个异常样本,漏掉9个异常样本,正常样本全部检测正常。综合总糖和总氮的监控结果,15个正常样本检测结果全部正确,15个异常样本检测出14个,说明本专利提出的监控方法能够有效地判断过程的运行状态。In the figure, the first 15 are normal samples, and the last 15 are abnormal samples. The univariate statistical monitoring results of total sugar based on semi-supervised PPLS are shown in Figure 1. The 19th to 30th samples are abnormal samples that exceed the control limit, that is, 12 abnormal samples are detected and 3 abnormal samples are missed; The univariate statistical monitoring results based on semi-supervised PPLS are shown in Figure 2. The 17th, 18th, 22nd, 23rd, 29th, and 30th samples are abnormal samples that exceed the control limit, that is, 6 abnormal samples are detected and 9 are missed. Each abnormal sample is normal, and all normal samples are normal. Combining the monitoring results of total sugar and total nitrogen, the detection results of 15 normal samples are all correct, and 14 of 15 abnormal samples are detected, indicating that the monitoring method proposed in this patent can effectively judge the operation state of the process.

以上对本发明提供的基于近红外光谱的柠檬酸发酵液化清液的统计监控方法做了详细的描述,还有以下几点需要说明:The statistical monitoring method of the citric acid fermentation liquefaction liquid based on near-infrared spectroscopy provided by the present invention has been described in detail above, and the following points need to be explained:

1.提出一种柠檬酸液化清液的统计过程监控方法,实现事前预警,取代传统的事后报警,能够做到在故障影响产品质量前给出预警。1. A statistical process monitoring method of citric acid liquefaction clear liquid is proposed, which realizes pre-warning, replaces the traditional post-alarming, and can give early warning before failure affects product quality.

2.基于第一点技术关键点,引入近红外光谱对液化清液进行检测,并提出一种半监督PPLS的近红光谱建模方法,能够充分利用大量没有对应总糖和总氮分析值的光谱数据,提高了建模的精度。2. Based on the first technical key point, near-infrared spectroscopy was introduced to detect the liquefied serum, and a near-infrared spectroscopy modeling method of semi-supervised PPLS was proposed, which can make full use of a large number of non-corresponding total sugar and total nitrogen analysis values. Spectral data, improving the accuracy of modeling.

3.基于第一点技术关键点和第二点技术关键点,将统计监控方法和近红外光谱技术结合,分析液化清液过程正常情况下总糖和总氮的统计分布,并判断近红外光谱检测的总糖和总氮是否满足正常的分布,从而达到监控液化清液过程的目的。3. Based on the first technical key point and the second technical key point, the statistical monitoring method and the near-infrared spectroscopy technology are combined to analyze the statistical distribution of total sugar and total nitrogen in the liquefaction process under normal conditions, and judge the near-infrared spectrum Whether the detected total sugar and total nitrogen meet the normal distribution, so as to achieve the purpose of monitoring the liquefaction process.

以上所述实施例仅是为充分说明本发明而所举的较佳的实施例,本发明的保护范围不限于此。本技术领域的技术人员在本发明基础上所作的等同替代或变换,均在本发明的保护范围之内。本发明的保护范围以权利要求书为准。The above-mentioned embodiments are only preferred embodiments for fully illustrating the present invention, and the protection scope of the present invention is not limited thereto. Equivalent substitutions or transformations made by those skilled in the art on the basis of the present invention are all within the protection scope of the present invention. The protection scope of the present invention is subject to the claims.

Claims (10)

1. A statistical monitoring method for citric acid fermentation liquefied clear liquid based on near infrared spectrum is characterized by comprising the steps of collecting and preprocessing input and output data, dividing a data set into marked data and unmarked data, establishing a semi-supervised PP L S model and calculating monitoring indexes.
2. The statistical monitoring method of the citric acid fermentation liquefied clear liquid based on the near infrared spectrum as claimed in claim 1, characterized by comprising the following specific steps:
the method comprises the following steps: spectral data acquisition and preprocessing
1) Collecting samples: collecting samples of the citric acid fermentation liquefied clear liquid in different batches on different dates under normal operation process and abnormal conditions, and labeling the samples to obtain a sample set;
2) near infrared spectrum collection and sugar content determination of the citric acid fermentation liquefied clear liquid: measuring the sugar content of part of the sample set, and measuring the near infrared spectrum of all samples by using a near infrared spectrometer;
3) spectrum pretreatment and band selection of clear liquid: preprocessing the spectrum by using a multivariate scattering correction method, and selecting the waveband of the preprocessed spectrum to obtain effective information data of the spectrum;
step two: data classification
4) Recombining data in the modeled dataset: dividing data in the modeling data into two groups, wherein one group is labeled data, the other group is unlabeled data, and the two groups have equal data quantity;
5) dividing a modeling set and a prediction set: the method comprises the steps of dividing a sample set into two parts, wherein one part is a modeling data set and comprises a part of labeled data and a part of unlabeled data for obtaining parameters required by monitoring indexes, and the other part is a prediction set and also comprises a part of labeled data and a part of unlabeled data for testing the monitoring effect of a model;
step three: establishing a semi-supervised probability partial least square model
6) Establishing a model, namely performing semi-supervised PP L S modeling on the data rearranged in the step 5 to obtain relevant parameter values;
assume that the input variables and the output variables are:
Figure FDA0002520758910000021
N<k is the number of sample samples, Dx,DyThe number of the process variables and the quality variables respectively, and the model of the semi-supervised PP L S can be expressed as:
xn=Ptnxn
yn=Ctny+n
dividing the complete data set S into marking data S1And unmarked data S2S after treatment1And S2All of which are consistent sample number data, the complete data set S can be divided into marker data S1And unmarked data S2S after treatment1And S2All the data are data with consistent sample number;
step four: establishing a monitoring index
7) Establishing a monitoring index: determining a normal distribution rule of total sugar and total nitrogen under normal operation according to the estimated total sugar and total nitrogen, and defining a statistical distribution range of the total sugar and the total nitrogen under normal operation;
8) and (3) substituting the online detected near infrared spectrum into the correction model established in the step (6) to obtain estimated values of total sugar and total nitrogen, then judging whether the estimated values meet normal distribution, if so, judging that the process is normal in operation, and if so, judging that the process is failed in operation.
3. The method for statistical monitoring of citric acid fermentation liquefied serum based on near infrared spectrum according to claim 1, wherein the spectrum after pretreatment is band-selected by joint interval probability partial least squares.
4. The method for statistical monitoring of liquefied serum for citric acid fermentation based on near infrared spectroscopy as claimed in claim 1, wherein the sugar content of part of the sample set is measured by the fisher method.
5. The method for statistical monitoring of liquefied serum for citric acid fermentation based on near infrared spectroscopy as claimed in claim 1, wherein the "6 σ" criterion is applied in defining the statistical distribution range under normal operation of total sugar and total nitrogen.
6. The method of claim 1, wherein the complete data set S is divided into labeled data S1And unmarked data S2S after treatment1And S2All of which are consistent sample number data, the complete data set S can be divided into marker data S1And unmarked data S2S after treatment1And S2All data are consistent in sample number "
Specifically, it can be expressed as:
S=S1∪S2={(xn,yn)|n=1,,,N}∪{xk|k=N+1,,,K}
the complete likelihood function can be expressed as:
Figure FDA0002520758910000031
the parameters to be estimated are:
Figure FDA0002520758910000032
7. the statistical monitoring method of citric acid fermentation liquefied clear liquid based on near infrared spectrum according to claim 1, wherein the parameter is obtained by EM algorithm;
initializing parameters:
Figure FDA0002520758910000033
and E, calculating:
Figure FDA0002520758910000034
and (4) calculating M steps: thetak+1=argmaxQ(Θ,Θk)
When | L (theta)k+1)-L(Θk)|>10-5And k is k +1, and the step E of calculation is skipped.
8. A computer device comprising a memory, a processor and a computer program stored on the memory and executable on the processor, characterized in that the steps of the method of any of claims 1 to 7 are implemented when the program is executed by the processor.
9. A computer-readable storage medium, on which a computer program is stored which, when being executed by a processor, carries out the steps of the method according to any one of claims 1 to 7.
10. A processor, characterized in that the processor is configured to run a program, wherein the program when running performs the method of any of claims 1 to 7.
CN202010490148.3A 2020-06-02 2020-06-02 Statistical monitoring method of citric acid fermentation liquefied clear liquid based on near infrared spectrum Pending CN111504944A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202010490148.3A CN111504944A (en) 2020-06-02 2020-06-02 Statistical monitoring method of citric acid fermentation liquefied clear liquid based on near infrared spectrum

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202010490148.3A CN111504944A (en) 2020-06-02 2020-06-02 Statistical monitoring method of citric acid fermentation liquefied clear liquid based on near infrared spectrum

Publications (1)

Publication Number Publication Date
CN111504944A true CN111504944A (en) 2020-08-07

Family

ID=71877091

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202010490148.3A Pending CN111504944A (en) 2020-06-02 2020-06-02 Statistical monitoring method of citric acid fermentation liquefied clear liquid based on near infrared spectrum

Country Status (1)

Country Link
CN (1) CN111504944A (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN115953880B (en) * 2023-03-10 2023-05-16 山东柠檬生化有限公司 Monitoring and early warning system and method for citric acid production

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN108507972A (en) * 2018-04-11 2018-09-07 江南大学 A kind of across the time apple sugar content prediction technique of near infrared spectrum based on distance metric and semi-supervised learning
CN109520962A (en) * 2017-09-19 2019-03-26 江南大学 A kind of grape wine near infrared spectrum detection method

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109520962A (en) * 2017-09-19 2019-03-26 江南大学 A kind of grape wine near infrared spectrum detection method
CN108507972A (en) * 2018-04-11 2018-09-07 江南大学 A kind of across the time apple sugar content prediction technique of near infrared spectrum based on distance metric and semi-supervised learning

Non-Patent Citations (2)

* Cited by examiner, † Cited by third party
Title
郝超 等: "基于近红外光谱的柠檬酸发酵液化清液PPLS监控", 《食品与发酵工业》 *
陈家益 等: "半监督鲁棒概率偏最小二乘模型及其在多采样率过程监控中的应用", 《信息与控制》 *

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN115953880B (en) * 2023-03-10 2023-05-16 山东柠檬生化有限公司 Monitoring and early warning system and method for citric acid production

Similar Documents

Publication Publication Date Title
Hu et al. Rapid detection of three quality parameters and classification of wine based on Vis-NIR spectroscopy with wavelength selection by ACO and CARS algorithms
CN102788752B (en) Non-destructive detection device and method of internal information of crops based on spectrum technology
Huang et al. Improved generalization of spectral models associated with Vis-NIR spectroscopy for determining the moisture content of different tea leaves
CN105445218B (en) The method for building up of middle infrared spectrum rapeseed protein content detection adaptive model
CN106560698A (en) Identification method for producing area of plant based on multiple detection technologies
CN103487537A (en) Detection method for producing areas of Xihulongjing tea based on genetic algorithm optimization
CN108181263A (en) The extraction of tobacco leaf genius loci and method of discrimination based near infrared spectrum
CN114295578B (en) Tobacco leaf conventional chemical composition general model modeling method based on near infrared spectrum
CN105784672A (en) Drug detector standardization method based on dual-tree complex wavelet algorithm
CN111795943A (en) A method for non-destructive detection of exogenous sucrose in tea based on near-infrared spectroscopy
CN108414611A (en) A kind of method in Rapid identification white tea storage time
CN102303377B (en) An online automatic measurement method for rubber Mooney viscosity
CN111504944A (en) Statistical monitoring method of citric acid fermentation liquefied clear liquid based on near infrared spectrum
CN103389360B (en) Based on the debutanizing tower butane content soft measuring method of probability principal component regression model
CN102353648A (en) Method for detecting fertilizer components
CN112651173B (en) A non-destructive testing method and generalizable system for agricultural product quality based on cross-domain spectral information
CN102288569A (en) Method for rapidly analyzing fiber biomass
Wang et al. Monitoring model for predicting maize grain moisture at the filling stage using NIRS and a small sample size
CN106841101A (en) The method of near-infrared quick detection wheat stalk rotten degree
CN107976416A (en) A kind of method by infrared spectrum prediction oil property
US20240404645A1 (en) Method for operating a chemical production system
CN117725519A (en) A method, system and application of Sichuan tobacco leaf quality evaluation based on machine learning
CN117214129A (en) Method for detecting quality of Alsentek apples with different maturity based on spectral information
CN117269106A (en) Hyperspectral model for rapidly predicting content of capsicum protein
CN117275604A (en) A method for predicting soil pH based on random forest model

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
RJ01 Rejection of invention patent application after publication

Application publication date: 20200807

RJ01 Rejection of invention patent application after publication