Detailed Description
The principles and features of this invention are described below in conjunction with the following drawings, which are set forth by way of illustration only and are not intended to limit the scope of the invention.
Fig. 1 is a schematic flow chart of a good sound stability early warning method for a screw water chiller according to an embodiment of the present invention.
As shown in fig. 1, a good sound stability early warning method for a screw water chiller unit includes the following steps:
importing a plurality of original audio data, performing data cleaning on the original audio data, and taking the remaining original audio data after the data cleaning as target audio data to obtain a plurality of target audio data, wherein the original audio data are obtained through a screw water cooling unit device;
respectively extracting the characteristics of each target audio data to obtain corresponding characteristic points, and collecting all the extracted characteristic points to obtain a characteristic point data set;
constructing a training model, and training the training model according to the feature point data set to obtain an audio recognition model;
optimizing the audio recognition model to obtain an audio recognition optimization model;
and identifying the audio data to be identified according to the audio identification optimization model to obtain an identification result of the audio data.
It should be understood that data cleansing refers to the last procedure to find and correct recognizable errors in a data file, including checking data consistency, processing invalid and missing values, and the like.
And optimizing the audio recognition model according to the preset adjusting parameters to obtain an audio recognition optimization model.
Specifically, parameter tuning is performed by using a manual parameter tuning method, so that the optimal parameters corresponding to the audio recognition model can be obtained, the recognition accuracy of the audio recognition optimization model on audio can be further ensured, audio data of audio sample discharge can be detected in real time, and timely intervention can be performed for clinic.
In the above embodiment, the data of the original audio data are respectively cleaned to obtain a plurality of target audio data, data containing missing values can be screened out, more useful information data and information having larger influence on identification and recognition can be analyzed, a data set can be conveniently manufactured according to the obtained target audio data, so that a recognition model with higher recognition and recognition accuracy can be conveniently obtained, the features of the target audio data are respectively extracted to obtain a feature point data set, a training model is constructed, an audio recognition model is obtained according to the training of the feature point data set on the training model, the reliability and stability of good sound stability early warning can be effectively improved, an audio recognition optimization model is obtained according to the optimization processing of preset adjustment parameters on the audio recognition model, and the recognition result of the audio data is obtained according to the recognition processing of the audio recognition optimization model to be recognized, the method can further improve the audio identification accuracy, overcomes the defects of large workload, low efficiency and insufficient accuracy of the traditional good sound stability early warning, can automatically carry out intelligent detection and identification on a large amount of audio data, and intervenes in time for the good sound stability early warning according to the audio data detected in real time, and has the characteristics of high efficiency, strong stability and high accuracy.
Optionally, as an embodiment of the present invention, the process of respectively performing feature extraction on each target audio data to obtain corresponding feature points includes:
respectively carrying out dimensionality reduction on the target audio data to obtain a plurality of audio data subjected to dimensionality reduction;
and respectively extracting the features of the plurality of audio data subjected to dimension reduction by using a preset spectrogram to obtain a plurality of feature points.
It should be understood that in the field of machine learning and statistics, dimensionality reduction refers to the process of reducing the number of random variables under certain defined conditions to yield a set of "uncorrelated" principal variables. The dimension reduction of the data can save the storage space of a computer on one hand, and can eliminate the noise in the data and improve the performance of a machine learning algorithm on the other hand; the root of data dimension reduction: the data dimensionality is reduced, and the data after dimensionality reduction can represent the original data as much as possible.
It should be understood that, the LDA linear discriminant analysis algorithm is used to perform dimension reduction on a plurality of target audio data, so as to obtain a plurality of audio data after dimension reduction.
Specifically, a plurality of target audio data are projected on a low dimension respectively, so that an audio text is reduced from thousands of dimensions to k dimensions, the intra-class variance of the projected target audio data is minimum, and the inter-class variance is maximum; and compressing the target audio data which is reduced to the new feature space, and reserving information as much as possible to obtain a plurality of audio data after dimension reduction.
It should be understood that a Spectrogram (Spectrogram), whose abscissa is time, ordinate is frequency, and whose coordinate points are voice data energy, is a display image of time-series-related fourier analysis, which can reflect the transformation of the music signal spectrum with time. Because the three-dimensional information is expressed by adopting the two-dimensional plane, the size of the energy value is expressed by the color, and the deeper the color, the stronger the voice energy for expressing the point is.
The spectrogram displays a great deal of information related to the characteristics of the music signal, such as the change of frequency domain parameters such as formants, energy and the like along with time, and has the characteristics of a time domain waveform and a spectrogram. That is, the spectrogram itself contains all the spectral information of the music signal without any processing, so that the information of the spectrogram about the music is lossless.
The patterns in the spectrogram comprise transverse lines, random lines, vertical strips and the like, the transverse lines are black bands parallel to the time axis and are formants, the corresponding formant frequency and bandwidth can be determined according to the frequency and bandwidth corresponding to the transverse lines, and whether the transverse lines appear in the spectrogram of a section of audio is an important mark for judging whether the spectrogram is voiced sound or not is judged; the vertical bars are narrow black bars perpendicular to the time axis, each vertical bar corresponding to a fundamental tone, the start of a stripe corresponding to the start of a voiceprint pulse, the distance between stripes representing the fundamental tone, the denser the stripes representing the higher the frequency of the fundamental tone.
In the above embodiment, the dimension reduction processing on the target audio data is performed respectively to obtain the reduced audio data, the preset spectrogram is used to extract the features of the reduced audio data to obtain the feature points, so that the main features having a greater influence on the sound stability early warning of the screw water-cooling unit can be obtained, the calculation amount of the subsequent steps is reduced, and the subsequent support vector machine can obtain a higher accuracy rate only by using less training data.
Optionally, as an embodiment of the present invention, the training model according to the feature point data set to obtain an audio recognition model includes:
s1: randomly dividing the feature point data set into a feature point training set and a feature point testing set;
s2: constructing a model based on a support vector machine algorithm to obtain a support vector machine structure;
s3: inputting the feature point training set and the feature point testing set into the support vector machine structure together to search for an optimal recognition hyperplane, so as to obtain an optimal recognition hyperplane, and obtaining a support vector set and VC (virtual component interconnect) credibility according to the optimal recognition hyperplane;
s4: carrying out discrimination processing on the support vector set according to the VC credibility to obtain a discrimination function, and generating a training model according to the discrimination function;
s5: and carrying out model screening processing on the training model according to the feature point training set and the feature point testing set to obtain an audio recognition model.
It should be understood that, since the feature point data set is randomly divided into the feature point training set and the feature point testing set each time, the random proportion of each time is different, and the train _ test _ split function may be invoked for random division.
The Support Vector Machine (SVM) is a supervised learning algorithm, the SVM theory provides complexity for avoiding a high-dimensional space, an inner product function (namely a kernel function) of the space is directly used, and a solving method under the condition of linear divisibility is utilized to directly solve a decision problem of the corresponding high-dimensional space.
It should be understood that the VC credibility is the VC dimensional confidence or confidence risk, and the VC dimension is a measure for the function class, which can be simply understood as the complexity of the problem, and the higher the VC dimension is, the more complex a problem is. For example: many classification functions can easily achieve 100% accuracy on a sample set, but are confusing (i.e., so-called poor generalization ability, or poor generalization ability) when classified in reality. In this case, a classification function is selected that is complex enough (i.e. its VC dimension is high) to accurately remember each sample, but uniformly classify the data outside the sample in error.
Specifically, the feature point training set and the feature point testing set are input into the structure of the support vector machine, and the sample feature space of the support vector machine is used to find out the optimal identification hyperplane of each class of feature sample and other feature samples, so as to obtain the support vector set representing each class of sample and the corresponding VC credibility, form the discriminant function for distinguishing each feature, and obtain the training model.
In the embodiment, the feature point data set is randomly divided into a feature point training set and a feature point testing set, so that the objectivity of data can be ensured, human factors are reduced, the accuracy of a subsequent identification model is effectively improved, the feature point training set and the feature point testing set are input into an optimal identification hyperplane of a support vector machine structure together to search for an optimal identification hyperplane, a support vector set and VC reliability are obtained according to the optimal identification hyperplane, and a training model is obtained by performing discrimination processing on the support vector set according to the VC reliability; the audio recognition model is obtained by screening the training model according to the feature point training set and the feature point testing set, so that the accuracy of the audio data to be recognized can be kept at a higher level all the time, and the stability and reliability of audio recognition are improved.
Optionally, as an embodiment of the present invention, the process of step S5 includes:
s51: inputting the feature point training set into the training model according to preset iterative training times to perform iterative training to obtain a first detection model;
s52: inputting the feature point test set into the first detection model for detection to obtain a first accuracy, and judging whether the first accuracy reaches a preset expected value, if so, taking the first detection model as an audio recognition model, then performing optimization processing on the audio recognition model, and if not, executing the step S53;
s53: inputting the feature point test set into the training model for iterative training according to the preset iterative training times to obtain a second detection model;
s54: inputting the feature point training set into the second detection model for detection to obtain a second accuracy;
s55: and judging whether the second accuracy reaches the preset expected value, if so, taking the second detection model as the audio recognition model, then carrying out optimization processing on the audio recognition model, and if not, returning to the step S1.
Preferably, the preset expected value is 0.90.
In the above embodiment, a higher recognition accuracy can be ensured through the obtained first detection model and the second detection model, and an audio recognition model meeting expectations is obtained, when the first accuracy of the first detection model does not reach an expected value, the second detection model is obtained through training of the feature point test set, and the second accuracy is obtained through detection by using the feature point training set, which is equivalent to exchanging the training set and the test set, and the recognition model meeting expectations can be further ensured to be obtained, so that the accuracy of detecting audio data to be recognized by the audio recognition model meeting the expected value can be always kept at a higher level, and the stability and reliability of audio recognition are improved.
Fig. 2 is a block diagram of a good-sound stability early warning device of a screw water chiller according to an embodiment of the present invention.
Optionally, as another embodiment of the present invention, as shown in fig. 2, a good sound stability early warning device for a screw water chiller includes:
the data cleaning module is used for importing a plurality of original audio data, cleaning the original audio data, and obtaining a plurality of target audio data by taking the residual original audio data after the data cleaning as the target audio data, wherein the original audio data are obtained through a screw water cooling unit device;
the feature extraction module is used for respectively extracting features of the target audio data to obtain corresponding feature points, and collecting all the extracted feature points to obtain a feature point data set;
the model training module is used for constructing a training model and training the training model according to the feature point data set to obtain an audio recognition model;
the optimization processing module is used for optimizing the audio recognition model to obtain an audio recognition optimization model;
and the good tone stability early warning module is used for identifying and processing the audio data to be identified according to the audio identification optimization model to obtain an identification result of the audio data.
Optionally, as an embodiment of the present invention, the feature extraction module is specifically configured to:
respectively carrying out dimensionality reduction on the target audio data to obtain a plurality of audio data subjected to dimensionality reduction;
and respectively extracting the features of the plurality of audio data subjected to dimension reduction by using a preset spectrogram to obtain a plurality of feature points.
Optionally, as an embodiment of the present invention, the model training module is specifically configured to:
randomly dividing the feature point data set into a feature point training set and a feature point testing set;
constructing a model based on a support vector machine algorithm to obtain a support vector machine structure;
inputting the feature point training set and the feature point testing set into the support vector machine structure together to search for an optimal recognition hyperplane, so as to obtain an optimal recognition hyperplane, and obtaining a support vector set and VC (virtual component interconnect) credibility according to the optimal recognition hyperplane;
carrying out discrimination processing on the support vector set according to the VC credibility to obtain a discrimination function, and generating a training model according to the discrimination function;
and carrying out model screening processing on the training model according to the feature point training set and the feature point testing set to obtain an audio recognition model.
Optionally, as an embodiment of the present invention, the model training module is specifically configured to:
inputting the feature point training set into the training model according to preset iterative training times to perform iterative training to obtain a first detection model;
inputting the feature point test set into the first detection model for detection to obtain a first accuracy rate, judging whether the first accuracy rate reaches a preset expected value, if so, taking the first detection model as an audio recognition model, then carrying out optimization processing on the audio recognition model, and if not, inputting the feature point test set into the training model for iterative training according to the preset iterative training times to obtain a second detection model;
inputting the feature point training set into the second detection model for detection to obtain a second accuracy;
and judging whether the second accuracy reaches the preset expected value, if so, taking the second detection model as the audio recognition model, then carrying out optimization processing on the audio recognition model, and if not, randomly dividing the feature point data set into a feature point training set and a feature point testing set again.
Optionally, another embodiment of the present invention provides a good sound stability early warning device for a screw water chiller, including a memory, a processor, and a computer program stored in the memory and executable on the processor, where when the processor executes the computer program, the good sound stability early warning method for the screw water chiller is implemented as described above. The device may be a computer or the like.
Optionally, another embodiment of the present invention provides a computer-readable storage medium, which stores a computer program, and when the computer program is executed by a processor, the method for early warning of sound stability of a screw water chiller according to the above is implemented.
It is clear to those skilled in the art that, for convenience and brevity of description, the specific working processes of the above-described apparatuses and units may refer to the corresponding processes in the foregoing method embodiments, and are not described herein again.
In the several embodiments provided in the present application, it should be understood that the disclosed apparatus and method may be implemented in other ways. For example, the above-described apparatus embodiments are merely illustrative, and for example, a division of a unit is merely a logical division, and an actual implementation may have another division, for example, a plurality of units or components may be combined or integrated into another system, or some features may be omitted, or not executed.
Units described as separate parts may or may not be physically separate, and parts displayed as units may or may not be physical units, may be located in one place, or may be distributed on a plurality of network units. Some or all of the units can be selected according to actual needs to achieve the purpose of the solution of the embodiment of the present invention.
In addition, functional units in the embodiments of the present invention may be integrated into one processing unit, or each unit may exist alone physically, or two or more units are integrated into one unit. The integrated unit can be realized in a form of hardware, and can also be realized in a form of a software functional unit.
The integrated unit, if implemented in the form of a software functional unit and sold or used as a stand-alone product, may be stored in a computer readable storage medium. It will be understood that the technical solution of the present invention essentially contributes to the prior art, or all or part of the technical solution can be embodied in the form of a software product stored in a storage medium and including instructions for causing a computer device (which may be a personal computer, a server, or a network device) to execute all or part of the steps of the method according to the embodiments of the present invention. And the aforementioned storage medium includes: a U-disk, a removable hard disk, a Read-Only Memory (ROM), a Random Access Memory (RAM), a magnetic disk or an optical disk, and other various media capable of storing program codes.
While the invention has been described with reference to specific embodiments, it will be understood by those skilled in the art that various changes in form and details may be made therein without departing from the spirit and scope of the invention as defined by the appended claims. Therefore, the protection scope of the present invention shall be subject to the protection scope of the claims.