Disclosure of Invention
The invention aims to provide a magnetotelluric denoising method and a magnetotelluric denoising system, which are used for reducing the over-processing of data and improving the data denoising accuracy.
In order to achieve the purpose, the invention provides the following scheme:
a magnetotelluric denoising method, the method comprising:
measuring magnetotelluric pulse signals of different depths of the earth by using a magnetotelluric sounding method;
carrying out overlapping segmentation on the magnetotelluric pulse signals to obtain a plurality of sounding data segments;
determining the approximate entropy of each sounding data segment based on the approximate entropy theory;
determining the depth measuring data segment with the approximate entropy smaller than or equal to the approximate entropy threshold value as a depth measuring data segment to be denoised, and determining the depth measuring data segment with the approximate entropy larger than the approximate entropy threshold value as an effective depth measuring data segment;
constructing a matrix to be denoised according to the sounding data segment to be denoised;
performing singular value decomposition on the matrix to be denoised to obtain a singular value matrix, a left singular matrix and a right singular matrix of the matrix to be denoised;
multiplying each diagonal element of the singular value matrix, the column vector corresponding to the left singular matrix and the row vector corresponding to the right singular matrix to obtain a plurality of field source component matrixes of the sounding data segment to be denoised;
converting each field source component matrix into a component data segment;
determining the approximate entropy of each component data segment based on the approximate entropy theory;
screening the component data sections with the approximate entropy larger than that of the depth measuring data section to be denoised, and summing the screened component data sections to obtain the denoised depth measuring data section;
and reconstructing the denoised sounding data segment and the effective sounding data segment to obtain a reconstructed denoised magnetotelluric pulse signal.
Optionally, the performing overlapping segmentation on the magnetotelluric pulse signal to obtain a plurality of sounding data segments specifically includes:
for the magnetotelluric pulse signal x1,x2,…xwPerforming overlapping segmentation processing to obtain each depth measurement data segment Xv={x(v-1)*N+1,x(v-1)*N+2,…,x(v-1)*N+N};
Wherein x is1,x2,…xwRespectively the 1 st, 2 nd and w th magnetotelluric pulse signals, w is the length of the magnetotelluric pulse signal, x(v-1)*N+1、x(v-1)*N+2、x(v-1)*N+NThe data are respectively the 1 st, 2 nd and nth data of the v-th sounding data segment, N is the sounding data segment length, v is 1,2, …, l is the number of sounding data segments, l is (w- α N)/N (1- α), and α is the data overlap ratio.
Optionally, the determining the approximate entropy of each depth measurement data segment based on the approximate entropy theory specifically includes:
initializing the embedding dimension of the Henkel matrix as h;
constructing a Hankel matrix of the embedding dimension for each sounding data segment; each column in the hankel matrix is a mode vector;
using formulas
Determining the maximum distance between the p mode vector in the Henkel matrix and the q mode vector of the Henkel matrix;
counting the number of the p-th mode vector in the Hankel matrix, wherein the maximum distance between the p-th mode vector and all the mode vectors of the Hankel matrix is smaller than or equal to a mode fault tolerance threshold;
based on the approximate probability theory, according to the number, using formula Cp h(r)=Np h(r)/(N-h +1), determining an approximate probability value of the p-th mode vector in the hankel matrix;
using a formula based on the approximate probability value
Determining an approximate probability average value of each sounding data segment under the h embedding dimension;
increasing the embedding dimension h by 1, returning to the step of constructing a Hankel matrix with the embedding dimension h for each depth measuring data segment, and determining the approximate probability average value of each depth measuring data segment under the embedding dimension h + 1;
according to the approximate probability average value of each sounding data segment in the h embedding dimension and the approximate probability average value of each sounding data segment in the h +1 embedding dimension, a formula is utilized
Determining an approximate entropy of each sounding data segment;
wherein d is
max[y(p),y(q)]Is the maximum distance between the p-th mode vector in the Hankel matrix and the q-th mode vector in the Hankel matrix, y (p) is the p-th mode vector in the Hankel matrix, y (q) is the q-th mode vector in the Hankel matrix, and x (p + delta) is the Hankel matrixThe delta-th element of the p-th mode vector in the matrix, x (q + delta) is the delta-th element of the q-th mode vector in the hank matrix, delta is 1,2, …, h, C
p h(r) is the approximate probability value of the p-th pattern vector in the Hankel matrix, N
p h(r) is the number of maximum distances less than or equal to the mode fault tolerance threshold, r is the mode fault tolerance threshold, N is the sounding data segment length,
is an approximate probability average of the sounding data segment in the h embedding dimension,
is the approximate probability average, Ae, of the sounding data segment in the h +1 embedding dimension
iIs the approximate entropy of the ith sounding data segment.
Optionally, constructing a matrix to be denoised according to the sounding data segment to be denoised specifically includes:
according to the preset matrix dimension, the a-th sounding data segment X to be denoiseda={x(a-1)*N+1,x(a-1)*N+2,…,x(a-1)*N+NEqually dividing into a plurality of sections;
constructing a matrix to be denoised according to each section after the equal division
Wherein, XaFor the a-th sounding data segment to be denoised, x(a-1)*N+1、x(a-1)*N+2、x(a-1)*N+n、x(a-1)*N+1+n、x(a-1)*N+2n、xa*N+1-nAnd x(a-1)*N+NRespectively representing the 1 st, 2 nd, nth, N +1 st, 2 nth, N-N +1 th and nth data of the depth measuring data segment to be denoised, wherein N is the length of the depth measuring data segment, N is the length of a preset matrix, N is N/m, and m is the dimension of the preset matrix.
Optionally, the performing singular value decomposition on the matrix to be denoised to obtain a singular value matrix, a left singular matrix, and a right singular matrix of the matrix to be denoised, and then further includes:
determining the ratio of each singular value in the singular value matrix to the sum of all singular values;
drawing a singular value proportion broken line graph by taking the number of singular values in the singular value matrix as an abscissa and the ratio as an ordinate;
judging whether the vertical coordinates corresponding to the tail branches of the singular value proportion broken line graph are all smaller than a ratio threshold value or not, and obtaining a judgment result;
if the judgment result shows that the matrix to be denoised is the zero matrix, outputting a singular value matrix, a left singular matrix and a right singular matrix of the matrix to be denoised;
if the judgment result shows no, updating the preset matrix dimension, and returning to the step of' according to the preset matrix dimension, enabling the a-th depth sounding data segment X to be denoiseda={x(a-1)*N+1,x(a-1)*N+2,…,x(a-1)*N+NAnd equally dividing into multiple sections.
Optionally, reconstructing the denoised sounding data segment and the effective sounding data segment to obtain a reconstructed denoised magnetotelluric pulse signal, specifically including:
reconstructing the denoised sounding data segment and the effective sounding data segment by using a formula w ═ lN- (l-1) · α N to obtain a reconstructed denoised magnetotelluric pulse signal;
wherein w is the length of the magnetotelluric pulse signal, l is the number of the sounding data segments, alpha is the data overlapping rate, and N is the length of the sounding data segments.
A magnetotelluric denoising system, the system comprising:
the magnetotelluric pulse signal measuring module is used for measuring magnetotelluric pulse signals of different depths of the earth by utilizing a magnetotelluric sounding method;
a sounding data segment obtaining module, configured to perform overlapping segmentation on the magnetotelluric pulse signals to obtain multiple sounding data segments;
the approximate entropy determining module of the sounding data segments is used for determining the approximate entropy of each sounding data segment based on the approximate entropy theory;
the depth measuring data segment determining module is used for determining a depth measuring data segment with approximate entropy smaller than or equal to the approximate entropy threshold value as a depth measuring data segment to be denoised and determining a depth measuring data segment with approximate entropy larger than the approximate entropy threshold value as an effective depth measuring data segment;
the matrix construction module to be denoised is used for constructing a matrix to be denoised according to the sounding data segment to be denoised;
the singular value decomposition module is used for performing singular value decomposition on the matrix to be denoised to obtain a singular value matrix, a left singular matrix and a right singular matrix of the matrix to be denoised;
a field source component matrix obtaining module, configured to multiply each diagonal element of the singular value matrix, a column vector corresponding to the left singular matrix, and a row vector corresponding to the right singular matrix, to obtain a plurality of field source component matrices of the sounding data segment to be denoised;
a component data segment obtaining module for converting each field source component matrix into a component data segment;
the approximate entropy determining module of the component data segments is used for determining the approximate entropy of each component data segment based on the approximate entropy theory;
the denoised depth measurement data section obtaining module is used for screening the component data sections of which the approximate entropy is larger than that of the depth measurement data section to be denoised, and summing the screened component data sections to obtain the denoised depth measurement data section;
and the reconstruction denoising magnetotelluric pulse signal obtaining module is used for reconstructing the denoised sounding data segment and the effective sounding data segment to obtain a reconstruction denoising magnetotelluric pulse signal.
Optionally, the depth measurement data segment obtaining module specifically includes:
a sounding data segment obtaining submodule for obtaining the magnetotelluric pulse signal x1,x2,…xwPerforming overlapping segmentation processing to obtain each depth measurement data segment Xv={x(v-1)*N+1,x(v-1)*N+2,…,x(v-1)*N+N};
Wherein x is1,x2,…xwRespectively the 1 st, 2 nd and w th magnetotelluric pulse signals, w is the length of the magnetotelluric pulse signal, x(v-1)*N+1、x(v-1)*N+2、x(v-1)*N+NThe data are respectively the 1 st, 2 nd and nth data of the v-th sounding data segment, N is the sounding data segment length, v is 1,2, …, l is the number of sounding data segments, l is (w- α N)/N (1- α), and α is the data overlap ratio.
Optionally, the approximate entropy determining module of the sounding data segment specifically includes:
the embedded dimension initialization submodule is used for initializing the embedded dimension of the Hankel matrix to be h;
the Hankel matrix construction submodule is used for respectively constructing a Hankel matrix with an embedded dimension h for each sounding data segment; each column in the hankel matrix is a mode vector;
maximum distance determination submodule for utilizing a formula
Determining the maximum distance between the p mode vector in the Henkel matrix and the q mode vector of the Henkel matrix;
the number counting submodule is used for counting the number of the p-th mode vector in the Hankel matrix, wherein the maximum distance between the p-th mode vector and all the mode vectors of the Hankel matrix is smaller than or equal to a mode fault tolerance threshold;
an approximate probability value determining submodule for utilizing a formula C according to the number based on the approximate probability theoryp h(r)=Np h(r)/(N-h +1), determining an approximate probability value of the p-th mode vector in the hankel matrix;
an approximate probability average determination submodule for utilizing a formula according to the approximate probability value
Determining an approximate probability average value of each sounding data segment under the h embedding dimension;
the circulation submodule is used for increasing the embedding dimension h by 1, returning to the step of 'respectively constructing a Hankel matrix with the embedding dimension h for each sounding data segment', and determining the approximate probability average value of each sounding data segment under the embedding dimension h + 1;
an approximate entropy determination submodule of the sounding data segment for utilizing a formula according to the approximate probability average value of each sounding data segment in the h embedding dimension and the approximate probability average value of each sounding data segment in the h +1 embedding dimension
Determining an approximate entropy of each sounding data segment;
wherein d is
max[y(p),y(q)]Is the maximum distance between the p-th mode vector in the Heckel matrix and the q-th mode vector in the Heckel matrix, y (p) is the p-th mode vector in the Heckel matrix, y (q) is the q-th mode vector in the Heckel matrix, x (p + delta) is the delta-th element of the p-th mode vector in the Heckel matrix, x (q + delta) is the delta-th element of the q-th mode vector in the Heckel matrix, delta is 1,2, …, h, C
p h(r) is the approximate probability value of the p-th pattern vector in the Hankel matrix, N
p h(r) is the number of maximum distances less than or equal to the mode fault tolerance threshold, r is the mode fault tolerance threshold, N is the sounding data segment length,
is an approximate probability average of the sounding data segment in the h embedding dimension,
is the approximate probability average, Ae, of the sounding data segment in the h +1 embedding dimension
iIs the approximate entropy of the ith sounding data segment.
Optionally, the matrix construction module to be denoised specifically includes:
an equal division module used for dividing the a-th depth sounding data segment X to be denoised according to the preset matrix dimensiona={x(a-1)*N+1,x(a-1)*N+2,…,x(a-1)*N+NEqually dividing into a plurality of sections;
a sub-module for constructing the matrix to be denoised according to each segment after being equally divided
Wherein, XaFor the a-th sounding data segment to be denoised, x(a-1)*N+1、x(a-1)*N+2、x(a-1)*N+n、x(a-1)*N+1+n、x(a-1)*N+2n、xa*N+1-nAnd x(a-1)*N+NRespectively representing the 1 st, 2 nd, nth, N +1 st, 2 nth, N-N +1 th and nth data of the depth measuring data segment to be denoised, wherein N is the length of the depth measuring data segment, N is the length of a preset matrix, N is N/m, and m is the dimension of the preset matrix.
According to the specific embodiment provided by the invention, the invention discloses the following technical effects:
the invention discloses a magnetotelluric denoising method and a magnetotelluric denoising system, wherein magnetotelluric pulse signals are subjected to overlapping segmentation to obtain a plurality of sounding data sections, the approximate entropy of each sounding data section is determined, the sounding data section with the approximate entropy smaller than or equal to the approximate entropy threshold is determined as a sounding data section to be denoised, the sounding data section with the approximate entropy larger than the approximate entropy threshold is determined as an effective sounding data section, and the data sections are screened by selecting the approximate entropy before performing signal-noise separation by adopting SVD (singular value decomposition), so that the effective sounding data section is prevented from being over-processed; carrying out singular value decomposition on the sounding data segment to be denoised, multiplying a singular value matrix, a left singular matrix and a right singular matrix after the singular value decomposition to obtain a plurality of component matrices of the sounding data segment to be denoised, converting each component matrix into a component data segment, determining the approximate entropy of each component data segment, screening the component data segments of which the approximate entropy is greater than that of the sounding data segment to be denoised, summing the screened component data segments to obtain the denoised sounding data segment, and screening the decomposed components by adopting the approximate entropy to realize the purpose of denoising; and finally reconstructing the denoised depth sounding data segment and the effective depth sounding data segment to obtain a reconstructed denoised magnetotelluric pulse signal, so that the accuracy of data denoising is improved.
Detailed Description
The technical solutions in the embodiments of the present invention will be clearly and completely described below with reference to the drawings in the embodiments of the present invention, and it is obvious that the described embodiments are only a part of the embodiments of the present invention, and not all of the embodiments. All other embodiments, which can be derived by a person skilled in the art from the embodiments given herein without making any creative effort, shall fall within the protection scope of the present invention.
The invention aims to provide a magnetotelluric denoising method and a magnetotelluric denoising system, which are used for reducing the over-processing of data and improving the data denoising accuracy.
In order to make the aforementioned objects, features and advantages of the present invention comprehensible, embodiments accompanied with figures are described in further detail below.
The invention provides a magnetotelluric denoising method in order to reduce the loss of effective signals and combine approximate entropy screening and SVD decomposition denoising, as shown in fig. 1 and fig. 2, the method comprises the following steps:
and S101, measuring magnetotelluric pulse signals of different depths of the earth by using a magnetotelluric sounding method.
And S102, carrying out overlapping segmentation on the magnetotelluric pulse signals to obtain a plurality of sounding data segments.
S103, determining the approximate entropy of each sounding data segment based on the approximate entropy theory.
S104, determining the depth measuring data segment with the approximate entropy smaller than or equal to the approximate entropy threshold value as a depth measuring data segment to be denoised, and determining the depth measuring data segment with the approximate entropy larger than the approximate entropy threshold value as an effective depth measuring data segment. Preferably, the approximate entropy threshold is Ae0,Ae0Typically between 0.5 and 1.
And S105, constructing a matrix to be denoised according to the sounding data segment to be denoised.
And S106, performing singular value decomposition on the matrix to be denoised to obtain a singular value matrix, a left singular matrix and a right singular matrix of the matrix to be denoised.
And S107, multiplying each diagonal element of the singular value matrix, the column vector corresponding to the left singular matrix and the row vector corresponding to the right singular matrix to obtain a plurality of field source component matrixes of the sounding data segment to be denoised.
And S108, converting each field source component matrix into a component data segment.
S109, based on the approximate entropy theory, the approximate entropy of each component data segment is determined.
S110, screening the component data sections of which the approximate entropy is larger than that of the depth measuring data section to be denoised, and summing the screened component data sections to obtain the denoised depth measuring data section.
And S111, reconstructing the denoised sounding data segment and the effective sounding data segment to obtain a reconstructed denoised magnetotelluric pulse signal.
The specific process is as follows:
step S102, carrying out overlapping segmentation on the magnetotelluric pulse signals to obtain a plurality of sounding data segments, which specifically comprises the following steps:
to magnetotelluric pulsating signal x1,x2,…xwPerforming overlapping segmentation processing to obtain each depth measurement data segment Xv={x(v-1)*N+1,x(v-1)*N+2,…,x(v-1)*N+N}。
Wherein x is1,x2,…xwRespectively the 1 st, 2 nd and w th magnetotelluric pulsesSignal, w is the length of the magnetotelluric pulse signal, x(v-1)*N+1、x(v-1)*N+2、x(v-1)*N+NThe data are respectively the 1 st, 2 nd and nth data of the v-th sounding data segment, N is the sounding data segment length, v is 1,2, …, l is the number of sounding data segments, l is (w- α N)/N (1- α), and α is the data overlap ratio.
Because the multidimensional matrix of the processing object decomposed by SVD has larger requirement on the running memory when the processing object is processed by computer software, the step firstly divides the one-dimensional vector into a plurality of data segments, and each data segment is constructed into the multidimensional matrix, thereby shortening the length of the processing object, and ensuring the feasibility of running, the continuity of data and the accuracy of screening the data segments.
S103, determining the approximate entropy of each sounding data segment based on the approximate entropy theory, and specifically comprising the following steps:
initializing an embedding dimension h of a Hankel matrix (Hankel matrix);
constructing a Hankel matrix with embedded dimensions for each sounding data segment; each column in the hankel matrix is a mode vector;
using formulas
Determining the maximum distance between the p-th mode vector in the Hankel matrix and the q-th mode vector of the Hankel matrix;
counting the number of the p-th mode vector in the Hankel matrix, wherein the maximum distance between the p-th mode vector and all the mode vectors of the Hankel matrix is smaller than or equal to the mode fault-tolerant threshold;
based on approximate probability theory, according to the number, using formula Cp h(r)=Np h(r)/(N-h +1), determining the approximate probability value of the p-th mode vector in the hankel matrix;
using a formula based on the approximate probability value
Determining an approximate probability average of each sounding data segment in the h-embedding dimension;
Increasing the embedding dimension h by 1, returning to the step of constructing a Hankel matrix with the embedding dimension h for each depth measuring data segment, and determining the approximate probability average value of each depth measuring data segment under the embedding dimension h + 1;
according to the approximate probability average value of each sounding data segment in the h embedding dimension and the approximate probability average value of each sounding data segment in the h +1 embedding dimension, a formula is utilized
Determining an approximate entropy of each sounding data segment;
wherein d is
max[y(p),y(q)]Is the maximum distance between the p-th mode vector in the Heckel matrix and the q-th mode vector in the Heckel matrix, y (p) is the p-th mode vector in the Heckel matrix, y (q) is the q-th mode vector in the Heckel matrix, x (p + delta) is the delta-th element of the p-th mode vector in the Heckel matrix, x (q + delta) is the delta-th element of the q-th mode vector in the Heckel matrix, delta is 1,2, …, h, C
p h(r) is the approximate probability value of the p-th pattern vector in the Hankel matrix, N
p h(r) is the number of maximum distances less than or equal to the mode fault tolerance threshold, r is the mode fault tolerance threshold, N is the sounding data segment length,
is an approximate probability average of the sounding data segment in the h embedding dimension,
is the approximate probability average, Ae, of the sounding data segment in the h +1 embedding dimension
iIs the approximate entropy of the ith sounding data segment.
The approximate entropy is adopted to carry out pre-screening on each segment of data before signal-noise separation, and noise has stronger directivity and more concentrated dispersion degree compared with effective signals, so that the data with better processing quality can be eliminated by the difference through the pre-screening by means of the approximate entropy, and the phenomenon that the effective signals are processed by denoising of the screening component when the data segment is the effective pure signals and the 'over-processing' of the screening component is still carried out is prevented.
Step S105, constructing a matrix to be denoised according to the sounding data segment to be denoised, which specifically comprises the following steps:
according to the preset matrix dimension, the a-th sounding data segment X to be denoiseda={x(a-1)*N+1,x(a-1)*N+2,…,x(a-1)*N+NEqually dividing into a plurality of sections;
constructing a matrix to be denoised according to each section after the equal division
Wherein, XaFor the a-th sounding data segment to be denoised, x(a-1)*N+1、x(a-1)*N+2、x(a-1)*N+n、x(a-1)*N+1+n、x(a-1)*N+2n、xa*N+1-nAnd x(a-1)*N+NRespectively representing the 1 st, 2 nd, nth, N +1 st, 2 nth, N-N +1 th and nth data of the depth measuring data segment to be denoised, wherein N is the length of the depth measuring data segment, N is the length of a preset matrix, N is N/m, and m is the dimension of the preset matrix.
Step S106, SVD decomposition: x
a=USV
TWherein U ═ U
1…U
m]Is a left singular matrix of m x m orthogonal,
is a matrix of singular values, the principal diagonal is the singular value of the matrix,
is X
a TX
aN × n right singular matrices. U shape
1、U
mRespectively 1 st and m-th elements, S, in the left singular matrix
1、S
mRespectively representing the 1 st and m-th elements, V, of the matrix of singular values
1 T、V
1 TRespectively representing the 1 st and nth elements in the right singular matrix.
The dimension and length of the matrix structure are selected very key during SVD decomposition, and the suppression effect on noise is better when the length of the matrix is less than or equal to the length of the noise through repeated tests.
Step S106, then further comprising:
determining the ratio of each singular value in the singular value matrix to the sum of all singular values; delta. in FIG. 2zFor the z-th singular value, delta, in the matrix of singular valuessumThe sum of singular values in a singular value matrix is obtained;
drawing a singular value proportion broken line graph by taking the number of singular values in the singular value matrix as an abscissa and the ratio as an ordinate;
judging whether the vertical coordinates corresponding to the tail branches of the singular value proportion broken line graph are all smaller than a ratio threshold value or not, and obtaining a judgment result; preferably, the threshold ratio is 0.001. The tail branch refers to the last part of the fold line in the singular value gravity fold line graph.
If the judgment result shows that the data segment is the deep sounding data segment to be denoised, outputting a singular value matrix, a left singular matrix and a right singular matrix of the deep sounding data segment to be denoised;
if the judgment result shows no, updating the preset matrix dimension, and returning to the step of' according to the preset matrix dimension, enabling the a-th sounding data segment X to be denoiseda={x(a-1)*N+1,x(a-1)*N+2,…,x(a-1)*N+NAnd equally dividing into multiple sections.
And judging whether the selection of the matrix dimension is correct or not by observing whether the tail branch of the line graph is less than 0.001 or not according to the singular value proportion line graph, and if the tail branch value of the line graph is more than 0.001, re-assigning the matrix dimension until the proportion value is less than 0.001.
Step S107, calculating all component matrixes according to the singular values and the left and right eigenvectors, connecting the component matrixes end to obtain signals transmitted by different field source information:
wherein, Xa 1、Xa mRespectively being the 1 st and the m-th component matrixes of the a-th sounding data segment to be denoised.
Step S108, the step of converting each field source component matrix in the field source matrix into a component data segment is the opposite step of "constructing a matrix to be denoised according to the sounding data segment to be denoised" in step S105.
In step S109, the method of determining the approximate entropy of each component data segment based on the approximate entropy theory is the same as that of step S103.
S110, the component data segment with the approximate entropy larger than that of the sounding data segment to be denoised is regarded as effective data to be reserved, and the component data segment with the approximate entropy smaller than that is regarded as noise data. And adding and summing the reserved effective components to realize the signal-noise separation of the sounding data section to be denoised. X in FIG. 2bRepresenting a denoised sounding data segment, Xb kThe k-th component matrix is shown, where k is 1,2, …, m.
Step S111, reconstructing the denoised sounding data segment and the effective sounding data segment to obtain a reconstructed denoised magnetotelluric pulse signal, which specifically comprises the following steps:
reconstructing the denoised depth sounding data segment and the effective depth sounding data segment by using a formula w ═ lN- (l-1) · α N to obtain a reconstructed denoised magnetotelluric pulse signal;
wherein w is the length of the magnetotelluric pulse signal, l is the number of the sounding data segments, alpha is the data overlapping rate, and N is the length of the sounding data segments.
The basic principle of the invention is as follows: selecting a data segment to be processed by using approximate entropy, decomposing a time sequence signal into a singular value matrix and left and right characteristic matrix vectors by using SVD (singular value decomposition) based on the property difference of a natural field source and an interference source, and then forming a plurality of component signals from different field source information by SVD inverse transformation, and calculating the approximate entropy of the component signals, wherein the approximate entropy of effective signals is larger (representing that the signal dispersion degree is larger); and the approximate entropy of the noise is smaller (with a certain polarization direction), so as to screen the components.
Therefore, the approximate entropy and the SVD are combined, so that the data over-processing can be reduced, and the component where the effective signal is located can be accurately selected to reconstruct the de-noised data segment. Under the conditions of low data quality and complex form noise of actually measured data, the magnetotelluric denoising technology based on approximate entropy and SVD decomposition can recover the change trend of an apparent resistivity-phase curve to obtain a continuous and smooth curve form.
According to the invention, the approximate entropy is adopted to pre-screen each segment of data before signal-noise separation, and noise has stronger directivity and more concentrated dispersion degree compared with effective signals, so that the data with better quality can be prevented from being further processed by pre-screening through the difference; and a plurality of singular values are extracted through the difference between the signal amplitudes, each singular value can calculate signal components containing different information, and the purpose of denoising is achieved through screening the components, so that the technology is irrelevant to the signal-to-noise ratio of data and is relevant to the number of the extracted singular values (the more the singular values are, the more the components are, the slower the processing speed is; the less the singular values are, the less the components are, effective signals can be lost); similarly, the invention does not carry out denoising according to the form, thereby simultaneously processing large-scale noise and sudden change noise without the limit of noise form.
The invention also provides a magnetotelluric denoising system, which comprises: the device comprises a magnetotelluric pulse signal measuring module, a sounding data segment obtaining module, an approximate entropy determining module of a sounding data segment, a sounding data segment determining module to be denoised, a matrix constructing module to be denoised, a singular value decomposition module, a field source component matrix obtaining module, a component data segment obtaining module, an approximate entropy determining module of a component data segment, a denoised sounding data segment obtaining module and a reconstructed denoised magnetotelluric pulse signal obtaining module.
And the magnetotelluric pulse signal measuring module is used for measuring magnetotelluric pulse signals of different depths of the earth by utilizing a magnetotelluric sounding method.
And the sounding data segment obtaining module is used for carrying out overlapping segmentation on the magnetotelluric pulse signals to obtain a plurality of sounding data segments.
And the approximate entropy determining module of the sounding data segment is used for determining the approximate entropy of each sounding data segment based on the approximate entropy theory.
And the depth data segment to be denoised determining module is used for determining the depth data segment with the approximate entropy smaller than or equal to the approximate entropy threshold as the depth data segment to be denoised and determining the depth data segment with the approximate entropy larger than the approximate entropy threshold as the effective depth data segment.
And the matrix construction module to be denoised is used for constructing the matrix to be denoised according to the sounding data segment to be denoised.
And the singular value decomposition module is used for performing singular value decomposition on the matrix to be denoised to obtain a singular value matrix, a left singular matrix and a right singular matrix of the matrix to be denoised.
And the field source component matrix obtaining module is used for multiplying each diagonal element of the singular value matrix, the column vector corresponding to the left singular matrix and the row vector corresponding to the right singular matrix to obtain a plurality of field source component matrixes of the sounding data section to be denoised.
A component data segment obtaining module for converting each field source component matrix into a component data segment.
And the approximate entropy determining module of the component data segments is used for determining the approximate entropy of each component data segment based on the approximate entropy theory.
And the denoised depth measurement data section obtaining module is used for screening the component data sections of which the approximate entropies are larger than that of the depth measurement data section to be denoised, and summing the screened component data sections to obtain the denoised depth measurement data section.
And the reconstruction denoising magnetotelluric pulse signal obtaining module is used for reconstructing the denoised depth sounding data segment and the effective depth sounding data segment to obtain a reconstruction denoising magnetotelluric pulse signal.
The sounding data segment obtaining module specifically comprises:
a sounding data segment obtaining submodule for obtaining magnetotelluric pulse signal x1,x2,…xwPerforming overlapping segmentation processing to obtain each depth measurement data segment Xv={x(v-1)*N+1,x(v-1)*N+2,…,x(v-1)*N+N}。
Wherein x is1,x2,…xwRespectively the 1 st, 2 nd and w th magnetotelluric pulse signals, w is the length of the magnetotelluric pulse signal, x(v-1)*N+1、x(v-1)*N+2、x(v-1)*N+NThe data are respectively the 1 st, 2 nd and nth data of the v-th sounding data segment, N is the sounding data segment length, v is 1,2, …, l is the number of sounding data segments, l is (w- α N)/N (1- α), and α is the data overlap ratio.
The approximate entropy determination module of the sounding data segment specifically comprises:
and the embedding dimension initialization submodule is used for initializing the embedding dimension h of the Hankel matrix.
And the Hankel matrix construction submodule is used for respectively constructing a Hankel matrix with an embedded dimension h for each depth sounding data segment. Each column in the hankel matrix is a mode vector.
Maximum distance determination submodule for utilizing a formula
And determining the maximum distance between the p-th mode vector in the Hankel matrix and the q-th mode vector of the Hankel matrix.
And the number counting submodule is used for counting the number of the p-th mode vector in the Hankel matrix, wherein the maximum distance between the p-th mode vector and all the mode vectors of the Hankel matrix is smaller than or equal to the mode fault-tolerant threshold.
An approximate probability value determining submodule for utilizing a formula C according to the number based on the approximate probability theoryp h(r)=Np h(r)/(N-h +1), determining the approximate probability value of the p-th mode vector in the Hankel matrix.
An approximate probability average determination submodule for utilizing a formula based on the approximate probability value
And determining the approximate probability average value of each sounding data segment under the h embedding dimension.
And the circulation submodule is used for increasing the embedding dimension h by 1, returning to the step of constructing a Hankel matrix with the embedding dimension h for each sounding data segment respectively, and determining the approximate probability average value of each sounding data segment under the embedding dimension h + 1.
An approximate entropy determination submodule of the sounding data segment for utilizing a formula according to the approximate probability average value of each sounding data segment in the h embedding dimension and the approximate probability average value of each sounding data segment in the h +1 embedding dimension
An approximate entropy for each sounding data segment is determined.
Wherein d is
max[y(p),y(q)]Is the maximum distance between the p-th mode vector in the Heckel matrix and the q-th mode vector in the Heckel matrix, y (p) is the p-th mode vector in the Heckel matrix, y (q) is the q-th mode vector in the Heckel matrix, x (p + delta) is the delta-th element of the p-th mode vector in the Heckel matrix, x (q + delta) is the delta-th element of the q-th mode vector in the Heckel matrix, delta is 1,2, …, h, C
p h(r) is the approximate probability value of the p-th pattern vector in the Hankel matrix, N
p h(r) is the number of maximum distances less than or equal to the mode fault tolerance threshold, r is the mode fault tolerance threshold, N is the sounding data segment length,
is an approximate probability average of the sounding data segment in the h embedding dimension,
is the approximate probability average, Ae, of the sounding data segment in the h +1 embedding dimension
iIs the approximate entropy of the ith sounding data segment.
The matrix construction module to be denoised specifically comprises:
an equal division module used for dividing the a-th depth sounding data segment X to be denoised according to the preset matrix dimensiona={x(a-1)*N+1,x(a-1)*N+2,…,x(a-1)*N+NAnd equally dividing into a plurality of sections.
A sub-module for constructing the matrix to be denoised according to each segment after being equally divided
Wherein, XaFor the a-th sounding data segment to be denoised, x(a-1)*N+1、x(a-1)*N+2、x(a-1)*N+n、x(a-1)*N+1+n、x(a-1)*N+2n、xa*N+1-nAnd x(a-1)*N+NRespectively representing the 1 st, 2 nd, nth, N +1 st, 2 nth, N-N +1 th and nth data of the depth measuring data segment to be denoised, wherein N is the length of the depth measuring data segment, N is the length of a preset matrix, N is N/m, and m is the dimension of the preset matrix.
The magnetotelluric denoising technology based on approximate entropy and SVD can reduce the phenomenon of 'over-processing' data under certain conditions; the requirement on the signal-to-noise ratio of data is low; the noise with complex forms can be suppressed.
The embodiments in the present description are described in a progressive manner, each embodiment focuses on differences from other embodiments, and the same and similar parts among the embodiments are referred to each other. For the system disclosed by the embodiment, the description is relatively simple because the system corresponds to the method disclosed by the embodiment, and the relevant points can be referred to the method part for description.
The principles and embodiments of the present invention have been described herein using specific examples, which are provided only to help understand the method and the core concept of the present invention; meanwhile, for a person skilled in the art, according to the idea of the present invention, the specific embodiments and the application range may be changed. In view of the above, the present disclosure should not be construed as limiting the invention.