[go: up one dir, main page]

CN119359606A - A multimedia image optimization method and system - Google Patents

A multimedia image optimization method and system Download PDF

Info

Publication number
CN119359606A
CN119359606A CN202411930713.8A CN202411930713A CN119359606A CN 119359606 A CN119359606 A CN 119359606A CN 202411930713 A CN202411930713 A CN 202411930713A CN 119359606 A CN119359606 A CN 119359606A
Authority
CN
China
Prior art keywords
brightness
image
layer
component
hsi
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN202411930713.8A
Other languages
Chinese (zh)
Other versions
CN119359606B (en
Inventor
王振中
罗天
王娟
王丽霞
张金沙
陈芳
高江杰
孙畅
李佳伟
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
China Media Group
Original Assignee
China Media Group
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by China Media Group filed Critical China Media Group
Priority to CN202411930713.8A priority Critical patent/CN119359606B/en
Publication of CN119359606A publication Critical patent/CN119359606A/en
Application granted granted Critical
Publication of CN119359606B publication Critical patent/CN119359606B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T5/00Image enhancement or restoration
    • G06T5/90Dynamic range modification of images or parts thereof
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/04Architecture, e.g. interconnection topology
    • G06N3/0464Convolutional networks [CNN, ConvNet]
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/04Architecture, e.g. interconnection topology
    • G06N3/048Activation functions
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T5/00Image enhancement or restoration
    • G06T5/60Image enhancement or restoration using machine learning, e.g. neural networks
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T7/00Image analysis
    • G06T7/90Determination of colour characteristics

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Theoretical Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • General Health & Medical Sciences (AREA)
  • Molecular Biology (AREA)
  • Biophysics (AREA)
  • Computational Linguistics (AREA)
  • Data Mining & Analysis (AREA)
  • Evolutionary Computation (AREA)
  • Artificial Intelligence (AREA)
  • Biomedical Technology (AREA)
  • Computing Systems (AREA)
  • General Engineering & Computer Science (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Mathematical Physics (AREA)
  • Software Systems (AREA)
  • Health & Medical Sciences (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Image Processing (AREA)

Abstract

本发明公开了一种多媒体图像优化方法及系统,涉及图像优化技术领域,包括获取待优化的多媒体图像并进行HSI颜色空间转换,提取HSI颜色空间中的亮度分量并进行亮度增强;调整增强后的亮度分量的对比度,并进行亮度区域的划分和动态范围优化;对优化后的亮度分量进行边缘增强,并重新组合生成增强后的HSI图像,将增强后的HSI图像转换回RGB颜色空间,得到优化后的多媒体图像。本发明实现了对图像亮度、对比度和细节的综合优化,提升了亮度层次感,解决了图像在高亮和暗部区域的细节丢失问题,改善了图像的边缘清晰度和色彩保真性,能够显著提升多媒体图像的视觉质量,满足多场景的实际应用需求。

The present invention discloses a multimedia image optimization method and system, which relates to the field of image optimization technology, including obtaining a multimedia image to be optimized and performing HSI color space conversion, extracting a brightness component in the HSI color space and performing brightness enhancement; adjusting the contrast of the enhanced brightness component, and performing brightness area division and dynamic range optimization; performing edge enhancement on the optimized brightness component, and recombining to generate an enhanced HSI image, and converting the enhanced HSI image back to the RGB color space to obtain an optimized multimedia image. The present invention realizes the comprehensive optimization of image brightness, contrast and details, improves the brightness layering, solves the problem of detail loss in the highlight and dark areas of the image, improves the edge clarity and color fidelity of the image, can significantly improve the visual quality of multimedia images, and meet the practical application needs of multiple scenes.

Description

Multimedia image optimization method and system
Technical Field
The invention relates to the technical field of image optimization, in particular to a multimedia image optimization method and system.
Background
With rapid development and wide application of multimedia technology, optimization of image quality has become a research area of great concern. The quality of multimedia images directly affects the visual experience of users, and especially in high-definition video, virtual Reality (VR), augmented Reality (AR) and other scenes, the requirements on image definition, brightness details and color expression are higher and higher. Conventional image optimization techniques typically focus on single-dimensional processing such as brightness enhancement, contrast adjustment, or color optimization. However, since the multimedia image is generally affected by noise, uneven illumination and color distortion of the acquisition device, these single optimization methods cannot comprehensively improve the image quality, and problems such as excessive enhancement, detail loss or unnatural color are easy to occur. In recent years, image processing methods based on color space are attracting attention, especially, the HSI color space can separate brightness from hue and saturation, and provides a new technical path for optimizing image quality. However, most of the existing methods only adjust a single channel or specific parameters, and cannot realize the omnibearing optimization of brightness, contrast and edge details.
Disclosure of Invention
The present invention has been made in view of the above-mentioned problems occurring in the conventional multimedia image optimization method and system.
Therefore, the problem to be solved by the present invention is that the existing method mostly adjusts only a single channel or specific parameters, and cannot realize the omnibearing optimization of brightness, contrast and edge details.
The technical scheme includes that the multimedia image optimization method comprises the steps of obtaining a multimedia image to be optimized, performing HSI color space conversion, extracting brightness components in the HSI color space, performing brightness enhancement, adjusting contrast of the enhanced brightness components, performing brightness region division and dynamic range optimization, performing edge enhancement on the optimized brightness components, recombining to generate an enhanced HSI image, and converting the enhanced HSI image back to an RGB color space to obtain the optimized multimedia image.
The method comprises the steps of obtaining a multimedia image to be optimized, performing HSI color space conversion, namely extracting original image data from multimedia equipment, uniformly converting the obtained image format into a standard format, and performing noise smoothing on the image through mean value filtering;
Normalizing the color channel data of the preprocessed original image, and calculating the brightness component I, the saturation S and the tone H of the image according to the normalized color channel data;
and obtaining a converted HSI color space and a corresponding HSI image.
The method comprises the steps of extracting a brightness component in an HSI color space and carrying out brightness enhancement, namely extracting a brightness component I from the HSI color space and taking the brightness component I as an independent channel, carrying out standardization on the extracted brightness component I, carrying out smoothing treatment on the brightness component through a bilateral filter, and outputting a final brightness component after finishing edge smoothing and standardization treatment;
applying a luminance transfer function to the luminance component enhances contrast and gradation of the original luminance:
Wherein A is the brightness value of the pixel after brightness conversion, a (x) is the brightness enhancement adjustment parameter, and I (x) is the brightness component;
Constructing a Gaussian pyramid for brightness components after brightness conversion, setting the pyramid as n layers, generating a plurality of resolution-level images, directly taking the converted brightness components by an initial layer, and carrying out Gaussian blur and downsampling on the previous layer image layer by layer from a second layer to generate each layer of low-resolution images until the n layer is reached;
after reaching the nth layer, obtaining a plurality of layers of Gaussian pyramids, wherein each layer represents low-frequency information with brightness in different spatial scales;
constructing a Laplacian pyramid based on the Gaussian pyramid, extracting high-frequency information from each layer of image of the Gaussian pyramid, upsampling the Gaussian pyramid image of the (l+1) th layer by a bilinear interpolation method, and calculating a Laplacian pyramid image L l (x, y) of the (L) th layer by using the following formula:
Where G l (x, y) is the first layer image of the gaussian pyramid, G l+1 (x, y) is the first +1 layer image of the gaussian pyramid, U () is an upsampling operation, G l+1 (x, y) is enlarged to the same resolution as the first layer G l (x, y), and method represents the interpolation method used for upsampling;
To be calculated to obtain Storing the high frequency component as a layer I Laplacian pyramid;
the last layer of the Gaussian pyramid is used as the lowest frequency information of the Laplacian pyramid, the high frequency information and the low frequency basic brightness information of each layer are integrated to form a complete Laplacian pyramid structure, and each layer of the Laplacian pyramid Is a multi-scale luminance component;
Inputting the brightness component of each layer into an initial convolution layer of the DCNN, capturing basic brightness characteristics through convolution operation, processing a convolution structure by using a ReLU activation function, and extracting basic details of brightness;
in the middle layer of DCNN, the output of the initial convolution layer H (z) is subjected to a multi-layer convolution process, each layer convolution being based on the output of the previous layer:
In the formula, Representing the luminance characteristics of the k-th layer output,For the layer k convolution kernel weights,For the luminance characteristics of the k-1 layer output,A bias term for the current layer;
Batch normalization of the output of each layer of convolution, and then using the ReLU activation function:
In the formula, Representing brightness characteristic output after the k-th layer convolution, batch normalization and activation processing, BN () being a batch normalization operation;
the brightness characteristics of the depth extraction are finally obtained through the layer-by-layer progressive processing of multi-layer convolution, batch normalization and activation functions;
based on the brightness component I, N is taken from each pixel point x in the image Calculating local brightness mean and standard deviation of pixel points x by pixel points in the N window, combining the local brightness mean and the local standard deviation of all the pixel points to generate a final illumination estimation graph T (x), and generating a weight matrix W (x) based on the illumination estimation graph;
The multi-scale brightness component and the depth brightness characteristic are adjusted to the same resolution through an up-sampling method and a down-sampling method;
channel combination is carried out on depth brightness characteristics of multiple channels by using an average value of all channels, the depth brightness characteristics are converted into single-channel brightness characteristics, after the formats are unified, weighted fusion is carried out through a weight matrix, so that fused brightness components are obtained, from the lowest frequency layer, laplacian pyramid components of each layer are sequentially subjected to up-sampling superposition, complete brightness components are gradually reconstructed, and the enhanced brightness components are obtained
The invention is used as a preferable scheme of the multimedia image optimization method, wherein the contrast of the enhanced brightness component is obtained and normalized by adjusting the enhanced brightness component, the solution group scale of the water circulation algorithm is defined as P, each solution represents a group of contrast parameters (s, d), wherein s is a contrast scaling factor, and d is a brightness offset;
initializing river solutions and streams, and simulating a process of searching an optimal solution by water flow;
And (3) carrying out contrast adjustment on the normalized brightness component according to the parameters (s, d) of the current solution:
In the formula, Is the luminance value after the adjustment,Is the normalized brightness value;
calculating an objective function value J for the adjusted brightness component, and evaluating the optimization effect of the current solution:
where M and N are the width and height of the image, representing the total range of pixels, Is the local luminance mean, x represents the position of the pixel in the horizontal direction, y represents the position of the pixel in the vertical direction;
updating river solutions and creek solutions according to the objective function values;
Setting the maximum iteration times, stopping iteration until the maximum iteration times are reached, selecting a solution with the minimum objective function value after the water circulation algorithm completes iteration, taking the parameter as the final optimal solution, using the optimal demodulation integral brightness component, and performing range clipping on the optimized brightness value to obtain the brightness component with the adjusted contrast
As a preferable scheme of the multimedia image optimization method, the method comprises the steps of dividing brightness areas and optimizing dynamic range, namely, dividing brightness componentsDivided into bright part, dark part and middle regulating area, threshold H andIf the brightness valueClassifying the pixel as a luminance region if the luminance valueClassifying the pixel as a dark region if the brightness value isClassifying the pixel as a middle adjustment region;
For a brightness region, reducing the output of brightness of a bright part by using a nonlinear compression function, for a dark part region, lifting dark part details by using logarithmic transformation, linearly adjusting an intermediate adjustment region, and balancing brightness transition;
The adjustment results of the bright portion, the dark portion and the intermediate tone region are combined into a final dynamic range optimized luminance component.
As a preferable scheme of the multimedia image optimization method, the method comprises the steps of carrying out edge enhancement on the optimized brightness component, and recombining to generate an enhanced HSI image, namely carrying out edge detail enhancement on the brightness component with optimized dynamic range by using a Laplacian operator, keeping the brightness component consistent with the tone and saturation in the original HSI image, and carrying out detail enhancement on the brightness componentReplacing the brightness channel in the original HSI image, and recombining the enhanced brightness component with the hue and saturation components in the HSI color space to form a final HSI image.
The method for optimizing the multimedia image comprises the steps of converting the enhanced HSI image back to an RGB color space, obtaining an optimized multimedia image, converting the HSI image into the RGB image through an image processing library OpenCV, obtaining the optimized multimedia image, storing the generated multimedia image into a standard format, and carrying out data backup.
It is another object of the present invention to provide a multimedia image optimizing system, which includes,
The image acquisition module is used for extracting original image data from the multimedia equipment and generating an HSI color space and a corresponding image;
the brightness enhancement module is used for extracting brightness components and enhancing the brightness components;
A contrast adjustment module for adjusting the contrast of the enhanced luminance component;
The range optimization module is used for carrying out region division on the optimized brightness components and carrying out dynamic range adjustment;
And the conversion output module is used for carrying out edge enhancement on the brightness component with the adjusted range, converting the HSI image into an RGB image and then storing the RGB image.
A computer device comprises a memory and a processor, wherein the memory stores a computer program, and the processor realizes the steps of a multimedia image optimization method when executing the computer program.
A computer readable storage medium having stored thereon a computer program which when executed by a processor implements the steps of a method of optimizing a multimedia image.
The invention has the beneficial effects that the comprehensive optimization of the brightness, contrast and details of the image is realized, the brightness layering sense is improved, the problem of detail loss of the image in the highlight and dark part areas is solved, the edge definition and color fidelity of the image are improved, the visual quality of the multimedia image can be obviously improved, and the practical application requirements of multiple scenes are met.
Drawings
In order to more clearly illustrate the technical solutions of the embodiments of the present invention, the drawings required for the description of the embodiments will be briefly described below, and it is obvious that the drawings in the following description are only some embodiments of the present invention, and other drawings may be obtained according to these drawings without inventive effort for a person skilled in the art.
FIG. 1 is a flow chart of a method for optimizing multimedia images;
Fig. 2 is a schematic structural diagram of a multimedia image optimization system.
Detailed Description
In order that the above-recited objects, features and advantages of the present invention will become more readily apparent, a more particular description of the invention will be rendered by reference to specific embodiments thereof which are illustrated in the appended drawings.
In the following description, numerous specific details are set forth in order to provide a thorough understanding of the present invention, but the present invention may be practiced in other ways other than those described, and persons skilled in the art will readily appreciate that the present invention is not limited to the specific embodiments disclosed below.
Further, reference herein to "one embodiment" or "an embodiment" means that a particular feature, structure, or characteristic can be included in at least one implementation of the invention. The appearances of the phrase "in one embodiment" in various places in the specification are not necessarily all referring to the same embodiment, nor are separate or alternative embodiments mutually exclusive of other embodiments.
Embodiment 1, referring to fig. 1, is a first embodiment of the present invention, which provides a multimedia image optimization method, including,
S1, acquiring a multimedia image to be optimized, performing HSI color space conversion, extracting a brightness component in the HSI color space, and performing brightness enhancement;
specifically, acquiring a multimedia image to be optimized and performing HSI color space conversion refers to extracting original image data from multimedia equipment, uniformly converting the obtained image format into a standard format, and performing noise smoothing on the image through mean value filtering;
Normalizing the color channel data of the preprocessed original image, and calculating the brightness component I of the image according to the normalized color channel data:
wherein R, G, B is the normalized red, green and blue channel values respectively;
Calculating the saturation S of the image:
calculating the hue H of the image:
In the formula, Is an inverse cosine function;
and obtaining a converted HSI color space and a corresponding HSI image.
The method has the advantages that the image data in the multimedia equipment are uniformly converted into the standard format, format differences among the equipment are reduced, a consistency foundation is laid for subsequent image processing, high-frequency noise in the image is removed by means of average filtering, meanwhile, edge information of the image is reserved, clean data input is provided for normalization of the image color channel data, the image data of different equipment are distributed in the same standard range by means of normalization, brightness or contrast differences caused by the fact that the equipment is different are reduced, brightness component I represents brightness of the image, and the method is one of core targets of image optimization. The brightness information is extracted from the normalized data, so that the interference of image illumination and color factors is avoided, the generated HSI image separates brightness from color information, the subsequent independent enhancement operation on brightness components is facilitated, meanwhile, the influence on colors is avoided, in the process of generating the HSI image, the tone and saturation information of an original image are reserved, and a foundation is laid for the consistent display effect of the optimized image on different devices.
Further, extracting a luminance component in the HSI color space and performing luminance enhancement refers to extracting the luminance component I from the HSI color space and taking the luminance component I as an independent channel, performing smoothing on the luminance component through a bilateral filter after normalizing the extracted luminance component I, and outputting a final luminance component after finishing edge smoothing and normalization;
applying a luminance transfer function to the luminance component enhances contrast and gradation of the original luminance:
Wherein A is a brightness value of a pixel after brightness conversion, represents a brightness component after enhancement, is used for generating different brightness levels and enhancing contrast, a (x) is a brightness enhancement adjusting parameter and is dynamically associated with the brightness I (x) of the pixel, and is used for controlling the enhancement intensity of different brightness areas, and I (x) is the brightness component;
where c is a constant, no unit, for controlling the overall enhancement intensity, and is set according to enhancement requirements, for example, k=1 is taken to ensure that enhancement is significant at low luminance and enhancement is suppressed at high luminance, Is a positive number, is used for avoiding zero removal errors during calculation, is usually set to be a minimum value (such as 0.01), and has the main function of avoiding zero removal errors caused by the condition that the brightness is zero;
Constructing a Gaussian pyramid for brightness components after brightness conversion, setting the pyramid as n layers, generating a plurality of resolution-level images, directly taking the converted brightness components by an initial layer, and carrying out Gaussian blur and downsampling on the previous layer image layer by layer from a second layer to generate each layer of low-resolution images until the n layer is reached;
after reaching the nth layer, obtaining a plurality of layers of Gaussian pyramids, wherein each layer represents low-frequency information with brightness in different spatial scales;
constructing a Laplacian pyramid based on the Gaussian pyramid, extracting high-frequency information from each layer of image of the Gaussian pyramid, upsampling the Gaussian pyramid image of the (l+1) th layer by a bilinear interpolation method, and calculating a Laplacian pyramid image L l (x, y) of the (L) th layer by using the following formula:
Where G l (x, y) is the first layer image of the gaussian pyramid, representing the low frequency luminance component, mainly containing smooth and blurred luminance information, G l+1 (x, y) is the first +1 layer image of the gaussian pyramid, U () is the upsampling operation, G l+1 (x, y) is amplified to the same resolution as the first layer G l (x, y), and method represents the interpolation method used for upsampling, common methods include bilinear interpolation, bicubic interpolation and nearest neighbor interpolation, and bilinear interpolation is preferred in this scheme;
To be calculated to obtain Storing the high frequency component as a layer I Laplacian pyramid;
The high-frequency information is calculated layer by layer, so that each layer of Laplacian pyramid image extracts high-frequency details of a specific scale, and the fine brightness change of the image on each scale is reserved;
the last layer of the Gaussian pyramid is used as the lowest frequency information of the Laplacian pyramid, the high frequency information and the low frequency basic brightness information of each layer are integrated to form a complete Laplacian pyramid structure, and each layer of the Laplacian pyramid Is a multi-scale luminance component;
constructing a complete Laplacian pyramid structure to provide layering information for image reconstruction and detail enhancement so as to refine and reconstruct details with different scales in subsequent processing;
the high-frequency brightness information represents the detail part of each layer of image, and the low-frequency brightness information represents the overall brightness trend of the image;
The luminance component of each layer is input to an initial convolution layer of the DCNN, basic luminance characteristics are captured through convolution operation, a convolution structure is processed by using a ReLU activation function, and basic details of luminance are extracted:
In the formula, Is the brightness characteristic extracted by the initial convolution layer, is the basic characteristic expression of brightness under the current scale,Is a convolution kernel weight matrix used to extract the basic details of luminance information in the convolution operation,Is an offset term used for adjusting the brightness characteristic value after convolution to ensure the data distribution balance,Removing the negative value for the ReLU activation function, and reserving the positive part of the brightness information to ensure the nonlinear expression of the brightness characteristics, wherein I is a brightness component;
the initial convolution layer is mainly used for capturing basic details of the brightness component, and a ReLU activation function is used for removing negative values, so that nonlinear expression capacity can be improved, and loss of brightness information is avoided;
in the middle layer of DCNN, the output of the initial convolution layer H (z) is subjected to a multi-layer convolution process, each layer convolution being based on the output of the previous layer:
In the formula, Representing the luminance characteristics of the k-th layer output, including finer luminance information extracted by that layer,For the k-th layer convolution kernel weight, for extracting refined luminance details at the current level,The luminance characteristics of the output for the k-1 layer, as input to the current layer,The bias item is used for further adjusting the brightness value;
the multi-layer convolution enables the brightness characteristics to be progressively extracted and refined on different layers, so that details are more abundant;
Batch normalization of the output of each layer of convolution, and then using the ReLU activation function:
In the formula, Representing brightness characteristic output after the k-th layer convolution, batch normalization and activation processing, BN () being a batch normalization operation;
batch normalization normalizes features, ensures data distribution stability, and avoids gradient extinction or explosion. ReLU activation retains the positive part of brightness, enhancing nonlinear expression;
the brightness characteristics of the depth extraction are finally obtained through the layer-by-layer progressive processing of multi-layer convolution, batch normalization and activation functions;
the characteristic contains brightness information after multi-scale decomposition, and is refined on different levels;
based on the brightness component I, N is taken from each pixel point x in the image Calculating local brightness mean and standard deviation of pixel points x by pixel points in the N window, combining the local brightness mean and the local standard deviation of all the pixel points to generate a final illumination estimation graph T (x), and generating a weight matrix W (x) based on the illumination estimation graph;
the multi-scale luminance component and the depth luminance feature are adjusted to the same resolution by up-sampling and down-sampling methods, and in this embodiment, the specific adjustment method is:
Acquiring width w and height H of L (x, y) and H (x), respectively, comparing AndIf (if)L (x, y) is high resolution, H (x) is low resolution, otherwise H (x) is high resolution, L (x, y) is low resolution;
upsampling the low-resolution multi-scale luminance component L (x, y) to match the resolution of the depth luminance feature H (x), if the resolution of the depth luminance feature H (x) is high, downsampling the H (x) to ensure that it is consistent with the multi-scale component;
Channel combination is carried out on depth brightness characteristics of multiple channels by using an average value of all channels, the depth brightness characteristics are converted into single-channel brightness characteristics, after the formats are unified, weighted fusion is carried out through a weight matrix, the depth brightness characteristics extracted by DCNN (direct digital network) and multi-scale brightness components are weighted and fused, details of a bright part and a dark part can be dynamically balanced through the weight matrix generated by an illumination estimation graph, the fused brightness effect is more natural, the fused brightness components are obtained, the Laplace pyramid components of each layer are sequentially subjected to up-sampling superposition from the lowest frequency layer, the complete brightness components are gradually reconstructed, and the enhanced brightness components are obtained ;
By combining the depth brightness characteristics and the reconstruction process of the multi-scale components, the detail information of the brightness in different scales can be effectively integrated, so that the image with the enhanced brightness has more layering sense and the detail is natural.
The spatial position and the intensity difference of the pixels are simultaneously considered through the bilateral filter, so that the edge and texture information of the image can be effectively protected while high-efficiency noise reduction is realized. The method avoids the problem of edge blurring caused by the traditional filter and provides a high-quality basis for the subsequent processing of the image. The nonlinear adjustment of the original brightness is realized through the brightness conversion function, so that the contrast of the brightness of the image can be enhanced, and the bright area is brighter and the details of the dark area are richer. The introduction of the brightness adjustment parameters enables the brightness conversion process to be flexibly adjusted according to different scenes and image requirements, so that the image optimization effect which is more in line with the visual perception of human eyes is achieved. The multi-level low-frequency information of the image is extracted through the Gaussian pyramid, so that the brightness enhancement processing can be optimized for the image features of different scales, and the problem of local over-brightness or over-darkness caused by single-scale processing in the brightness enhancement process is avoided. The Laplacian pyramid is used for extracting high-frequency information of the image, so that edge features are subjected to key treatment in the enhancement process, and the sharpness and edge definition of the image are effectively improved. Through multi-layer convolution and batch normalization operation, the DCNN can extract multi-level detail features from the original brightness component, wherein the multi-level detail features comprise local brightness information and global brightness modes, so that the image is more layered. And a ReLU activation function and a batch normalization technology are introduced, so that the nonlinear expression capability of the neural network is enhanced, the convergence speed and stability of network training are improved, and the extracted brightness characteristics are ensured to have higher reliability. The multi-layer feature extraction mechanism of the DCNN can restore fine textures in the image while the brightness is enhanced, so that the optimized image is more similar to a real scene. The illumination condition of each pixel point in the image can be accurately described through the generation of the illumination estimation graph T (x), and the dynamic adjustment of the brightness component is realized by combining the weight matrix W (x), so that the problem of dark image or overexposure caused by uneven illumination is effectively solved. In the brightness fusion process, the weight matrix can balance the prominence of local details and the uniformity of global brightness distribution, so that the final image has local definition and maintains the overall visual consistency. By sampling and fusing the Laplacian pyramid components of each layer, detail information in the multi-scale brightness characteristics can be completely restored, and the optimized brightness components are ensured to show high-quality details under any resolution.
S2, adjusting the contrast of the enhanced brightness component, and dividing a brightness area and optimizing a dynamic range;
Specifically, adjusting the contrast of the enhanced luminance component refers to obtaining the enhanced luminance component and performing normalization processing, defining the solution population scale of the water circulation algorithm as P, wherein each solution represents a group of contrast parameters (s, d), s refers to a contrast scaling factor, which is used for adjusting the contrast of the image luminance, controlling the difference between luminance values, d refers to a luminance offset, which is used for adjusting the overall luminance level of the image, determining the offset of the luminance value in the whole image, and setting the initial range of the parameters;
initializing river solutions and streams, and simulating a process of searching an optimal solution by water flow;
And (3) carrying out contrast adjustment on the normalized brightness component according to the parameters (s, d) of the current solution:
In the formula, Is the luminance value after the adjustment,The normalized brightness value is in the range of 0, 1;
calculating an objective function value J for the adjusted brightness component, and evaluating the optimization effect of the current solution:
where M and N are the width and height of the image, representing the total range of pixels, Is the local luminance mean, x represents the position of the pixel in the horizontal direction, y represents the position of the pixel in the vertical direction;
updating river solutions and streams according to objective function values:
In the formula, AndIs a new value for the contrast scaling factor s and the brightness offset d in the updated river solution,AndIs the new value of the contrast scaling factor s and the brightness offset d in the updated streams solution,AndIs the original value of the contrast scaling factor s and the brightness offset d in the current river solution,AndIs the original value of the contrast scaling factor s and the brightness offset d in the current stream solution,AndIs the solution with the minimum objective function value in the current solution group, and represents the optimal solution of contrast scaling coefficient and brightness offset, r andThe random disturbance factor is in the range of 0,1 and is used for simulating uncertainty in natural water flow, increasing the diversity of searching, and r is calculated by a random number generation technology;
Setting the maximum iteration times, stopping iteration until the maximum iteration times are reached, selecting a solution with the minimum objective function value after the water circulation algorithm completes iteration, taking the parameter as the final optimal solution, using the optimal demodulation integral brightness component, and performing range clipping on the optimized brightness value to obtain the brightness component with the adjusted contrast
The normalization processing unifies the value ranges of the brightness components to 0,1, so that the difference of brightness ranges of different images is eliminated, and the subsequent contrast adjustment is more unified and standardized. The global search is realized through river solutions, the contrast optimization is ensured to cover a larger solution space, the situation that a local optimal solution is trapped in is avoided, the stream solution is responsible for local detail adjustment, so that the brightness detail is accurately adjusted on the basis of global optimization, the dynamic process of searching the optimal solution by simulating water flow is simulated, the global search capability and the local optimization capability are achieved, the optimization result is more accurate, the contrast adjustment function dynamically adjusts the brightness range and the overall brightness level through s and d, the image shows a clearer visual effect, the method is suitable for various image characteristics (such as a high-contrast scene or a low-contrast scene), and the overall visual quality is improved. The objective function value J quantifies the brightness adjustment effect, ensures the reliability of an optimization result through comprehensive evaluation of brightness uniformity and local details, limits the operation time of an algorithm by the maximum iteration times, avoids low calculation efficiency caused by overlarge solution space, ensures more uniform brightness distribution of an image after contrast adjustment through selecting a solution with the minimum objective function value, has optimal visual effect, performs range clipping on the optimized brightness component, ensures that pixel values are in a display range supported by equipment, improves the compatibility of the image on different equipment, and ensures the display effect of the optimized image.
Further, performing the division of the luminance area and the dynamic range optimization means to divide the luminance componentDivided into bright part, dark part and middle regulating area, threshold H andIf the brightness valueClassifying the pixel as a luminance region if the luminance valueClassifying the pixel as a dark region if the brightness value isClassifying the pixel as a middle adjustment region;
For a brightness region, reducing the output of brightness of a bright part by using a nonlinear compression function, for a dark part region, lifting dark part details by using logarithmic transformation, linearly adjusting an intermediate adjustment region, and balancing brightness transition;
The adjustment results of the bright portion, the dark portion and the intermediate tone region are combined into a final dynamic range optimized luminance component.
The combination mode ensures that details of different brightness areas can be clearly expressed visually, bright parts cannot be overexposed, dark part details are promoted, the intermediate adjustment area is natural in transition, and finally, the obtained brightness component is subjected to area self-adaptive adjustment, so that the brightness component has higher detail level sense and balanced brightness contrast.
Through the brightness region division, the accurate adjustment of different brightness regions can be realized without influencing the details of other regions, after an image is divided into a bright part, a dark part and a middle adjusting region, the most suitable optimization method can be adopted for each region, the details of the image are reserved to the greatest extent, the brightness region division is beneficial to the high efficiency of dynamic range optimization, the local contrast and the whole layering sense of the image are improved, the nonlinear compression function can effectively reduce the brightness of the excessively high bright part, the loss of the details of the high bright part in the image is avoided, the details of the bright part can be effectively enhanced while the natural transition is maintained, the overall impression of the image is improved, the high bright part is more smoothly transited to the middle adjusting region, the brightness range of the dark part region is stretched by logarithmic transformation, the details are easier to distinguish under the condition of dark light, the logarithmic transformation avoids the interference of a highlight region, enhances the low-brightness region, has higher optimization efficiency, linearly adjusts the intermediate adjustment region through smooth transition, balances the contrast difference between the bright part and the dark part, makes the image more natural, keeps the integrity of key details in the image, simultaneously provides good transition for the optimization of the bright part and the dark part, realizes the balance of the overall brightness distribution of the image through the adjustment of the bright part, the dark part and the intermediate adjustment region by dynamic range optimization, avoids the overexposure of the bright part region and the underexposure of the dark part region, ensures that each part of the image can be clearly presented, ensures the integral consistency and the naturalness of the image after integrating the adjustment results of the bright part, the dark part and the intermediate adjustment region, and has smooth transition of different regions in the merging process, thereby avoiding abrupt visual effects.
S3, carrying out edge enhancement on the optimized brightness component, and recombining to generate an enhanced HSI image, and converting the enhanced HSI image back to an RGB color space to obtain an optimized multimedia image;
Specifically, edge enhancement is performed on the optimized luminance component, and the enhanced HSI image is generated by recombination, namely, edge detail enhancement is performed on the luminance component with optimized dynamic range by using the Laplacian, the luminance component is consistent with the tone and saturation in the original HSI image, and the detail-enhanced luminance component is maintained Replacing the brightness channel in the original HSI image, and recombining the enhanced brightness component with the hue and saturation components in the HSI color space to form a final HSI image.
The method has the advantages that the edge outline is sharper through the extraction and superposition of the brightness high-frequency information, the quick transmission of visual information is facilitated, the structural characteristics of the image are clearer after the edge enhancement, the method is suitable for the image analysis of complex scenes, such as object recognition or image segmentation, the edge enhancement is only operated on brightness components, the color tone and the saturation are not directly processed, the unnatural change of the color of the image is avoided, the color tone and the saturation information are not affected through the replacement of the brightness channel, and the color of the image is more natural. And combining the enhanced brightness channel with hue and saturation components of the HSI image to generate a complete HSI image. The enhanced HSI image may be used directly in a visual perception system (such as autopilot or remote sensing image processing) or converted to an RGB image for display. The enhanced HSI image is suitable for display effects under different illumination conditions, and universality of the multimedia image is enhanced.
Further, the enhanced HSI image is converted back into an RGB color space, the optimized multimedia image is obtained by converting the HSI image into the RGB image through an image processing library OpenCV, and the generated multimedia image is stored as a standard format and is subjected to data backup.
Through the built-in optimization algorithm of OpenCV, realize quick and high-accuracy color space conversion, reduce manual calculation and potential error, RGB format is the mainstream standard of current display device, the image after the conversion can directly use on multimedia device, through accurate color space mapping, keep the color gradation of image, not influence tone and saturation when optimizing luminance and contrast, ensure through backup mechanism that data can not lose because of equipment failure or manual maloperation, the image with metadata is convenient for version management and follow-up optimization work, the application efficiency of multimedia image has been promoted.
Embodiment 2, referring to fig. 2, which is a second embodiment of the present invention, which is different from the previous embodiment, there is provided a multimedia image optimizing system, which includes,
The image acquisition module is used for extracting original image data from the multimedia equipment and generating an HSI color space and a corresponding image;
the brightness enhancement module is used for extracting brightness components and enhancing the brightness components;
A contrast adjustment module for adjusting the contrast of the enhanced luminance component;
The range optimization module is used for carrying out region division on the optimized brightness components and carrying out dynamic range adjustment;
And the conversion output module is used for carrying out edge enhancement on the brightness component with the adjusted range, converting the HSI image into an RGB image and then storing the RGB image.
The functions, if implemented in the form of software functional units and sold or used as a stand-alone product, may be stored in a computer-readable storage medium. Based on this understanding, the technical solution of the present invention may be embodied essentially or in a part contributing to the prior art or in a part of the technical solution, in the form of a software product stored in a storage medium, comprising several instructions for causing a computer device (which may be a personal computer, a server, a network device, etc.) to perform all or part of the steps of the method according to the embodiments of the present invention. The storage medium includes a U disk, a removable hard disk, a Read-Only Memory (ROM), a random access Memory (RAM, random Access Memory), a magnetic disk, an optical disk, or other various media capable of storing program codes.
Logic and/or steps represented in the flowcharts or otherwise described herein, e.g., a ordered listing of executable instructions for implementing logical functions, can be embodied in any computer-readable medium for use by or in connection with an instruction execution system, apparatus, or device, such as a computer-based system, processor-containing system, or other system that can fetch the instructions from the instruction execution system, apparatus, or device and execute the instructions. For the purposes of this description, a "computer-readable medium" can be any means that can contain, store, communicate, propagate, or transport the program for use by or in connection with the instruction execution system, apparatus, or device.
More specific examples (a non-exhaustive list) of the computer-readable medium include an electrical connection (an electronic device) having one or more wires, a portable computer diskette (a magnetic device), a Random Access Memory (RAM), a read-only memory (ROM), an erasable programmable read-only memory (EPROM or flash memory), an optical fiber device, and a portable compact disc read-only memory (CDROM). In addition, the computer readable medium may even be paper or other suitable medium on which the program is printed, as the program may be electronically captured, via, for instance, optical scanning of the paper or other medium, then compiled, interpreted or otherwise processed in a suitable manner, if necessary, and then stored in a computer memory.
It is to be understood that portions of the present invention may be implemented in hardware, software, firmware, or a combination thereof. In the above embodiments, the various steps or methods may be implemented in software or firmware stored in a memory and executed by a suitable instruction execution system. For example, if implemented in hardware, as in another embodiment, may be implemented using any one or combination of techniques known in the art, discrete logic circuits with logic gates for implementing logic functions on data signals, application specific integrated circuits with appropriate combinational logic gates, programmable Gate Arrays (PGAs), field Programmable Gate Arrays (FPGAs), and the like.

Claims (10)

1.一种多媒体图像优化方法,其特征在于:包括,1. A multimedia image optimization method, characterized in that: it includes: 获取待优化的多媒体图像并进行HSI颜色空间转换,提取HSI颜色空间中的亮度分量并进行亮度增强;Acquire the multimedia image to be optimized and perform HSI color space conversion, extract the brightness component in the HSI color space and perform brightness enhancement; 调整增强后的亮度分量的对比度,并进行亮度区域的划分和动态范围优化;Adjust the contrast of the enhanced brightness component, and perform brightness area division and dynamic range optimization; 对优化后的亮度分量进行边缘增强,并重新组合生成增强后的HSI图像,将增强后的HSI图像转换回RGB颜色空间,得到优化后的多媒体图像。The optimized brightness components are edge enhanced and recombined to generate an enhanced HSI image, and the enhanced HSI image is converted back to RGB color space to obtain an optimized multimedia image. 2.如权利要求1所述的多媒体图像优化方法,其特征在于:所述获取待优化的多媒体图像并进行HSI颜色空间转换指从多媒体设备中提取原始图像数据,将得到的图像格式统一转换为标准格式并通过均值滤波对图像进行噪声平滑处理;2. The multimedia image optimization method according to claim 1, wherein: the obtaining of the multimedia image to be optimized and performing HSI color space conversion refers to extracting original image data from the multimedia device, converting the obtained image format into a standard format, and performing noise smoothing on the image by mean filtering; 对预处理后的原始图像的颜色通道数据进行归一化,根据归一化后的颜色通道数据计算图像的亮度分量I,饱和度S和色调H;Normalize the color channel data of the preprocessed original image, and calculate the brightness component I, saturation S and hue H of the image based on the normalized color channel data; 得到转换后的HSI颜色空间以及对应的HSI图像。Get the converted HSI color space and the corresponding HSI image. 3.如权利要求2所述的多媒体图像优化方法,其特征在于:所述提取HSI颜色空间中的亮度分量并进行亮度增强指从HSI颜色空间中提取亮度分量I并作为独立通道,对提取的亮度分量I进行标准化后通过双边滤波器对亮度分量进行平滑处理,在完成边缘平滑和标准化处理后输出最终的亮度分量;3. The multimedia image optimization method according to claim 2, wherein the extracting the brightness component in the HSI color space and performing brightness enhancement refers to extracting the brightness component I from the HSI color space and using it as an independent channel, normalizing the extracted brightness component I and then smoothing the brightness component through a bilateral filter, and outputting the final brightness component after completing edge smoothing and normalization processing; 对亮度分量应用亮度转换函数,增强原始亮度的对比和层次:Apply a brightness conversion function to the brightness component to enhance the contrast and depth of the original brightness: ; 式中,A是亮度转换后的像素亮度值,a(x)是亮度增强调节参数,I(x)是亮度分量;Where A is the pixel brightness value after brightness conversion, a(x) is the brightness enhancement adjustment parameter, and I(x) is the brightness component; 对亮度转换后的亮度分量构建高斯金字塔,设定金字塔为n层,生成多个分辨率层次的图像,初始层直接取转换后的亮度分量,从第二层起,逐层对前一层图像进行高斯模糊和下采样,生成每一层低分辨率图像,直到达到第n层;A Gaussian pyramid is constructed for the brightness component after brightness conversion. The pyramid is set to n layers to generate images at multiple resolution levels. The initial layer directly takes the converted brightness component. From the second layer onwards, the previous layer image is Gaussian blurred and downsampled layer by layer to generate low-resolution images at each layer until the nth layer is reached. 在达到第n层之后得到多层高斯金字塔,每层代表亮度在不同空间尺度的低频信息;After reaching the nth layer, a multi-layer Gaussian pyramid is obtained, each layer represents the low-frequency information of brightness at different spatial scales; 基于高斯金字塔构建拉普拉斯金字塔,从高斯金字塔的每一层图像中提取高频信息,通过双线性插值方法对第l+1层的高斯金字塔图像进行上采样,使用以下公式计算第l层的拉普拉斯金字塔图像Ll(x,y):The Laplacian pyramid is constructed based on the Gaussian pyramid. High-frequency information is extracted from each layer of the Gaussian pyramid image. The Gaussian pyramid image of the l+1th layer is upsampled by the bilinear interpolation method. The Laplacian pyramid image of the lth layer is calculated using the following formula: ; 式中,Gl(x,y)是高斯金字塔的第l层图像,Gl+1(x,y)是高斯金字塔的第l+1层图像,U()是上采样操作,将Gl+1(x,y)放大到与第l层Gl(x,y)相同的分辨率,method表示用于上采样的插值方法;Where G l (x, y) is the lth layer image of the Gaussian pyramid, G l+1 (x, y) is the l+1th layer image of the Gaussian pyramid, U() is the upsampling operation, which enlarges G l+1 (x, y) to the same resolution as the lth layer G l (x, y), and method represents the interpolation method used for upsampling; 将计算得到的存储为第l层拉普拉斯金字塔的高频分量;The calculated Stored as the high-frequency component of the l-th layer of the Laplacian pyramid; 将高斯金字塔的最后一层作为拉普拉斯金字塔的最低频信息,将每一层的高频信息与低频基础亮度信息整合,形成完整的拉普拉斯金字塔结构,拉普拉斯金字塔的各层为多尺度亮度分量;The last layer of the Gaussian pyramid is used as the lowest frequency information of the Laplace pyramid, and the high frequency information of each layer is integrated with the low frequency basic brightness information to form a complete Laplace pyramid structure. is the multi-scale brightness component; 将每一层的亮度分量输入到DCNN的初始卷积层,通过卷积操作捕捉基本亮度特征,并使用ReLU激活函数处理卷积结构,提取亮度的基础细节;The brightness component of each layer is input into the initial convolution layer of DCNN to capture the basic brightness features through convolution operation, and the convolution structure is processed using the ReLU activation function to extract the basic details of brightness; 在DCNN的中间层,对初始卷积层H(z)的输出进行多层卷积处理,每层卷积都基于上一层的输出:In the middle layer of DCNN, the output of the initial convolutional layer H(z) is processed by multiple layers of convolution, and each layer of convolution is based on the output of the previous layer: ; 式中,表示第k层输出的亮度特征,为第k层卷积核权重,为第k-1层输出的亮度特征,为当前层的偏置项;In the formula, represents the brightness feature of the k-th layer output, is the k-th convolution kernel weight, is the brightness feature output from the k-1th layer, is the bias item of the current layer; 对每一层卷积的输出进行批归一化,再使用ReLU激活函数:The output of each convolution layer is batch normalized and then the ReLU activation function is used: ; 式中,表示经过第k层卷积、批归一化和激活处理后的亮度特征输出,BN()是批归一化操作;In the formula, Represents the brightness feature output after the k-th layer of convolution, batch normalization and activation processing. BN() is the batch normalization operation; 经过多层卷积、批归一化和激活函数的层层递进处理,最终得到深度提取的亮度特征;After multiple layers of convolution, batch normalization and activation function, the deep extracted brightness features are finally obtained; 基于亮度分量I,在图像中的每个像素点x,取NN窗口内的像素点计算像素点x的局部亮度均值和标准差,将所有像素点的局部亮度均值和局部标准差结合,生成最终的光照估计图T(x),基于光照估计图生成权重矩阵W(x);Based on the brightness component I, at each pixel x in the image, take N The local brightness mean and standard deviation of pixel x are calculated for the pixels in the N windows, and the local brightness mean and local standard deviation of all pixels are combined to generate the final illumination estimation map T(x). The weight matrix W(x) is generated based on the illumination estimation map. 通过上采样和下采样方法将多尺度亮度分量和深度亮度特征调整到相同的分辨率;The multi-scale brightness components and deep brightness features are adjusted to the same resolution through upsampling and downsampling methods; 使用全通道的均值将多通道的深度亮度特征进行通道合并,转换为单通道亮度特征,在格式统一后,通过权重矩阵进行加权融合,得到融合后的亮度分量,从最低频率层开始,将每层的拉普拉斯金字塔分量依次进行上采样叠加,逐步重建出完整的亮度分量,得到增强后的亮度分量The multi-channel depth brightness features are merged using the mean of all channels and converted into single-channel brightness features. After the format is unified, weighted fusion is performed through the weight matrix to obtain the fused brightness component. Starting from the lowest frequency layer, the Laplacian pyramid components of each layer are upsampled and superimposed in turn to gradually reconstruct the complete brightness component to obtain the enhanced brightness component. . 4.如权利要求3所述的多媒体图像优化方法,其特征在于:所述调整增强后4. The multimedia image optimization method according to claim 3, characterized in that: after the adjustment and enhancement 的亮度分量的对比度指获取增强后的亮度分量并进行归一化处理,定义水循环算法的解群体规模为P,每个解代表一组对比度参数(s,d),其中s指对比度缩放系数,d指亮度偏移量;The contrast of the brightness component refers to obtaining the enhanced brightness component and normalizing it. The solution group size of the water cycle algorithm is defined as P, and each solution represents a set of contrast parameters (s, d), where s refers to the contrast scaling factor and d refers to the brightness offset; 初始化河流解和小溪解,模拟水流寻找最优解的过程;Initialize the river solution and the stream solution, and simulate the process of water flow to find the optimal solution; 根据当前解的参数(s,d),对归一化后的亮度分量进行对比度调整:According to the parameters (s, d) of the current solution, the contrast of the normalized brightness component is adjusted: ; 式中,是调整后的亮度值,是归一化后的亮度值;In the formula, is the adjusted brightness value, is the normalized brightness value; 对调整后的亮度分量计算目标函数值J,评估当前解的优化效果:Calculate the objective function value J for the adjusted brightness component to evaluate the optimization effect of the current solution: ; 式中,M和N是图像的宽度和高度,表示像素的总范围,是局部亮度均值,x表示像素在水平方向上的位置,y表示像素在垂直方向上的位置;Where M and N are the width and height of the image, representing the total range of pixels. is the local brightness mean, x represents the horizontal position of the pixel, and y represents the vertical position of the pixel; 根据目标函数值更新河流解和小溪解;Update the river solution and the stream solution according to the objective function value; 设定最大迭代次数,直到到达最大迭代次数停止迭代,在水循环算法完成迭代后,选择目标函数值最小的解,将其参数作为最终的最优解,使用最优解调整亮度分量,并对优化后的亮度值进行范围裁剪,得到对比度调整后的亮度分量Set the maximum number of iterations until the maximum number of iterations is reached and stop the iteration. After the water cycle algorithm completes the iteration, select the solution with the smallest objective function value and use its parameters as the final optimal solution. Use the optimal solution to adjust the brightness component and perform range clipping on the optimized brightness value to obtain the brightness component after contrast adjustment. . 5.如权利要求4所述的多媒体图像优化方法,其特征在于:所述进行亮度区域的划分和动态范围优化指将亮度分量划分为亮部,暗部和中间调区域,设定阈值H和,若亮度值,则将该像素归类为亮度区域,若亮度值,则将该像素归类为暗部区域,若亮度值,则将该像素归类为中间调区域;5. The multimedia image optimization method according to claim 4, wherein the brightness region division and dynamic range optimization refers to dividing the brightness component Divide into bright, dark and mid-tone areas, set the threshold H and , if the brightness value , then the pixel is classified as a brightness area. If the brightness value , then the pixel is classified as a dark area. If the brightness value , then the pixel is classified as a midtone area; 对于亮度区域,使用非线性压缩函数降低亮部亮度的输出,对于暗部区域,使用对数变换提升暗部细节,对于中间调区域进行线性调整,平衡亮度过渡;For the brightness area, a nonlinear compression function is used to reduce the output of the brightness of the bright part. For the dark area, a logarithmic transformation is used to enhance the dark details. For the mid-tone area, a linear adjustment is performed to balance the brightness transition. 将亮部、暗部和中间调区域的调整结果组合成最终的动态范围优化亮度分量。Combines adjustments to the highlight, shadow, and midtone areas into a final dynamic range optimized luminance component. 6.如权利要求5所述的多媒体图像优化方法,其特征在于:所述对优化后的亮度分量进行边缘增强,并重新组合生成增强后的HSI图像指使用拉普拉斯算子对动态范围优化后的亮度分量进行边缘细节增强,将亮度分量与原HSI图像中的色调和饱和度保持一致,将细节增强后的亮度分量替换原HSI图像中的亮度通道,将增强后的亮度分量与HSI颜色空间中的色调和饱和度分量重新组合,形成最终的HSI图像。6. The multimedia image optimization method according to claim 5, characterized in that: the edge enhancement of the optimized brightness component and the recombining to generate the enhanced HSI image refers to using the Laplace operator to enhance the edge details of the brightness component after dynamic range optimization, keeping the brightness component consistent with the hue and saturation in the original HSI image, and recombining the brightness component after detail enhancement to generate the enhanced HSI image. The brightness channel in the original HSI image is replaced, and the enhanced brightness component is recombined with the hue and saturation components in the HSI color space to form the final HSI image. 7.如权利要求6所述的多媒体图像优化方法,其特征在于:所述将增强后的HSI图像转换回RGB颜色空间,得到优化后的多媒体图像指通过图像处理库OpenCV将HSI图像转换为RGB图像后得到优化后的多媒体图像,将生成的多媒体图像保存为标准格式并进行数据备份。7. The multimedia image optimization method according to claim 6 is characterized in that: converting the enhanced HSI image back to the RGB color space to obtain the optimized multimedia image means converting the HSI image into an RGB image through the image processing library OpenCV to obtain the optimized multimedia image, saving the generated multimedia image in a standard format and performing data backup. 8.一种基于权利要求1-7任一所述的多媒体图像优化方法的多媒体图像优化系统,其特征在于:包括,8. A multimedia image optimization system based on the multimedia image optimization method according to any one of claims 1 to 7, characterized in that it comprises: 图像获取模块,用于从多媒体设备中提取原始图像数据并生成HSI颜色空间及对应图像;An image acquisition module is used to extract raw image data from multimedia devices and generate HSI color space and corresponding images; 亮度增强模块,用于提取亮度分量并对亮度分量进行增强;A brightness enhancement module, used for extracting and enhancing brightness components; 对比度调整模块,用于调整增强后的亮度分量的对比度;A contrast adjustment module, used for adjusting the contrast of the enhanced brightness component; 范围优化模块,用于对优化后的亮度分量进行区域划分并进行动态范围调整;A range optimization module, used for dividing the optimized brightness components into regions and adjusting the dynamic range; 转化输出模块,用于将范围调整后的亮度分量进行边缘增强后将HSI图像转换为RGB图像后进行保存。The conversion output module is used to perform edge enhancement on the brightness component after range adjustment and convert the HSI image into an RGB image for saving. 9.一种计算机设备,包括:存储器和处理器;所述存储器存储有计算机程序,其特征在于:所述处理器执行所述计算机程序时实现权利要求1至7中任一项所述的多媒体图像优化方法的步骤。9. A computer device, comprising: a memory and a processor; the memory stores a computer program, characterized in that: when the processor executes the computer program, the steps of the multimedia image optimization method described in any one of claims 1 to 7 are implemented. 10.一种计算机可读存储介质,其上存储有计算机程序,其特征在于:所述计算机程序被处理器执行时实现权利要求1至7中任一项所述的多媒体图像优化方法的步骤。10. A computer-readable storage medium having a computer program stored thereon, wherein the computer program, when executed by a processor, implements the steps of the multimedia image optimization method according to any one of claims 1 to 7.
CN202411930713.8A 2024-12-26 2024-12-26 A multimedia image optimization method and system Active CN119359606B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202411930713.8A CN119359606B (en) 2024-12-26 2024-12-26 A multimedia image optimization method and system

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202411930713.8A CN119359606B (en) 2024-12-26 2024-12-26 A multimedia image optimization method and system

Publications (2)

Publication Number Publication Date
CN119359606A true CN119359606A (en) 2025-01-24
CN119359606B CN119359606B (en) 2025-04-18

Family

ID=94306897

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202411930713.8A Active CN119359606B (en) 2024-12-26 2024-12-26 A multimedia image optimization method and system

Country Status (1)

Country Link
CN (1) CN119359606B (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN119904604A (en) * 2025-03-04 2025-04-29 中氢投电力(北京)有限公司 An intelligent parking assistance system based on image acquisition

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20100080485A1 (en) * 2008-09-30 2010-04-01 Liang-Gee Chen Chen Depth-Based Image Enhancement
CN101951523A (en) * 2010-09-21 2011-01-19 北京工业大学 Adaptive colour image processing method and system
CN106780379A (en) * 2016-12-08 2017-05-31 哈尔滨工业大学 The microscopical colour-image reinforcing method of one kind metering
CN112465711A (en) * 2020-10-30 2021-03-09 南京理工大学 Degraded image enhancement method for foggy environment
CN115131248A (en) * 2022-07-23 2022-09-30 南京信息工程大学 Underwater image restoration and contrast and edge enhancement method thereof

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20100080485A1 (en) * 2008-09-30 2010-04-01 Liang-Gee Chen Chen Depth-Based Image Enhancement
CN101951523A (en) * 2010-09-21 2011-01-19 北京工业大学 Adaptive colour image processing method and system
CN106780379A (en) * 2016-12-08 2017-05-31 哈尔滨工业大学 The microscopical colour-image reinforcing method of one kind metering
CN112465711A (en) * 2020-10-30 2021-03-09 南京理工大学 Degraded image enhancement method for foggy environment
CN115131248A (en) * 2022-07-23 2022-09-30 南京信息工程大学 Underwater image restoration and contrast and edge enhancement method thereof

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
李昌利 等: ""基于多通道均衡化的水下彩色图像增强算法"", 《华中科技大学学报(自然科学版)》, 17 June 2019 (2019-06-17) *

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN119904604A (en) * 2025-03-04 2025-04-29 中氢投电力(北京)有限公司 An intelligent parking assistance system based on image acquisition

Also Published As

Publication number Publication date
CN119359606B (en) 2025-04-18

Similar Documents

Publication Publication Date Title
Wang et al. Local color distributions prior for image enhancement
CN112734650B (en) Virtual multi-exposure fusion based uneven illumination image enhancement method
CN112001863B (en) A deep learning based image restoration method for underexposed images
Fan et al. Integrating semantic segmentation and retinex model for low-light image enhancement
Ying et al. A bio-inspired multi-exposure fusion framework for low-light image enhancement
Li et al. Clustering based content and color adaptive tone mapping
Paul Adaptive tri-plateau limit tri-histogram equalization algorithm for digital image enhancement
CN113129236A (en) Single low-light image enhancement method and system based on Retinex and convolutional neural network
Garg et al. LiCENt: Low-light image enhancement using the light channel of HSL
Xu et al. Color-compensated multi-scale exposure fusion based on physical features
CN119359606B (en) A multimedia image optimization method and system
CN117974459A (en) Low-illumination image enhancement method integrating physical model and priori
Song et al. Multi-scale joint network based on Retinex theory for low-light enhancement
Wang et al. Single underwater image enhancement based on $ l_ {P} $-norm decomposition
CN119091379A (en) A machine vision unit for smart community security management system
CN118674655B (en) Image enhancement method and enhancement system in low light environment
Yang et al. Low-light image enhancement via feature restoration
CN120219880A (en) Image processing model training method, image enhancement method and electronic device
Yu et al. Adaptive inverse hyperbolic tangent algorithm for dynamic contrast adjustment in displaying scenes
Yang et al. Tone mapping based on multi-scale histogram synthesis
Yeganeh Cross dynamic range and cross resolution objective image quality assessment with applications
Liang et al. Reconstructing hdr image from a single filtered ldr image base on a deep hdr merger network
Zhang et al. A dynamic range adjustable inverse tone mapping operator based on human visual system
CN115953340A (en) Multi-exposure image fusion method
Chen et al. Low-light image enhancement network based on central difference convolution

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant