[go: up one dir, main page]

CN117319631A - A light field display system and image rendering method based on human eye tracking - Google Patents

A light field display system and image rendering method based on human eye tracking Download PDF

Info

Publication number
CN117319631A
CN117319631A CN202311235605.4A CN202311235605A CN117319631A CN 117319631 A CN117319631 A CN 117319631A CN 202311235605 A CN202311235605 A CN 202311235605A CN 117319631 A CN117319631 A CN 117319631A
Authority
CN
China
Prior art keywords
light field
display
rendering
human eye
unit
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202311235605.4A
Other languages
Chinese (zh)
Inventor
李希才
季明江
王元庆
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Nanjing University
Original Assignee
Nanjing University
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Nanjing University filed Critical Nanjing University
Priority to CN202311235605.4A priority Critical patent/CN117319631A/en
Publication of CN117319631A publication Critical patent/CN117319631A/en
Pending legal-status Critical Current

Links

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N13/00Stereoscopic video systems; Multi-view video systems; Details thereof
    • H04N13/30Image reproducers
    • H04N13/398Synchronisation thereof; Control thereof
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N13/00Stereoscopic video systems; Multi-view video systems; Details thereof
    • H04N13/10Processing, recording or transmission of stereoscopic or multi-view image signals
    • H04N13/106Processing image signals
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N13/00Stereoscopic video systems; Multi-view video systems; Details thereof
    • H04N13/30Image reproducers
    • H04N13/366Image reproducers using viewer tracking
    • H04N13/368Image reproducers using viewer tracking for two or more viewers
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N13/00Stereoscopic video systems; Multi-view video systems; Details thereof
    • H04N13/30Image reproducers
    • H04N13/366Image reproducers using viewer tracking
    • H04N13/383Image reproducers using viewer tracking for tracking with gaze detection, i.e. detecting the lines of sight of the viewer's eyes

Landscapes

  • Engineering & Computer Science (AREA)
  • Multimedia (AREA)
  • Signal Processing (AREA)
  • Testing, Inspecting, Measuring Of Stereoscopic Televisions And Televisions (AREA)

Abstract

本申请公开了一种基于人眼跟踪的光场显示系统及图像渲染方法,由于对观看者的双眼位置进行定位与跟踪,并计算出光场重建所需的实时视角、方位等基础信息,从而通过神经辐射场体渲染算法完成光场图像的实时渲染。结合人眼跟踪技术与神经辐射场渲染技术,使得光场显示器的空间带宽积得到大幅提高,并支持多人同时观看,不但降低了光场显示器对液晶面板高像素密度的设计依赖,而且提高了光场显示器的立体显示效果,使得观看者在不同观看位置处都可以获得较好的立体沉浸感。另外采用所提出的神经辐射场对光场图像进行重建,有效避免了深度反转的问题,同时还可以减少光场图像渲染算法的数据量和计算复杂度。

This application discloses a light field display system and image rendering method based on human eye tracking. By positioning and tracking the position of the viewer's eyes, and calculating the real-time viewing angle, orientation and other basic information required for light field reconstruction, the system can The neural radiation field rendering algorithm completes the real-time rendering of light field images. Combining human eye tracking technology and neural radiation field rendering technology, the spatial bandwidth product of the light field display is greatly improved, and it supports multiple people to watch at the same time. It not only reduces the design dependence of the light field display on the high pixel density of the LCD panel, but also improves the The three-dimensional display effect of the light field display allows viewers to obtain a better three-dimensional immersion at different viewing positions. In addition, the proposed neural radiation field is used to reconstruct the light field image, which effectively avoids the problem of depth inversion and can also reduce the data volume and computational complexity of the light field image rendering algorithm.

Description

Light field display system and image rendering method based on human eye tracking
Technical Field
The application relates to the fields of image processing, stereoscopic display and artificial intelligence, in particular to a light field display system based on human eye tracking and an image rendering method.
Background
Current stereoscopic display technology generally adopts a Parallax Barrier PB (Parallax Barrier) technology to generate Parallax so as to allow a viewer to obtain stereoscopic immersion. The stereoscopic display technology can be roughly divided into two main categories, namely auxiliary and non-auxiliary, and the stereoscopic display with auxiliary can be further divided into switch glasses, polarized glasses, VR helmets, chromatic aberration glasses and the like. Unassisted and can be divided into parallax barrier type and true three-dimensional displays. Parallax barrier type mainly comprises a lenticular lens grating, a slit grating, a directional backlight (Directional Backlight) technology and the like, and true three-dimensional display is further divided into integrated imaging, holographic display, light field display, volume display and other technologies. Among them, parallax barrier type stereoscopic display technology is the most mature display technology, but once the process control is poor, the stereoscopic display technology can cause larger stereoscopic crosstalk. The optical field display system is a geometrical optical system, which skillfully uses crosstalk, and the initial optical field image acquisition path is obtained by directly imaging through a lens array, but the direct display has the problem of depth inversion.
In the existing auxiliary stereoscopic display mode, the driver of the stereoscopic display will alternately render the left and right eye images, if the first frame is the left eye image, then the latter frame corresponds to the right eye image. The viewer wears liquid crystal shutter glasses, the shutter glasses are synchronized with the display card and the display in a wired or wireless manner, when the left eye image is displayed on the display, the glasses open the shutter of the left lens and simultaneously close the shutter of the right lens, and when the right eye image is displayed on the display, the glasses open the shutter of the right lens and simultaneously close the shutter of the left lens. Since the refresh rate is fast, the view of a certain eye which is not seen at the moment is synthesized by the brain according to the vision temporary storage effect, so that the stereoscopic image is watched from the display only by wearing the shutter type stereoscopic glasses interface in the coverage range of the synchronous signal.
The auxiliary stereoscopic display requires that the left and right lenses of the stereoscopic shutter glasses must be synchronized with the corresponding projection image signals, and a higher stereoscopic display refresh rate is required to obtain a good stereoscopic silence effect. The stereoscopic display technology has the advantages of low cost and good stereoscopic immersion, can realize seamless switching of the plane image and the stereoscopic image, and has the disadvantages of low stereoscopic brightness and easy visual fatigue.
The unassisted autostereoscopic display technology is generally called as "naked eye 3D display" technology, that is, two images with parallax can be seen by left and right eyes from a display screen without any tool, and reflected to the brain, so that a viewer can generate stereoscopic immersion. The unassisted stereoscopic display also utilizes the parallax principle of human eyes, and different display pictures are respectively provided for the left eye and the right eye of a viewer, so that the stereoscopic visual viewing effect is achieved. Currently, the unassisted stereoscopic display technology is mainly a multi-view stereoscopic display technology, and is commonly used in public places for displaying and propaganda because an observer can wear no glasses. However, the multi-viewpoint autostereoscopic display technique has disadvantages of reduced resolution, limited viewing angle, low brightness, large crosstalk, stereoscopic vertigo, and the like, and has relatively poor display effects.
The directional backlight technology is a practical stereo display technology in the existing unassisted stereo display technology, the directional backlight stereo display technology is matched with two groups of LEDs, and is matched with a fast-response LCD panel and a driving method, so that 3D content enters left and right eye interchange images of a viewer in a Sequential mode to generate parallax, the eyes of the viewer feel 3D stereo effect, and communication and interaction based on mobile equipment are greatly enhanced. The display device has the advantages that resolution and light transmittance are guaranteed, an existing display design architecture cannot be affected, and the 3D display effect is excellent. The defect is that the technology is still under development and the product is not mature.
Compared with the traditional parallax barrier type stereoscopic display technology, the light field display technology has the advantages of good stereoscopic immersion sense, multiple display information dimensions, no dizziness sense, high resolution, quick refreshing and the like, is considered as one of the most promising next generation reserve display technologies, and is a key technology for realizing large-scale stereoscopic display.
In summary, the prior art has the following disadvantages:
1. parallax barrier type auto-stereoscopic displays are limited by the resolution of the liquid crystal display panel, and the spatial bandwidth product is difficult to increase, and generally only about 40% is achieved.
2. The parallax barrier type autostereoscopic display generates stereoscopic crosstalk in the stereoscopic display process due to the optical structural characteristics thereof, and the parallax barrier type autostereoscopic display needs to modulate light to the left and right eye positions of a user due to the optical structural characteristics thereof, thereby causing lower brightness of the autostereoscopic display and aggravating the asthenopia phenomenon.
3. The dense viewpoint stereoscopic display technology has higher requirement on the display density of the liquid crystal panel, difficult increase of the spatial bandwidth product and large data volume of image rendering, so that the rendering speed of the stereoscopic image is slow.
4. The light field display adopts the traditional light ray chasing pipeline rendering scheme, so that the algorithm rendering process is complex, the speed is low, the accuracy is low, the depth inversion phenomenon can be generated, and the process of sampling and reconstructing the target model to be displayed is complex, thereby the display efficiency of the display is lower and the three-dimensional performance of the three-dimensional display is deteriorated.
Disclosure of Invention
In order to solve the problems, the application provides a light field display system based on human eye tracking and an image rendering method.
According to a first aspect of the present application, there is provided a light field display system based on human eye tracking, comprising:
the display control unit is used for providing three-dimensional display data and man-machine interaction related interaction information for the light field display;
the light field rendering unit is connected with the display control unit and is used for performing light field image real-time rendering according to the light field model and human eye position information output by the video input interface in the display control unit;
the driving units are connected with the light field rendering units, the driving units are in a modularized design, and the display resolution of each driving module is 2K multiplied by 1K or 4K multiplied by 2K and is used for carrying out partition refreshing and synchronous control on the high-density liquid crystal display panel;
the display unit is used for providing a display carrier for the presentation of the light field information and constructing a high-density display physical model with the resolution of 16K multiplied by 8K;
the high-speed eyeball three-dimensional tracking unit is connected with the display unit and is used for positioning and tracking the human eye position information of a viewer in real time and outputting the human eye position information from the serial bus according to a preset communication protocol;
the visual driving unit is connected with the light field rendering unit and the high-speed eyeball three-dimensional tracking unit and is used for bridging the visual angle information of the viewer with the light field rendering model so as to output expected visual angle rendering parameters for the light field rendering unit according to the three-dimensional position information of the human eye of the viewer.
The light field display system based on eye tracking is characterized in that the display control unit is a 2D interaction unit, a user can interact with an operating system by being provided with a 2D display screen on one hand, and can also switch between 2D and 3D switching control by a mouse or gestures by setting the working mode of the light field display to be a 2D working mode.
The light field display system based on human eye tracking is characterized in that the light field rendering unit is a parallel rendering core with high stability and high instantaneity, and adopts a nerve radiation field deep learning algorithm to conduct rendering.
The light field display system based on human eye tracking is characterized in that the driving unit is a Tcon driving module with independent driving and array synchronization functions, and the resolution of the driving board is 1920×1080 or 3840×2160.
The light field display system based on human eye tracking is characterized in that the visual driving unit provides visual angle information of image rendering for the light field rendering unit, wherein the image rendering adopts an inverse sequence ray integral rendering technology.
According to a second aspect of the present application, the present application provides a light field image rendering method based on eye tracking, which is characterized by comprising two stages of model training and model reasoning, and comprises the following steps:
a step of acquiring a target scene: the target scene to be displayed is used for providing an original training data set for the light field image rendering unit;
and (3) self-adaptive scene acquisition: sampling the target scene by adopting a multi-scale self-adaptive method according to the target scene, and generating a digital image signal to be output;
and a camera pose estimation step: estimating the pose of a shooting camera according to the sampled scene information;
training a nerve radiation field model: preprocessing the digital waveform signals output in the data acquisition step and outputting processing results;
obtaining an implicit expression step: according to a neural radiation field algorithm, training a model by adopting a fully connected network to obtain node information and weight parameters of the model;
a light field reproduction step: and converting the human eye position signal in the neural radiation field rendering step into visual angle information by taking parameters in the model implicit expression as network weights, so as to complete model reasoning. And rasterizing the data output according to the human eye position rendering step and converting the data into a display screen coordinate system so as to achieve the aim of light field display.
The light field image rendering method based on eye tracking is characterized in that in the target scene acquisition step, the data can be a real scene to be displayed or a virtual scene in a game or a three-dimensional model in the animation field.
The light field image rendering method based on human eye tracking is characterized in that the data acquisition step can sample a scene in real time through an aircraft, sample data of a light field camera or virtually sample a three-dimensional model by adopting a virtual camera.
The light field image rendering method based on human eye tracking is characterized in that in the step of obtaining model parameters, a reverse rendering algorithm is adopted in the rendering process, so that a front-looking observed image is obtained.
The light field image rendering method based on human eye tracking is characterized in that the light field rendering algorithm is a real-time rendering algorithm
The beneficial effects of this application are:
according to the light field display system and the image rendering method based on human eye tracking, the positions of eyes of a viewer are tracked, and corresponding image rendering viewpoint information is calculated, so that light field images are rendered through a neural radiation field reconstruction algorithm. The spatial bandwidth product of the light field stereoscopic display is greatly improved by combining with human eyes, and simultaneous watching of multiple persons is supported, so that the design requirement of the light field display on the high density of pixels of the liquid crystal panel is reduced, the display effect of the light field display is improved, a viewer can obtain better stereoscopic immersion feeling at different watching positions, and meanwhile, the data volume and the computational complexity of a light field image rendering algorithm can be reduced.
Drawings
FIG. 1 is a schematic diagram of a light field display system based on human eye tracking in accordance with the present application;
FIG. 2 is a schematic diagram of a light field display system based on eye tracking;
FIG. 3 is a schematic diagram of a driving circuit of a light field display system based on human eye tracking;
FIG. 4 is a flow chart of a light field image rendering method based on human eye tracking;
FIG. 5 is a schematic diagram of a light field acquisition and reproduction method based on human eye tracking;
FIG. 6 is a schematic diagram of a light field image rendering method based on human eye tracking;
Detailed Description
The present application is described in further detail below with reference to the accompanying drawings by way of specific embodiments.
Referring to fig. 1, the present application discloses a light field display system (hereinafter referred to as light field display) based on human eye tracking, which includes a display control unit 10, a light field rendering unit 11, a driving unit 12, a display unit 13, a high-speed eyeball three-dimensional tracking unit 14 and a visual driving unit 15, which are described below.
The display control unit 10 is arranged to provide stereoscopic display data and related interaction information for a light field display. In one embodiment, the display control unit 10 is a computer with high stability and high computing power. For example, the display control unit 10 may be a server, and of course, in other embodiments, the display control unit 10 may be a conventional computer or a computer with an HDMI video interface.
The light field rendering unit 11 is connected to the display control unit 10, and is configured to perform light field image rendering according to the light field model and the human eye position information output by the video input interface in the display control unit.
The driving unit 12 is connected with the light field rendering unit 11, and adopts a modularized design, and each module is driven to 1920×1080 in size and is used for carrying out partition refreshing and control on the high-density liquid crystal display panel.
The resolution of the driving unit 12 is 1920×1080, and the driving unit is used for displaying, distributing and refreshing the light field digital signal signals output by the light field rendering unit. In a particular embodiment, the driver may support HDMI, DP, or Type-C inputs.
The display unit 13 is used for providing display information with a physical resolution of 16K x 8K for a presentation carrier of the light field display information; in one embodiment, the display unit 13 is a composite display carrier composed of a liquid crystal display panel with high resolution and an optical microstructure. For example, the display unit 13 may be a display device composed of a backlight structure, a liquid crystal display panel, and a grating, and of course, in other embodiments, the display unit 13 may be a non-liquid crystal display panel. In another embodiment, as shown in fig. 2, the backlight module 24 provides a light source for the lcd panel, the eye tracking module 26 can track the viewing position of the user in real time, the input end of the high-density lcd module 27 is connected to the display driving module 23, and the front panel of the high-density lcd module is attached to the aspheric microlens array module 28.
And the high-speed eyeball three-dimensional tracking unit 14 is connected with the display unit 13 and is used for positioning and tracking the human eye position information of the viewer in real time and outputting the human eye position information from a serial port according to a preset communication protocol. In one embodiment, the high-speed three-dimensional eye tracking unit 14 is a visual tracking device having three-dimensional position detection and tracking functions for the human eye.
The visual driving unit 15 is connected with the light field rendering unit 11 and the high-speed eyeball three-dimensional tracking unit 14, and is used for bridging the visual angle information of the viewer and the light field rendering model so as to output expected visual angle rendering parameters for the light field rendering unit according to the three-dimensional information of the human eye of the viewer.
Accordingly, the light field display image rendering method based on human eye tracking is characterized by comprising two stages of model training and model reasoning, as shown in fig. 4, and comprises the following steps:
s41, a target scene acquisition step: the target scene to be displayed may provide the original training data set for the light field image rendering unit, as shown in fig. 4. In a specific embodiment, the target scene may be a virtual three-dimensional scene in computer software or may be a real object directly acquired in a real environment;
s42, self-adaptive scene acquisition: and according to the target scene, performing scene sampling on the target scene by adopting a multi-scale self-adaptive method to generate a digital image signal to be output. In a specific embodiment, sampling with different scales is performed according to the complexity of the model so as to reduce information redundancy and image acquisition of key areas;
s43, estimating the pose of the camera: and estimating the pose of the photographed camera according to the sampled scene information. In a specific embodiment, estimating the pose of the camera by carrying out feature extraction and feature matching on the acquired image;
s44, training a nerve radiation field model: according to the position and direction of the visual angle, taking the color as a supervision item, and performing model training by adopting a fully-connected network;
s45, obtaining an implicit expression step: according to a neural radiation field algorithm, in a specific embodiment, node information and weight parameters of a model are obtained from the model which is trained by the fully-connected network;
s46, light field reproduction: and converting the human eye position signal in the neural radiation field rendering step into visual angle information by taking parameters in the model implicit expression as network weights, so as to complete model reasoning. In a specific embodiment, the data output according to the human eye position rendering step is rasterized and then converted into a display screen coordinate system, so as to achieve the purpose of light field display.
According to the light field display system and the image rendering method based on human eye tracking, due to the fact that the positions of two eyes of a viewer are tracked, corresponding image rendering viewpoint information is calculated, and therefore light field image rendering is conducted through a nerve radiation field reconstruction algorithm. The spatial bandwidth product of the light field stereoscopic display is greatly improved by combining with human eyes, and simultaneous watching of multiple persons is supported, so that the design requirement of the light field display on the high density of pixels of the liquid crystal panel is reduced, the display effect of the light field display is improved, a viewer can obtain better stereoscopic immersion feeling at different watching positions, the data volume and the computational complexity of a light field image rendering algorithm can be reduced, and the method is specifically analyzed.
As shown in fig. 1, in the technical solution for displaying a light field provided in the present application, a display control unit is used to control displaying contents to be displayed, where a light field model output by the display control unit may output a standard three-dimensional point cloud model (such as a ply, pcl, txt, etc.), and may also be an implicit expression of the three-dimensional model, including a weight file of the model and a connection relationship between all fully connected layers. The data such as the nonstandard three-dimensional model is encoded and then transmitted through the standard HDMI output interface, so that the method has good compatibility, can be used for compression transmission, encryption transmission and the like of the optical field data, has high timeliness and safety, and the output resolution of the optical field data is 1920 multiplied by 1080.
As shown in fig. 1, for a large light field model, the rendering unit 11 needs to perform data segmentation, so as to distribute the segmented data to the arrayed parallel acceleration unit for accelerating the rendering algorithm, and the segmented data blocks respectively complete the rendering of the light field image in the arrayed rendering unit. In fig. 1, the data segmentation module, the light field image rendering module and the video interface module output are collectively called a light field rendering unit, and the light field data rendered by the light field rendering unit needs to be output to the driving unit through the video output interface module.
As shown in fig. 1, the driving unit 12 further drives the high-density liquid crystal display unit 13 to display after receiving the light field data of the light field rendering unit 11. In order to increase the spatial bandwidth product of the light field display, on one hand, the information amount of the light field display needs to be increased, and on the other hand, a high-density liquid crystal panel is used as a physical support, but as the display information amount increases, huge computing power expenditure is undoubtedly caused for the light field image rendering unit 11, so that the display system cannot realize smooth light field image playing and interaction.
Inspired by a peripheral vision mechanism, the application adopts the eye tracking technology to realize acceleration of a rendering algorithm and improvement of a display bandwidth product, and as shown in fig. 1, the application determines the watching position of a viewer through the high-speed eyeball three-dimensional tracking unit 14, so that display information is accurately generated by a user, the regions which are not watched by the viewer are not rendered with light field images, the data volume of rendering the light field images is reduced, a large number of effective pixels are saved, and more liquid crystal display resources are contributed to the effective viewer regions. Therefore, the spatial bandwidth product of the light field display is increased by the high-speed three-dimensional eye tracking unit 14, and the spatial bandwidth product (Space Bandwidth Product, SBP) is generally calculated by sbp=horizontal resolution×vertical resolution×field frequency×1.344. In order to achieve good stereoscopic viewing effect, it is generally required that the spatial bandwidth product SBP of the light field display is not less than 10 8 The visual driving unit 15 is therefore of central value in establishing a physical logical link between the light field rendering unit and the high-speed eye three-dimensional tracking unit.
As shown in fig. 2, which is a system structure diagram of a light field display system based on eye tracking in the present application, the system includes a video interface module 21, a unit module 22, a driving module 23, a backlight module 24, a light field rendering module 25, an eye tracking module 26, a high-density liquid crystal display module 27, and an aspheric lens array module 28, it can be seen that in the present application, the eye tracking module cooperates with the light field image rendering module and the like.
As shown in fig. 3, the video interface unit is composed of a unit module 31, a PC interface module 32, an ESD protection module 33, a signal conversion module 34, a display panel 35, an audio decoding module 36, a video synchronization interface module 37, and a core processor module 38, and adopts a modularized design concept, so that the design difficulty of the arrayed display screen drive is reduced.
Referring to fig. 4, an algorithm of a light field display image rendering method based on human eye tracking includes a target scene acquisition step S41, an adaptive scene acquisition step S42, a camera pose estimation step S43, a neural radiation field model training step S44, an implicit expression acquisition step S45, and a light field reproduction step S46. The target scene can obtain multi-view information of the scene after multi-scale self-adaptive sampling, pose estimation is carried out according to the sampled scene information, network input of a nerve radiation field can be obtained according to pose parameters and multi-view images after pose estimation is completed, a NeRF function is a function which is used for representing a continuous scene as a 5D vector input, the function comprises a 3D coordinate position x= (x, y, z) and a direction (theta, phi) of a space point, and the color c= (r, g, b) of the 3D point related to the view angle and the volume density sigma of the position are output. The bulk density σ (x) is understood to mean that at the spatial x-position an infinitely small derivative of the particle (derivative of the opacity at the position) is independent of the direction of radiation. Camera imaging radiation r (t) =o+td; the camera position o and the imaging point d constitute a camera imaging ray, which accompanies a range t of near and far points, the color C (r) of which is expressed in an integrated manner as:
wherein t is f And t n Representing the far and near points of the ray, respectively, σ representing the bulk density of the current point, c representing the RGB value of the current point, T (T) representing the ray from T n Cumulative transparency to this segment of t:
wherein t is n Representing the near point of the ray, t represents any point on the ray,σ represents the bulk density of the current point.
In the continuous radiation field, rendering is performed for any view angle, and the ray passing through each pixel of the target virtual camera needs to be integrated with the color, so that the color of each pixel is obtained, and an imaging picture under the view angle is rendered. And obtaining a difference value between the color and the actual color through rendering of the L2 loss calculation body as a loss function to supervise the learning of the neural network.
And (3) training the neural network is completed, and the scene is converted into implicit expression. If new visual angle information is needed to be obtained, the visual angle information of the current viewing position can be rendered by only inputting the visual angle information of a viewer, and the visual angle image can be matched with a physical coordinate system of the light field display after rasterization, so that a three-dimensional display effect is achieved.
As shown in fig. 5, in the present application, a three-dimensional scene 51 is captured by a capture lens array 52 through a capture plane 53, and this process is a capture process of light field display, and because each lens of the capture lens array 52 can capture specific information of a corresponding scene from different angles, each capture lens can capture specific unit images at specific positions of the capture plane 53, and all the unit images form a sub-image array 54. The sub-image array 54 is loaded onto the display plane 55, and a reconstruction lens array 56 is arranged behind the display plane, the reconstruction lens array 56 corresponds to the sub-image array 54 precisely, and a reconstructed three-dimensional scene 57 can be obtained according to the principle of reversibility of the light path, which is a reproduction process of the light field display.
As shown in fig. 6, in order to illustrate a light field image rendering method based on eye tracking in the present application, the aforementioned display plane 61 can reconstruct the light field information 63 of the three-dimensional scene by loading the sub-image array and the reconstruction lens array 62 above the display plane, but the light field information of the whole scene is not required to be actually viewed, and only the image required for human eye viewing needs to be generated. The light that the observer can see at the current position is called effective light, and the light that the observer cannot see at the current position is called ineffective light 66, so the eyes 64 and 65 of the observer are tracked by the eye tracking module, the eye positions are sent to the light field rendering unit, only an effective sub-image array for the observer is needed to be rendered by the nerve radiation field, and the data volume and the computational complexity of the light field image rendering algorithm can be effectively reduced.
In summary, since the positions of the eyes of the viewer are tracked and the corresponding image rendering viewpoint information is calculated, the rendering of the light field image is performed through the neural radiation field reconstruction algorithm. The spatial bandwidth product of the light field stereoscopic display is greatly improved by combining with human eyes, and simultaneous watching of multiple persons is supported, so that the design requirement of the light field display on the high density of pixels of the liquid crystal panel is reduced, the display effect of the light field display is improved, a viewer can obtain better stereoscopic immersion feeling at different watching positions, and meanwhile, the data volume and the computational complexity of a light field image rendering algorithm can be reduced. The method and the device combine the eye tracking technology to reduce the data volume of data rendering, thereby being capable of reducing the requirement of light field image rendering on high performance of a processor, reducing the design difficulty of a light field display, enabling the real-time rendering of the light field image and the design of a high-density liquid crystal display panel not to be the bottleneck for improving the space bandwidth product of the light field display any more, and enabling the output signal bandwidth of the light field display to break through the space bandwidth product (10 8 ) Limiting, the spatial bandwidth product refers to the information flux of optical imaging of the light field display screen, the spatial bandwidth product is abbreviated as SPB, spb=horizontal resolution×vertical resolution×field frequency×1.344, and the spatial bandwidth product SPB of the light field display screen needs to be greater than 10 8 And a better viewing effect can be obtained at any viewing angle by adopting the neural radiation field rendering technology.
In addition, the visual fatigue phenomenon caused by crosstalk and the like of the traditional parallax type auto-stereoscopic display, such as convergence point and focusing not in a unified plane and the like, is relieved due to the increase of the number of rays and the improvement of the spatial bandwidth product in light field display, so that the stereoscopic immersion and stereoscopic viewing comfort of the light field display are improved.
The foregoing is a further detailed description of the present application in connection with the specific embodiments, and it is not intended that the practice of the present application be limited to such descriptions. It will be apparent to those skilled in the art from this disclosure that several simple deductions or substitutions can be made without departing from the inventive concepts of the present application.

Claims (10)

1.一种基于人眼跟踪的光场显示系统,其特征在于,包括:1. A light field display system based on human eye tracking, characterized by including: 显示控制单元,用于为光场显示器提供立体显示数据和人机交互相关交互信息;A display control unit is used to provide stereoscopic display data and human-computer interaction related interaction information for the light field display; 光场渲染单元,与所述显示控制单元相连,用于根据显示控制单元中视频输入接口输出的光场模型和人眼位置信息进行光场图像实时渲染;A light field rendering unit, connected to the display control unit, used for real-time rendering of light field images based on the light field model and human eye position information output by the video input interface in the display control unit; 驱动单元,与所述光场渲染单元相连,所述驱动单元采用模块化设计,每个驱动模块的显示分辨率为2K×1K或4K×2K,用于对高密度液晶显示面板进行分区刷新与同步控制;The driving unit is connected to the light field rendering unit. The driving unit adopts a modular design. The display resolution of each driving module is 2K×1K or 4K×2K. It is used for partition refresh and high-density liquid crystal display panels. Synchronous control; 显示单元,用于为光场信息的呈现提供显示载体,构建分辨率为16K×8K的高密度显示物理模型;The display unit is used to provide a display carrier for the presentation of light field information and build a high-density display physical model with a resolution of 16K×8K; 高速眼球三维追踪单元,与所述显示单元相连接,用于实时定位和追踪观看者的人眼位置信息,将所述人眼位置信息根据预定的通信协议从串行总线输出;A high-speed eyeball three-dimensional tracking unit, connected to the display unit, is used to locate and track the human eye position information of the viewer in real time, and output the human eye position information from the serial bus according to a predetermined communication protocol; 视觉驱动单元,与所述光场渲染单元和高速眼球三维追踪单元相连,用于将观看者的视角信息与光场渲染模型进行桥接,以根据观看者的人眼三维位置信息为光场渲染单元输出期望的视角渲染参数。A visual drive unit, connected to the light field rendering unit and the high-speed eyeball three-dimensional tracking unit, is used to bridge the viewer's perspective information and the light field rendering model to create a light field rendering unit based on the viewer's three-dimensional position information of the human eye. Outputs the desired perspective rendering parameters. 2.如权利要求1所述的一种基于人眼跟踪的光场显示系统,其特征在于,所述显示控制单元为2D交互单元,用户可一方面可通过配备2D显示屏与操作系统进行交互,还可以通过设置光场显示器的工作模式为2D工作模式,所述的2D与3D切换控制可通过鼠标或者手势进行切换。2. A light field display system based on human eye tracking according to claim 1, characterized in that the display control unit is a 2D interactive unit, and the user can interact with the operating system by being equipped with a 2D display screen. , you can also set the working mode of the light field display to the 2D working mode, and the 2D and 3D switching control can be switched through the mouse or gestures. 3.如权利要求1所述的一种基于人眼跟踪的光场显示系统,其特征在于,所述光场渲染单元为具有高稳定度和高实时性的并行渲染核心,并采用神经辐射场深度学习算法进行渲染。3. A light field display system based on human eye tracking according to claim 1, characterized in that the light field rendering unit is a parallel rendering core with high stability and high real-time performance, and uses a neural radiation field. Deep learning algorithms for rendering. 4.如权利要求1所述的一种基于人眼跟踪的光场显示系统,其特征在于,所述驱动单元为具有独立驱动和阵列同步功能的Tcon驱动模块,驱动板的分辨率为1920×1080或3840×2160。4. A light field display system based on human eye tracking as claimed in claim 1, characterized in that the drive unit is a Tcon drive module with independent drive and array synchronization functions, and the resolution of the drive board is 1920× 1080 or 3840×2160. 5.如权利要求1至4中任一项所述的一种基于人眼跟踪的光场显示系统,其特征在于,所述视觉驱动单元为光场渲染单元提供图像渲染的视角信息,其中图像渲染采用逆序光线积分渲染技术。5. A light field display system based on human eye tracking according to any one of claims 1 to 4, characterized in that the visual driving unit provides the light field rendering unit with perspective information for image rendering, wherein the image Rendering uses reverse ray integration rendering technology. 6.一种基于人眼跟踪的光场图像渲染方法,其特征在于,包括模型训练和模型推理两个阶段,共包括如下步骤:6. A light field image rendering method based on human eye tracking, which is characterized by including two stages of model training and model inference, including the following steps: 获取目标场景步骤:需要显示的目标场景,用于可为光场图像渲染单元提供原始的训练数据集;Obtain the target scene step: the target scene that needs to be displayed is used to provide the original training data set for the light field image rendering unit; 自适应场景采集步骤:根据所述目标场景,采用多尺度自适应方法对目标场景进行采样,用来产生需要输出的数字图像信号;Adaptive scene acquisition step: According to the target scene, use a multi-scale adaptive method to sample the target scene to generate a digital image signal that needs to be output; 相机位姿估计步骤:根据采样到的场景信息对拍摄相机的位姿进行估计;Camera pose estimation step: estimate the pose of the shooting camera based on the sampled scene information; 神经辐射场模型训练步骤:对所述数据采集步骤中输出的数字波形信号进行预处理并输出处理结果;Neural radiation field model training step: preprocess the digital waveform signal output in the data acquisition step and output the processing result; 获得隐式表达步骤:根据神经辐射场算法,采用全连接网络进行模型训练,获得模型的节点信息和权重参数;Obtain implicit expression steps: According to the neural radiation field algorithm, use a fully connected network for model training to obtain the node information and weight parameters of the model; 光场重现步骤:以模型隐式表达中的参数作为网络权重,将所述神经辐射场渲染步骤人眼位置信号转换为视角信息,完成模型推理。对所述根据人眼位置渲染步骤输出的数据光栅化后转换到显示屏坐标系,以达到光场显示的目的。Light field reproduction step: Using the parameters in the implicit expression of the model as network weights, convert the human eye position signal in the neural radiation field rendering step into perspective information to complete model inference. The data output by the rendering step according to the human eye position is rasterized and then converted to the coordinate system of the display screen to achieve the purpose of light field display. 7.如权利要求6所述的一种基于人眼跟踪的光场图像渲染方法,其特征在于,所述目标场景获取步骤中,该数据可以是需要展示的真实场景,也可以是游戏中的虚拟场景或动画领域的三维模型。7. A light field image rendering method based on human eye tracking as claimed in claim 6, characterized in that in the step of obtaining the target scene, the data can be a real scene that needs to be displayed, or it can be a game scene. Three-dimensional models of virtual scenes or animation fields. 8.如权利要求6所述的一种基于人眼跟踪的光场图像渲染方法,其特征在于,所述数据采步骤可通过飞行器对场景进行实时采样、光场相机的采样数据,或采用虚拟相机对三维模型进行虚拟采样。8. A light field image rendering method based on human eye tracking as claimed in claim 6, characterized in that the data acquisition step can be performed by using an aircraft to sample the scene in real time, sampling data from a light field camera, or using virtual The camera virtually samples the 3D model. 9.如权利要求6所述的一种基于人眼跟踪的光场图像渲染方法,其特征在于,在获取模型参数步骤中,渲染过程采用逆向渲染算法,从而获得正视的观察影像。9. A light field image rendering method based on human eye tracking according to claim 6, characterized in that, in the step of obtaining model parameters, the rendering process adopts a reverse rendering algorithm to obtain an orthographic observation image. 10.如权利要求6至9中任一项所述的一种基于人眼跟踪的光场图像渲染方法,其特征在于,所述光场渲染算法为实时渲染算法。10. A light field image rendering method based on human eye tracking according to any one of claims 6 to 9, characterized in that the light field rendering algorithm is a real-time rendering algorithm.
CN202311235605.4A 2023-09-22 2023-09-22 A light field display system and image rendering method based on human eye tracking Pending CN117319631A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202311235605.4A CN117319631A (en) 2023-09-22 2023-09-22 A light field display system and image rendering method based on human eye tracking

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202311235605.4A CN117319631A (en) 2023-09-22 2023-09-22 A light field display system and image rendering method based on human eye tracking

Publications (1)

Publication Number Publication Date
CN117319631A true CN117319631A (en) 2023-12-29

Family

ID=89261455

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202311235605.4A Pending CN117319631A (en) 2023-09-22 2023-09-22 A light field display system and image rendering method based on human eye tracking

Country Status (1)

Country Link
CN (1) CN117319631A (en)

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN118337978A (en) * 2024-04-24 2024-07-12 广东金鼎移动传媒有限公司 Naked eye 3D imaging AI conversion method based on MOST optical fiber network
CN119228966A (en) * 2024-10-08 2024-12-31 南京大学 A digital sand table system using image rendering and a light field image rendering method
CN120143576A (en) * 2025-02-27 2025-06-13 北京画无界科技有限公司 A holographic imaging system and method for VR holographic theater

Citations (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US10560689B2 (en) * 2017-11-28 2020-02-11 Paul Lapstun Viewpoint-optimized light field display
US20210132693A1 (en) * 2019-11-01 2021-05-06 Raxium, Inc. Light Field Displays Incorporating Eye Trackers and Methods for Generating Views for a Light Field Display Using Eye Tracking Information
CN114666564A (en) * 2022-03-23 2022-06-24 南京邮电大学 Method for synthesizing virtual viewpoint image based on implicit neural scene representation
CN115100360A (en) * 2022-07-28 2022-09-23 中国电信股份有限公司 Image generation method and device, storage medium and electronic equipment
CN115272575A (en) * 2022-07-28 2022-11-01 中国电信股份有限公司 Image generation method and device, storage medium and electronic equipment
CN115457188A (en) * 2022-09-19 2022-12-09 遥在(山东)数字科技有限公司 3D rendering display method and system based on fixation point
CN116071278A (en) * 2022-11-29 2023-05-05 西安交通大学 UAV aerial image synthesis method, system, computer equipment and storage medium
CN116418961A (en) * 2023-06-09 2023-07-11 深圳臻像科技有限公司 Light field display method and system based on three-dimensional scene stylization

Patent Citations (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US10560689B2 (en) * 2017-11-28 2020-02-11 Paul Lapstun Viewpoint-optimized light field display
US20210132693A1 (en) * 2019-11-01 2021-05-06 Raxium, Inc. Light Field Displays Incorporating Eye Trackers and Methods for Generating Views for a Light Field Display Using Eye Tracking Information
CN114666564A (en) * 2022-03-23 2022-06-24 南京邮电大学 Method for synthesizing virtual viewpoint image based on implicit neural scene representation
CN115100360A (en) * 2022-07-28 2022-09-23 中国电信股份有限公司 Image generation method and device, storage medium and electronic equipment
CN115272575A (en) * 2022-07-28 2022-11-01 中国电信股份有限公司 Image generation method and device, storage medium and electronic equipment
CN115457188A (en) * 2022-09-19 2022-12-09 遥在(山东)数字科技有限公司 3D rendering display method and system based on fixation point
CN116071278A (en) * 2022-11-29 2023-05-05 西安交通大学 UAV aerial image synthesis method, system, computer equipment and storage medium
CN116418961A (en) * 2023-06-09 2023-07-11 深圳臻像科技有限公司 Light field display method and system based on three-dimensional scene stylization

Cited By (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN118337978A (en) * 2024-04-24 2024-07-12 广东金鼎移动传媒有限公司 Naked eye 3D imaging AI conversion method based on MOST optical fiber network
CN119228966A (en) * 2024-10-08 2024-12-31 南京大学 A digital sand table system using image rendering and a light field image rendering method
CN119228966B (en) * 2024-10-08 2025-11-11 南京大学 Digital sand table system adopting image rendering and light field image rendering method
CN120143576A (en) * 2025-02-27 2025-06-13 北京画无界科技有限公司 A holographic imaging system and method for VR holographic theater

Similar Documents

Publication Publication Date Title
US12028502B2 (en) Three dimensional glasses free light field display using eye location
TWI813098B (en) Neural blending for novel view synthesis
EP3110518B1 (en) Image encoding and display
EP3111640B1 (en) Image encoding and display
KR101095392B1 (en) System and method for rendering 3-D images on a 3-D image display screen
CN117319631A (en) A light field display system and image rendering method based on human eye tracking
US20160105665A1 (en) Unassisted stereoscopic display device using directional backlight structure
CN117176936B (en) A freely expandable stereoscopic digital sandbox system and light field rendering method
CN108769664B (en) Naked eye 3D display method, device, equipment and medium based on human eye tracking
CN102005062A (en) Method and device for producing three-dimensional image for three-dimensional stereo display
CN112866676B (en) A naked-eye 3D display algorithm based on single-pixel multi-view reconstruction
JP2008522270A (en) System and method for composite view display of single 3D rendering
US10122987B2 (en) 3D system including additional 2D to 3D conversion
US12143561B2 (en) Image generation system and method
Wang et al. An introduction to immersive display technologies
Fadzli et al. 3D display for 3D telepresence: a review
Surman et al. Towards the reality of 3D imaging and display
CN114339191B (en) A naked-eye 3D display method based on multi-viewpoint reconstruction
HaiyangYu et al. Research summary on light field display technology based on projection
Ebisu et al. Realization of electronic 3D display combining multiview and volumetric solutions
Lu Research on Optical Display Technology of Virtual Reality Technology Based on Optical Image
Watanabe et al. Fast calculation method based on hidden region continuity for computer-generated holograms with multiple cameras in a sports scene
Zhang et al. 35.1: Tabletop three‐dimensional floating autostereoscopic display system with 360‐degree continuous visualization
Kejian et al. The development of stereoscopic display technology
Megyesi et al. 3D video visualization on the holovizio system

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination