[go: up one dir, main page]

CN117319631A - Light field display system and image rendering method based on human eye tracking - Google Patents

Light field display system and image rendering method based on human eye tracking Download PDF

Info

Publication number
CN117319631A
CN117319631A CN202311235605.4A CN202311235605A CN117319631A CN 117319631 A CN117319631 A CN 117319631A CN 202311235605 A CN202311235605 A CN 202311235605A CN 117319631 A CN117319631 A CN 117319631A
Authority
CN
China
Prior art keywords
light field
rendering
display
unit
human eye
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202311235605.4A
Other languages
Chinese (zh)
Inventor
李希才
季明江
王元庆
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Nanjing University
Original Assignee
Nanjing University
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Nanjing University filed Critical Nanjing University
Priority to CN202311235605.4A priority Critical patent/CN117319631A/en
Publication of CN117319631A publication Critical patent/CN117319631A/en
Pending legal-status Critical Current

Links

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N13/00Stereoscopic video systems; Multi-view video systems; Details thereof
    • H04N13/30Image reproducers
    • H04N13/398Synchronisation thereof; Control thereof
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N13/00Stereoscopic video systems; Multi-view video systems; Details thereof
    • H04N13/10Processing, recording or transmission of stereoscopic or multi-view image signals
    • H04N13/106Processing image signals
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N13/00Stereoscopic video systems; Multi-view video systems; Details thereof
    • H04N13/30Image reproducers
    • H04N13/366Image reproducers using viewer tracking
    • H04N13/368Image reproducers using viewer tracking for two or more viewers
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N13/00Stereoscopic video systems; Multi-view video systems; Details thereof
    • H04N13/30Image reproducers
    • H04N13/366Image reproducers using viewer tracking
    • H04N13/383Image reproducers using viewer tracking for tracking with gaze detection, i.e. detecting the lines of sight of the viewer's eyes

Landscapes

  • Engineering & Computer Science (AREA)
  • Multimedia (AREA)
  • Signal Processing (AREA)
  • Testing, Inspecting, Measuring Of Stereoscopic Televisions And Televisions (AREA)

Abstract

The application discloses a light field display system and an image rendering method based on human eye tracking, wherein the positions of eyes of a viewer are positioned and tracked, and basic information such as real-time viewing angles, directions and the like required by light field reconstruction is calculated, so that the real-time rendering of light field images is completed through a nerve radiation field body rendering algorithm. By combining the human eye tracking technology and the nerve radiation field rendering technology, the spatial bandwidth product of the light field display is greatly improved, and simultaneous watching of multiple persons is supported, so that the design dependence of the light field display on the high pixel density of the liquid crystal panel is reduced, the stereoscopic display effect of the light field display is improved, and a viewer can obtain better stereoscopic immersion feeling at different watching positions. In addition, the proposed nerve radiation field is adopted to reconstruct the light field image, so that the problem of depth inversion is effectively avoided, and meanwhile, the data volume and the computational complexity of a light field image rendering algorithm can be reduced.

Description

一种基于人眼跟踪的光场显示系统及图像渲染方法A light field display system and image rendering method based on human eye tracking

技术领域Technical field

本申请涉及图像处理、立体显示与人工智能领域,具体涉及一种基于人眼跟踪的光场显示系统及图像渲染方法。This application relates to the fields of image processing, stereoscopic display and artificial intelligence, and specifically relates to a light field display system and image rendering method based on human eye tracking.

背景技术Background technique

目前的立体显示器技术普遍采用视差屏障PB(Parallax Barrier,视差屏障)技术来产生视差从而让观看者获得立体沉浸感。立体显示技术大致可以分为有辅助和无辅助两大类,有辅助的立体显示器又可以分为开关眼镜、偏振眼镜、VR头盔、色差眼镜等。无辅助的又可以分为视差屏障型、和真三维显示。视差屏障型主要包括柱镜光栅、狭缝光栅、指向背光(Directional Backlight)技术等,真三维显示又分为集成成像、全息显示、光场显示、体显示等技术。其中视差屏障型立体显示技术是最为成熟的一类显示技术,但该类立体显示技术一旦工艺控制不好会造成较大的立体串扰。光场显示系统是一个几何光学系统,其巧妙的应用了串扰,其最初的光场图像获取途径就是通过透镜阵列直接成像得到,但是直接显示会出现深度反转的问题。Current stereoscopic display technology generally uses Parallax Barrier (PB) technology to generate parallax so that viewers can gain a sense of stereoscopic immersion. Stereoscopic display technology can be roughly divided into two categories: assisted and unassisted. Assisted stereoscopic displays can be divided into switch glasses, polarized glasses, VR helmets, chromatic glasses, etc. Unassisted can be divided into parallax barrier type, and true three-dimensional display. The parallax barrier type mainly includes lenticular grating, slit grating, directional backlight technology, etc. True three-dimensional display is divided into integrated imaging, holographic display, light field display, volume display and other technologies. Among them, parallax barrier stereoscopic display technology is the most mature type of display technology. However, if the process control of this type of stereoscopic display technology is not good, it will cause greater stereoscopic crosstalk. The light field display system is a geometric optical system that cleverly uses crosstalk. The initial way to obtain light field images is through direct imaging through a lens array, but direct display will cause depth inversion problems.

在现有的辅助式立体显示模式下,立体显示器的驱动程序将交替的渲染左右眼的图像,如第一帧为左眼图像,那么后一帧对应的是右眼图像。观看者配戴液晶快门眼镜,快门眼镜通过有线或无线的方式与显卡和显示器同步,当显示器上显示左眼图像时,眼镜打开左镜片的快门同时关闭右镜片的快门,当显示器上显示右眼图像时,眼镜打开右镜片的快门同时关闭左镜片的快门。由于刷新率较快,瞬时看不见的某只眼的视图将由大脑根据视觉暂存效应来合成,因此只要在同步信号的覆盖范围内戴上快门式立体眼镜接口从显示器上观看到立体影像。In the existing auxiliary stereoscopic display mode, the driver of the stereoscopic display will alternately render images for the left and right eyes. If the first frame is a left-eye image, then the next frame corresponds to a right-eye image. The viewer wears LCD shutter glasses, which are synchronized with the graphics card and monitor through wired or wireless means. When the left eye image is displayed on the monitor, the glasses open the shutter of the left lens and close the shutter of the right lens. When the right eye image is displayed on the monitor, When taking an image, the glasses open the shutter of the right lens and close the shutter of the left lens. Due to the fast refresh rate, the view of the temporarily invisible eye will be synthesized by the brain based on the visual temporary effect. Therefore, as long as the shutter-type stereoscopic glasses interface is worn within the coverage of the synchronization signal, the stereoscopic image can be viewed on the display.

辅助式立体显示要求立体快门眼镜的左右镜片必须与对应的投影图像信号保持同步,同时必须有较高的立体显示刷新率才会取得良好的立体沉静效果。该立体显示技术的优点是成本较低、立体沉浸感好,可实现平面图像与立体图像的无缝切换,但缺点是立体亮度低、容易视疲劳。Auxiliary stereoscopic display requires that the left and right lenses of the stereoscopic shutter glasses must be synchronized with the corresponding projected image signal, and must have a high refresh rate of the stereoscopic display to achieve a good stereoscopic calming effect. The advantages of this stereoscopic display technology are low cost, good stereoscopic immersion, and seamless switching between flat and stereoscopic images. However, the disadvantage is that the stereoscopic brightness is low and it is prone to visual fatigue.

无辅助式自由立体显示技术一般被称为“裸眼3D显示”技术,也就是不通过任何工具就能让左右两只眼睛从显示屏幕上看到两幅具有视差的画面,将它们反射到大脑,观看者就会产生立体沉浸感。无辅助式立体显示器也是利用了人眼的视差原理,通过为观看者左右两眼分别提供不同的显示画面,从而达到立体的视觉观看效果。当前无辅助式主要以多视点立体显示技术为主,由于观察者可以不佩戴眼镜,因此该技术常用在公共场所用于展示和宣传。不过,多视点自由立体显示技术的缺点是分辨率下降、观看视角有限、亮度低、串扰大有立体眩晕等现象显示效果相对较差。Unassisted autostereoscopic display technology is generally called "naked-eye 3D display" technology, which allows the left and right eyes to see two images with parallax from the display screen without using any tools, and reflects them to the brain. The viewer will have a sense of three-dimensional immersion. The unassisted stereoscopic display also takes advantage of the parallax principle of the human eye to achieve a three-dimensional visual viewing effect by providing different display images to the viewer's left and right eyes. Currently, the unassisted display technology is mainly based on multi-viewpoint stereoscopic display technology. Since the observer does not need to wear glasses, this technology is commonly used in public places for display and promotion. However, the shortcomings of multi-viewpoint free stereoscopic display technology are reduced resolution, limited viewing angles, low brightness, large crosstalk, and relatively poor display effects such as stereoscopic vertigo.

指向性背光技术是目前无辅助式立体显示技术中比较实用立体显示技术,指向背光立体显示技术搭配两组LED,配合快速响应的LCD面板和驱动方法,让3D内容以排序(Sequential)方式进入观看者的左右眼互换影像产生视差,进而让人眼感受到3D立体效果,极大地增强了基于移动设备的交流和互动。其优点是分辨率、透光率均有所保证,不会影响既有的显示设计架构,3D显示效果出色。不足是技术尚在开发,产品不成熟。Directional backlight technology is one of the more practical stereoscopic display technologies among current unassisted stereoscopic display technologies. The directional backlight stereoscopic display technology is paired with two sets of LEDs and a fast-response LCD panel and driving method to allow 3D content to be viewed in a sequential manner. The user's left and right eyes exchange images to produce parallax, which in turn allows the human eye to experience a 3D stereoscopic effect, greatly enhancing communication and interaction based on mobile devices. Its advantage is that the resolution and light transmittance are guaranteed, it will not affect the existing display design structure, and the 3D display effect is excellent. The disadvantage is that the technology is still under development and the products are immature.

与传统的视差屏障型立体显示技术相比,光场显示技术具有立体沉浸感好、显示信息维度多、没有眩晕感、分辨率高和快速刷新等优点,其被视为是最有前景的下一代储备显示技术之一,是实现大画幅立体显示的一个关键技术。Compared with traditional parallax barrier stereoscopic display technology, light field display technology has the advantages of good stereoscopic immersion, multiple dimensions of displayed information, no dizziness, high resolution and fast refresh, etc. It is regarded as the most promising next-generation technology. One of the generation reserve display technologies, it is a key technology to realize large-format three-dimensional display.

综上所述,现有技术存在如下缺点:To sum up, the existing technology has the following shortcomings:

1、视差屏障型自由立体显示器,由于受液晶显示面板分辨率的限制,一般空间带宽积很难提升,通常只能做到40%左右。1. Parallax barrier type free stereoscopic display, due to the limitation of the resolution of the liquid crystal display panel, it is generally difficult to increase the spatial bandwidth product, usually only about 40%.

2、视差屏障型自由立体显示器由于其光学结构特性,在立体显示的过程中会产生立体串扰,视差屏障型自由立体显示器由于其光学结构特性,需要将光线调制到用户的左右眼位置,从而导致自由立体显示器的亮度较低,视疲劳现象加剧。2. Due to its optical structural characteristics, the parallax barrier autostereoscopic display will produce stereoscopic crosstalk during the stereoscopic display process. Due to its optical structural characteristics, the parallax barrier autostereoscopic display needs to modulate the light to the position of the user's left and right eyes, resulting in The brightness of the autostereoscopic display is low, which aggravates visual fatigue.

3、密集视点立体显示技术,对液晶面板的显示密度要求较高,空间带宽积提升困难,图像渲染的数据量大,导致立体图像渲染速度变慢。3. Dense-viewpoint stereoscopic display technology requires high display density of the LCD panel, making it difficult to increase the spatial bandwidth product. The amount of data required for image rendering is large, resulting in slower stereoscopic image rendering speed.

4、光场显示器,由于采用传统的光线追击管线渲染方案,算法渲染流程复杂、速度慢、精度低还会产生深度反转的现象,对要显示的目标模型的采样和重构的流程复杂,从而使得显示器的显示效率较低恶化立体显示器的立体性能。4. For light field displays, due to the traditional ray tracing pipeline rendering scheme, the algorithmic rendering process is complex, slow, low-precision, and may cause depth inversion. The sampling and reconstruction process of the target model to be displayed is complex. As a result, the display efficiency of the display is lowered and the stereoscopic performance of the stereoscopic display is deteriorated.

发明内容Contents of the invention

为解决上述问题,本申请提供一种基于人眼跟踪的光场显示系统及图像渲染方法。In order to solve the above problems, this application provides a light field display system and image rendering method based on human eye tracking.

根据本申请的第一方面,本申请提供一种基于人眼跟踪的光场显示系统,包括:According to the first aspect of this application, this application provides a light field display system based on human eye tracking, including:

显示控制单元,用于为光场显示器提供立体显示数据和人机交互相关交互信息;A display control unit is used to provide stereoscopic display data and human-computer interaction related interaction information for the light field display;

光场渲染单元,与所述显示控制单元相连,用于根据显示控制单元中视频输入接口输出的光场模型和人眼位置信息进行光场图像实时渲染;A light field rendering unit, connected to the display control unit, used for real-time rendering of light field images based on the light field model and human eye position information output by the video input interface in the display control unit;

驱动单元,与所述光场渲染单元相连,所述驱动单元采用模块化设计,每个驱动模块的显示分辨率为2K×1K或4K×2K,用于对高密度液晶显示面板进行分区刷新与同步控制;The driving unit is connected to the light field rendering unit. The driving unit adopts a modular design. The display resolution of each driving module is 2K×1K or 4K×2K. It is used for partition refresh and high-density liquid crystal display panels. Synchronous control;

显示单元,用于为光场信息的呈现提供显示载体,构建分辨率为16K×8K的高密度显示物理模型;The display unit is used to provide a display carrier for the presentation of light field information and build a high-density display physical model with a resolution of 16K×8K;

高速眼球三维追踪单元,与所述显示单元相连接,用于实时定位和追踪观看者的人眼位置信息,将所述人眼位置信息根据预定的通信协议从串行总线输出;A high-speed eyeball three-dimensional tracking unit, connected to the display unit, is used to locate and track the human eye position information of the viewer in real time, and output the human eye position information from the serial bus according to a predetermined communication protocol;

视觉驱动单元,与所述光场渲染单元和高速眼球三维追踪单元相连,用于将观看者的视角信息与光场渲染模型进行桥接,以根据观看者的人眼三维位置信息为光场渲染单元输出期望的视角渲染参数。A visual drive unit, connected to the light field rendering unit and the high-speed eyeball three-dimensional tracking unit, is used to bridge the viewer's perspective information and the light field rendering model to create a light field rendering unit based on the viewer's three-dimensional position information of the human eye. Outputs the desired perspective rendering parameters.

所述的一种基于人眼跟踪的光场显示系统,其特征在于,所述显示控制单元为2D交互单元,用户可一方面可通过配备2D显示屏与操作系统进行交互,还可以通过设置光场显示器的工作模式为2D工作模式,所述的2D与3D切换控制可通过鼠标或者手势进行切换。The light field display system based on human eye tracking is characterized in that the display control unit is a 2D interactive unit. The user can interact with the operating system by equipping a 2D display screen, and can also set the light field display system. The working mode of the field display is a 2D working mode, and the 2D and 3D switching control can be switched through the mouse or gestures.

所述的一种基于人眼跟踪的光场显示系统,其特征在于,所述光场渲染单元为具有高稳定度和高实时性的并行渲染核心,并采用神经辐射场深度学习算法进行渲染。The light field display system based on human eye tracking is characterized in that the light field rendering unit is a parallel rendering core with high stability and high real-time performance, and uses a neural radiation field deep learning algorithm for rendering.

所述的一种基于人眼跟踪的光场显示系统,其特征在于,所述驱动单元为具有独立驱动和阵列同步功能的Tcon驱动模块,驱动板的分辨率为1920×1080或3840×2160。The light field display system based on human eye tracking is characterized in that the driving unit is a Tcon driving module with independent driving and array synchronization functions, and the resolution of the driving board is 1920×1080 or 3840×2160.

所述的一种基于人眼跟踪的光场显示系统,其特征在于,所述视觉驱动单元为光场渲染单元提供图像渲染的视角信息,其中图像渲染采用逆序光线积分渲染技术。The light field display system based on human eye tracking is characterized in that the visual driving unit provides the light field rendering unit with perspective information for image rendering, wherein the image rendering adopts reverse ray integration rendering technology.

根据本申请的第二方面,本申请提供一种基于人眼跟踪的光场图像渲染方法,其特征在于,包括模型训练和模型推理两个阶段,共包括如下步骤:According to the second aspect of this application, this application provides a light field image rendering method based on human eye tracking, which is characterized in that it includes two stages: model training and model inference, and includes the following steps:

获取目标场景步骤:需要显示的目标场景,用于可为光场图像渲染单元提供原始的训练数据集;Obtain the target scene step: the target scene that needs to be displayed is used to provide the original training data set for the light field image rendering unit;

自适应场景采集步骤:根据所述目标场景,采用多尺度自适应方法对目标场景进行采样,用来产生需要输出的数字图像信号;Adaptive scene acquisition step: According to the target scene, use a multi-scale adaptive method to sample the target scene to generate a digital image signal that needs to be output;

相机位姿估计步骤:根据采样到的场景信息对拍摄相机的位姿进行估计;Camera pose estimation step: estimate the pose of the shooting camera based on the sampled scene information;

神经辐射场模型训练步骤:对所述数据采集步骤中输出的数字波形信号进行预处理并输出处理结果;Neural radiation field model training step: preprocess the digital waveform signal output in the data acquisition step and output the processing result;

获得隐式表达步骤:根据神经辐射场算法,采用全连接网络进行模型训练,获得模型的节点信息和权重参数;Obtain implicit expression steps: According to the neural radiation field algorithm, use a fully connected network for model training to obtain the node information and weight parameters of the model;

光场重现步骤:以模型隐式表达中的参数作为网络权重,将所述神经辐射场渲染步骤人眼位置信号转换为视角信息,完成模型推理。对所述根据人眼位置渲染步骤输出的数据光栅化后转换到显示屏坐标系,以达到光场显示的目的。Light field reproduction step: Using the parameters in the implicit expression of the model as network weights, convert the human eye position signal in the neural radiation field rendering step into perspective information to complete model inference. The data output by the rendering step according to the human eye position is rasterized and then converted to the coordinate system of the display screen to achieve the purpose of light field display.

所述的一种基于人眼跟踪的光场图像渲染方法,其特征在于,所述目标场景获取步骤中,该数据可以是需要展示的真实场景,也可以是游戏中的虚拟场景或动画领域的三维模型。The light field image rendering method based on human eye tracking is characterized in that in the target scene acquisition step, the data can be a real scene that needs to be displayed, or it can be a virtual scene in a game or in the field of animation. 3D model.

所述的一种基于人眼跟踪的光场图像渲染方法,其特征在于,所述数据采步骤可通过飞行器对场景进行实时采样、光场相机的采样数据,或采用虚拟相机对三维模型进行虚拟采样。The described light field image rendering method based on human eye tracking is characterized in that the data acquisition step can use an aircraft to sample the scene in real time, sample data from a light field camera, or use a virtual camera to virtualize a three-dimensional model. sampling.

所述的一种基于人眼跟踪的光场图像渲染方法,其特征在于,在获取模型参数步骤中,渲染过程采用逆向渲染算法,从而获得正视的观察影像。The light field image rendering method based on human eye tracking is characterized in that, in the step of obtaining model parameters, the rendering process adopts a reverse rendering algorithm to obtain an orthographic observation image.

所述的一种基于人眼跟踪的光场图像渲染方法,其特征在于,所述光场渲染算法为实时渲染算法The light field image rendering method based on human eye tracking is characterized in that the light field rendering algorithm is a real-time rendering algorithm

本申请的有益效果是:The beneficial effects of this application are:

依上述实施例的采用一种基于人眼跟踪的光场显示系统及图像渲染方法,由于对观看者的双眼位置进行跟踪,并计算出对应的图像渲染视点信息,从而通过神经辐射场重建算法进行光场图像的渲染。结合人眼跟着使得光场立体显示器的空间带宽积得到大幅提高,并支持多人同时观看,从而降低了光场显示器对液晶面板像素高密度的设计要求,提高了光场显示器的显示效果,而且使得观看者在不同观看位置处都可以获得较好的立体沉浸感,同时还可以减少光场图像渲染算法的数据量和计算复杂度。According to the above embodiment, a light field display system and image rendering method based on human eye tracking are used. Since the position of the viewer's eyes is tracked and the corresponding image rendering viewpoint information is calculated, the neural radiation field reconstruction algorithm is used to perform Rendering of light field images. Combined with human eye tracking, the spatial bandwidth product of the light field stereoscopic display is greatly improved, and it supports multiple people to watch at the same time, thus reducing the design requirements of the light field display for the high pixel density of the liquid crystal panel, improving the display effect of the light field display, and This allows viewers to obtain better three-dimensional immersion at different viewing positions, while also reducing the amount of data and computational complexity of the light field image rendering algorithm.

附图说明Description of drawings

图1为本申请一种基于人眼跟踪的光场显示系统的原理图;Figure 1 is a schematic diagram of a light field display system based on human eye tracking in this application;

图2为本申请一种基于人眼跟踪的光场显示系统的结构示意图;Figure 2 is a schematic structural diagram of a light field display system based on human eye tracking in this application;

图3为本申请一种基于人眼跟踪的光场显示系驱动电路原理图;Figure 3 is a schematic diagram of a light field display system driving circuit based on human eye tracking in this application;

图4为本申请一种基于人眼跟踪的光场图像渲染方法的流程示意图;Figure 4 is a schematic flow chart of a light field image rendering method based on human eye tracking in this application;

图5为本申请一种基于人眼跟踪的光场采集与再现方法的示意图;Figure 5 is a schematic diagram of a light field acquisition and reproduction method based on human eye tracking in this application;

图6为本申请一种基于人眼跟踪的光场图像渲染方法的示意图;Figure 6 is a schematic diagram of a light field image rendering method based on human eye tracking in this application;

具体实施方式Detailed ways

下面通过具体实施方式结合附图对本申请作进一步详细说明。The present application will be further described in detail below through specific embodiments in conjunction with the accompanying drawings.

请参考图1,本申请公开了一种基于人眼跟踪的光场显示系统(以下简称光场显示器),其包括显示控制单元10、光场渲染单元11、驱动单元12、显示单元13、高速眼球三维跟踪单元14和视觉驱动单元15,下面分别说明。Please refer to Figure 1. This application discloses a light field display system based on human eye tracking (hereinafter referred to as light field display), which includes a display control unit 10, a light field rendering unit 11, a driving unit 12, a display unit 13, a high-speed The three-dimensional eye tracking unit 14 and the visual driving unit 15 are described separately below.

显示控制单元10用于为光场显示器提供立体显示数据和相关交互信息。在一具体实施例中,显示控制单元10为具有高稳定度和较高算力的计算机。例如,显示控制单元10可以是服务器,当然,在其他一些实施例中,显示控制单元10也可以是常规的计算机或是具备HDMI视频接口的计算机。The display control unit 10 is used to provide stereoscopic display data and related interactive information for the light field display. In a specific embodiment, the display control unit 10 is a computer with high stability and high computing power. For example, the display control unit 10 can be a server. Of course, in some other embodiments, the display control unit 10 can also be a conventional computer or a computer with an HDMI video interface.

光场渲染单元11与显示控制单元10相连,用于根据显示控制单元中视频输入接口输出的光场模型和人眼位置信息进行光场图像渲染。The light field rendering unit 11 is connected to the display control unit 10 and is used to render the light field image according to the light field model and human eye position information output by the video input interface in the display control unit.

驱动单元12与所述光场渲染单元11相连,所述驱动单元的采用模块化设计,每个模块驱动的尺寸为1920×1080,用于对高密度液晶显示面板进行分区刷新与控制。The driving unit 12 is connected to the light field rendering unit 11. The driving unit adopts a modular design. The size of each module driven is 1920×1080, and is used for partition refresh and control of high-density liquid crystal display panels.

驱动单元12的分辨率为1920×1080,用于对光场渲染单元输出的光场数字信号信号显示分配和刷新。在一具体实施例中,驱动可支持HDMI、DP或Type-C输入。The driving unit 12 has a resolution of 1920×1080 and is used to display, distribute and refresh the light field digital signal output by the light field rendering unit. In a specific embodiment, the driver may support HDMI, DP or Type-C input.

显示单元13用于为光场显示信息的呈现载体,提供物理分辨率为16K×8K的显示信息;在一具体实施例中,显示单元13为具有高分辨率的液晶显示面板和光学微结构组成的复合显示载体。例如,显示单元13可以为背光结构、液晶显示面板和光栅组成的显示装置,当然,在其他一些实施例中,显示单元13也可以是非液晶显示面板。在另一具体实施例中,如图2所示,背光模组24为液晶显示面板提供光源,人眼跟踪模块26可实时跟踪用户的观看位置,高密度液晶显示模块27的输入端与显示驱动模块23相连,高密度液晶显示模块的前面板与非球面微透镜阵列模块28贴合。The display unit 13 is used as a presentation carrier for light field display information and provides display information with a physical resolution of 16K×8K; in a specific embodiment, the display unit 13 is composed of a high-resolution liquid crystal display panel and an optical microstructure. composite display carrier. For example, the display unit 13 may be a display device composed of a backlight structure, a liquid crystal display panel and a grating. Of course, in some other embodiments, the display unit 13 may also be a non-liquid crystal display panel. In another specific embodiment, as shown in Figure 2, the backlight module 24 provides a light source for the liquid crystal display panel, the human eye tracking module 26 can track the user's viewing position in real time, and the input terminal of the high-density liquid crystal display module 27 is connected to the display driver. The modules 23 are connected, and the front panel of the high-density liquid crystal display module is bonded to the aspherical microlens array module 28 .

高速眼球三维追踪单元14,与所述显示单元13相连接,用于实时定位和追踪观看者的人眼位置信息,将所述人眼位置信息根据预定的通信协议从串口输出。在一具体实施例中,高速眼球三维追踪单元14为具有人眼三维位置检测和跟踪功能的视觉跟踪装置。The high-speed eyeball three-dimensional tracking unit 14 is connected to the display unit 13 and is used to locate and track the human eye position information of the viewer in real time, and output the human eye position information from the serial port according to a predetermined communication protocol. In a specific embodiment, the high-speed eyeball three-dimensional tracking unit 14 is a visual tracking device with human eye three-dimensional position detection and tracking functions.

视觉驱动单元15,与所述光场渲染单元11和高速眼球三维追踪单元14相连,用于将观看者的视角信息与光场渲染模型进行桥接,以根据观看者的人眼三维信息为光场渲染单元输出期望的视角渲染参数。The visual driving unit 15 is connected to the light field rendering unit 11 and the high-speed eyeball three-dimensional tracking unit 14, and is used to bridge the viewer's perspective information and the light field rendering model to form a light field based on the viewer's three-dimensional human eye information. The rendering unit outputs the desired perspective rendering parameters.

相应地,本申请提出的一种基于人眼跟踪的光场显图像渲染方法,其特征在于,包括模型训练和模型推理两个阶段,如图4所示,共包括如下步骤:Accordingly, this application proposes a light field display image rendering method based on human eye tracking, which is characterized by including two stages of model training and model inference, as shown in Figure 4, including the following steps:

S41、获取目标场景步骤:需要显示的目标场景,可为光场图像渲染单元提供原始的训练数据集,如图4所示。在一具体实施例中,目标场景可以是计算机软件中虚拟三维场景也可以是现实环境中对真实物体直接采集;S41. Step of obtaining the target scene: The target scene that needs to be displayed can provide the original training data set for the light field image rendering unit, as shown in Figure 4. In a specific embodiment, the target scene can be a virtual three-dimensional scene in computer software or a direct collection of real objects in the real environment;

S42、自适应场景采集步骤:根据所述目标场景,采用多尺度自适应方法对目标场景进行场景采样,来产生需要输出的数字图像信号。在一具体实施例中,根据模型的复杂度进行不同尺度的采样以减少信息冗余和重点区域的图像获取;S42. Adaptive scene collection step: According to the target scene, use a multi-scale adaptive method to perform scene sampling on the target scene to generate digital image signals that need to be output. In a specific embodiment, sampling at different scales is performed according to the complexity of the model to reduce information redundancy and obtain images of key areas;

S43、相机位姿估计步骤:根据采样到的场景信息对拍摄的相机的位姿进行估计。在一具体实施例中,通过对采集的图像进行特征提取、特征匹配对相机位姿进行估计;S43. Camera pose estimation step: estimate the pose of the captured camera based on the sampled scene information. In a specific embodiment, the camera pose is estimated by performing feature extraction and feature matching on the collected images;

S44、神经辐射场模型训练步骤:根据视角的位置和方向,以颜色作为监督项,采用全连接网络进行模型训练;S44. Neural radiation field model training steps: Based on the position and direction of the viewing angle, color is used as a supervision item, and a fully connected network is used for model training;

S45、获得隐式表达步骤:根据神经辐射场算法,在一具体实施例中,从全连接网络训练完成的模型中获得模型的节点信息和权重参数;S45. Obtain implicit expression step: According to the neural radiation field algorithm, in a specific embodiment, obtain the node information and weight parameters of the model from the model that has been trained by the fully connected network;

S46、光场重现步骤:以模型隐式表达中的参数作为网络权重,将所述神经辐射场渲染步骤人眼位置信号转换为视角信息,完成模型推理。在一具体实施例中,对所述根据人眼位置渲染步骤输出的数据光栅化后转换到显示屏坐标系,以达到光场显示的目的。S46. Light field reproduction step: Use the parameters in the implicit expression of the model as network weights to convert the human eye position signal in the neural radiation field rendering step into perspective information to complete model inference. In a specific embodiment, the data output from the rendering step based on the human eye position is rasterized and then converted to the coordinate system of the display screen to achieve the purpose of light field display.

本申请提出的一种基于人眼跟踪的光场显示系统及图像渲染方法,由于对观看者的双眼位置进行跟踪,并计算出对应的图像渲染视点信息,从而通过神经辐射场重建算法进行光场图像的渲染。结合人眼跟着使得光场立体显示器的空间带宽积得到大幅提高,并支持多人同时观看,从而降低了光场显示器对液晶面板像素高密度的设计要求,提高了光场显示器的显示效果,而且使得观看者在不同观看位置处都可以获得较好的立体沉浸感,同时还可以减少光场图像渲染算法的数据量和计算复杂度,下面具体分析。This application proposes a light field display system and image rendering method based on human eye tracking. Since the position of the viewer's eyes is tracked and the corresponding image rendering viewpoint information is calculated, the light field is reconstructed through a neural radiation field reconstruction algorithm. Rendering of images. Combined with human eye tracking, the spatial bandwidth product of the light field stereoscopic display is greatly improved, and it supports multiple people to watch at the same time, thus reducing the design requirements of the light field display for the high pixel density of the liquid crystal panel, improving the display effect of the light field display, and This allows viewers to obtain better three-dimensional immersion at different viewing positions, and at the same time, it can also reduce the amount of data and computational complexity of the light field image rendering algorithm, which will be analyzed in detail below.

如图1所示,本申请所提光场显示技术方案中使用显示控制单元来控制显示所需要显示的内容,其中显示控制单元输出的光场模型,除了可以输出标准的三维点云模型(.ply、.pcl、.txt等格式),还可以是三维模型的隐式表达,包含模型的权重文件和各全连接层间的连接关系。通过将非标准的三维模型等数据经过编码后通过标准的HDMI输出接口进行传输,不但具有较好的兼容性,还可用于对光场数据压缩传输和加密传输等,具有较高的时效性和安全性,光场数据的输出分辨率为1920×1080。As shown in Figure 1, the light field display technical solution proposed in this application uses a display control unit to control the content that needs to be displayed. The light field model output by the display control unit can, in addition to outputting a standard three-dimensional point cloud model (. ply, .pcl, .txt and other formats), it can also be an implicit expression of the three-dimensional model, including the weight file of the model and the connection relationship between each fully connected layer. By encoding non-standard three-dimensional model and other data and then transmitting it through the standard HDMI output interface, it not only has better compatibility, but can also be used for compressed transmission and encrypted transmission of light field data, etc., with high timeliness and For security, the output resolution of light field data is 1920×1080.

如图1所示,对于一个大型光场模型,渲染单元11需要进行数据分割,其目的是为了将分割后的数据分发到阵列化并行加速单元中进行渲染算法的加速,分割后的数据块分别在阵列化渲染单元中完成光场图像的渲染。图1中数据分割模块、光场图像渲染模块、视频接口模块输出统称为光场渲染单元,经过光场渲染单元渲染后的光场数据需要经过视频输出接口模块输出至驱动单元。As shown in Figure 1, for a large light field model, the rendering unit 11 needs to perform data segmentation. The purpose is to distribute the segmented data to the arrayed parallel acceleration unit to accelerate the rendering algorithm. The segmented data blocks are respectively The rendering of the light field image is completed in the arrayed rendering unit. In Figure 1, the data segmentation module, light field image rendering module, and video interface module output are collectively referred to as the light field rendering unit. The light field data rendered by the light field rendering unit needs to be output to the drive unit through the video output interface module.

如图1所示,驱动单元12在接收到所述光场渲染单元11的光场数据后进一步驱动高密度液晶显示单元13进行显示。为了提高光场显示器的空间带宽积,一方面需要提高光场显示器的信息量,另一方面要高密度的液晶面板作为物理支撑,但是随着显示信息量的增加对光场图像渲染单元11无疑会造成巨大的算力开销,导致显示系统无法实现流畅的光场图像播放与交互。As shown in FIG. 1 , after receiving the light field data from the light field rendering unit 11 , the driving unit 12 further drives the high-density liquid crystal display unit 13 to display. In order to increase the spatial bandwidth product of the light field display, on the one hand, it is necessary to increase the information amount of the light field display, and on the other hand, a high-density liquid crystal panel is required as a physical support. However, with the increase in the amount of displayed information, there is no doubt that the light field image rendering unit 11 This will cause huge computing power overhead, causing the display system to be unable to achieve smooth light field image playback and interaction.

受周边视觉机制启发,本申请采用人眼跟踪技术来实现渲染算法的加速以及显示带宽积的提升,进行如图1所示,本申请通过高速眼球三维追踪单元14来确定观看者的观看位置,从而让显示信息精准的面向用户生成,对哪些没有观看者观看的区域不进行光场图像的渲染,从而减少了光场图像渲染的数据量,并节约了大量的有效像素,使得更多的液晶显示资源贡献到有效的观看者区域。因此,由于高速眼球三维追踪单元14的存在,就使得光场显示器的空间带宽积提升成为了可能,空间带宽积(Space Bandwidth Product,SBP)通常的计算方法为SBP=水平分辨率*垂直分辨率*场频*1.344。为了达到良好的立体观看效果,一般要求光场显示器的空间带宽积SBP≥108,因此视觉驱动单元15的核心价值就在于建立起了光场渲染单元和高速眼球三维追踪单元间的物理逻辑衔接。Inspired by the peripheral vision mechanism, this application uses human eye tracking technology to accelerate the rendering algorithm and improve the display bandwidth product. As shown in Figure 1, this application uses a high-speed eyeball three-dimensional tracking unit 14 to determine the viewing position of the viewer. This allows the display information to be accurately generated for users, and the light field image will not be rendered in areas that are not viewed by viewers, thus reducing the amount of data for light field image rendering and saving a large number of effective pixels, allowing more LCD Display resources contribute to the valid viewer area. Therefore, due to the existence of the high-speed eyeball three-dimensional tracking unit 14, it is possible to increase the spatial bandwidth product of the light field display. The usual calculation method of the spatial bandwidth product (Space Bandwidth Product, SBP) is SBP=horizontal resolution*vertical resolution *Field frequency*1.344. In order to achieve good stereoscopic viewing effects, the spatial bandwidth product SBP of the light field display is generally required to be ≥ 10 8 . Therefore, the core value of the visual drive unit 15 lies in establishing the physical and logical connection between the light field rendering unit and the high-speed eyeball three-dimensional tracking unit. .

如图2所示,为本申请一种基于人眼跟踪的光场显示系统的系统结构图,系统包括视频接口模块21、单元模块22、驱动模块23、背光模块24光场渲染模块25、人眼跟踪模块26、高密度液晶显示模块27非球面透镜阵列模块28,可以看到,在本申请中,人眼跟踪模块与光场图像渲染模块等协同工作。As shown in Figure 2, it is a system structure diagram of a light field display system based on human eye tracking in this application. The system includes a video interface module 21, a unit module 22, a driver module 23, a backlight module 24, a light field rendering module 25, Eye tracking module 26, high-density liquid crystal display module 27, aspherical lens array module 28, it can be seen that in this application, the human eye tracking module and the light field image rendering module work together.

如图3所示,视频接口单元由单元模块31、PC接口模块32、ESD保护模块33、信号转换模块34、显示面板35、音频解码模块36、视频同步接口模块37、以及核心处理器模块38组成,采用模块化的设计思想,从而降低了阵列化显示屏驱动的设计难度。As shown in Figure 3, the video interface unit consists of a unit module 31, a PC interface module 32, an ESD protection module 33, a signal conversion module 34, a display panel 35, an audio decoding module 36, a video synchronization interface module 37, and a core processor module 38 It adopts modular design ideas, thereby reducing the difficulty of designing the array display driver.

请参照图4,一种基于人眼跟踪的光场显示图像渲染方法,算法包括获取目标场景步骤S41、自适应场景采集步骤S42、相机姿态估计步骤S43、神经辐射场模型训练步骤S44、获得隐式表达步骤S45、光场重现步骤S46。目标场景经过多尺度自适应采样后可以获得场景的多视角信息,根据采样到的场景信息进行位姿估计,完成位姿估计后根据位姿参数和多视角图像即可得到神经辐射场的网络输入,NeRF函数是将一个连续的场景表示为一个输入为5D向量的函数,包括一个空间点的3D坐标位置x=(x,y,z)以及方向(θ,φ),输出为与视角相关的该3D点的颜色c=(r,g,b)以及该位置的体密度σ。体密度σ(x)可以理解为,在空间x位置,颗粒无限小的一个微分(不透明度在位置上的微分)与辐射方向无关。相机成像辐射线r(t)=o+td;相机位置o和成像点d构成了相机成像射线,并伴随一个近点和远点的范围t,这条射线的颜色C(r)用积分的方式表示为:Please refer to Figure 4, a light field display image rendering method based on human eye tracking. The algorithm includes the step of obtaining the target scene S41, the adaptive scene acquisition step S42, the camera attitude estimation step S43, the neural radiation field model training step S44, and obtaining the hidden scene. The formula expresses step S45 and light field reproduction step S46. After multi-scale adaptive sampling of the target scene, the multi-view information of the scene can be obtained. The pose estimation is performed based on the sampled scene information. After the pose estimation is completed, the network input of the neural radiation field can be obtained based on the pose parameters and multi-view images. , the NeRF function represents a continuous scene as a function whose input is a 5D vector, including the 3D coordinate position x = (x, y, z) and direction (θ, φ) of a spatial point, and the output is perspective-related. The color of the 3D point c = (r, g, b) and the volume density σ of the position. The volume density σ(x) can be understood as, at the x position in space, a differential of infinitesimal particles (the differential of the opacity at the position) has nothing to do with the radiation direction. Camera imaging radiation r(t)=o+td; camera position o and imaging point d constitute the camera imaging ray, accompanied by a near point and far point range t. The color C(r) of this ray is calculated by integrating The way is expressed as:

其中,tf和tn分别表示射线的远点和近点,σ表示当前点的体密度,c表示当前点的RGB值,T(t)表示的是射线从tn到t这一段的累计透明度:Among them, t f and t n represent the far point and near point of the ray respectively, σ represents the volume density of the current point, c represents the RGB value of the current point, and T(t) represents the accumulation of the ray from t n to t. transparency:

其中,tn表示射线的近点,t表示射线上的任意一点,σ表示当前点的体密度。Among them, t n represents the near point of the ray, t represents any point on the ray, and σ represents the volume density of the current point.

在连续的辐射场中,针对任意视角进行渲染,就需要对穿过目标虚拟相机的每个像素的射线,求取上述颜色积分,从而得到每个像素的颜色,渲染出该视角下的成像图片。通过L2损失计算体渲染得到颜色与实际颜色的差值作为损失函数监督神经网络的学习。In a continuous radiation field, to render for any viewing angle, it is necessary to calculate the above color integral for the ray passing through each pixel of the target virtual camera to obtain the color of each pixel and render the imaging image from that viewing angle. . The difference between the color and the actual color obtained by calculating volume rendering through L2 loss is used as the loss function to supervise the learning of the neural network.

完成神经网络的训练,将场景转换为隐式表达。如果想要获得新的视角信息,只需要输入观看者的视角信息即可渲染出当前观看位置的视角信息,该视角图像经过光栅化后即可与光场显示器的物理坐标系进行匹配,达到立体显示效果。Complete the training of the neural network and convert the scene into an implicit expression. If you want to obtain new perspective information, you only need to input the viewer's perspective information to render the perspective information of the current viewing position. After rasterization, the perspective image can be matched with the physical coordinate system of the light field display to achieve stereoscopic display effect.

如图5所示,为本申请一种基于人眼跟踪的光场采集与再现方法的示意图,采集平面53通过采集透镜阵列52拍摄三维场景51,此过程为光场显示的采集过程,因为采集透镜阵列52的每个透镜都可以从不同角度采集到对应场景的特定信息,所以每个采集透镜都可以在采集平面53的特定位置采集到特定的单元图像,所有的单元图像组成了子图像阵列54。将子图像阵列54加载到显示平面55上,在显示平面后方有一个重建透镜阵列56,重建透镜阵列56要与子图像阵列54精确对应,根据光路可逆的原理就可以获得重建三维场景57,此过程为光场显示的再现过程。As shown in Figure 5, it is a schematic diagram of a light field acquisition and reproduction method based on human eye tracking in this application. The acquisition plane 53 captures the three-dimensional scene 51 through the acquisition lens array 52. This process is the acquisition process of the light field display, because the acquisition process Each lens of the lens array 52 can collect specific information of the corresponding scene from different angles, so each collection lens can collect a specific unit image at a specific position of the collection plane 53, and all unit images form a sub-image array. 54. Load the sub-image array 54 onto the display plane 55. There is a reconstruction lens array 56 behind the display plane. The reconstruction lens array 56 should accurately correspond to the sub-image array 54. According to the principle of reversible optical path, the reconstructed three-dimensional scene 57 can be obtained. The process is the reproduction process of light field display.

如图6所示,为本申请一种基于人眼跟踪的光场图像渲染方法的示意图,前文提到的显示平面61通过加载子图像阵列和显示平面上方的重建透镜阵列62可以重构三维场景的光场信息63,但是在实际观看时并不需要将整个场景光场信息,仅需要生成人眼观看所需要的图像即可。观察者在当前位置可以看到的光线称之为有效光线,而观察者在当前位置看不到的光线称之为无效光线66,所以通过人眼跟踪模块对观看者的眼睛64、65进行跟踪,将人眼位置发送给光场渲染单元,通过神经辐射场只需要渲染出针对观察者有效的子图像阵列,从而可以有效减少光场图像渲染算法的数据量和计算复杂度。As shown in Figure 6, it is a schematic diagram of a light field image rendering method based on human eye tracking in this application. The display plane 61 mentioned above can reconstruct a three-dimensional scene by loading the sub-image array and the reconstruction lens array 62 above the display plane. The light field information 63 is not required for actual viewing. It is only necessary to generate the image required for human eyes to view. The light that the observer can see at the current position is called effective light, and the light that the observer cannot see at the current position is called invalid light 66. Therefore, the viewer's eyes 64 and 65 are tracked through the human eye tracking module. , the human eye position is sent to the light field rendering unit, and only the sub-image array effective for the observer needs to be rendered through the neural radiation field, which can effectively reduce the data volume and computational complexity of the light field image rendering algorithm.

综上所述,由于对观看者的双眼位置进行跟踪,并计算出对应的图像渲染视点信息,从而通过神经辐射场重建算法进行光场图像的渲染。结合人眼跟着使得光场立体显示器的空间带宽积得到大幅提高,并支持多人同时观看,从而降低了光场显示器对液晶面板像素高密度的设计要求,提高了光场显示器的显示效果,而且使得观看者在不同观看位置处都可以获得较好的立体沉浸感,同时还可以减少光场图像渲染算法的数据量和计算复杂度。本申请结合人眼跟踪技术减少数据渲染的数据量,从而能够降低光场图像渲染对处理器高性能的要求,降低光场显示器的设计难度,使得光场图像的实时渲染和高密度液晶显示面板的设计不再是提升光场显示器空间带宽积的瓶颈,使光场显示器的输出信号带宽突破现有技术的空间带宽积(108)限制,空间带宽积是指光场显示屏光学成像的信息通量,空间带宽积简写为SPB,SPB=水平分辨率*垂直分辨率*场频*1.344,光场显示屏的空间带宽积SPB需要大于108,并且采用神经辐射场渲染技术能够在任意视角都可以获得较好的观看效果。In summary, since the position of the viewer's eyes is tracked and the corresponding image rendering viewpoint information is calculated, the light field image is rendered through the neural radiation field reconstruction algorithm. Combined with human eye tracking, the spatial bandwidth product of the light field stereoscopic display is greatly improved, and it supports multiple people to watch at the same time, thus reducing the design requirements of the light field display for the high pixel density of the liquid crystal panel, improving the display effect of the light field display, and This allows viewers to obtain better three-dimensional immersion at different viewing positions, while also reducing the amount of data and computational complexity of the light field image rendering algorithm. This application combines human eye tracking technology to reduce the amount of data for data rendering, thereby reducing the high-performance requirements of light field image rendering on the processor, reducing the design difficulty of light field displays, and enabling real-time rendering of light field images and high-density liquid crystal display panels. The design is no longer the bottleneck to improve the spatial bandwidth product of the light field display, allowing the output signal bandwidth of the light field display to break through the spatial bandwidth product (10 8 ) limit of the existing technology. The spatial bandwidth product refers to the information of the optical imaging of the light field display. Flux, spatial bandwidth product is abbreviated as SPB, SPB = horizontal resolution * vertical resolution * field frequency * 1.344. The spatial bandwidth product SPB of the light field display needs to be greater than 10 8 , and the neural radiation field rendering technology can be used at any viewing angle You can get better viewing effects.

另外,传统视差型号自由立体显示器由于串扰等造成的视觉疲劳现象,如汇聚点和调焦不在统一平面等,由于光场显示中光线数量的增加和空间带宽积的提高,视觉疲劳现象也会得到缓解,从而提高光场显示器的立体沉浸感和立体观看舒适度。In addition, the visual fatigue phenomenon caused by traditional parallax free stereoscopic displays due to crosstalk, etc., such as the convergence point and focusing are not in the same plane, will also be affected by the increase in the number of light rays and the increase in spatial bandwidth product in the light field display. Alleviation, thereby improving the stereoscopic immersion and stereoscopic viewing comfort of light field displays.

以上内容是结合具体的实施方式对本申请所作的进一步详细说明,不能认定本申请的具体实施只局限于这些说明。对于本申请所属技术领域的普通技术人员来说,在不脱离本申请发明构思的前提下,还可以做出若干简单推演或替换。The above content is a further detailed description of the present application in combination with specific implementation modes, and it cannot be concluded that the specific implementation of the present application is limited to these descriptions. For those of ordinary skill in the technical field to which this application belongs, several simple deductions or substitutions can be made without departing from the inventive concept of this application.

Claims (10)

1. A light field display system based on eye tracking, comprising:
the display control unit is used for providing three-dimensional display data and man-machine interaction related interaction information for the light field display;
the light field rendering unit is connected with the display control unit and is used for performing light field image real-time rendering according to the light field model and human eye position information output by the video input interface in the display control unit;
the driving units are connected with the light field rendering units, the driving units are in a modularized design, and the display resolution of each driving module is 2K multiplied by 1K or 4K multiplied by 2K and is used for carrying out partition refreshing and synchronous control on the high-density liquid crystal display panel;
the display unit is used for providing a display carrier for the presentation of the light field information and constructing a high-density display physical model with the resolution of 16K multiplied by 8K;
the high-speed eyeball three-dimensional tracking unit is connected with the display unit and is used for positioning and tracking the human eye position information of a viewer in real time and outputting the human eye position information from the serial bus according to a preset communication protocol;
the visual driving unit is connected with the light field rendering unit and the high-speed eyeball three-dimensional tracking unit and is used for bridging the visual angle information of the viewer with the light field rendering model so as to output expected visual angle rendering parameters for the light field rendering unit according to the three-dimensional position information of the human eye of the viewer.
2. The light field display system based on eye tracking as claimed in claim 1, wherein the display control unit is a 2D interaction unit, a user can interact with the operating system by providing a 2D display screen on one hand, and can also set the operation mode of the light field display to be a 2D operation mode, and the 2D and 3D switching control can be switched by a mouse or a gesture.
3. The eye-tracking based light field display system of claim 1, wherein the light field rendering unit is a parallel rendering core with high stability and high real-time, and performs rendering using a neural radiation field deep learning algorithm.
4. The eye-tracking based light field display system of claim 1, wherein the driving unit is a Tcon driving module having independent driving and array synchronization functions, and the resolution of the driving board is 1920 x 1080 or 3840 x 2160.
5. The eye-tracking based light field display system of any one of claims 1-4, wherein the visual drive unit provides visual angle information for image rendering to the light field rendering unit, wherein the image rendering employs an inverse ray-integral rendering technique.
6. The light field image rendering method based on human eye tracking is characterized by comprising two stages of model training and model reasoning, and comprises the following steps:
a step of acquiring a target scene: the target scene to be displayed is used for providing an original training data set for the light field image rendering unit;
and (3) self-adaptive scene acquisition: sampling the target scene by adopting a multi-scale self-adaptive method according to the target scene, and generating a digital image signal to be output;
and a camera pose estimation step: estimating the pose of a shooting camera according to the sampled scene information;
training a nerve radiation field model: preprocessing the digital waveform signals output in the data acquisition step and outputting processing results;
obtaining an implicit expression step: according to a neural radiation field algorithm, training a model by adopting a fully connected network to obtain node information and weight parameters of the model;
a light field reproduction step: and converting the human eye position signal in the neural radiation field rendering step into visual angle information by taking parameters in the model implicit expression as network weights, so as to complete model reasoning. And rasterizing the data output according to the human eye position rendering step and converting the data into a display screen coordinate system so as to achieve the aim of light field display.
7. The method for rendering a light field image based on eye tracking as claimed in claim 6, wherein in said target scene obtaining step, the data may be a real scene to be displayed, or may be a virtual scene in a game or a three-dimensional model in an animation field.
8. The method of claim 6, wherein the data sampling step is performed by sampling the scene in real time by an aircraft, sampling data by a light field camera, or virtually sampling a three-dimensional model by a virtual camera.
9. The method of light field image rendering based on eye tracking as claimed in claim 6, wherein in the step of obtaining model parameters, a reverse rendering algorithm is adopted in the rendering process, so as to obtain a front-looking observation image.
10. A light field image rendering method based on human eye tracking as claimed in any one of claims 6 to 9, wherein the light field rendering algorithm is a real-time rendering algorithm.
CN202311235605.4A 2023-09-22 2023-09-22 Light field display system and image rendering method based on human eye tracking Pending CN117319631A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202311235605.4A CN117319631A (en) 2023-09-22 2023-09-22 Light field display system and image rendering method based on human eye tracking

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202311235605.4A CN117319631A (en) 2023-09-22 2023-09-22 Light field display system and image rendering method based on human eye tracking

Publications (1)

Publication Number Publication Date
CN117319631A true CN117319631A (en) 2023-12-29

Family

ID=89261455

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202311235605.4A Pending CN117319631A (en) 2023-09-22 2023-09-22 Light field display system and image rendering method based on human eye tracking

Country Status (1)

Country Link
CN (1) CN117319631A (en)

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN118337978A (en) * 2024-04-24 2024-07-12 广东金鼎移动传媒有限公司 Naked eye 3D imaging AI conversion method based on MOST optical fiber network
CN119228966A (en) * 2024-10-08 2024-12-31 南京大学 A digital sand table system using image rendering and a light field image rendering method

Citations (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US10560689B2 (en) * 2017-11-28 2020-02-11 Paul Lapstun Viewpoint-optimized light field display
US20210132693A1 (en) * 2019-11-01 2021-05-06 Raxium, Inc. Light Field Displays Incorporating Eye Trackers and Methods for Generating Views for a Light Field Display Using Eye Tracking Information
CN114666564A (en) * 2022-03-23 2022-06-24 南京邮电大学 Method for synthesizing virtual viewpoint image based on implicit neural scene representation
CN115100360A (en) * 2022-07-28 2022-09-23 中国电信股份有限公司 Image generation method and device, storage medium and electronic equipment
CN115272575A (en) * 2022-07-28 2022-11-01 中国电信股份有限公司 Image generation method and device, storage medium and electronic equipment
CN115457188A (en) * 2022-09-19 2022-12-09 遥在(山东)数字科技有限公司 3D rendering display method and system based on fixation point
CN116071278A (en) * 2022-11-29 2023-05-05 西安交通大学 UAV aerial image synthesis method, system, computer equipment and storage medium
CN116418961A (en) * 2023-06-09 2023-07-11 深圳臻像科技有限公司 Light field display method and system based on three-dimensional scene stylization

Patent Citations (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US10560689B2 (en) * 2017-11-28 2020-02-11 Paul Lapstun Viewpoint-optimized light field display
US20210132693A1 (en) * 2019-11-01 2021-05-06 Raxium, Inc. Light Field Displays Incorporating Eye Trackers and Methods for Generating Views for a Light Field Display Using Eye Tracking Information
CN114666564A (en) * 2022-03-23 2022-06-24 南京邮电大学 Method for synthesizing virtual viewpoint image based on implicit neural scene representation
CN115100360A (en) * 2022-07-28 2022-09-23 中国电信股份有限公司 Image generation method and device, storage medium and electronic equipment
CN115272575A (en) * 2022-07-28 2022-11-01 中国电信股份有限公司 Image generation method and device, storage medium and electronic equipment
CN115457188A (en) * 2022-09-19 2022-12-09 遥在(山东)数字科技有限公司 3D rendering display method and system based on fixation point
CN116071278A (en) * 2022-11-29 2023-05-05 西安交通大学 UAV aerial image synthesis method, system, computer equipment and storage medium
CN116418961A (en) * 2023-06-09 2023-07-11 深圳臻像科技有限公司 Light field display method and system based on three-dimensional scene stylization

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN118337978A (en) * 2024-04-24 2024-07-12 广东金鼎移动传媒有限公司 Naked eye 3D imaging AI conversion method based on MOST optical fiber network
CN119228966A (en) * 2024-10-08 2024-12-31 南京大学 A digital sand table system using image rendering and a light field image rendering method

Similar Documents

Publication Publication Date Title
US12028502B2 (en) Three dimensional glasses free light field display using eye location
TWI813098B (en) Neural blending for novel view synthesis
US10257492B2 (en) Image encoding and display
EP3110518B1 (en) Image encoding and display
US8704882B2 (en) Simulated head mounted display system and method
US7787009B2 (en) Three dimensional interaction with autostereoscopic displays
CN117319631A (en) Light field display system and image rendering method based on human eye tracking
US20160105665A1 (en) Unassisted stereoscopic display device using directional backlight structure
CN101909219A (en) Stereoscopic display method, tracking type stereoscopic display and image processing device
CN105093551B (en) Real tri-dimension display system during landscape environment shows in real time and display packing
CN117176936B (en) A freely expandable stereoscopic digital sandbox system and light field rendering method
CN102005062A (en) Method and device for producing three-dimensional image for three-dimensional stereo display
Shen et al. Virtual stereo content rendering technology review for light-field display
TWI829552B (en) Real-world view display method, video pass-through system and computer readable storage medium
CN204496117U (en) 3d glasses
CN102063735B (en) Method and device for manufacturing three-dimensional image source by changing viewpoint angles
US12143561B2 (en) Image generation system and method
JP2008522270A (en) System and method for composite view display of single 3D rendering
Fadzli et al. 3D display for 3D telepresence: a review
Yan et al. Research summary on light field display technology based on projection
CN102256160B (en) Stereo image processing equipment and method
Delaney Forget the funny glasses [autostereoscopic display systems]
Peterka et al. Evolution of the varrier autostereoscopic vr display: 2001-2007
CN113611181B (en) Stereoscopic display method and device for virtual simulation scene
Thatte et al. Real-World Virtual Reality With Head-Motion Parallax

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination