CN103051934A

CN103051934A - Intelligent television human-machine interaction method, device and system

Info

Publication number: CN103051934A
Application number: CN2011103124784A
Authority: CN
Inventors: 刘宏; 钱跃良; 王向东
Original assignee: Institute of Computing Technology of CAS
Current assignee: Institute of Computing Technology of CAS
Priority date: 2011-10-14
Filing date: 2011-10-14
Publication date: 2013-04-17

Abstract

The present invention provides a smart TV human-computer interaction device, comprising: a remote controller module, adapted to receive user operations, and generate corresponding control signals; a server module, adapted to receive and analyze the control signals to obtain instructions; an area of interest analysis module, adapted to intercept and analyze the current TV picture according to the instruction of analyzing the current picture, so as to obtain the potential area and feed it back to the server module; and an output module, adapted to output the image of the potential area and the selected sensory area Region of interest image. And provide a smart TV human-computer interaction system based on the device, which also includes a TV module, which is suitable for receiving instructions from the output module, operating according to the instructions, and receiving and displaying data from the output module. Correspondingly, a smart TV human-computer interaction method based on the above-mentioned smart TV human-computer interaction device is also provided. The user can conveniently click on the target of interest in the TV screen by using the remote controller.

Description

Smart TV human-computer interaction method, device and system

技术领域 technical field

本发明涉及智能电视领域，特别是一种智能电视人机交互方法、装置和系统。The invention relates to the field of smart TVs, in particular to a smart TV human-computer interaction method, device and system.

背景技术 Background technique

随着社会的发展，电视将逐渐发展成为一个开放的业务承载平台，成为家庭智能娱乐终端。智能电视将拥有传统电视厂商所不具备的应用平台优势。智能电视将实现网络搜索、IP电视、视频点播(VOD)、数字音乐、网络新闻、网络视频电话等各种应用服务。电视机正在成为继计算机、手机之后的第三种信息访问终端，用户可随时访问自己需要的信息；电视机也将成为一种智能设备，实现电视、网络和程序之间跨平台搜索；智能电视还将是一个“娱乐中心”，用户可以搜索电视频道、录制电视节目、能够播放卫星和有线电视节目以及网络视频。智能电视的到来，顺应了电视机高清化、网络化、智能化的趋势。With the development of society, TV will gradually develop into an open service carrying platform and become a home intelligent entertainment terminal. Smart TV will have application platform advantages that traditional TV manufacturers do not have. Smart TV will realize various application services such as network search, IP TV, video on demand (VOD), digital music, network news, and network video calling. TV is becoming the third information access terminal after computers and mobile phones, and users can access the information they need at any time; TV will also become a smart device, enabling cross-platform search between TV, the Internet and programs; Smart TV It will also be an "entertainment center" where users can search for TV channels, record TV shows, and be able to play satellite and cable TV programming, as well as Internet video. The arrival of smart TVs conforms to the trend of high-definition, networked and intelligent TV sets.

除了这些“网络化”、“智能化”之外，作为电视的基本功能，观众对电视节目更感兴趣，比如电视剧、人物访谈、体育赛事、旅游风光等电视节目。有时候，观众往往对电视画面中的某一特定目标感兴趣，比如某一演员，运动员，某一标志建筑等，并想进一步了解该目标的详细情况，就需要对画面中的某一目标进行方便的点选。和电脑上不同的是，要在电视上实现这些功能，只能依靠一个遥控器。如何利用遥控器，对电视画面中感兴趣的目标进行方便的点选，以便进行后续的信息处理或网络搜索，需要设计一种方便快捷的人机交互方法和系统。In addition to these "networking" and "intelligence", as the basic functions of TV, viewers are more interested in TV programs, such as TV dramas, character interviews, sports events, tourism and other TV programs. Sometimes, viewers are often interested in a certain target in the TV picture, such as a certain actor, athlete, a certain landmark building, etc., and want to know more about the details of the target, they need to carry out an investigation on a certain target in the picture. Easy to click. Different from the computer, to realize these functions on the TV, you can only rely on a remote control. How to use the remote control to conveniently click on the target of interest in the TV screen for subsequent information processing or network search, it is necessary to design a convenient and fast human-computer interaction method and system.

发明内容 Contents of the invention

本发明要解决的技术问题是提供一种智能电视人机交互方法、装置和系统，使用户利用遥控器即可对电视画面中感兴趣的目标进行方便的点选。The technical problem to be solved by the present invention is to provide a smart TV human-computer interaction method, device and system, so that the user can conveniently click on the target of interest in the TV screen by using a remote control.

根据本发明的一个方面，提供一种智能电视人机交互装置，包括：遥控器模块，适于接收用户操作，产生相应的控制信号；所述控制信号包括：分析当前画面的控制信号和选择控制信号；服务器模块，适于接收并解析所述控制信号以获取指令；所述指令包括：与分析当前画面的控制信号相对应的分析当前画面的指令、与选择控制信号相对应的选定感兴趣区域指令；感兴趣区域分析模块，适于根据所述分析当前画面的指令，截取当前电视画面并对其进行分析，以获取潜在区域并反馈给服务器模块；所述服务器模块还适于将接收到的反馈数据绘制在原始图像上，成为潜在区域图像；所述服务器模块还适于接收选定感兴趣区域指令，将选定的反馈数据绘制在原始图像上，成为选定的感兴趣区域图像；和输出模块，适于输出所述潜在区域图像和选定的感兴趣区域图像。According to one aspect of the present invention, a smart TV human-computer interaction device is provided, including: a remote controller module, adapted to receive user operations, and generate corresponding control signals; the control signals include: analyzing the control signal of the current picture and selecting the control signal; the server module is adapted to receive and analyze the control signal to obtain instructions; the instructions include: an instruction for analyzing the current picture corresponding to the control signal for analyzing the current picture, and a selected interest corresponding to the selection control signal Region instructions; the region of interest analysis module is adapted to intercept the current TV image and analyze it according to the instruction of the analysis of the current image, so as to obtain the potential region and feed it back to the server module; the server module is also adapted to receive the Draw the feedback data on the original image to become a potential region image; the server module is also adapted to receive an instruction for selecting a region of interest, and draw the selected feedback data on the original image to become a selected region of interest image; and an output module adapted to output the latent region image and the selected ROI image.

可选的，所述的智能电视人机交互装置，还包括：信息检索模块，适于根据所述选定感兴趣区域指令，对选定感兴趣区域进一步进行目标分析、识别和信息检索。Optionally, the smart TV human-computer interaction device further includes: an information retrieval module, adapted to further perform target analysis, identification and information retrieval on the selected ROI according to the instruction of the selected ROI.

根据本发明的另一个方面，提供一种智能电视人机交互系统，包括：上述的智能电视人机交互装置；和电视模块，适于接收输出模块发来的指令并根据指令进行操作。According to another aspect of the present invention, a smart TV human-computer interaction system is provided, including: the above-mentioned smart TV human-computer interaction device; and a TV module adapted to receive instructions from the output module and operate according to the instructions.

可选的，所述电视模块接收并显示来自输出模块的数据。Optionally, the TV module receives and displays data from the output module.

根据本发明的又一个方面，提供一种基于上述智能电视人机交互装置的智能电视人机交互方法，包括：步骤一、接收用户操作，产生控制信号；所述控制信号包括：分析当前画面的控制信号；步骤二、解析该信号以获取指令；所述指令包括：与分析当前画面的控制信号相对应的分析当前画面的指令；步骤三、根据所述分析当前画面的指令，对当前电视画面进行分析以获取潜在区域，将潜在区域绘制在原始图像上，成为潜在区域图像，并输出潜在区域图像；步骤四、接收用户操作，产生控制信号；所述控制信号包括：选择控制信号；步骤五、解析该信号以获取指令；所述指令包括：与选择控制信号相对应的选定感兴趣区域指令；步骤六、基于选定感兴趣区域指令，将选定的反馈数据绘制在原始图像上，成为选定的感兴趣区域图像，并输出选定的感兴趣区域图像。According to another aspect of the present invention, there is provided a smart TV human-computer interaction method based on the smart TV human-computer interaction device, including: step 1, receiving user operations, and generating a control signal; the control signal includes: analyzing the current picture Control signal; Step 2, analyze this signal to obtain instruction; Described instruction comprises: the instruction of analyzing current picture corresponding to the control signal of analyzing current picture; Step 3, according to the instruction of described analyzing current picture, to current TV picture Perform analysis to obtain the potential area, draw the potential area on the original image to become a potential area image, and output the potential area image; step 4, receive user operation and generate a control signal; the control signal includes: select the control signal; step 5 , analyzing the signal to obtain instructions; the instructions include: a selected region of interest instruction corresponding to the selection control signal; step 6, drawing the selected feedback data on the original image based on the selected region of interest instruction, becomes the selected ROI image, and outputs the selected ROI image.

可选的，所述的智能电视人机交互方法，还包括：步骤七、利用选定感兴趣区域所覆盖的图像信息，对选定感兴趣区进一步进行目标分析、识别和信息检索。Optionally, the smart TV human-computer interaction method further includes: Step 7: Use the image information covered by the selected ROI to further perform target analysis, identification and information retrieval on the selected ROI.

可选的，在步骤一中，当用户按下遥控器模块的某个功能键，可以产生分析当前画面的控制信号；在步骤四中，基于潜在区域，当用户按下遥控器模块的某个数字键或字母键，可以产生选择控制信号。Optionally, in step 1, when the user presses a certain function key of the remote control module, a control signal for analyzing the current picture can be generated; in step 4, based on the potential area, when the user presses a certain function key of the remote control module Number keys or letter keys can generate selection control signals.

可选的，步骤三还包括：截取当前电视画面，对当前画面进行潜在区域分析以获取潜在区域；对潜在区域按位置进行标号；将位置和标号信息反馈给服务器模块。Optionally, Step 3 further includes: intercepting the current TV picture, analyzing the potential area of the current picture to obtain the potential area; labeling the potential area according to the position; feeding back the position and label information to the server module.

可选的，所述潜在区域分析包括人脸检测。Optionally, the latent region analysis includes face detection.

可选的，所述潜在区域用矩形表示；所述位置信息包括矩形左上角坐标和矩形右下角坐标表示；所述标号信息包括数字标号。Optionally, the potential area is represented by a rectangle; the location information includes the coordinates of the upper left corner of the rectangle and the coordinates of the lower right corner of the rectangle; and the label information includes a digital label.

与现有技术相比，本发明的优点在于：Compared with the prior art, the present invention has the advantages of:

(1)通过自动对当前画面进行分析，提供给观众一些潜在区域，观众利用普通遥控器，就可以方便的进行目标区域的点选，该方案使用简捷，符合当前观众的使用习惯，是一种方便有效的人机交互方式；(1) By automatically analyzing the current screen, some potential areas are provided to the audience, and the audience can easily select the target area by using a common remote control. This solution is simple to use and conforms to the current usage habits of the audience. Convenient and effective human-computer interaction;

(2)确定了感兴趣区域之后，可以根据实际需要进行后续的处理，比如识别该目标，根据识别结果进行网络搜索相关信息等操作。(2) After the area of interest is determined, follow-up processing can be performed according to actual needs, such as identifying the target, and performing operations such as searching for relevant information on the Internet according to the identification result.

附图说明 Description of drawings

图1是本发明一个实施例中提供的智能电视人机交互装置的结构框图；Fig. 1 is a structural block diagram of an intelligent TV human-computer interaction device provided in an embodiment of the present invention;

图2是本发明一个实施例中提供的智能电视人机交互系统的结构框图；Fig. 2 is a structural block diagram of the intelligent TV human-computer interaction system provided in one embodiment of the present invention;

图3是图2中的智能电视人机交互系统的示例的工作过程流程图；Fig. 3 is the working process flowchart of the example of the smart TV human-computer interaction system in Fig. 2;

图4是本发明一个实施例中感兴趣区域分析模块的分析结果示意图；Fig. 4 is a schematic diagram of the analysis results of the region of interest analysis module in one embodiment of the present invention;

图5是本发明一个实施例中提供的智能电视人机交互方法流程图。Fig. 5 is a flowchart of a human-computer interaction method for a smart TV provided in an embodiment of the present invention.

具体实施方式 Detailed ways

为了使本发明的目的、技术方案及优点更加清楚明白，以下结合附图，对本发明进一步详细说明。应当理解，此处所描述的具体实施例仅仅用以解释本发明，并不用于限定本发明。In order to make the object, technical solution and advantages of the present invention clearer, the present invention will be further described in detail below in conjunction with the accompanying drawings. It should be understood that the specific embodiments described here are only used to explain the present invention, not to limit the present invention.

根据本发明一个实施例，提供一种智能电视人机交互装置。如图1所示，该装置包括：According to an embodiment of the present invention, a smart TV human-computer interaction device is provided. As shown in Figure 1, the device includes:

遥控器模块101，适于接收用户操作，并产生相应的控制信号。例如，用户按下遥控器模块的某个功能键，可以产生分析当前画面的控制信号；又例如，为了对自动分析得到的潜在区域进行选择，确定真正感兴趣的区域，按下遥控器模块的某个数字键或字母键，可以产生数字或字母选择控制信号。本实施例中，所述遥控器模块101可以是长虹电视机遥控器或创维电视机遥控器。在本发明的其他实施例中，用户也可以利用带鼠标定位功能的遥控器模块101，将鼠标直接移至感兴趣区域内部，并按确定键发出数字选择信号。The remote control module 101 is adapted to receive user operations and generate corresponding control signals. For example, the user presses a certain function key of the remote control module to generate a control signal for analyzing the current picture; A number key or letter key can generate a number or letter selection control signal. In this embodiment, the remote control module 101 may be a Changhong TV remote control or a Skyworth TV remote control. In other embodiments of the present invention, the user can also use the remote control module 101 with the mouse positioning function to directly move the mouse to the inside of the region of interest, and press the OK key to send a digital selection signal.

服务器模块103，适于接收并解析遥控器模块101发来的信号以获取指令(指令格式可以是二进制编码格式，不同的编码代表不同的指令)。如果信号中包含的指令是分析当前画面的指令或选择某一区域的指令(例如数字键)，进行如下处理：The server module 103 is adapted to receive and analyze the signal sent by the remote control module 101 to obtain instructions (the instruction format may be a binary code format, and different codes represent different instructions). If the instruction contained in the signal is an instruction to analyze the current screen or an instruction to select a certain area (such as a number key), proceed as follows:

(1)如果是分析当前画面的指令，服务器模块103将调用感兴趣区域分析模块104，接收感兴趣区域分析模块104的分析结果(即反馈数据)，服务器模块103将接收到的反馈数据绘制在原始图像上成为潜在区域图像，并将该潜在区域图像发送给输出模块105；和(1) If it is an instruction to analyze the current picture, the server module 103 will invoke the ROI analysis module 104 to receive the analysis result (i.e. feedback data) of the ROI analysis module 104, and the server module 103 will draw the received feedback data on the become a latent area image on the original image, and send the latent area image to the output module 105; and

(2)如果是选择当前画面中某区域的指令，即选定感兴趣区域指令(例如数字键)，服务器模块103将接收到的指令与所述反馈数据进行比较，然后将反馈数据中的被选择区域的数据绘制在原始图像上，成为选定的感兴趣区域图像，并将该感兴趣区域图像发送给输出模块105。(2) If it is an instruction to select an area in the current screen, that is, an instruction to select an area of interest (for example, a number key), the server module 103 will compare the received instruction with the feedback data, and then compare The data of the selected region is drawn on the original image to become a selected ROI image, and the ROI image is sent to the output module 105 .

感兴趣区域分析模块104，适于对当前电视画面进行分析，具体包括：The region of interest analysis module 104 is suitable for analyzing the current TV picture, specifically including:

(1)截取当前电视画面，对当前画面进行感兴趣区域分析。感兴趣的图像区域可以通过自动分析图像特征(包括但不限于边缘特征、纹理特征等)完成，或者通过模式识别中的目标检测和识别算法进行特定类的目标检测(比如人脸检测等)，也可以利用其他技术对潜在区域进行自动检测，在此不限制使用的具体方法。(1) Intercept the current TV picture, and analyze the region of interest on the current picture. The image area of interest can be completed by automatically analyzing image features (including but not limited to edge features, texture features, etc.), or through target detection and recognition algorithms in pattern recognition for specific types of target detection (such as face detection, etc.), Other technologies can also be used to automatically detect potential regions, and the specific method used is not limited here.

下面以人脸检测为例，说明感兴趣区域分析模块104对人脸目标的检测过程(可以利用opencv中公开的人脸检测的训练和检测模块)：首先对电视节目中的人脸进行标注，作为人脸分类器的正例样本，再收集部分不含人脸的图片作为反例样本，调用人脸检测的训练模块，生成人脸检测分类器。在待检测图片上，调用人脸检测模块，扫描整个图像，利用已经训练好的人脸分类器对图像中的人脸目标进行检测，并给出最终检测到的人脸区域。具体的技术细节和操作过程，为目前学术界和工业界共知的技术，在此不再详细描述。其他的目标训练和检测方法，类似于此。也可以利用其他技术对潜在的目标进行检测，在此不限制使用的具体方法。Take face detection as an example below to illustrate the detection process of the ROI analysis module 104 to the face target (the training and detection module of the face detection disclosed in opencv can be utilized): first the faces in the TV program are marked, As the positive samples of the face classifier, some pictures without faces are collected as negative samples, and the training module of face detection is called to generate the face detection classifier. On the picture to be detected, call the face detection module, scan the entire image, use the trained face classifier to detect the face target in the image, and give the finally detected face area. The specific technical details and operation process are currently well-known technologies in the academic and industrial circles, and will not be described in detail here. Other object training and detection methods are similar to this. Potential targets can also be detected using other techniques, and the specific methods used are not limited here.

这些分析得到的感兴趣区域可能是人脸等区域，可以用矩形或圆形参数表示，其位置信息可以用矩形表示，也可以用圆或椭圆信息表示，比如矩形的左上角坐标，矩形的长宽等信息。在本发明的其他实施例中，上述区域可以以其他可用的形状参数表示。The regions of interest obtained by these analyzes may be areas such as faces, which can be represented by rectangle or circle parameters, and their position information can be represented by rectangle, circle or ellipse information, such as the coordinates of the upper left corner of the rectangle, the length of the rectangle wide information. In other embodiments of the present invention, the aforementioned regions may be represented by other available shape parameters.

(2)对分析得到当前画面中的潜在区域按位置进行标号；并将位置和标号信息反馈给服务器模块103。例如，标号可以从1开始，可以是1、2、3等数字键。在本发明的其他实施例中，上述标号方式可以以其他可用的方式进行，例如字母编号、汉字编号等。(2) Label the analyzed potential areas in the current frame by position; and feed back the position and label information to the server module 103 . For example, the label can start from 1, and can be number keys such as 1, 2, 3, etc. In other embodiments of the present invention, the above-mentioned numbering methods can be implemented in other available ways, such as letter numbers, Chinese character numbers, and the like.

感兴趣区域分析模块104的分析结果可以用下面的数据格式表示：The analysis results of the region of interest analysis module 104 can be expressed in the following data formats:

{m，(1，(x11，y11)，(x12，y12))，(2，(x21，y21)，(x22，y22))，...}{m, (1, (x11, y11), (x12, y12)), (2, (x21, y21), (x22, y22)), ...}

其中m表示分析得到的m个潜在区域，(1，(x11，y11)，(x12，y12))表示第一个感兴趣区域的矩形表示，用矩形左上角坐标和矩形右下角坐标表示。Among them, m represents the m potential regions obtained by the analysis, and (1, (x11, y11), (x12, y12)) represents the rectangular representation of the first region of interest, expressed by the coordinates of the upper left corner of the rectangle and the coordinates of the lower right corner of the rectangle.

例如，如图4所示，分析结果可表示为：For example, as shown in Figure 4, the analysis results can be expressed as:

{2，(1，(100，50)，(200，200))，(2，(250，70)，(320，180))}。{2, (1, (100, 50), (200, 200)), (2, (250, 70), (320, 180))}.

上述的服务器模块103和感兴趣区域分析模块104可以由微处理器芯片分别实现，或在一个微处理器芯片上完成服务器模块103和感兴趣区域分析模块104的功能。The server module 103 and the region of interest analysis module 104 mentioned above can be respectively implemented by a microprocessor chip, or the functions of the server module 103 and the region of interest analysis module 104 can be completed on one microprocessor chip.

输出模块105，接收并输出服务器模块103发来的数据。The output module 105 receives and outputs the data sent by the server module 103 .

例如，如图4所示，服务器模块103接收到的指令是遥控器按下数字键1，即选择了第一个感兴趣区域，则服务器模块删除其他的感兴趣2，仅保留感兴趣1，并将数据{1，(1，(100，50)，(200，200))}表示的图形绘制到原始图像上，然后发送给输出模块105，输出模块105将该数据发送给电视机进行显示。For example, as shown in Figure 4, the instruction received by the server module 103 is that the remote controller presses the number key 1, that is, the first area of interest is selected, then the server module deletes other interest 2, and only keeps interest 1, Draw the graph represented by the data {1, (1, (100, 50), (200, 200))} onto the original image, and then send it to the output module 105, and the output module 105 sends the data to the TV for display .

进一步的，该智能电视人机交互装置还可以包括：信息检索模块106。Further, the smart TV human-computer interaction device may also include: an information retrieval module 106 .

当服务器模块103接收到的指令是选择当前画面中某一区域(例如数字键)，服务器模块103还调用信息检索模块106。When the instruction received by the server module 103 is to select a certain area (such as a number key) in the current screen, the server module 103 also calls the information retrieval module 106 .

信息检索模块106适于接收服务器模块转发来的观众感兴趣区域数据信息，比如{1，(1，(100，50)，(200，200))}，并在当前电视画面中截取感兴趣区域所覆盖的图像信息，利用服务器模块存储的数据，对区域内容进一步进行目标分析和识别，包括并不限于，演员，运动员，采访嘉宾，标志性建筑等。以人脸为例，服务器模块可以提前存储该电视剧中出现的角色的人脸图像，并对这些人脸图像进行了名字标注，建立人脸库。将感区域图像信息和人脸库中的图像信息进行比对，从而确定感兴趣区域中的角色身份。根据该识别结果对该目标进行服务器端或者网络检索，并得到最终的检索结果反馈给服务器模块103。信息检索方法包括并不限于图像匹配，目标匹配，关键字检索等，并可以利用特定电视节目的演员信息，嘉宾信息等。信息检索的结果可以是图像、视频或者文字等信息。The information retrieval module 106 is adapted to receive the viewer's region of interest data information forwarded by the server module, such as {1, (1, (100, 50), (200, 200))}, and intercept the region of interest in the current TV screen The covered image information uses the data stored in the server module to further analyze and identify the target of the regional content, including but not limited to, actors, athletes, interview guests, landmark buildings, etc. Taking human faces as an example, the server module can store the facial images of characters appearing in the TV series in advance, and tag these facial images with names to build a human face library. Compare the image information of the sensing area with the image information in the face database to determine the identity of the character in the area of interest. The target is searched on the server side or on the network according to the recognition result, and the final search result is obtained and fed back to the server module 103 . Information retrieval methods include but are not limited to image matching, target matching, keyword retrieval, etc., and can utilize actor information, guest information, etc. of a specific TV program. The results of information retrieval can be information such as images, videos, or texts.

服务器模块103接收感兴趣区域分析模块104和信息检索模块106的分析结果和检索结果，并将其发送给输出模块105。输出模块105可以将该数据发送给电视机进行显示。在本发明的其他实施例中，也可以发送给其他设备进行表达，以给用户提供指示(包括但不限于图形、色彩、声音)。The server module 103 receives the analysis results and retrieval results from the ROI analysis module 104 and the information retrieval module 106 and sends them to the output module 105 . The output module 105 can send the data to the TV for display. In other embodiments of the present invention, the expressions may also be sent to other devices to provide instructions (including but not limited to graphics, colors, and sounds) to users.

结合上述装置，本发明一个实施例中，提供一种智能电视人机交互系统。如图2所示，该系统包括上述的智能电视人机交互装置和电视模块107。所述电视模块107适于接收输出模块105发来的具体指令(包括传统遥控器涉及的命令，比如换台，调音量等等)和显示图像，并根据指令进行操作，所述操作、处理过程皆为目前学术界和工业界的公知技术，不再赘述。In combination with the above devices, an embodiment of the present invention provides a smart TV human-computer interaction system. As shown in FIG. 2 , the system includes the above-mentioned smart TV human-computer interaction device and a TV module 107 . The TV module 107 is suitable for receiving specific instructions sent by the output module 105 (including commands related to traditional remote controllers, such as changing channels, adjusting volume, etc.) and displaying images, and operating according to the instructions. The operation and processing process All are known technologies in the current academic and industrial circles, and will not be repeated here.

如图3所示，该系统的工作过程如下：As shown in Figure 3, the working process of the system is as follows:

S101、遥控器按键发出分析当前画面信号；即当观众对当前画面中的某区域感兴趣，希望进一步了解该目标的信息时，需要按下遥控器上的某个功能按键；S101. The button of the remote controller sends a signal for analyzing the current picture; that is, when the viewer is interested in a certain area in the current picture and wants to know more about the target, he needs to press a certain function button on the remote controller;

S102、服务器模块解析该信号，调用感兴趣区域分析模块；S102. The server module analyzes the signal, and calls the ROI analysis module;

S103、感兴趣区域分析模块将分析结果反馈给服务器模块；感兴趣区域分析模块首先截取当前画面，自动分析当前画面，利用算法获取用户潜在区域，并按顺序进行标号，最后将潜在的感兴趣区域位置信息和标号信息反馈给服务器模块；S103, the region of interest analysis module feeds back the analysis results to the server module; the region of interest analysis module first intercepts the current picture, automatically analyzes the current picture, uses an algorithm to obtain the potential regions of the user, and labels them in order, and finally lists the potential regions of interest The position information and label information are fed back to the server module;

S104、服务器模块将编号之后的分析结果数据绘制到原始图像，并发送给输出模块；S104. The server module draws the numbered analysis result data to the original image, and sends it to the output module;

S105、遥控器按数字键选择某一个感兴趣区域；用户根据屏幕显示的自动分析得到的潜在区域，利用数字键进行点选；S105. The remote controller presses the number keys to select a certain area of interest; the user uses the number keys to click on the potential area obtained from the automatic analysis displayed on the screen;

S106、服务器模块解析该信号，调用信息检索模块；S106. The server module parses the signal, and invokes the information retrieval module;

S107、信息检索模块根据用户点选的感兴趣区域，对目标进行检索并将检索结果反馈给服务器模块；检索结果可以是图像、视频或者文字信息，网页信息等；S107. The information retrieval module retrieves the target according to the region of interest selected by the user and feeds the retrieval result back to the server module; the retrieval result can be image, video or text information, webpage information, etc.;

S108、服务器模块将检索结果发送给输出模块。S108. The server module sends the retrieval result to the output module.

与上述的装置相应的，本发明一个实施例中，提供一种智能电视人机交互方法，主要包括根据画面内容自动生成潜在区域，确定感兴趣区域两个步骤。如图5所示，所述方法具体包括：Corresponding to the above-mentioned device, in one embodiment of the present invention, a smart TV human-computer interaction method is provided, which mainly includes two steps of automatically generating a potential area according to screen content and determining an area of interest. As shown in Figure 5, the method specifically includes:

S201、接收用户操作，产生控制信号；所述控制信号包括：分析当前画面的控制信号；其过程与上述遥控器模块101的工作过程相同，不再赘述；S201. Receive a user operation and generate a control signal; the control signal includes: analyzing the control signal of the current screen; the process is the same as that of the above-mentioned remote control module 101, and will not be described again;

S202、解析该信号以获取指令；所述指令包括：与分析当前画面的控制信号相对应的分析当前画面的指令；S202. Analyze the signal to obtain an instruction; the instruction includes: an instruction for analyzing the current image corresponding to the control signal for analyzing the current image;

S203、根据所述分析当前画面的指令，对当前电视画面进行分析以获取潜在区域，将潜在区域绘制在原始图像上，成为潜在区域图像，并输出潜在区域图像；其过程与上述服务器模块103、感兴趣区域分析模块104的工作过程相同，不再赘述；S203. According to the instruction for analyzing the current picture, analyze the current TV picture to obtain the potential area, draw the potential area on the original image to become a potential area image, and output the potential area image; the process is the same as that of the server module 103, The working process of the region of interest analysis module 104 is the same and will not be repeated;

S204、接收用户操作，产生控制信号；所述控制信号包括：选择控制信号；S204. Receive a user operation and generate a control signal; the control signal includes: a selection control signal;

S205、解析该信号以获取指令；所述指令包括：与选择控制信号相对应的选定感兴趣区域指令；S205. Analyze the signal to obtain an instruction; the instruction includes: an instruction for selecting a region of interest corresponding to the selection control signal;

S206、基于选定感兴趣区域指令，将选定的反馈数据绘制在原始图像上，成为选定的感兴趣区域图像，并输出选定的感兴趣区域图像。该过程与上述服务器模块103描述相同，不再赘述。S206. Draw the selected feedback data on the original image based on the instruction of selecting the region of interest to become the image of the selected region of interest, and output the image of the selected region of interest. This process is the same as the description of the server module 103 above, and will not be repeated here.

进一步的，上述方法在步骤S206后还包括：Further, the above method also includes after step S206:

S207、利用选定感兴趣区域所覆盖的图像信息，对选定感兴趣区进一步进行目标分析和识别。该过程与上述服务器模块103、信息检索模块106的描述相同，不再赘述。S207. Using the image information covered by the selected ROI, further perform target analysis and recognition on the selected ROI. This process is the same as the description of the server module 103 and the information retrieval module 106 above, and will not be repeated here.

利用上述方法和系统，如果观众对当前电视画面中的某些部分感兴趣，想进一步了解相关信息，需要按动遥控器上的某一按钮，遥控器通过无线信号发送给电视机，电视机解析这一指令，截取当前图像，利用内容分析模块，对当前图像进行分析，通过一定的算法，得到一些用户可能感兴趣的区域，并进行标号，同时在当前图像中显示出来，由用户利用遥控器的数字键对标示出的目标框进行选择。该结果反馈给电视机，由信息检索模块，根据选择的目标，进行后台搜索或网络搜索，将结果呈现出来，供观众了解更多的信息。Using the above method and system, if the viewer is interested in some parts of the current TV screen and wants to know more about relevant information, he needs to press a certain button on the remote control, and the remote control sends a wireless signal to the TV, and the TV resolves This command intercepts the current image, uses the content analysis module to analyze the current image, and obtains some areas that the user may be interested in through a certain algorithm, and labels them, and displays them in the current image at the same time, and the user uses the remote control Number keys to select the marked target frame. The result is fed back to the TV, and the information retrieval module performs background search or network search according to the selected target, and presents the result for the viewer to learn more information.

应该注意到并理解，在不脱离后附的权利要求所要求的本发明的精神和范围的情况下，能够对上述详细描述的本发明做出各种修改和改进。因此，要求保护的技术方案的范围不受所给出的任何特定示范教导的限制。It should be noted and understood that various modifications and improvements can be made to the invention described in detail above without departing from the spirit and scope of the invention as claimed in the appended claims. Accordingly, the scope of the claimed technical solution is not limited by any particular exemplary teaching given.

Claims

1. A smart TV human-computer interaction device, comprising:

The remote control module is adapted to receive user operations and generate corresponding control signals; the control signals include: analyzing the control signals of the current screen and selecting the control signals;

The server module is adapted to receive and analyze the control signal to obtain instructions; the instructions include: an instruction for analyzing the current picture corresponding to the control signal for analyzing the current picture, and an instruction for selecting the region of interest corresponding to the selection control signal ;

The region of interest analysis module is adapted to intercept and analyze the current TV picture according to the instruction of analyzing the current picture, so as to obtain the potential area and feed it back to the server module; the server module is also suitable for receiving the feedback data drawing on the original image to become a potential region image; the server module is further adapted to receive an instruction for selecting a region of interest, and draw the selected feedback data on the original image to become a selected region of interest image; and

An output module adapted to output the latent region image and the selected ROI image.

2. The smart TV human-computer interaction device according to claim 1, further comprising:

The information retrieval module is adapted to further perform target analysis, identification and information retrieval on the selected ROI according to the instruction of selecting ROI.

3. A smart TV human-computer interaction system, comprising:

The smart TV human-computer interaction device according to any one of claims 1-2; and

The television module is suitable for receiving instructions from the output module and operating according to the instructions.

4. The smart TV human-computer interaction system according to claim 3, wherein the TV module receives and displays data from the output module.

5. A smart TV human-computer interaction method, comprising:

Step 1, receiving user operation and generating a control signal; the control signal includes: analyzing the control signal of the current screen;

Step 2, analyzing the signal to obtain an instruction; the instruction includes: an instruction for analyzing the current image corresponding to the control signal for analyzing the current image;

Step 3. According to the instruction of analyzing the current picture, analyze the current TV picture to obtain the potential area, draw the potential area on the original image to become a potential area image, and output the potential area image;

Step 4, receiving user operation and generating a control signal; the control signal includes: selection control signal;

Step 5, analyzing the signal to obtain an instruction; the instruction includes: an instruction for selecting a region of interest corresponding to the selection control signal;

Step 6: Drawing the selected feedback data on the original image based on the instruction of selecting the region of interest to become the image of the selected region of interest, and outputting the image of the selected region of interest.

6. The intelligent TV human-computer interaction method according to claim 5, further comprising:

Step 7: Use the image information covered by the selected ROI to further perform target analysis, recognition and information retrieval on the selected ROI.

7. The intelligent TV human-computer interaction method according to claim 5, wherein,

In step 1, when the user presses a certain function key of the remote controller module, a control signal for analyzing the current screen can be generated;

In step four, based on the potential area, when the user presses a number key or letter key of the remote controller module, a selection control signal may be generated.

8. The intelligent TV human-computer interaction method according to claim 5, wherein, step 3 also includes:

Capture the current TV picture, and analyze the potential area of the current picture to obtain the potential area;

Label potential regions by location;

Feed back location and label information to the server module.

9. The smart TV human-computer interaction method according to claim 8, wherein the potential region analysis includes face detection.

10. The intelligent TV human-computer interaction method according to claim 8, wherein the potential area is represented by a rectangle; the location information includes coordinates of the upper left corner of the rectangle and the coordinates of the lower right corner of the rectangle; and the label information includes a digital label.