CN107944501A

CN107944501A - Image-recognizing method and device

Info

Publication number: CN107944501A
Application number: CN201711310937.9A
Authority: CN
Inventors: 林形省
Original assignee: Beijing Xiaomi Mobile Software Co Ltd
Current assignee: Beijing Xiaomi Mobile Software Co Ltd
Priority date: 2017-12-11
Filing date: 2017-12-11
Publication date: 2018-04-20

Abstract

The disclosure provides an image recognition method and device. The disclosed image recognition method includes: acquiring image features and size information of an object to be recognized; and identifying the object to be recognized according to the image feature and size information. The present disclosure can improve the accuracy of image recognition and improve user experience.

Description

Image recognition method and device

技术领域technical field

本公开涉及图像识别技术，尤其涉及一种图像识别方法及装置。The present disclosure relates to image recognition technology, in particular to an image recognition method and device.

背景技术Background technique

随着人工智能的发展，图像识别技术日趋完善。具体地，图像识别的发展经历了三个阶段：文字识别、数字图像处理与识别和物体识别。With the development of artificial intelligence, image recognition technology is becoming more and more perfect. Specifically, the development of image recognition has gone through three stages: character recognition, digital image processing and recognition, and object recognition.

目前，对物体的识别基本都是基于物品的图像特征。这样，当两个物品的外观相似时，当前的图像识别就不能区分开这两个物体。At present, the recognition of objects is basically based on the image features of the items. In this way, current image recognition cannot distinguish two objects when their appearances are similar.

发明内容Contents of the invention

为克服相关技术中存在的问题，本公开提供一种图像识别方法及装置。所述技术方案如下：In order to overcome the problems existing in related technologies, the present disclosure provides an image recognition method and device. Described technical scheme is as follows:

根据本公开实施例的第一方面，提供一种图像识别方法。该方法包括：According to the first aspect of the embodiments of the present disclosure, an image recognition method is provided. The method includes:

获取待识别物体的图像特征及尺寸信息；Obtain the image features and size information of the object to be recognized;

根据图像特征及尺寸信息，识别待识别物体。According to the image features and size information, identify the object to be identified.

本公开的实施例提供的技术方案可以包括以下有益效果：通过图像特征及尺寸信息识别待识别物体，相比仅基于图像特征识别待识别物体，可进一步区分外观类似但尺寸有差别的物体，提高图像识别的准确度，提升用户体验。The technical solutions provided by the embodiments of the present disclosure may include the following beneficial effects: the object to be recognized can be identified through image features and size information, compared to identifying objects to be recognized based only on image features, objects with similar appearances but different sizes can be further distinguished, improving The accuracy of image recognition improves user experience.

可选地，上述获取待识别物体的图像特征及尺寸信息，可以包括：Optionally, the acquisition of image features and size information of the object to be identified may include:

通过摄像头获取待识别物体的图像特征及深度信息；Obtain the image features and depth information of the object to be recognized through the camera;

根据深度信息，获得尺寸信息。According to the depth information, the size information is obtained.

可选地，上述摄像头为单目摄像头或双目摄像头。Optionally, the aforementioned cameras are monocular cameras or binocular cameras.

可选地，上述根据图像特征及所述尺寸信息，识别待识别物体，可具体包括：Optionally, the above-mentioned identifying the object to be identified based on the image features and the size information may specifically include:

根据图像特征，确定第一物体集合，该第一物体集合至少包括待识别物体；Determining a first object set according to the image features, the first object set includes at least the object to be identified;

在第一物体集合中，根据尺寸信息识别待识别物体。In the first object set, identify the object to be identified according to the size information.

可选地，上述图像识别方法还可包括：Optionally, the above-mentioned image recognition method may also include:

获取待识别物体的标签信息；Obtain the label information of the object to be identified;

相应地，上述根据图像特征及所述尺寸信息，识别待识别物体，可以包括：Correspondingly, the above-mentioned identification of the object to be identified based on the image features and the size information may include:

根据图像特征及尺寸信息，确定第二物体集合，该第二物体集合至少包括待识别物体；Determining a second object set according to the image feature and size information, the second object set includes at least the object to be identified;

在第二物体集合中，根据标签信息识别待识别物体。In the second object set, identify the object to be identified according to the tag information.

本公开的实施例提供的技术方案可以包括以下有益效果：通过图像特征及尺寸信息初步识别待识别物体，确定待识别物体所在的集合，即第二物体集合，在该第二物体集合内进一步根据标签信息识别出待识别物体，从而实现外观类似且尺寸相当的物体的识别，进一步提高图像识别的准确度及用户体验。The technical solutions provided by the embodiments of the present disclosure may include the following beneficial effects: preliminarily identify the object to be identified through image features and size information, determine the set where the object to be identified is located, that is, the second object set, and in the second object set further according to The tag information identifies the object to be recognized, so as to realize the recognition of objects with similar appearance and size, and further improve the accuracy of image recognition and user experience.

根据本公开实施例的第二方面，提供一种图像识别装置。该图像识别装置包括：第一获取模块和识别模块；其中，According to a second aspect of the embodiments of the present disclosure, an image recognition device is provided. The image recognition device includes: a first acquisition module and a recognition module; wherein,

第一获取模块，被配置为获取待识别物体的图像特征及尺寸信息；The first acquisition module is configured to acquire image features and size information of the object to be identified;

识别模块，被配置为根据图像特征及尺寸信息，识别待识别物体。The identification module is configured to identify the object to be identified according to image features and size information.

可选地，上述第一获取模块被配置为：通过摄像头获取待识别物体的图像特征及深度信息；根据深度信息，获得尺寸信息。Optionally, the above-mentioned first acquisition module is configured to: acquire image features and depth information of the object to be identified through a camera; and acquire size information according to the depth information.

可选地，上述摄像头可以为单目摄像头或双目摄像头。Optionally, the aforementioned cameras may be monocular cameras or binocular cameras.

可选地，上述识别模块包括：Optionally, the above identification module includes:

确定子模块，被配置为根据图像特征，确定第一物体集合，该第一物体集合至少包括待识别物体；The determining submodule is configured to determine a first object set according to image features, and the first object set includes at least the object to be identified;

识别子模块，被配置为在第一物体集合中，根据尺寸信息识别待识别物体。The identification submodule is configured to identify the object to be identified according to the size information in the first object set.

可选地，上述图像识别装置还包括：Optionally, the above-mentioned image recognition device also includes:

第二获取模块，被配置为获取待识别物体的标签信息；The second acquisition module is configured to acquire label information of the object to be identified;

相应地，识别模块被配置为：Accordingly, the recognition module is configured to:

根据本公开实施例的第三方面，提供一种图像识别装置。该图像识别装置包括：处理器和用于存储可执行指令的存储器；其中，处理器被配置为执行所述可执行指令，以执行如第一方面任一项所述的图像识别方法。According to a third aspect of the embodiments of the present disclosure, an image recognition device is provided. The image recognition device includes: a processor and a memory for storing executable instructions; wherein the processor is configured to execute the executable instructions to perform the image recognition method according to any one of the first aspect.

根据本公开实施例的第四方面，提供一种计算机可读存储介质，其上存储有计算机程序，该计算机程序被处理器执行时实现如第一方面任一项所述的图像识别方法。According to a fourth aspect of the embodiments of the present disclosure, there is provided a computer-readable storage medium, on which a computer program is stored, and when the computer program is executed by a processor, the image recognition method according to any one of the first aspect is implemented.

应当理解的是，以上的一般描述和后文的细节描述仅是示例性和解释性的，并不能限制本公开。It is to be understood that both the foregoing general description and the following detailed description are exemplary and explanatory only and are not restrictive of the present disclosure.

附图说明Description of drawings

为了更清楚地说明本公开实施例或现有技术中的技术方案，下面将对实施例或现有技术描述中所需要使用的附图作简单地介绍，显而易见地，下面描述中的附图仅仅是本公开的一些实施例，对于本领域普通技术人员来讲，在不付出创造性劳动性的前提下，还可以根据这些附图获得其他的附图。In order to more clearly illustrate the technical solutions in the embodiments of the present disclosure or the prior art, the following will briefly introduce the drawings that need to be used in the description of the embodiments or the prior art. Obviously, the drawings in the following description are only These are some embodiments of the present disclosure. Those skilled in the art can also obtain other drawings based on these drawings without any creative effort.

图1是根据一示例性实施例示出的一种图像识别方法的流程图；Fig. 1 is a flowchart of an image recognition method shown according to an exemplary embodiment;

图2是根据另一示例性实施例示出的一种图像识别方法的流程图；Fig. 2 is a flowchart of an image recognition method according to another exemplary embodiment;

图3是根据又一示例性实施例示出的一种图像识别方法的流程图；Fig. 3 is a flowchart of an image recognition method according to yet another exemplary embodiment;

图4是根据一示例性实施例示出的一种图像识别装置的结构示意图；Fig. 4 is a schematic structural diagram of an image recognition device according to an exemplary embodiment;

图5是根据另一示例性实施例示出的一种图像识别装置的结构示意图；Fig. 5 is a schematic structural diagram of an image recognition device according to another exemplary embodiment;

图6是根据又一示例性实施例示出的一种图像识别装置的结构示意图；Fig. 6 is a schematic structural diagram of an image recognition device according to yet another exemplary embodiment;

图7是根据一示例性实施例示出的一种图像识别装置框图。Fig. 7 is a block diagram of an image recognition device according to an exemplary embodiment.

通过上述附图，已示出本公开明确的实施例，后文中将有更详细的描述。这些附图和文字描述并不是为了通过任何方式限制本公开构思的范围，而是通过参考特定实施例为本领域技术人员说明本公开的概念。By means of the above-mentioned drawings, certain embodiments of the present disclosure have been shown and will be described in more detail hereinafter. These drawings and written description are not intended to limit the scope of the disclosed concept in any way, but to illustrate the disclosed concept for those skilled in the art by referring to specific embodiments.

具体实施方式Detailed ways

这里将详细地对示例性实施例进行说明，其示例表示在附图中。下面的描述涉及附图时，除非另有表示，不同附图中的相同数字表示相同或相似的要素。以下示例性实施例中所描述的实施方式并不代表与本公开相一致的所有实施方式。相反，它们仅是与如所附权利要求书中所详述的、本公开的一些方面相一致的装置和方法的例子。Reference will now be made in detail to the exemplary embodiments, examples of which are illustrated in the accompanying drawings. When the following description refers to the accompanying drawings, the same numerals in different drawings refer to the same or similar elements unless otherwise indicated. The implementations described in the following exemplary examples do not represent all implementations consistent with the present disclosure. Rather, they are merely examples of apparatuses and methods consistent with aspects of the present disclosure as recited in the appended claims.

本公开的说明书和权利要求书中的术语“第一”、“第二”等是用于区别类似的对象，而不必用于描述特定的顺序或先后次序。应该理解这样使用的数据在适当情况下可以互换，以便这里描述的本公开的实施例例如能够以除了在这里图示或描述的那些以外的顺序实施。此外，术语“包括”和“具有”以及他们的任何变形，意图在于覆盖不排他的包含，例如，包含了一系列步骤或单元的过程、方法、系统、产品或设备不必限于清楚地列出的那些步骤或单元，而是可包括没有清楚地列出的或对于这些过程、方法、产品或设备固有的其它步骤或单元。The terms "first", "second" and the like in the specification and claims of the present disclosure are used to distinguish similar objects, and are not necessarily used to describe a specific order or sequence. It is to be understood that the data so used are interchangeable under appropriate circumstances such that the embodiments of the disclosure described herein are, for example, capable of practice in sequences other than those illustrated or described herein. Furthermore, the terms "comprising" and "having", as well as any variations thereof, are intended to cover a non-exclusive inclusion, for example, a process, method, system, product or device comprising a sequence of steps or elements is not necessarily limited to the expressly listed instead, may include other steps or elements not explicitly listed or inherent to the process, method, product or apparatus.

多个，包括两个或者两个以上。Multiple, including two or more.

和/或，应当理解，对于本公开中使用的术语“和/或”，其仅仅是一种描述关联对象的关联关系，表示可以存在三种关系。例如，A和/或B，可以表示：单独存在A，同时存在A和B，单独存在B这三种情况。And/or, it should be understood that the term "and/or" used in the present disclosure is only an association relationship describing associated objects, indicating that there may be three relationships. For example, A and/or B may mean that A exists alone, A and B exist simultaneously, and B exists alone.

由于实际应用中存在很多外观类似的物体(例如，部分智能手机的外观类似)，这些外观类似的物体的图像特征基本相同，因此，仅通过图像特征不能区分开这些物体，导致图像识别率低。考虑到这些外观类似的物体的尺寸可能会存在差异，因此，本公开在图像识别时引入尺寸信息，结合图像特征和尺寸信息识别物体，提高图像识别的准确度，提升用户体验。Since there are many similar-looking objects in practical applications (for example, some smartphones have similar appearances), the image features of these similar-looking objects are basically the same. Therefore, these objects cannot be distinguished only by image features, resulting in a low image recognition rate. Considering that there may be differences in the sizes of these objects with similar appearances, the present disclosure introduces size information during image recognition, combines image features and size information to identify objects, improves the accuracy of image recognition, and improves user experience.

图1是根据一示例性实施例示出的一种图像识别方法的流程图。本实施例提供一种图像识别方法，该方法可以由图像识别装置来执行，该装置可通过硬件和/或软件的方式实现。Fig. 1 is a flow chart of an image recognition method according to an exemplary embodiment. This embodiment provides an image recognition method, which can be executed by an image recognition device, and the device can be realized by means of hardware and/or software.

图像识别装置可以是手持设备、可穿戴设备、车载装置、计算机和服务器等具有图像识别功能的设备。手持设备可以为智能手机、平板电脑及个人数字助理等。可穿戴设备包括增强现实(Augmented Reality，AR)设备和虚拟现实(Virtual Reality，VR)设备等仿真设备。The image recognition device may be a device with image recognition function such as a handheld device, a wearable device, a vehicle-mounted device, a computer, and a server. Handheld devices can be smartphones, tablet computers, personal digital assistants, and the like. Wearable devices include simulation devices such as augmented reality (Augmented Reality, AR) devices and virtual reality (Virtual Reality, VR) devices.

如图1所示，该图像识别方法包括以下步骤：As shown in Figure 1, the image recognition method includes the following steps:

在步骤101中，获取待识别物体的图像特征及尺寸信息。In step 101, image features and size information of an object to be recognized are acquired.

图像特征可包括图像的颜色特征、纹理特征、形状特征和空间关系特征。其中，颜色特征和纹理特征为全局特征，描述了图像或图像区域所对应的景物的表面性质；形状特征有两类表示方法，一类是轮廓特征，另一类是区域特征，图像的轮廓特征主要针对物体的外边界，而图像的区域特征则关系到整个形状区域；空间关系特征，是指图像中分割出来的多个目标之间的相互的空间位置或相对方向关系，这些关系也可分为连接或邻接关系、交叠或重叠关系和包含或包容关系等。Image features may include image color features, texture features, shape features and spatial relationship features. Among them, the color feature and the texture feature are global features, which describe the surface properties of the scene corresponding to the image or image area; there are two types of representation methods for the shape feature, one is the contour feature, and the other is the regional feature, the contour feature of the image It is mainly aimed at the outer boundary of the object, while the regional features of the image are related to the entire shape area; the spatial relationship feature refers to the mutual spatial position or relative direction relationship between multiple targets segmented in the image, and these relationships can also be divided into For connection or adjacency relationship, overlap or overlap relationship and containment or containment relationship, etc.

尺寸信息，用于描述待识别物体的外在尺寸的信息，外在尺寸例如为长宽高等中的一个或多个。该尺寸信息的获取可以是执行该步骤的主体通过其他设备获取，或者，执行该步骤的主体通过其内置部件获取，例如，智能手机通过其内置摄像头获取尺寸信息；或者，执行该步骤的主体已存储有该尺寸信息，获取的过程即读取的过程。Dimension information, used to describe the information of the external dimensions of the object to be recognized, such as one or more of the length, width, and height. The size information can be obtained by the subject performing this step through other devices, or the subject performing this step obtains it through its built-in components, for example, a smart phone obtains the size information through its built-in camera; or, the subject performing this step has The size information is stored, and the process of obtaining is the process of reading.

在步骤102中，根据图像特征及尺寸信息，识别待识别物体。In step 102, the object to be recognized is recognized according to the image feature and size information.

该步骤表明，本公开在图像识别时，不单纯基于图像特征，还要结合尺寸信息。这样，当多个尺寸存在差异的物体之间不能通过图像特征进行辨别时，可以通过本公开提供的图像识别方法，即图像特征结合尺寸信息来进行区分，提高图像识别率。This step shows that in the present disclosure, image recognition is not based solely on image features, but also combined with size information. In this way, when multiple objects with different sizes cannot be distinguished by image features, the image recognition method provided in the present disclosure can be used to distinguish them by combining image features with size information to improve the image recognition rate.

综上所述，本实施例提供的图像识别方法，通过图像特征及尺寸信息识别待识别物体，相比仅基于图像特征识别待识别物体，可进一步区分外观类似但尺寸有差别的物体，提高图像识别的准确度，提升用户体验。To sum up, the image recognition method provided in this embodiment identifies objects to be recognized through image features and size information. Compared with identifying objects to be recognized based only on image features, it can further distinguish objects with similar appearances but different sizes, and improve image quality. The accuracy of recognition improves user experience.

图2是根据另一示例性实施例示出的一种图像识别方法的流程图。参考图2，在图1所示流程的基础上，步骤101、获取待识别物体的图像特征及尺寸信息，可以具体包括：Fig. 2 is a flowchart of an image recognition method according to another exemplary embodiment. Referring to FIG. 2, on the basis of the process shown in FIG. 1, step 101, acquiring image features and size information of the object to be identified may specifically include:

在步骤201中，通过摄像头获取待识别物体的图像特征及深度信息。In step 201, image features and depth information of an object to be recognized are acquired through a camera.

该步骤包括：通过摄像头获取待识别物体的图像特征，以及，通过摄像头获取待识别物体的深度信息。This step includes: acquiring image features of the object to be identified through the camera, and acquiring depth information of the object to be identified through the camera.

其中，通过摄像头获取待识别物体的图像特征的具体描述可参考相关技术。接下来说明如何通过摄像头获取待识别物体的深度信息。Wherein, for the specific description of acquiring the image features of the object to be recognized through the camera, reference may be made to related technologies. Next, how to obtain the depth information of the object to be recognized through the camera.

对于深度信息，可以理解，在摄像机坐标系中，以垂直成像平面并穿过镜面中心的直线为Z轴，若物体在摄像机坐标系的坐标为(X，Y，Z)，则其中的Z值即为物体在该摄像机成像平面的深度信息。For the depth information, it can be understood that in the camera coordinate system, the straight line perpendicular to the imaging plane and passing through the center of the mirror is the Z axis. If the coordinates of the object in the camera coordinate system are (X, Y, Z), then the Z value That is, the depth information of the object in the imaging plane of the camera.

实现时，可以通过摄像头获取深度信息。When implemented, the depth information can be obtained through the camera.

深度信息的获取方式至少可包括以下两种：The acquisition methods of depth information may include at least the following two methods:

获取方式一，单目深度估计方法Obtaining method 1, monocular depth estimation method

单目深度估计方法是基于一幅图像来估计它的深度信息。例如，单目深度估计方法可包括基于图像内容理解的深度估计方法和基于聚焦的深度估计方法等。The monocular depth estimation method is based on an image to estimate its depth information. For example, monocular depth estimation methods may include depth estimation methods based on image content understanding, depth estimation methods based on focusing, and the like.

获取方式二，双目深度估计方法Obtaining method 2, binocular depth estimation method

通过两个摄像头成像，因为两个摄像头之间存在一定的距离，所以同一景物通过两个镜头所成的图像存在视差，通过视差估计出深度信息。Imaging through two cameras, because there is a certain distance between the two cameras, there is parallax in the image of the same scene through the two lenses, and the depth information is estimated through the parallax.

基于上述分析可知，本公开涉及的摄像头可为单目摄像头或双目摄像头，具体实现可根据时机需求进行设置，本公开不予限制。Based on the above analysis, it can be seen that the camera involved in the present disclosure may be a monocular camera or a binocular camera, and the specific implementation may be set according to timing requirements, which is not limited in the present disclosure.

在步骤202中，根据深度信息，获得尺寸信息。In step 202, size information is obtained according to the depth information.

根据深度信息，计算得到待识别物体的尺寸信息。具体可通过待识别物体的多个边缘点坐标(包括深度信息)计算获得尺寸信息。其中，尺寸信息包括待识别物体的三维大小信息。According to the depth information, the size information of the object to be recognized is calculated. Specifically, the size information can be obtained by calculating multiple edge point coordinates (including depth information) of the object to be recognized. Wherein, the size information includes three-dimensional size information of the object to be recognized.

可选地，步骤102、根据图像特征及所述尺寸信息，识别待识别物体，可具体包括：Optionally, step 102, identifying the object to be identified according to the image features and the size information may specifically include:

在步骤203中，根据图像特征，确定第一物体集合。In step 203, a first object set is determined according to image features.

该第一物体集合至少包括待识别物体。The first set of objects includes at least objects to be identified.

由于外观类似的物体的图像特征基本相同，因此，该步骤通过图像特征初步确定待识别物体所在的集合，也就是第一物体集合。Since the image features of objects with similar appearance are basically the same, in this step, the set in which the object to be recognized is located is preliminarily determined based on the image features, that is, the first object set.

在步骤204中，在第一物体集合中，根据尺寸信息识别待识别物体。In step 204, in the first object set, identify the object to be identified according to the size information.

通过该步骤，在第一物体集合中匹配到与待识别物体的尺寸信息相应的物体，完成图像识别。Through this step, an object corresponding to the size information of the object to be recognized is matched in the first object set, and the image recognition is completed.

图3是根据又一示例性实施例示出的一种图像识别方法的流程图。参考图3，在图1所示流程的基础上，该图像识别方法还可包括：Fig. 3 is a flow chart of an image recognition method according to yet another exemplary embodiment. Referring to FIG. 3, on the basis of the process shown in FIG. 1, the image recognition method may also include:

在步骤301中，获取待识别物体的标签信息。In step 301, tag information of an object to be recognized is acquired.

标签信息包括待识别物体的名称。进一步地，标签信息还可以包括材质、制造商和产地等。The tag information includes the name of the object to be identified. Further, the label information may also include material, manufacturer and place of origin, etc.

该标签信息的获取方式与上述尺寸信息的获取方式类似。例如，该标签信息可已存储于执行该图像识别方法的主体，通过读取的方式获取；或者，实时获取标签信息，示例性地，终端设备通过摄像头获取待识别物体的标签信息，等等。The way to acquire the label information is similar to the way to acquire the above-mentioned size information. For example, the tag information may have been stored in the main body executing the image recognition method and obtained by reading; or, the tag information may be obtained in real time, for example, the terminal device obtains the tag information of the object to be recognized through a camera, and so on.

需说明的是，本公开不限定步骤301及步骤101的执行顺序，二者可以同时被执行，或者先后被执行，例如，先执行步骤101再执行步骤301，或者，先执行步骤301再执行步骤101。It should be noted that the present disclosure does not limit the order of execution of step 301 and step 101, the two may be executed simultaneously or successively, for example, step 101 is executed first and then step 301 is executed, or step 301 is executed first and then step 301 is executed 101.

此时，步骤102、根据图像特征及尺寸信息，识别待识别物体，可以包括：At this time, step 102, identifying the object to be identified according to the image features and size information may include:

在步骤302中，根据图像特征及尺寸信息，确定第二物体集合。In step 302, a second object set is determined according to image features and size information.

该第二物体集合至少包括待识别物体。The second set of objects includes at least objects to be identified.

可以理解，当多个外观相似的物体，其中部分或全部物体的尺寸信息也相当时，此时，根据图像特征及尺寸信息就不能识别出待识别物体，只能确定该待识别物体所在的集合，即第二物体集合。It can be understood that when there are multiple objects with similar appearance, some or all of them have similar size information, at this time, the object to be recognized cannot be identified according to the image features and size information, and only the set of the object to be recognized can be determined. , that is, the second set of objects.

在步骤303中，在第二物体集合中，根据标签信息识别待识别物体。In step 303, in the second object set, identify the object to be identified according to the tag information.

由于标签信息中包含各物体的基本信息，例如，名称，因此，根据标签信息可以在第二物体集合中准确识别出待识别物体。Since the tag information includes basic information of each object, such as a name, the object to be recognized can be accurately identified in the second object set according to the tag information.

综上所述，本实施例提供的方法，通过图像特征及尺寸信息初步识别待识别物体，确定待识别物体所在的集合，即第二物体集合，在该第二物体集合内进一步根据标签信息识别出待识别物体，从而实现外观类似且尺寸相当的物体的识别，进一步提高图像识别的准确度及用户体验。To sum up, the method provided in this embodiment initially identifies the object to be identified through image features and size information, determines the set where the object to be identified is located, that is, the second object set, and further identifies objects in the second object set according to the label information. The object to be recognized can be identified, so as to realize the recognition of objects with similar appearance and size, and further improve the accuracy of image recognition and user experience.

下述为本公开装置实施例，可以用于执行本公开方法实施例。对于本公开装置实施例中未披露的细节，请参照本公开方法实施例。The following are device embodiments of the present disclosure, which can be used to implement the method embodiments of the present disclosure. For details not disclosed in the disclosed device embodiments, please refer to the disclosed method embodiments.

图4是根据一示例性实施例示出的一种图像识别装置的结构示意图。本公开提供一种图像识别装置，该装置可通过硬件和/或软件的方式实现。Fig. 4 is a schematic structural diagram of an image recognition device according to an exemplary embodiment. The present disclosure provides an image recognition device, which can be realized by means of hardware and/or software.

参照图4，该图像识别装置40包括第一获取模块41和识别模块42。Referring to FIG. 4 , the image recognition device 40 includes a first acquisition module 41 and a recognition module 42 .

该第一获取模块41，被配置为获取待识别物体的图像特征及尺寸信息。The first acquiring module 41 is configured to acquire image features and size information of the object to be identified.

该识别模块42，被配置为根据图像特征及尺寸信息，识别待识别物体。The identification module 42 is configured to identify the object to be identified according to image features and size information.

综上所述，本实施例提供的图像识别装置，通过图像特征及尺寸信息识别待识别物体，相比仅基于图像特征识别待识别物体，可进一步区分外观类似但尺寸有差别的物体，提高图像识别的准确度，提升用户体验。To sum up, the image recognition device provided in this embodiment can identify objects to be recognized through image features and size information. Compared with identifying objects to be recognized based only on image features, it can further distinguish objects with similar appearances but different sizes, and improve image quality. The accuracy of recognition improves user experience.

可选地，第一获取模块41被配置为：通过摄像头获取待识别物体的图像特征及深度信息；根据深度信息，获得尺寸信息。Optionally, the first acquiring module 41 is configured to: acquire image features and depth information of the object to be identified through a camera; and acquire size information according to the depth information.

可选地，上述摄像头可以为单目摄像头或双目摄像头等。Optionally, the aforementioned camera may be a monocular camera or a binocular camera.

图5是根据另一示例性实施例示出的一种图像识别装置的结构示意图。参考图5，在图4所示结构的基础上，图像识别装置50中，识别模块42可以包括：确定子模块421和识别子模块422。其中，Fig. 5 is a schematic structural diagram of an image recognition device according to another exemplary embodiment. Referring to FIG. 5 , on the basis of the structure shown in FIG. 4 , in the image recognition device 50 , the recognition module 42 may include: a determination submodule 421 and a recognition submodule 422 . in,

该确定子模块421，被配置为根据图像特征，确定第一物体集合，该第一物体集合至少包括待识别物体。The determining sub-module 421 is configured to determine a first object set according to image features, and the first object set includes at least the object to be identified.

该识别子模块422，被配置为在第一物体集合中，根据尺寸信息识别待识别物体。The identification submodule 422 is configured to identify the object to be identified according to the size information in the first object set.

图6是根据又一示例性实施例示出的一种图像识别装置的结构示意图。参考图6，在图4所示结构的基础上，图像识别装置60还包括：第二获取模块61。Fig. 6 is a schematic structural diagram of an image recognition device according to yet another exemplary embodiment. Referring to FIG. 6 , on the basis of the structure shown in FIG. 4 , the image recognition device 60 further includes: a second acquisition module 61 .

该第二获取模块61，被配置为获取待识别物体的标签信息。The second acquiring module 61 is configured to acquire tag information of the object to be identified.

相应地，该实施例中，识别模块42被配置为：根据图像特征及尺寸信息，确定第二物体集合，该第二物体集合至少包括待识别物体；在第二物体集合中，根据标签信息识别待识别物体。Correspondingly, in this embodiment, the identification module 42 is configured to: determine the second object set according to the image features and size information, the second object set includes at least the object to be identified; in the second object set, identify object to be identified.

本实施例通过图像特征及尺寸信息初步识别待识别物体，确定待识别物体所在的集合，即第二物体集合，在该第二物体集合内进一步根据标签信息识别出待识别物体，从而实现外观类似且尺寸相当的物体的识别，进一步提高图像识别的准确度及用户体验。In this embodiment, the object to be identified is preliminarily identified through image features and size information, and the set where the object to be identified is determined, that is, the second object set. In the second object set, the object to be identified is further identified according to the label information, so as to achieve a similar appearance. Moreover, the recognition of objects of comparable size further improves the accuracy of image recognition and user experience.

图7是根据一示例性实施例示出的一种图像识别装置框图。参照图7，该图像识别装置800包括可以包括以下一个或多个组件：处理组件802，存储器804，电源组件806，多媒体组件808，音频组件810，输入/输出(input/output，简称：I/O)接口812，传感器组件814，以及通信组件816。Fig. 7 is a block diagram of an image recognition device according to an exemplary embodiment. 7, the image recognition device 800 includes one or more of the following components: a processing component 802, a memory 804, a power supply component 806, a multimedia component 808, an audio component 810, an input/output (input/output, referred to as: I/ O) interface 812, sensor component 814, and communication component 816.

处理组件802通常控制图像识别装置800的整体操作，诸如与显示，数据通信，相机操作和记录操作相关联的操作。处理组件802可以包括一个或多个处理器820来执行指令，以完成上述的方法的全部或部分步骤。此外，处理组件802可以包括一个或多个模块，便于处理组件802和其他组件之间的交互。例如，处理组件802可以包括多媒体模块，以方便多媒体组件808和处理组件802之间的交互。The processing component 802 generally controls the overall operations of the image recognition device 800, such as operations associated with display, data communication, camera operations, and recording operations. The processing component 802 may include one or more processors 820 to execute instructions to complete all or part of the steps of the above method. Additionally, processing component 802 may include one or more modules that facilitate interaction between processing component 802 and other components. For example, processing component 802 may include a multimedia module to facilitate interaction between multimedia component 808 and processing component 802 .

存储器804被配置为存储各种类型的数据以支持在图像识别装置800的操作。这些数据的示例包括用于在图像识别装置800上操作的任何应用程序或方法的指令等。存储器804可以由任何类型的易失性或非易失性存储设备或者它们的组合实现，如静态随机存取存储器(Static Random Access Memory，简称：SRAM)，电可擦除可编程只读存储器(Electrically Erasable Programmable Read-Only Memory，简称：EEPROM)，可擦除可编程只读存储器(Erasable Programmable Read Only Memory，简称：EPROM)，可编程只读存储器(Programmable Red-Only Memory，简称：PROM)，只读存储器(Read-Only Memory，简称：ROM)，磁存储器，快闪存储器，磁盘或光盘。The memory 804 is configured to store various types of data to support operations in the image recognition device 800 . Examples of such data include instructions for any application programs or methods operating on the image recognition device 800 , and the like. The memory 804 can be implemented by any type of volatile or non-volatile storage device or their combination, such as Static Random Access Memory (Static Random Access Memory, referred to as: SRAM), Electrically Erasable Programmable Read-Only Memory (EPROM) Electrically Erasable Programmable Read-Only Memory (EEPROM for short), Erasable Programmable Read Only Memory (EPROM for short), Programmable Red-Only Memory (PROM for short), Read-Only Memory (Read-Only Memory, ROM for short), magnetic memory, flash memory, magnetic disk or optical disk.

电源组件806为图像识别装置800的各种组件提供电力。电源组件806可以包括电源管理系统，一个或多个电源，及其他与为图像识别装置800生成、管理和分配电力相关联的组件。The power supply component 806 provides power to various components of the image recognition device 800 . Power components 806 may include a power management system, one or more power supplies, and other components associated with generating, managing, and distributing power for image recognition device 800 .

多媒体组件808包括在所述图像识别装置800和用户之间的提供一个输出接口的屏幕。在一些实施例中，屏幕可以包括OLED显示屏和触摸面板(Touch Panel，简称：TP)。如果OLED显示屏包括触摸面板，OLED显示屏可以被实现为触摸屏，以接收来自用户的输入信号。触摸面板包括一个或多个触摸传感器以感测触摸、滑动和触摸面板上的手势。所述触摸传感器可以不仅感测触摸或滑动动作的边界，而且还检测与所述触摸或滑动操作相关的持续时间和压力。The multimedia component 808 includes a screen providing an output interface between the image recognition device 800 and the user. In some embodiments, the screen may include an OLED display and a touch panel (Touch Panel, TP for short). If the OLED display includes a touch panel, the OLED display may be implemented as a touch screen to receive an input signal from a user. The touch panel includes one or more touch sensors to sense touches, swipes, and gestures on the touch panel. The touch sensor may not only sense a boundary of a touch or swipe action, but also detect duration and pressure associated with the touch or swipe action.

音频组件810被配置为输出和/或输入音频信号。例如，音频组件810包括一个麦克风(Microphone，简称：MIC)，当图像识别装置800处于按摩模式时，麦克风被配置为接收外部音频信号。所接收的音频信号可以被进一步存储在存储器804或经由通信组件816发送。在一些实施例中，音频组件810还包括一个扬声器，用于输出音频信号。The audio component 810 is configured to output and/or input audio signals. For example, the audio component 810 includes a microphone (Microphone, MIC for short), and when the image recognition device 800 is in the massage mode, the microphone is configured to receive an external audio signal. Received audio signals may be further stored in memory 804 or sent via communication component 816 . In some embodiments, the audio component 810 also includes a speaker for outputting audio signals.

I/O接口812为处理组件802和外围接口模块之间提供接口，上述外围接口模块可以是按钮等。这些按钮可包括但不限于：主页按钮、音量按钮、启动按钮和锁定按钮。The I/O interface 812 provides an interface between the processing component 802 and a peripheral interface module, and the peripheral interface module may be a button or the like. These buttons may include, but are not limited to: a home button, volume buttons, start button, and lock button.

传感器组件814包括一个或多个传感器，用于为图像识别装置800提供各个方面的状态评估。Sensor assembly 814 includes one or more sensors for providing image recognition device 800 with various aspects of status assessment.

通信组件816被配置为便于图像识别装置800和其他设备之间有线或无线方式的通信。图像识别装置800可以接入基于通信标准的无线网络，如无线保真(Wireless-Fidelity，简称：Wi-Fi)，2G或3G，或它们的组合。在一个示例性实施例中，所述通信组件816还包括近场通信(Near Field Communication，简称：NFC)模块，以促进短程通信。例如，在NFC模块可基于射频识别(Radio Frequency Identification，简称：RFID)技术，红外数据协会(Infrared Data Association，简称：IrDA)技术，超宽带(Ultra Wideband，简称：UWB)技术，蓝牙(Bluetooth，简称：BT)技术和其他技术来实现。The communication component 816 is configured to facilitate wired or wireless communication between the image recognition apparatus 800 and other devices. The image recognition apparatus 800 may access a wireless network based on a communication standard, such as Wireless-Fidelity (Wireless-Fidelity, Wi-Fi for short), 2G or 3G, or a combination thereof. In an exemplary embodiment, the communication component 816 further includes a near field communication (Near Field Communication, NFC for short) module to facilitate short-range communication. For example, the NFC module can be based on radio frequency identification (Radio Frequency Identification, referred to as: RFID) technology, infrared data association (Infrared Data Association, referred to as: IrDA) technology, ultra wideband (Ultra Wideband, referred to as: UWB) technology, Bluetooth (Bluetooth, Abbreviation: BT) technology and other technologies to achieve.

在示例性实施例中，图像识别装置800可以被一个或多个应用专用集成电路(Application Specific Integrated Circuit，简称：ASIC)、数字信号处理器(DigitalSignal Processor，简称：DSP)、数字信号处理设备(Digital Signal Processing Device，简称：DSPD)、可编程逻辑器件(Programmable Logic Device，简称：PLD)、现场可编程门阵列(Field Programmable Gate Array，简称：FPGA)、控制器、微控制器、微处理器或其他电子元件实现，用于执行上述方法。In an exemplary embodiment, the image recognition apparatus 800 may be implemented by one or more application-specific integrated circuits (Application Specific Integrated Circuit, ASIC for short), digital signal processors (Digital Signal Processor, DSP for short), digital signal processing devices ( Digital Signal Processing Device, referred to as: DSPD), programmable logic device (Programmable Logic Device, referred to as: PLD), field programmable gate array (Field Programmable Gate Array, referred to as: FPGA), controller, microcontroller, microprocessor or other electronic components to implement the above method.

在示例性实施例中，还提供了一种包括指令的非临时性计算机可读存储介质，例如包括指令的存储器804，上述指令可由图像识别装置800的处理器820执行以完成上述方法。例如，所述非临时性计算机可读存储介质可以是ROM、随机存取存储器(Random AccessMemory，简称：RAM)、只读光盘(Compact Disc Read-Only Memory，简称：CD-ROM)、磁带、软盘和光数据存储设备等。In an exemplary embodiment, there is also provided a non-transitory computer-readable storage medium including instructions, such as the memory 804 including instructions, which can be executed by the processor 820 of the image recognition device 800 to complete the above method. For example, the non-transitory computer-readable storage medium may be ROM, random access memory (Random Access Memory, referred to as: RAM), read-only disc (Compact Disc Read-Only Memory, referred to as: CD-ROM), magnetic tape, floppy disk and optical data storage devices, etc.

一种非临时性计算机可读存储介质，当所述存储介质中的指令由终端设备的处理器执行时，使得终端设备能够执行该方法：获取待识别物体的图像特征及尺寸信息；根据图像特征及尺寸信息，识别待识别物体。A non-transitory computer-readable storage medium, when the instructions in the storage medium are executed by the processor of the terminal device, the terminal device can execute the method: acquire the image feature and size information of the object to be identified; according to the image feature and size information to identify the object to be identified.

本领域技术人员在考虑说明书及实践这里公开的发明后，将容易想到本公开的其它实施方案。本申请旨在涵盖本公开的任何变型、用途或者适应性变化，这些变型、用途或者适应性变化遵循本公开的一般性原理并包括本公开未公开的本技术领域中的公知常识或惯用技术手段。说明书和实施例仅被视为示例性的，本公开的真正范围和精神由下面的权利要求书指出。Other embodiments of the present disclosure will be readily apparent to those skilled in the art from consideration of the specification and practice of the invention disclosed herein. This application is intended to cover any modification, use or adaptation of the present disclosure, and these modifications, uses or adaptations follow the general principles of the present disclosure and include common knowledge or conventional technical means in the technical field not disclosed in the present disclosure . The specification and examples are to be considered exemplary only, with a true scope and spirit of the disclosure being indicated by the following claims.

应当理解的是，本公开并不局限于上面已经描述并在附图中示出的精确结构，并且可以在不脱离其范围进行各种修改和改变。本公开的范围仅由所附的权利要求书来限制。It should be understood that the present disclosure is not limited to the precise constructions which have been described above and shown in the drawings, and various modifications and changes may be made without departing from the scope thereof. The scope of the present disclosure is limited only by the appended claims.

Claims

1. An image recognition method, characterized in that, comprising:

Obtain the image features and size information of the object to be recognized;

The object to be recognized is recognized according to the image feature and the size information.

2. The method according to claim 1, wherein said acquiring image features and size information of the object to be identified comprises:

Obtaining image features and depth information of the object to be identified through a camera;

According to the depth information, the size information is obtained.

3. The method according to claim 1, wherein the identifying the object to be identified according to the image features and the size information comprises:

Determine a first object set according to the image features, the first object set includes at least the object to be identified;

In the first object set, the object to be identified is identified according to the size information.

4. The method according to any one of claims 1 to 3, characterized in that the method further comprises:

Acquiring tag information of the object to be identified;

Correspondingly, the identifying the object to be identified according to the image feature and the size information includes:

determining a second object set according to the image features and the size information, the second object set including at least the object to be identified;

In the second object set, the object to be identified is identified according to the tag information.

5. An image recognition device, characterized in that it comprises:

The first acquisition module is configured to acquire image features and size information of the object to be identified;

The identification module is configured to identify the object to be identified according to the image features and the size information.

6. The device according to claim 5, wherein the first acquisition module is configured to:

According to the depth information, the size information is obtained.

7. The device according to claim 5, wherein the identification module comprises:

The determining submodule is configured to determine a first object set according to the image features, and the first object set includes at least the object to be identified;

The identification submodule is configured to identify the object to be identified in the first object set according to the size information.

8. The device according to any one of claims 5 to 7, further comprising:

The second acquiring module is configured to acquire the label information of the object to be identified;

Correspondingly, the identification module is configured to:

9. An image recognition device, characterized in that it comprises:

processor and memory for storing executable instructions;

Wherein, the processor is configured to execute the executable instructions to perform the image recognition method according to any one of claims 1-4.

10. A computer-readable storage medium, on which a computer program is stored, wherein when the computer program is executed by a processor, the image recognition method according to any one of claims 1 to 4 is implemented.