[go: up one dir, main page]

CN106709531B - Method and device for identifying substances used by multi-process matched tobacco materials - Google Patents

Method and device for identifying substances used by multi-process matched tobacco materials Download PDF

Info

Publication number
CN106709531B
CN106709531B CN201710042035.5A CN201710042035A CN106709531B CN 106709531 B CN106709531 B CN 106709531B CN 201710042035 A CN201710042035 A CN 201710042035A CN 106709531 B CN106709531 B CN 106709531B
Authority
CN
China
Prior art keywords
matching
information
materials
tobacco
tobacco material
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN201710042035.5A
Other languages
Chinese (zh)
Other versions
CN106709531A (en
Inventor
冯伟华
李国政
赵乐
周浩
王洪波
郝辉
张鹏
邱建华
樊美娟
鲁平
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
China Tobacco Henan Industrial Co Ltd
Zhengzhou Tobacco Research Institute of CNTC
Original Assignee
China Tobacco Henan Industrial Co Ltd
Zhengzhou Tobacco Research Institute of CNTC
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by China Tobacco Henan Industrial Co Ltd, Zhengzhou Tobacco Research Institute of CNTC filed Critical China Tobacco Henan Industrial Co Ltd
Priority to CN201710042035.5A priority Critical patent/CN106709531B/en
Publication of CN106709531A publication Critical patent/CN106709531A/en
Application granted granted Critical
Publication of CN106709531B publication Critical patent/CN106709531B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/22Matching criteria, e.g. proximity measures
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V10/00Arrangements for image or video recognition or understanding
    • G06V10/20Image preprocessing
    • G06V10/22Image preprocessing by selection of a specific region containing or referencing a pattern; Locating or processing of specific regions to guide the detection or recognition

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Data Mining & Analysis (AREA)
  • General Physics & Mathematics (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Evolutionary Biology (AREA)
  • Evolutionary Computation (AREA)
  • Bioinformatics & Computational Biology (AREA)
  • General Engineering & Computer Science (AREA)
  • Artificial Intelligence (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Multimedia (AREA)
  • Management, Administration, Business Operations System, And Electronic Commerce (AREA)
  • Image Analysis (AREA)

Abstract

本发明提供了一种多过程匹配烟用材料使用物质识别方法及装置,该方法首先扫描并识别烟草材料纸质文件或图片中的烟用材料的信息;然后将识别出的烟用材料的信息与烟用材料许可标准数据库进行匹配,若匹配成功,则将结果进行显示;若匹配不成功,则将烟用材料的信息与历史经验库进行匹配,若匹配成功,则将结果进行显示。本发明解决现有技术中因烟草材料的标准不统一、形式各样,很难进行识别匹配的问题。构建的具有自学习功能的历史经验库,在与烟用材料许可标准数据库匹配不成功的情况下,再与历史经验库进行二次匹配,使得匹配准确度得到提升。

Figure 201710042035

The invention provides a method and device for identifying substances used in multi-process matching cigarette materials. The method first scans and identifies the information of cigarette materials in paper documents or pictures of tobacco materials; Match with the cigarette material licensing standard database. If the match is successful, the result will be displayed; if the match is unsuccessful, the information of the cigarette material will be matched with the historical experience database. If the match is successful, the result will be displayed. The invention solves the problem that identification and matching are difficult in the prior art due to inconsistent standards and various forms of tobacco materials. The constructed historical experience database with self-learning function, in the case of unsuccessful matching with the cigarette material licensing standard database, is then matched with the historical experience database for a second time, so that the matching accuracy is improved.

Figure 201710042035

Description

一种多过程匹配烟用材料使用物质识别方法及装置A method and device for identifying substances used in multi-process matching cigarette materials

技术领域technical field

本发明属于文本识别检测技术领域,具体涉及一种多过程匹配烟用材料使用物质识别方法及装置。The invention belongs to the technical field of text recognition and detection, and in particular relates to a method and a device for identifying substances used in multi-process matching cigarette materials.

背景技术Background technique

烟用材料是指卷烟生产所使用的材料,包括卷烟纸、卷烟纸钢印印刷油墨、滤棒成形纸、烟用接装纸、烟用内衬纸、烟用框架纸、包装纸(条与盒)、封签纸、烟用水基胶、烟用热熔胶、烟用二醋酸纤维素丝束、烟用聚丙烯纤维丝束和烟用三乙酸甘油酯。Cigarette materials refer to the materials used in the production of cigarettes, including cigarette paper, stencil printing ink for cigarette paper, filter rod forming paper, cigarette tipping paper, cigarette lining paper, cigarette frame paper, wrapping paper (bars and boxes) ), seal paper, water-based adhesive for cigarette, hot melt adhesive for cigarette, cellulose diacetate tow for cigarette, polypropylene fiber tow for cigarette and triacetin for cigarette.

烟用材料使用物质是指在烟用材料生产过程中所使用的原材料和为满足预期用途添加的有助于改善其品质、特性或辅助改善品质、特性的物质,以及为促进生产过程的顺利进行而不是为了改善终产品品质、特性所添加的加工助剂。Substances used in smoking materials refer to the raw materials used in the production process of smoking materials and the substances added to meet the intended use to help improve their quality, characteristics or assist in improving their quality and characteristics, as well as to promote the smooth progress of the production process. It is not a processing aid added to improve the quality and characteristics of the final product.

烟用材料的使用直接影响到卷烟产品的安全性。烟草企业制定了允许使用的物质名单。所有进入卷烟产品的烟用材料都必须以标准为依据,进行符合性审查。目前,烟用材料许可物质审核采用人工审核的方式,对供应商上报的烟用材料批次物质信息逐条进行符合性审查,由于烟用材料种类众多、许可使用物质类别多样、上报烟用材料许可物质数据缺乏规范性等原因,审查工作量巨大,给审核人员带来了极大的不便。而且,烟草材料的标准不统一,形式多样,很难进行识别匹配。The use of smoking materials directly affects the safety of cigarette products. Tobacco companies develop lists of permitted substances. All smoking materials entering cigarette products must be reviewed for compliance based on the standards. At present, the audit of licensed substances for cigarette materials adopts the method of manual review, and the compliance review is carried out on the batch substance information of cigarette materials reported by suppliers one by one. Due to the lack of normative material data and other reasons, the review workload is huge, which brings great inconvenience to the reviewers. Moreover, the standards for tobacco materials are not uniform and in various forms, making it difficult to identify and match.

发明内容SUMMARY OF THE INVENTION

本发明的目的是提供一种多过程匹配烟用材料使用物质识别方法及装置,以解决现有技术中因烟草材料的标准不统一、形式各样,很难进行识别匹配的问题。The purpose of the present invention is to provide a method and device for identifying substances used in multi-process matching tobacco materials, so as to solve the problem that identification and matching are difficult in the prior art due to inconsistent standards and various forms of tobacco materials.

为解决上述技术问题,本发明的技术方案是:For solving the above-mentioned technical problems, the technical scheme of the present invention is:

本发明提供一种智能的多过程匹配烟用材料使用物质识别方法,包括如下方法方案:The present invention provides an intelligent multi-process matching method for identifying substances used in cigarette materials, including the following method solutions:

方法方案一,包括如下步骤:Method 1 includes the following steps:

1)扫描并识别烟草材料纸质文件或图片中的烟用材料的信息;1) Scan and identify information on tobacco materials in paper documents or pictures of tobacco materials;

2)将识别出的烟用材料的信息与烟用材料许可标准数据库进行匹配,若匹配成功,则将结果进行显示;其中,烟用材料许可标准数据库存储的是允许的烟用材料使用物质的信息;2) Match the information of the identified smoking materials with the smoking materials licensing standard database, and if the matching is successful, the result will be displayed; wherein, the smoking materials licensing standard database stores the permitted smoking materials using substances. information;

3)若匹配不成功,则将烟草材料的信息与历史经验库进行匹配,若匹配成功,则将结果进行显示;其中,历史经验库存储的是以前匹配成功或不成功的烟用材料使用物质的信息。3) If the matching is unsuccessful, the information of the tobacco material is matched with the historical experience database, and if the matching is successful, the result is displayed; wherein, the historical experience database stores the substances used in tobacco materials that have been successfully or unsuccessfully matched before. Information.

方法方案二,在方法方案一的基础上,还包括将与历史经验库匹配的结果存储到历史经验库的步骤。The second method, on the basis of the first method, further includes the step of storing the result matched with the historical experience database in the historical experience database.

方法方案三、四,分别在方法方案一、方法方案二的基础上,将烟用材料的信息与历史经验库进行匹配时,对烟用材料的信息的特征集合采用公式Method 3 and 4: On the basis of Method 1 and Method 2, respectively, when matching the information of cigarette materials with the historical experience database, formulas are used for the feature set of the information of cigarette materials.

Figure BDA0001215146150000021
Figure BDA0001215146150000021

来度量,按照匹配相似度由高到低将结果进行显示;其中,Ci为特征集合簇类的中心,x为簇的集合分布点,k为特征数据集合数,E为结果的最优值。to measure, and display the results according to the matching similarity from high to low; among them, C i is the center of the feature set cluster, x is the set distribution point of the cluster, k is the number of feature data sets, and E is the optimal value of the result .

方法方案五、六,分别在方法方案一、方法方案二的基础上,通过OCR识别来识别烟草材料纸质文件或图片中的烟用材料的信息。Methods 5 and 6 are based on method 1 and method 2, respectively, through OCR identification to identify the tobacco material information in paper documents or pictures of tobacco materials.

本发明还提供一种多过程匹配烟用材料使用物质识别装置,包括如下装置方案:The present invention also provides a multi-process matching device for identifying substances used in smoking materials, including the following device solutions:

装置方案一,包括如下模块:Device solution one includes the following modules:

用于扫描并识别烟草材料纸质文件或图片中的烟用材料的信息的模块;A module for scanning and identifying information on tobacco material in paper documents or pictures of tobacco material;

用于将识别出的烟用材料的信息与烟用材料许可标准数据库进行匹配,若匹配成功,则将结果进行显示的模块;其中,烟用材料许可标准数据库存储的是允许的烟用材料使用物质的信息;A module used to match the information of the identified smoking materials with the smoking materials licensing standard database, and if the matching is successful, the result will be displayed; wherein, the smoking materials licensing standard database stores the allowed smoking materials using information on substances;

用于若匹配不成功,则将烟草材料的信息与历史经验库进行匹配,若匹配成功,则将结果进行显示的模块;其中,历史经验库存储的是以前匹配成功或不成功的烟用材料使用物质的信息。A module used to match the information of tobacco materials with the historical experience database if the matching is unsuccessful, and display the results if the matching is successful; wherein, the historical experience database stores the cigarette materials that have been successfully or unsuccessfully matched before Information on substances used.

装置方案二,在装置方案一的基础上,还包括用于将与历史经验库匹配的结果存储到历史经验库的模块。The second device solution, on the basis of the first device solution, further includes a module for storing the result matched with the historical experience database in the historical experience database.

装置方案三、四,分别在装置方案一、装置方案二的基础上,将烟用材料的信息与历史经验库进行匹配时,对烟用材料的信息的特征集合采用公式Device scheme three and four, based on device scheme one and device scheme two, respectively, when matching the information of cigarette materials with the historical experience database, formulas are used for the feature set of the information of cigarette materials

Figure BDA0001215146150000022
Figure BDA0001215146150000022

来度量,按照匹配相似度由高到低将结果进行显示;其中,Ci为特征集合簇类的中心,x为簇的集合分布点,k为特征数据集合数,E为结果的最优值。to measure, and display the results according to the matching similarity from high to low; among them, C i is the center of the feature set cluster, x is the set distribution point of the cluster, k is the number of feature data sets, and E is the optimal value of the result .

装置方案五、六,分别在装置方案一、装置方案二的基础上,通过OCR识别来识别烟草材料纸质文件或图片中的烟用材料的信息。Device solutions 5 and 6 are based on device solution 1 and device solution 2, respectively, through OCR identification to identify tobacco material information in paper documents or pictures.

本发明的有益效果是:The beneficial effects of the present invention are:

本发明的多过程匹配烟用材料使用物质识别方法及装置,在识别出烟用材料的信息后,首先快速检索查询烟用材料许可标准库,将烟用材料的信息与标准库中进行匹配,若匹配不成功,继续将烟用材料的信息与构建的历史经验库进行匹配,实现烟用材料使用物质文本信息的智能匹配和预测。解决了现有技术中采用人工审核的方式对烟用材料许可物质进行审核工作量大、审核不准确的问题,可以提高烟用材料审核人员的工作效率,降低人工审核带来的差错率。The method and device for identifying substances used in multi-process matching cigarette materials of the present invention, after identifying the information of cigarette materials, firstly quickly search and query the cigarette material licensing standard library, and match the information of cigarette materials with the standard library, If the matching is unsuccessful, continue to match the information of cigarette materials with the constructed historical experience database to realize intelligent matching and prediction of the text information of cigarette materials used. The method solves the problems of large workload and inaccurate auditing of licensed substances in tobacco materials by manual auditing in the prior art, which can improve the work efficiency of tobacco material auditing personnel and reduce the error rate caused by manual auditing.

而且,因烟用材料许可标准数据库不一定能够囊括所有的烟用材料的化学式、英文名称、俗名等等,一次匹配过程往往很难匹配出结果,所以,构建了具有自学习功能的历史经验库,在与烟用材料许可标准数据库匹配不成功的情况下,再与历史经验库进行二次匹配,使得匹配准确度得到提升。Moreover, because the cigarette material licensing standard database may not be able to cover all the chemical formulas, English names, common names, etc. of all cigarette materials, it is often difficult to match the results in one matching process. Therefore, a historical experience database with self-learning function is constructed. , in the case of unsuccessful matching with the cigarette material licensing standard database, a second matching is performed with the historical experience database, so that the matching accuracy is improved.

附图说明Description of drawings

图1是本发明的主要组成部分结构示意图;Fig. 1 is the main component structure schematic diagram of the present invention;

图2是本发明主要组成部分网络连接关系示意图;2 is a schematic diagram of the network connection relationship of the main components of the present invention;

图3是本发明的流程图;Fig. 3 is the flow chart of the present invention;

图4是供应商提供的纸质物质成分图;Figure 4 is the composition diagram of the paper substance provided by the supplier;

图5是烟用材料许可标准数据库示意图;Fig. 5 is the schematic diagram of the licensing standard database of tobacco materials;

图6是历史经验库示意图;Fig. 6 is the schematic diagram of historical experience database;

图7是匹配结果图。FIG. 7 is a matching result graph.

具体实施方式Detailed ways

为了使本发明的目的、技术方案及优点更加清楚明白,以下结合附图及实施例,对本发明进行进一步详细说明。应当理解,此处所描述的具体实施例仅用以解释本发明,并不用于限定本发明。In order to make the objectives, technical solutions and advantages of the present invention clearer, the present invention will be further described in detail below with reference to the accompanying drawings and embodiments. It should be understood that the specific embodiments described herein are only used to explain the present invention, but not to limit the present invention.

如图1所示,本发明主要包括:手持设备和服务器端。其中手持设备包括手持扫描模块、第一数据收发模块、终端显示模块;服务器端包括OCR识别模块、第二数据收发模块、智能匹配模块、烟用材料许可标准数据库模块和匹配经验模块。As shown in FIG. 1 , the present invention mainly includes: a handheld device and a server. The handheld device includes a handheld scanning module, a first data transceiver module, and a terminal display module; the server side includes an OCR identification module, a second data transceiver module, an intelligent matching module, a tobacco material licensing standard database module, and a matching experience module.

具体的:specific:

手持扫描模块,用于扫描烟用材料使用物质化学文本信息、图片信息或文本及图片组合信息,还用于对终端显示的匹配结果信息进行确认。The handheld scanning module is used to scan the chemical text information, picture information or text and picture combination information of substances used in cigarette materials, and is also used to confirm the matching result information displayed on the terminal.

第一数据收发模块,用于将手持设备发送过来扫描图片数据发送到服务器端,进行智能匹配处理,还用于接收传回的匹配结果。The first data transceiver module is used to send the scanned image data sent by the handheld device to the server for intelligent matching processing, and is also used to receive the returned matching result.

第二数据收发模块,用于接收从手持设备发送过来的图片数据信息,还用于发送匹配结果信息到手持设备。The second data transceiver module is used for receiving the picture data information sent from the handheld device, and is also used for sending matching result information to the handheld device.

OCR识别模块,用于将第一数据收发模块发送来的图片数据进行扫描识别,识别图片中的具体烟用材料信息。The OCR identification module is used to scan and identify the picture data sent by the first data transceiver module to identify the specific cigarette material information in the picture.

烟用材料许可标准数据库模块,用于存储烟用材料许可物质标准信息。Tobacco material licensing standard database module, which is used to store the standard information of licensed substances for smoking materials.

历史经验数据库模块,用于存储匹配经验数据和历史数据信息。The historical experience database module is used to store matching experience data and historical data information.

终端显示模块,用于显示匹配结果信息,并按照匹配的相似度的高低进行排序显示。The terminal display module is used to display the matching result information, and sort and display it according to the similarity of the matching.

如图2所示为本发明主要组成部分网络连接关系示意图,主要包括:智能手持设备1、无线路由器2、应用服务器3和数据库服务器4。FIG. 2 is a schematic diagram of the network connection relationship of the main components of the present invention, mainly including: an intelligent handheld device 1 , a wireless router 2 , an application server 3 and a database server 4 .

具体的:specific:

智能手持设备1用于扫描烟用材料使用物质化学文本信息图片,并通过数据收发模块把数据通过无线路由器2发送到数据库服务器4,还用于对匹配结果进行确认的操作。The smart handheld device 1 is used to scan pictures of chemical text information of substances used in cigarette materials, and send the data to the database server 4 through the wireless router 2 through the data transceiver module, and is also used for the operation of confirming the matching results.

无线路由器2用于智能手持设备1向数据库服务器4发送数据,并用于应用服务器3向智能手持设备发送对比匹配结果信息。The wireless router 2 is used for the smart handheld device 1 to send data to the database server 4, and for the application server 3 to send the comparison matching result information to the smart handheld device.

应用服务器3对数据库服务器4中存储的烟用材料使用物质化学文本信息图片数据进行OCR识别,并对比数据库服务器4中存储的烟用材料许可物质标准信息、匹配历史经验数据库信息,将对比结果通过无线路由器2,发送给智能手持设备1。The application server 3 performs OCR identification on the chemical text information and picture data of the substances used for smoking materials stored in the database server 4, and compares the standard information of the permitted substances for smoking materials stored in the database server 4, and matches the historical experience database information, and passes the comparison result through. The wireless router 2 sends it to the smart handheld device 1.

数据库服务器4存储的有扫描的烟用材料使用物质化学文本信息图片和智能手持设备1发送过来的数据、烟用材料许可物质标准信息和匹配经验数据和历史数据信息。The database server 4 stores scanned chemical text information pictures of substances used in smoking materials, data sent from the smart handheld device 1 , standard information of licensed substances for smoking materials, matching experience data and historical data information.

如图3所示,本发明的主要流程如下:As shown in Figure 3, the main flow of the present invention is as follows:

1)通过手持扫描设备逐行扫描烟草材料纸质文件,识别材料供应商提交的纸质物质成分图片(比如:碳酸钙、CaCO3)等,如图4所示为本实施例中供应商提供的纸质物质成分图。1) Scan the tobacco material paper file line by line through a handheld scanning device, and identify the paper material composition pictures (such as calcium carbonate, CaCO 3 ) submitted by the material supplier, etc., as shown in FIG. 4 , provided by the supplier in this embodiment composition chart of paper substances.

2)扫描设备将扫描到的图片信息以无线方式发送到服务器端。2) The scanning device sends the scanned picture information to the server in a wireless manner.

3)服务器端通过OCR特征识别模块,识别出图片中的文字信息或者化学符号,即识别出“石灰石”图片中的文字信息“石灰石”。3) The server side recognizes the text information or chemical symbols in the picture through the OCR feature recognition module, that is, recognizes the text information "limestone" in the "limestone" picture.

4)针对“石灰石”的文字信息,对照烟用材料许可标准数据库,采用搜索技术进行全模式匹配,如若发现“石灰石”出现在许可名录上,则将结果反馈用户终端显示,完成一次匹配。但是,在本实施例中,烟用材料许可标准数据库如图5所示,在该许可名录上并未发现“石灰石”,一次匹配失败,需要进行二次匹配。其中,烟用材料许可标准数据库中存储的是烟用材料许可标准里允许的物质名称信息。4) For the text information of "limestone", use the search technology to perform full pattern matching against the cigarette material licensing standard database. If it is found that "limestone" appears on the license list, the result will be fed back to the user terminal for display to complete a match. However, in this embodiment, the cigarette material licensing standard database is shown in FIG. 5 , and “limestone” is not found in the licensing list, and the first matching fails, and a second matching is required. Among them, the information on the names of substances permitted in the licensing standards for smoking materials is stored in the smoking materials licensing standard database.

5)对物质名称“石灰石”文本的特征或者图片段的特征集合进行操作,开展模糊智能匹配,在历史经验库中进行预测匹配,如图6所示为历史经验库图。结果发现“石灰石—碳酸钙”在历史经验库中,结果如图7所示,计算的相似度结果高低,从表格顶部开始排列,越靠近顶部相似度越高,则说明“石灰石”在烟用材料许可范围内,对应烟用材料许可标准数据库中编号5的记录,则将该结果显示在终端上,二次匹配过程完成。5) Operate on the feature set of the material name "limestone" text or the feature set of the picture segment, carry out fuzzy intelligent matching, and perform predictive matching in the historical experience database, as shown in Figure 6 for the historical experience database map. It was found that "limestone-calcium carbonate" is in the historical experience database, and the results are shown in Figure 7. The calculated similarity results are arranged from the top of the table. The closer to the top, the higher the similarity, indicating that "limestone" is used in cigarettes Within the scope of the material license, corresponding to the record No. 5 in the tobacco material license standard database, the result is displayed on the terminal, and the secondary matching process is completed.

6)根据最终的匹配结果,由操作人员确认,自动将结果加入到历史经验库中。6) According to the final matching result, the operator will confirm and automatically add the result to the historical experience database.

另外,本发明还提供一种多过程匹配烟用材料使用物质识别装置,包括如下模块:In addition, the present invention also provides a multi-process matching device for identifying substances used in smoking materials, including the following modules:

用于扫描并识别烟草材料纸质文件或图片中的烟用材料的信息的模块;A module for scanning and identifying information on tobacco material in paper documents or pictures of tobacco material;

用于将识别出的烟用材料的信息与烟用材料许可标准数据库进行匹配,若匹配成功,则将结果进行显示的模块;其中,烟用材料许可标准数据库存储的是允许的烟用材料使用物质的信息;A module used to match the information of the identified smoking materials with the smoking materials licensing standard database, and if the matching is successful, the result will be displayed; wherein, the smoking materials licensing standard database stores the allowed smoking materials using information on substances;

用于若匹配不成功,则将烟草材料的信息与历史经验库进行匹配,若匹配成功,则将结果进行显示的模块;其中,历史经验库存储的是以前匹配成功或不成功的烟用材料使用物质的信息。A module used to match the information of tobacco materials with the historical experience database if the matching is unsuccessful, and display the results if the matching is successful; wherein, the historical experience database stores the cigarette materials that have been successfully or unsuccessfully matched before Information on substances used.

上述多过程匹配烟用材料使用物质识别装置,实际上是基于本发明对应方法流程的一种计算机解决方案,即一种软件构架,上述各种模块即为与方法流程相对应的各处理进程或程序。由于对上述方法的介绍已经足够清楚完整,故不再对该装置进行详细描述。The above-mentioned multi-process matching device for identifying substances used in cigarette materials is actually a computer solution based on the corresponding method process of the present invention, that is, a software framework, and the above-mentioned various modules are each processing process or process corresponding to the method process. program. Since the introduction of the above method is sufficiently clear and complete, the device will not be described in detail.

Claims (8)

1. A method for identifying substances used by a multi-process matched tobacco material is characterized by comprising the following steps:
1) scanning and identifying information of the tobacco material in the paper file or the picture of the tobacco material;
2) matching the identified information of the tobacco material with a tobacco material permission standard database, and if the matching is successful, displaying the result; wherein the information of the allowable substances used by the tobacco materials is stored in the tobacco material permission standard database;
3) if the matching is unsuccessful, matching the information of the tobacco material with a historical experience library, and if the matching is successful, displaying the result; wherein the historical experience library stores information of the used substances of the cigarette materials which are successfully or unsuccessfully matched before;
the successful matching means that the information of the tobacco materials is determined to be stored in the tobacco material permission standard database or the historical experience database;
the database of the standard of permission for the tobacco material comprises calcium carbonate, and the historical experience database comprises the only mapping relation between limestone and calcium carbonate and CaCO3Unique mapping relation with calcium carbonate and CaCO3Unique mapping relation with calcium carbonate.
2. The method of claim 1, further comprising the step of storing the results of the matching with the historical experience repository in the historical experience repository.
3. The method for identifying substances used in multi-process matched tobacco materials according to claim 1 or 2, wherein when the information of the tobacco materials is matched with a historical experience base, a formula is adopted for a characteristic set of the information of the tobacco materials
Figure FDA0002629659040000011
Measuring, and displaying the result according to the matching similarity from high to low; wherein, CiThe method comprises the steps of setting a central point of a feature set cluster, setting x as a set distribution point of the cluster, setting k as a feature data set number, and setting E as an optimal value of a result.
4. The method for identifying substances used by multi-process matched tobacco materials as claimed in claim 1 or 2, wherein the information of the tobacco materials in the paper documents or pictures of the tobacco materials is identified by OCR recognition.
5. The device for identifying the substances used by the tobacco materials in the multi-process matching manner is characterized by comprising the following modules:
a module for scanning and identifying information of the tobacco material in the paper document or picture of the tobacco material;
a module for matching the identified information of the tobacco material with a database of allowable standard for tobacco material, and displaying the result if the matching is successful; wherein the information of the allowable substances used by the tobacco materials is stored in the tobacco material permission standard database;
a module for matching the information of the tobacco material with the historical experience base if the matching is unsuccessful, and displaying the result if the matching is successful; wherein the historical experience library stores information of the used substances of the cigarette materials which are successfully or unsuccessfully matched before;
the successful matching means that the information of the tobacco materials is determined to be stored in the tobacco material permission standard database or the historical experience database;
the database of the standard of permission for the tobacco material comprises calcium carbonate, and the historical experience database comprises the only mapping relation between limestone and calcium carbonate and CaCO3Unique mapping relation with calcium carbonate and CaCO3Unique mapping relation with calcium carbonate.
6. The apparatus for identifying substances for use with multiple processes matched with smoking materials of claim 5, further comprising a module for storing the results of the matching with the historical experience library in the historical experience library.
7. The device for identifying substances used in multi-process matched tobacco materials as claimed in claim 5 or 6, wherein when the information of the tobacco materials is matched with the historical experience base, a formula is adopted for the characteristic set of the information of the tobacco materials
Figure FDA0002629659040000021
Measuring, and displaying the result according to the matching similarity from high to low; wherein, CiThe method comprises the steps of setting a central point of a feature set cluster, setting x as a set distribution point of the cluster, setting k as a feature data set number, and setting E as an optimal value of a result.
8. The device for identifying the substance used by the multi-process matched tobacco material as claimed in claim 5 or 6, wherein the information of the tobacco material in the paper file or the picture of the tobacco material is identified by OCR recognition.
CN201710042035.5A 2017-01-20 2017-01-20 Method and device for identifying substances used by multi-process matched tobacco materials Active CN106709531B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201710042035.5A CN106709531B (en) 2017-01-20 2017-01-20 Method and device for identifying substances used by multi-process matched tobacco materials

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201710042035.5A CN106709531B (en) 2017-01-20 2017-01-20 Method and device for identifying substances used by multi-process matched tobacco materials

Publications (2)

Publication Number Publication Date
CN106709531A CN106709531A (en) 2017-05-24
CN106709531B true CN106709531B (en) 2020-10-13

Family

ID=58910017

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201710042035.5A Active CN106709531B (en) 2017-01-20 2017-01-20 Method and device for identifying substances used by multi-process matched tobacco materials

Country Status (1)

Country Link
CN (1) CN106709531B (en)

Families Citing this family (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107860868B (en) * 2017-11-03 2020-09-18 福建中烟工业有限责任公司 Tobacco matching method and system
CN111143409A (en) * 2019-12-13 2020-05-12 中国航空工业集团公司西安飞机设计研究所 Aluminum alloy material design verification method for airworthiness certification
CN113537191B (en) * 2021-06-30 2023-12-29 福建迦百农信息技术有限公司 Cigarette case spray code extraction and identification method
CN116882936B (en) * 2023-07-26 2025-02-18 企程通(北京)咨询管理有限公司 Expense management method and system based on expense card

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN88100316A (en) * 1987-01-20 1988-08-10 R·J·雷诺兹烟草公司 computer integrated manufacturing system
CN104133839A (en) * 2014-06-24 2014-11-05 国家电网公司 Data processing method and system with intelligent detection function
CN104133841A (en) * 2014-06-24 2014-11-05 国家电网公司 Data processing method and data processing system with system detection and image identification functions

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN88100316A (en) * 1987-01-20 1988-08-10 R·J·雷诺兹烟草公司 computer integrated manufacturing system
CN104133839A (en) * 2014-06-24 2014-11-05 国家电网公司 Data processing method and system with intelligent detection function
CN104133841A (en) * 2014-06-24 2014-11-05 国家电网公司 Data processing method and data processing system with system detection and image identification functions

Also Published As

Publication number Publication date
CN106709531A (en) 2017-05-24

Similar Documents

Publication Publication Date Title
CN106709531B (en) Method and device for identifying substances used by multi-process matched tobacco materials
WO2018149187A1 (en) Method and device for analyzing open-source license
CN111539923A (en) Digital ray detection method and system for weld defects and server
CN105096016B (en) Print order automatic generation method and device
CN104657274B (en) software interface test method and device
CN108090675A (en) A quality management system for vehicle manufacturing
CN105095034A (en) Product test equipment and testing method based on same
CN108446717A (en) A kind of board state collection method and system based on image recognition
CN105058989A (en) Bar production line nameplate mark system and control method thereof
CN108052685A (en) A kind of APP recommendation informations methods of exhibiting
CN107943968B (en) Structured processing method for construction data table data
CN105389314A (en) Log file query system and query method
CN103678327B (en) Method and device for information association
CN103365989A (en) Electronic patient record clinical data check method and system
CN106570119A (en) Device for quickly obtaining product information and method for obtaining product information
CN105844281A (en) Automatic form set parameter acquiring system and method
CN109508904B (en) Industrial parts production and quality inspection data processing method, server and terminal
CN116188033A (en) Product tracing method and device, electronic equipment and storage medium
CN107085603B (en) Data processing method and device
CN103399854B (en) Data positioning identifying and storing method and system
CN111126030B (en) Label typesetting processing method, device and system
CN113159245A (en) Industrial Internet label analysis method and device and electronic equipment
CN105335787A (en) On-line repairing work processing method of cell
CN105718928A (en) Accountant bill processing method
CN116010672A (en) Data processing method and device based on identification analysis system

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant