CN112732703B - Metadata processing method, metadata processing apparatus, and readable storage medium - Google Patents
Metadata processing method, metadata processing apparatus, and readable storage medium Download PDFInfo
- Publication number
- CN112732703B CN112732703B CN202110354795.6A CN202110354795A CN112732703B CN 112732703 B CN112732703 B CN 112732703B CN 202110354795 A CN202110354795 A CN 202110354795A CN 112732703 B CN112732703 B CN 112732703B
- Authority
- CN
- China
- Prior art keywords
- metadata
- enterprise
- standard
- template
- item
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
Images
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/20—Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
- G06F16/21—Design, administration or maintenance of databases
- G06F16/215—Improving data quality; Data cleansing, e.g. de-duplication, removing invalid entries or correcting typographical errors
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/20—Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
- G06F16/24—Querying
- G06F16/245—Query processing
- G06F16/2457—Query processing with adaptation to user needs
- G06F16/24573—Query processing with adaptation to user needs using data annotations, e.g. user-defined metadata
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F21/00—Security arrangements for protecting computers, components thereof, programs or data against unauthorised activity
- G06F21/60—Protecting data
- G06F21/64—Protecting data integrity, e.g. using checksums, certificates or signatures
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06Q—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
- G06Q50/00—Information and communication technology [ICT] specially adapted for implementation of business processes of specific business sectors, e.g. utilities or tourism
- G06Q50/04—Manufacturing
-
- Y—GENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
- Y02—TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
- Y02P—CLIMATE CHANGE MITIGATION TECHNOLOGIES IN THE PRODUCTION OR PROCESSING OF GOODS
- Y02P90/00—Enabling technologies with a potential contribution to greenhouse gas [GHG] emissions mitigation
- Y02P90/30—Computing systems specially adapted for manufacturing
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- Physics & Mathematics (AREA)
- General Physics & Mathematics (AREA)
- General Engineering & Computer Science (AREA)
- Databases & Information Systems (AREA)
- Computer Security & Cryptography (AREA)
- Business, Economics & Management (AREA)
- Health & Medical Sciences (AREA)
- General Health & Medical Sciences (AREA)
- Data Mining & Analysis (AREA)
- Human Resources & Organizations (AREA)
- Marketing (AREA)
- Tourism & Hospitality (AREA)
- Strategic Management (AREA)
- Library & Information Science (AREA)
- Computational Linguistics (AREA)
- Primary Health Care (AREA)
- General Business, Economics & Management (AREA)
- Economics (AREA)
- Manufacturing & Machinery (AREA)
- Bioethics (AREA)
- Computer Hardware Design (AREA)
- Software Systems (AREA)
- Quality & Reliability (AREA)
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
Abstract
Description
技术领域technical field
本申请涉及工业互联网的数据处理领域,尤其涉及一种元数据处理方法、元数据处理装置及可读存储介质。The present application relates to the data processing field of the Industrial Internet, and in particular, to a metadata processing method, a metadata processing device, and a readable storage medium.
背景技术Background technique
工业互联网是新一代信息通信技术与工业经济融合的全新工业生态、关键基础设施的新型应用模式,通过人、机、物的全面互联,实现全要素、全产业链、价值链的全面连接,推动企业智能化发展。The Industrial Internet is a new application model of a new industrial ecology and key infrastructure that integrates a new generation of information and communication technology with the industrial economy. Intelligent development of enterprises.
用户依托工业互联网标识解析系统来访问保存机器、原材料、零部件及产品等物件相关信息的服务器,并通过工业互联网标识实现异主、异地、异构信息的智能关联,为信息共享以及产品全生命周期管理提供支撑。而工业互联网数据因为企业所处行业、类型、规模、产品等差异,导致数据千差万别,目前很多工业企业都有自己的元数据模板,格式不统一,互通性较差,容易形成信息孤岛,信息没有得到有效利用。Users rely on the industrial Internet identification analysis system to access the server that saves information about objects such as machines, raw materials, parts, and products, and realize the intelligent association of different owners, different places, and heterogeneous information through the industrial Internet identification, for information sharing and product life. Cycle management provides support. The industrial Internet data varies greatly due to the differences in the industry, type, scale, and product of the enterprises. At present, many industrial enterprises have their own metadata templates. The format is not uniform and the interoperability is poor. It is easy to form information islands, and there is no information. be used effectively.
发明内容SUMMARY OF THE INVENTION
本申请实施例中提供了一种元数据处理方法、元数据处理装置及可读存储介质,用于解决工业互联网系统中元数据模板格式不统一导致数据互通性差的问题。Embodiments of the present application provide a metadata processing method, a metadata processing device, and a readable storage medium, which are used to solve the problem of poor data interoperability caused by inconsistent metadata template formats in an industrial Internet system.
本申请第一方面实施例提供一种元数据处理方法,应用于元数据系统,包括:The embodiment of the first aspect of the present application provides a metadata processing method, which is applied to a metadata system, including:
接收企业上传的企业元数据模板,其中,所述企业元数据模板中包括所述企业配置的至少一个企业元数据项;receiving an enterprise metadata template uploaded by an enterprise, wherein the enterprise metadata template includes at least one enterprise metadata item configured by the enterprise;
将所述企业元数据模板中的每个企业元数据项与第一标准元数据模板中的标准元数据项进行相似度匹配,获得匹配结果,其中,所述匹配结果包括匹配成功的第一企业元数据项和匹配失败的第二企业元数据项;Perform similarity matching between each enterprise metadata item in the enterprise metadata template and the standard metadata item in the first standard metadata template to obtain a matching result, wherein the matching result includes the first enterprise that has been successfully matched The metadata item and the second enterprise metadata item that failed to match;
基于所述匹配结果,确定反馈信息,其中,针对所述第一企业元数据项,所述反馈信息中包括与所述第一企业元数据项匹配的标准数据项的标识和描述信息,针对所述第二企业元数据项,将所述第二企业元数据项添加至所述第一标准元数据模板,为所述第二企业元数据项配置新标识和新描述信息,所述反馈信息中包括所述新标识和新描述信息;Based on the matching result, feedback information is determined, wherein, for the first enterprise metadata item, the feedback information includes identification and description information of a standard data item matching the first enterprise metadata item, and for the first enterprise metadata item, the identification and description information are included in the feedback information. the second enterprise metadata item, add the second enterprise metadata item to the first standard metadata template, configure a new identifier and new description information for the second enterprise metadata item, and the feedback information Include the new identification and new description information;
将所述反馈信息发送至所述企业,以使得所述企业基于所述反馈信息建立本地数据与所述第一标准元数据模板中各标准数据项间的映射关系。The feedback information is sent to the enterprise, so that the enterprise establishes a mapping relationship between local data and each standard data item in the first standard metadata template based on the feedback information.
本申请第二方面实施例提供一种元数据处理装置,应用于元数据系统,包括:The embodiment of the second aspect of the present application provides a metadata processing apparatus, which is applied to a metadata system, including:
接收模块,用于接收企业上传的企业元数据模板,其中,所述企业元数据模板中包括所述企业配置的至少一个企业元数据项;a receiving module, configured to receive an enterprise metadata template uploaded by an enterprise, wherein the enterprise metadata template includes at least one enterprise metadata item configured by the enterprise;
匹配模块,用于将所述企业元数据模板中的每个企业元数据项与第一标准元数据模板中的标准元数据项进行相似度匹配,获得匹配结果,其中,所述匹配结果包括匹配成功的第一企业元数据项和匹配失败的第二企业元数据项;A matching module, configured to perform similarity matching between each enterprise metadata item in the enterprise metadata template and the standard metadata item in the first standard metadata template to obtain a matching result, wherein the matching result includes matching A successful first enterprise metadata item and a failed match for a second enterprise metadata item;
确定模块,用于基于所述匹配结果,确定反馈信息,其中,针对所述第一企业元数据项,所述反馈信息中包括与所述第一企业元数据项匹配的标准数据项的标识和描述信息,针对所述第二企业元数据项,将所述第二企业元数据项添加至所述第一标准元数据模板,为所述第二企业元数据项配置新标识和新描述信息,所述反馈信息中包括所述新标识和新描述信息;A determination module, configured to determine feedback information based on the matching result, wherein, for the first enterprise metadata item, the feedback information includes an identifier of a standard data item that matches the first enterprise metadata item and description information, for the second enterprise metadata item, adding the second enterprise metadata item to the first standard metadata template, and configuring a new identifier and new description information for the second enterprise metadata item, The feedback information includes the new identification and new description information;
反馈模块,用于将所述反馈信息发送至所述企业,以使得所述企业基于所述反馈信息建立本地数据与所述第一标准元数据模板中各标准数据项间的映射关系。A feedback module, configured to send the feedback information to the enterprise, so that the enterprise establishes a mapping relationship between local data and each standard data item in the first standard metadata template based on the feedback information.
本申请第三方面实施例提供一种元数据处理装置,应用于元数据系统,所述元数据处理装置包括处理器,所述处理器用于执行存储器中存储的计算机程序时实现如前述第一方面所述的元数据处理方法的步骤。Embodiments of a third aspect of the present application provide a metadata processing apparatus, which is applied to a metadata system. The metadata processing apparatus includes a processor, and the processor is configured to implement the foregoing first aspect when executing a computer program stored in a memory. The steps of the metadata processing method.
本申请第四方面实施例提供了一种可读存储介质,其上存储有计算机程序,所述计算机程序被处理器执行时实现如前述第一方面实施例中所述的元数据处理方法的步骤。Embodiments of the fourth aspect of the present application provide a readable storage medium on which a computer program is stored, and when the computer program is executed by a processor, implements the steps of the metadata processing method described in the foregoing first aspect embodiment .
本申请实施例提供的技术方案中,元数据系统中维护了不同类型的元数据模板,在接收到企业上传的企业元数据模板情况下,将企业元数据模板中的每个企业元数据项与第一标准元数据模板中的标准元数据项进行相似度匹配,获得匹配结果,针对匹配成功的企业元数据项,表示系统中的第一标准元数据模板中已经收录了该元数据项,将收录的标准元数据项对应的标记和描述信息反馈至企业。而针对匹配失败的企业元数据项,为其分配新的标识和定义其新描述信息,并反馈给企业。这样,企业就可以通过反馈信息明确自己的元数据模板中的各企业元数据项与第一标准元数据模板的各标准元数据项间的差异,以及相关的描述定义,将自己的企业元数据模板按系统标准的元数据模板进行修改,进而针对本地数据建立起本地数据与所述第一标准元数据模板中各标准数据项间的映射关系,以确保能够准确进行数据传输,数据能有效被利用。In the technical solutions provided by the embodiments of the present application, different types of metadata templates are maintained in the metadata system, and when an enterprise metadata template uploaded by an enterprise is received, each enterprise metadata item in the enterprise metadata template is associated with The standard metadata items in the first standard metadata template are subjected to similarity matching, and a matching result is obtained. For the enterprise metadata items that are successfully matched, it means that the metadata item has been included in the first standard metadata template in the system. The tags and description information corresponding to the included standard metadata items are fed back to the enterprise. For the enterprise metadata items that fail to match, new identifiers and new description information are assigned to them, and fed back to the enterprise. In this way, the enterprise can clarify the differences between each enterprise metadata item in its own metadata template and each standard metadata item in the first standard metadata template through the feedback information, as well as the relevant description definitions, and convert its own enterprise metadata The template is modified according to the metadata template of the system standard, and then the mapping relationship between the local data and each standard data item in the first standard metadata template is established for the local data, so as to ensure that the data can be transmitted accurately and the data can be effectively stored. use.
附图说明Description of drawings
此处所说明的附图用来提供对本申请的进一步理解,构成本申请的一部分,本申请的示意性实施例及其说明用于解释本申请,并不构成对本申请的不当限定。在附图中:The drawings described herein are used to provide further understanding of the present application and constitute a part of the present application. The schematic embodiments and descriptions of the present application are used to explain the present application and do not constitute an improper limitation of the present application. In the attached image:
图1为本申请第一实施例提供的一种元数据处理方法的流程图;FIG. 1 is a flowchart of a method for processing metadata according to a first embodiment of the present application;
图2为本申请第一实施例提供的工业互联网标识解析系统的示意图;FIG. 2 is a schematic diagram of an industrial Internet identification resolution system provided by the first embodiment of the present application;
图3为本申请第一实施例提供的标准元数据模板数据项集合的示意图;3 is a schematic diagram of a set of standard metadata template data items provided by the first embodiment of the present application;
图4为本申请第一实施例提供的元数据系统确定企业进行企业模板上传匹配的流程图;4 is a flowchart of the metadata system provided by the first embodiment of the present application for determining an enterprise to upload and match an enterprise template;
图5为本申请第一实施例提供的对企业元数据模板中的企业元数据项和第一标准元数据模板中的标准元数据项进行相似度匹配的流程图;5 is a flowchart of similarity matching between enterprise metadata items in the enterprise metadata template and standard metadata items in the first standard metadata template provided by the first embodiment of the present application;
图6为本申请第一实施例提供的工业互联网标识解析系统进行标识查询时的流程图;FIG. 6 is a flowchart of the identification query performed by the industrial Internet identification analysis system provided by the first embodiment of the present application;
图7为本申请第一实施例提供的工业互联网标识解析系统构成区块链网络进行标准元数据模板更新的示意图;FIG. 7 is a schematic diagram of updating a standard metadata template by forming a blockchain network by an industrial Internet identification resolution system provided by the first embodiment of the present application;
图8为本申请第二实施例提供的一种元数据处理装置的结构示意图;FIG. 8 is a schematic structural diagram of a metadata processing apparatus according to a second embodiment of the present application;
图9为本申请第三实施例提供的另一种元数据处理装置的结构示意图。FIG. 9 is a schematic structural diagram of another metadata processing apparatus provided by a third embodiment of the present application.
具体实施方式Detailed ways
下面通过附图以及具体实施例对本发明技术方案做详细的说明,应当理解本申请实施例以及实施例中的具体特征是对本申请技术方案的详细的说明,而不是对本申请技术方案的限定,在不冲突的情况下,本申请实施例以及实施例中的技术特征可以相互组合。The technical solutions of the present invention will be described in detail below with reference to the accompanying drawings and specific embodiments. If there is no conflict, the embodiments of the present application and the technical features in the embodiments may be combined with each other.
本文中术语“和/或”,仅仅是一种描述关联对象的关联关系,表示可以存在三种关系,例如,A和/或B,可以表示:单独存在A,同时存在A和B,单独存在B这三种情况。另外,本文中字符“/”,一般表示前后关联对象是一种“或”的关系。The term "and/or" in this article is only an association relationship to describe the associated objects, indicating that there can be three kinds of relationships, for example, A and/or B, it can mean that A exists alone, A and B exist at the same time, and A and B exist independently B these three cases. In addition, the character "/" in this document generally indicates that the related objects are an "or" relationship.
实施例Example
请参考图1,本发明第一实施例提供一种元数据处理方法,应用于元数据系统,该方法包括如下步骤:Referring to FIG. 1 , a first embodiment of the present invention provides a method for processing metadata, which is applied to a metadata system. The method includes the following steps:
S101:接收企业上传的企业元数据模板,其中,所述企业元数据模板中包括所述企业配置的至少一个企业元数据项;S101: Receive an enterprise metadata template uploaded by an enterprise, wherein the enterprise metadata template includes at least one enterprise metadata item configured by the enterprise;
S102:将所述企业元数据模板中的每个企业元数据项与第一标准元数据模板中的标准元数据项进行相似度匹配,获得匹配结果,其中,所述匹配结果包括匹配成功的第一企业元数据项和匹配失败的第二企业元数据项;S102: Perform similarity matching between each enterprise metadata item in the enterprise metadata template and the standard metadata item in the first standard metadata template to obtain a matching result, wherein the matching result includes the first matching successfully an enterprise metadata item and a second enterprise metadata item that failed to match;
S103:基于所述匹配结果,确定反馈信息,其中,针对所述第一企业元数据项,所述反馈信息中包括与所述第一企业元数据项匹配的标准数据项的标识和描述信息,针对所述第二企业元数据项,将所述第二企业元数据项添加至所述第一标准元数据模板,为所述第二企业元数据项配置新标识和新描述信息,所述反馈信息中包括所述新标识和新描述信息;S103: Determine feedback information based on the matching result, wherein, for the first enterprise metadata item, the feedback information includes identification and description information of a standard data item matching the first enterprise metadata item, For the second enterprise metadata item, add the second enterprise metadata item to the first standard metadata template, configure a new identification and new description information for the second enterprise metadata item, and the feedback The information includes the new identification and new description information;
S104:将所述反馈信息发送至所述企业,以使得所述企业基于所述反馈信息建立本地数据与所述第一标准元数据模板中各标准数据项间的映射关系。S104: Send the feedback information to the enterprise, so that the enterprise establishes a mapping relationship between local data and each standard data item in the first standard metadata template based on the feedback information.
具体的,本实施例提供的元数据处理方法,主要应用于元数据系统,该元数据系统应用于工业互联网标识解析系统中的国际根节点、递归节点、国家顶级节点、二级节点和企业节点中的任意一个或多个组合,工业互联网标识解析系统的递归节点、国家顶级节点、二级节点和企业节点均为区块链节点。Specifically, the metadata processing method provided in this embodiment is mainly applied to a metadata system, and the metadata system is applied to the international root node, recursive node, national top-level node, secondary node and enterprise node in the industrial Internet identification resolution system Any one or more combinations of the above, the recursive nodes, national top-level nodes, secondary nodes and enterprise nodes of the industrial Internet identification resolution system are all blockchain nodes.
具体的,请参见图2,工业互联网标识解析系统包括国际根节点、国家顶级节点、二级节点、企业节点以及递归节点。其中,国际根节点是标识体系的最高层级服务节点,面向全球范围的根区数据管理和根解析服务。国家顶级节点是指面向一个国家的提供标识编码注册和标识解析服务的节点。二级节点是指一个行业或者区域内部的标识解析公共服务节点,能够面向行业或区域提供标识编码注册和标识解析服务,以及完成相关的标识业务管理、标识应用对接等。二级节点包含一个或多个服务器。企业节点是指一个企业内部的标识解析服务节点,能够面向特定企业提供标识编码注册和标识解析服务。企业节点也包括一个或多个标识解析服务器,企业数据存储在企业节点的内部。递归节点是指标识解析体系的关键性入口设施,负责路由功能,将解析请求路由至对应的节点,并能够通过缓存等技术手段提升整体服务性能。节点部署引入区块链架构,企业节点、二级节点、国家顶级节点都在区块链中,标识数据服务是在区块链节点上,也即,每个节点均能提供相应的标识解析服务。Specifically, please refer to Fig. 2, the industrial Internet identification resolution system includes an international root node, a national top-level node, a second-level node, an enterprise node, and a recursive node. Among them, the international root node is the highest-level service node of the identification system, which is oriented to the global root zone data management and root resolution services. A country's top-level node refers to a node that provides identification code registration and identification resolution services for a country. A secondary node refers to a public service node for identification resolution within an industry or region, which can provide identification code registration and identification resolution services for the industry or region, and complete related identification business management, identification application docking, etc. Secondary nodes contain one or more servers. An enterprise node refers to an identification resolution service node within an enterprise, which can provide identification code registration and identification resolution services for a specific enterprise. The enterprise node also includes one or more identity resolution servers, and enterprise data is stored inside the enterprise node. The recursive node refers to the key entry facility that identifies the parsing system, is responsible for the routing function, routes the parsing request to the corresponding node, and can improve the overall service performance through technical means such as caching. Node deployment introduces blockchain architecture. Enterprise nodes, secondary nodes, and national top-level nodes are all in the blockchain, and the identification data service is on the blockchain nodes, that is, each node can provide corresponding identification resolution services. .
各节点的承建方负责本节点标识域里的标识的注册和所注册的标识的信息的录入。例如,二级标识解析节点的承建方负责关于行业的标识的注册和所注册的标识的信息的录入,这样,已注册的标识的信息就存储于二级标识解析节点所对应的标识解析服务器中。The contractor of each node is responsible for the registration of the logo in the logo domain of the node and the entry of the registered logo information. For example, the contractor of the secondary identification resolution node is responsible for the registration of the industry identification and the entry of the registered identification information. In this way, the registered identification information is stored in the identification resolution server corresponding to the secondary identification resolution node. .
工业互联网异主、异地、异构数据的交换中会涉及元数据模板。元数据模板中包含了各元数据项,元数据算是一种电子式目录,为了达到编制目录的目的,必须在描述并收藏数据的内容或特色,进而达成协助数据检索的目的。例如:有一条学生信息记录,其中包括字段姓名(name)、年龄(age)、性别(male)、班级(class)等,那么name、age、male、class就是元数据。通过它们的描述,一条关于学生信息的数据记录就产生。工业互联网涉及的企业众多,各自企业均有自己的元模板,通用性较差,没有统一规范化,这样,会导致上传至系统的数据存在重复,或者因元数据项名称定义不一致(比如有的企业命名为“标识”,有的企业命名为“ID”),系统向企业请求数据时,因元数据模板不一致,导致数据无法有效获取。有必要进行清洗统一,便于后续元数据模板上传、主数据上传以及分析监管工作。Metadata templates are involved in the exchange of heterogeneous, heterogeneous and heterogeneous data in the Industrial Internet. Metadata template contains various metadata items. Metadata is a kind of electronic catalog. In order to achieve the purpose of cataloguing, it is necessary to describe and collect the content or characteristics of the data, so as to achieve the purpose of assisting data retrieval. For example: There is a student information record, which includes the fields name (name), age (age), gender (male), class (class), etc., then name, age, male, and class are metadata. From their descriptions, a data record of student information is generated. There are many companies involved in the Industrial Internet, and each company has its own meta-template, which is less versatile and has no unified standardization. This will lead to duplication of data uploaded to the system, or inconsistent definitions of metadata item names (for example, some companies Named "identity", and some companies named it "ID"), when the system requests data from the enterprise, the data cannot be obtained effectively due to inconsistent metadata templates. It is necessary to clean and unify to facilitate subsequent metadata template upload, master data upload, and analysis and supervision.
进而,企业会上传自己的企业元数据模板至元数据系统,元数据系统通过步骤S101接收到该企业元数据模板后,会确定与之对应的第一标准元数据模板。具体的,可通过如下步骤实现:Further, the enterprise will upload its own enterprise metadata template to the metadata system, and after the metadata system receives the enterprise metadata template through step S101, it will determine the corresponding first standard metadata template. Specifically, it can be achieved through the following steps:
确定所述企业元数据模板所属的目标类型,其中,所述元数据系统中包括行业类型和综合类型对应的标准元数据模板;determining the target type to which the enterprise metadata template belongs, wherein the metadata system includes standard metadata templates corresponding to industry types and comprehensive types;
确定与所述目标类型对应的第一标准元数据模板。A first standard metadata template corresponding to the target type is determined.
具体的,在本实施例中,工业互联网标识解析中,涉及通过标识解析获得的在工业互联网生产和供应链中产生的数据。分为属性数据和事件数据。属性数据是指描述对象固有的、区别于其他实体的属性特征的数据,事件数据是指用于描述工业产品在生产、流通、使用过程中记录的由位置、状态、所有权、管理权等变化所产生的数据。元数据系统维护通用统一的元数据模板,涵盖范围广、行业全,规定了工业互联网标识解析核心元数据的描述方法、元数据模型、标识解析核心元数据描述及核心元数据扩展原则和方法。Specifically, in this embodiment, the industrial Internet identification analysis involves data generated in the industrial Internet production and supply chain obtained through identification analysis. Divided into attribute data and event data. Attribute data refers to the data that describe the attribute characteristics inherent in the object and distinguish it from other entities. Event data refers to the data used to describe the changes of location, state, ownership, management rights, etc. recorded during the production, circulation, and use of industrial products. generated data. The metadata system maintains a universal and unified metadata template, covering a wide range and all industries, and specifies the description method, metadata model, core metadata description and core metadata extension principles and methods of identification resolution for the Industrial Internet.
举例来说,系统维护的典型的标准元数据模板如图3所示。由图3可见,系统维护了的标准元数据模板包括不同类型的数据项,包括事件类型和属性类型,属性类型又包括了人员类型、产品类型、机器类型、物料类型、方法类型和环境类型等,不同类型下还设置了更细类型的数据项,比如:产品类型下设置有主体数据项、位置数据项和对象数据项,主体数据项中包括了生产企业、流通企业、使用单位/机构/企业,对应的生产企业项中又包括了生产企业名称、生产企业统一代码、生产企业注册地址、生产企业联系信息等。以此类推,其他类型元数据项以及下属元数据项也可通过这样的形式展开配置,在此,本实施例不做赘述。当然,在具体实施过程中还可根据实际需要配置更多类型的数据项,在此,本实施例不做限制。需要说明的是,图3仅是一种标准元数据模板的示意,并不构成对标准元数据模板的限定,在具体实施过程中,标准元数据模板还可以包括比图示更多或更少的元数据项。For example, a typical standard metadata template maintained by the system is shown in Figure 3. As can be seen from Figure 3, the standard metadata templates maintained by the system include different types of data items, including event types and attribute types, and attribute types include personnel types, product types, machine types, material types, method types, and environment types. , and more detailed types of data items are set under different types. For example, the main data item, location data item and object data item are set under the product type. Enterprise, the corresponding production enterprise item also includes the name of the production enterprise, the unified code of the production enterprise, the registered address of the production enterprise, and the contact information of the production enterprise. By analogy, other types of metadata items and subordinate metadata items can also be deployed and configured in this form, and details are not described here in this embodiment. Of course, in the specific implementation process, more types of data items may also be configured according to actual needs, which are not limited in this embodiment. It should be noted that FIG. 3 is only a schematic diagram of a standard metadata template, and does not constitute a limitation on the standard metadata template. In the specific implementation process, the standard metadata template may also include more or less than the illustration. metadata item.
进一步,图3中展示了不同类型的数据项有对应的标识,并且标准元数据模板中的标准元数据项也配置有唯一的标识和对应的描述信息,由图3所示,标准元数据模板中包括对象数据项的标识为2.2。对应的描述信息中包含了该元数据项的相关定义,比如:数据类型、数据格式、权限信息等,在具体实施过程中,描述信息可根据实际需要进行配置,在此,本实施例不做限制。Further, Figure 3 shows that different types of data items have corresponding identifiers, and the standard metadata items in the standard metadata template are also configured with unique identifiers and corresponding description information. As shown in Figure 3, the standard metadata template The identification of the object data item included in it is 2.2. The corresponding description information contains the relevant definitions of the metadata item, such as: data type, data format, permission information, etc. During the specific implementation process, the description information can be configured according to actual needs, and this embodiment does not limit.
为了能够确定企业上传的企业元数据模板是否规范,系统会确定与之做相似度匹配的第一标准元数据模板。具体的,在本实施例中,如图2所示,各企业节点归属于对应的二级节点,二级节点可以是一个地区的节点,该地区的所有企业节点均属于该二级节点维护,各二级节点维护了不同类型的标准元数据模板,涉及行业型和综合型。行业型包括了针对进入国民经济分类的各行业对应的标准元数据模板,综合型包括了针对涉及多个行业的企业的标准元数据模板。In order to be able to determine whether the enterprise metadata template uploaded by the enterprise is standardized, the system will determine the first standard metadata template for similarity matching. Specifically, in this embodiment, as shown in FIG. 2 , each enterprise node belongs to the corresponding secondary node, the secondary node may be a node in a region, and all enterprise nodes in the region belong to the maintenance of the secondary node, Each secondary node maintains different types of standard metadata templates, involving industry type and comprehensive type. The industry type includes standard metadata templates for various industries that enter the national economic classification, and the comprehensive type includes standard metadata templates for enterprises involved in multiple industries.
进而,请参照图4,企业在上传元数据模板时,系统可判断该企业是否为二级节点企业,如果是,则从其所属的二级节点维护的标准元数据模板中确定出第一标准元数据模板。在该二级节点中,基于企业的类型,具体的,判断该企业是行业类型还是综合型企业,再分别按照这两种类型对应的方式确定对应的第一标准元数据模板。Further, please refer to FIG. 4, when an enterprise uploads a metadata template, the system can determine whether the enterprise is a secondary node enterprise, and if so, determine the first standard from the standard metadata template maintained by the secondary node to which it belongs. Metadata template. In the secondary node, based on the type of the enterprise, specifically, it is determined whether the enterprise is an industry type or a comprehensive enterprise, and then the corresponding first standard metadata template is determined according to the corresponding methods of the two types.
进一步,判断是行业类型还是综合型可通过但不限于如下方式:Further, to determine whether it is an industry type or a comprehensive type, the following methods can be used but not limited to:
第一种:企业主动选定方式。将所述企业的选定类型作为目标类型,确定与所述目标类型对应的第一标准元数据模板。The first: the enterprise chooses the method actively. Using the selected type of the enterprise as a target type, a first standard metadata template corresponding to the target type is determined.
具体的,在本实施例中,企业在上传自己的企业元数据模板时,可以选定类型,包括选定其所属的行业。如果企业选定单一的行业,如图4中的服装、机械、石油、汽车等其中任意一个,还需要判断元数据库中是否存有企业所选行业的标准元数据模板,如果存有,就将其作为第一标准元数据模板,完成企业元数据的上传与匹配。元数据库中还未存储该所选行业的标准元数据模板,可根据国民经济分类的中类,添加该企业上传的企业元数据模板作为行业标准元数据模板。Specifically, in this embodiment, when an enterprise uploads its own enterprise metadata template, it can select a type, including selecting an industry to which it belongs. If the enterprise selects a single industry, such as clothing, machinery, petroleum, automobile, etc. in Figure 4, it is necessary to judge whether there is a standard metadata template of the industry selected by the enterprise in the metadata database. It is used as the first standard metadata template to complete the uploading and matching of enterprise metadata. The standard metadata template of the selected industry has not been stored in the metadata database. According to the middle category of the national economic classification, the enterprise metadata template uploaded by the enterprise can be added as the industry standard metadata template.
如果企业选定的类型为综合型,需要判断是否有明确的多行业类别。在判断时,可通过企业选定的行业作为判断依据。举例来说,企业可以勾选多个行业进行企业元数据模板上传。如图4中的服装、机械、石油、汽车等其中任意两个及两个以上,则确定其有明确的多行业类别。则将对应的多个行业中每个行业的标准元数据模板整合起来作为第一标准元数据模板进行匹配。或者,将企业元数据模板分别与选择的各个行业的标准元数据模板进行匹配。If the type selected by the enterprise is a comprehensive type, it is necessary to judge whether there is a clear multi-industry category. In the judgment, the industry selected by the enterprise can be used as the basis for judgment. For example, an enterprise can select multiple industries to upload the enterprise metadata template. As shown in Figure 4, any two or more of clothing, machinery, petroleum, automobiles, etc., are determined to have clear multi-industry categories. Then, the standard metadata templates of each industry in the corresponding multiple industries are integrated as the first standard metadata template for matching. Alternatively, match the enterprise metadata template to the standard metadata template for each industry of your choice.
如果没有明确的多行业类别,则采用系统自动确定方式。基于所述企业的属性信息确定目标类型,确定与所述目标类型对应的第一标准元数据模板。If there is no clear multi-industry category, the system will automatically determine it. A target type is determined based on the attribute information of the enterprise, and a first standard metadata template corresponding to the target type is determined.
具体的,在本实施例中,企业在上传自己的企业元数据模板时,可借助企业注册元数据系统时企业填写的“企业名称”、“生产经营类型”等基本属性信息,进行网络检索和库内检索。可通过网络检索到该企业生产经营的相关信息,所属行业等。进而确定其关联的行业,将关联的行业对应的标准元数据模板整合形成第一标准元数据模板。或者,选取相似度最高的2个行业,分别随机抽取40%元数据,在剩余行业中,随机抽取20%元数据,组成形成第一标准元数据模板,进而完成企业元数据模板上传与匹配。在具体实施过程中,上述随机抽取的比例可根据实际需要进行配置,在此,本实施例不做限制。Specifically, in this embodiment, when an enterprise uploads its own enterprise metadata template, it can use the basic attribute information such as "enterprise name" and "production and operation type" that the enterprise fills in when the enterprise registers the metadata system to perform network retrieval and retrieval. Search within the library. The relevant information of the production and operation of the enterprise, the industry it belongs to, etc. can be retrieved through the network. Further, the associated industry is determined, and the standard metadata templates corresponding to the associated industries are integrated to form a first standard metadata template. Alternatively, select the two industries with the highest similarity, and randomly extract 40% of the metadata respectively. In the remaining industries, randomly select 20% of the metadata to form the first standard metadata template, and then complete the upload and matching of the enterprise metadata template. In a specific implementation process, the above-mentioned ratio of random extraction may be configured according to actual needs, which is not limited in this embodiment.
举例来说,钢铁企业A上传自己的企业元数据模板时,系统通过其企业名称或代码查询到钢铁企业A属于钢铁行业,进而,系统可直接根据钢铁行业的标准元数据模板来对企业元数据模板进行相似性匹配。也可将钢铁行业作为关联行业中的一个。或者,系统通过其企业名称或代码查询到钢铁企业A生产带钢,带钢通常是作为工业领域的相关生产物料,所以,系统将根据物料类的标准元数据模板来对企业元数据模板进行相似性匹配。也可将物料行业作为关联行业中的一个。在通过检索的方式得到多个关联行业基础上,就可以将关联的行业对应的标准元数据模板整合形成第一标准元数据模板。或者,选取相似度最高的2个行业,分别随机抽取40%元数据,在剩余行业中,随机抽取20%元数据,组成形成第一标准元数据模板,进而完成企业元数据模板上传与匹配。在具体实施过程中,上述随机抽取的比例可根据实际需要进行配置,在此,本实施例不做限制。For example, when iron and steel enterprise A uploads its own enterprise metadata template, the system finds out that iron and steel enterprise A belongs to the iron and steel industry through its enterprise name or code. Furthermore, the system can directly analyze the enterprise metadata according to the standard metadata template of the iron and steel industry. Template for similarity matching. The steel industry can also be considered as one of the associated industries. Or, the system finds that steel company A produces strip steel through its company name or code. Strip steel is usually used as a related production material in the industrial field. Therefore, the system will compare the enterprise metadata template according to the standard metadata template of the material class. Sexual match. You can also use the material industry as one of the associated industries. On the basis of obtaining multiple associated industries through retrieval, the standard metadata templates corresponding to the associated industries can be integrated to form a first standard metadata template. Alternatively, select the two industries with the highest similarity, and randomly extract 40% of the metadata respectively. In the remaining industries, randomly select 20% of the metadata to form the first standard metadata template, and then complete the upload and matching of the enterprise metadata template. In a specific implementation process, the above-mentioned ratio of random extraction may be configured according to actual needs, which is not limited in this embodiment.
当然,还可以直接基于标准元数据模板的自动匹配方式。具体的,确定所述企业元数据模板的文本信息与每个类型的标准元数据模板的文本信息间的相似性,基于所述相似性确定目标类型,确定与所述目标类型对应的第一标准元数据模板。Of course, it can also be directly based on the automatic matching method of standard metadata templates. Specifically, the similarity between the text information of the enterprise metadata template and the text information of each type of standard metadata template is determined, the target type is determined based on the similarity, and the first standard corresponding to the target type is determined Metadata template.
具体的,在本实施例中,系统维护了不同类型的标准元数据模板,各个标准元数据模板的标准元数据项均配置有描述信息,描述信息以文本方式设置,所以,可以整合得到每个标准元数据模板的文本信息。同理,针对企业上传的企业元数据模板,也可整合得到对应的文本信息。进而可通过语义分析的方式得到文本信息间的相似性,将相似度最高的标准元数据模板与企业元数据模板进行相似度匹配的第一标准元数据模板。或者,选取相似度最高的2个标准元数据模板分别随机抽取40%元数据,在剩余模板中,随机抽取20%元数据,组成形成第一标准元数据模板,进而完成企业元数据模板上传与匹配。在具体实施过程中,上述随机抽取的比例可根据实际需要进行配置,在此,本实施例不做限制。Specifically, in this embodiment, the system maintains different types of standard metadata templates, the standard metadata items of each standard metadata template are configured with description information, and the description information is set in the form of text. Therefore, each standard metadata template can be integrated to obtain Text information for standard metadata templates. Similarly, for the enterprise metadata template uploaded by the enterprise, the corresponding text information can also be integrated. Furthermore, the similarity between the text information can be obtained by means of semantic analysis, and the standard metadata template with the highest similarity is matched with the enterprise metadata template for the first standard metadata template. Alternatively, select the two standard metadata templates with the highest similarity to randomly extract 40% of the metadata, and randomly select 20% of the metadata from the remaining templates to form the first standard metadata template, and then complete the uploading and matching of the enterprise metadata template. match. In a specific implementation process, the above-mentioned ratio of random extraction may be configured according to actual needs, which is not limited in this embodiment.
进一步,如果上传企业元数据模板的企业不是二级节点企业,则企业在上传自己的企业元数据模板时,可借助企业注册元数据系统时企业填写的“企业名称”、“生产经营类型”等基本属性信息,进行网络检索和库内检索。可通过网络检索到该企业生产经营的相关信息,所属行业等。进而确定其关联的行业,将关联的行业对应的标准元数据模板整合形成第一标准元数据模板。或者,选取相似度最高的2个行业,分别随机抽取40%元数据,在剩余行业中,随机抽取20%元数据,组成形成第一标准元数据模板,进而完成企业元数据模板上传与匹配。在具体实施过程中,随机抽取的比例可根据实际需要进行配置,在此,本实施例不做限制。Further, if the enterprise uploading the enterprise metadata template is not a second-level node enterprise, when uploading its own enterprise metadata template, the enterprise can use the "enterprise name", "production and operation type" that the enterprise filled in when the enterprise registered the metadata system, etc. Basic attribute information for network retrieval and in-library retrieval. The relevant information of the production and operation of the enterprise, the industry it belongs to, etc. can be retrieved through the network. Further, the associated industry is determined, and the standard metadata templates corresponding to the associated industries are integrated to form a first standard metadata template. Alternatively, select the two industries with the highest similarity, and randomly extract 40% of the metadata respectively. In the remaining industries, randomly select 20% of the metadata to form the first standard metadata template, and then complete the upload and matching of the enterprise metadata template. In the specific implementation process, the ratio of random extraction can be configured according to actual needs, which is not limited in this embodiment.
进而,确定出第一标准元数据模板后,执行步骤S102,将所述企业元数据模板中的每个企业元数据项与第一标准元数据模板中的标准元数据项进行相似度匹配,获得匹配结果。具体的,请参考图5,步骤S102可通过如下步骤实现:Further, after the first standard metadata template is determined, step S102 is performed to perform similarity matching between each enterprise metadata item in the enterprise metadata template and the standard metadata item in the first standard metadata template to obtain match results. Specifically, please refer to FIG. 5 , step S102 can be implemented by the following steps:
基于语义分析算法确定每个所述企业元数据项的特征向量和每个所述标准元数据项的特征向量;determining a feature vector of each of the enterprise metadata items and a feature vector of each of the standard metadata items based on a semantic analysis algorithm;
基于每个所述企业元数据项的特征向量和每个所述标准元数据项的特征向量,确定每个所述企业元数据项与每个所述标准元数据项间的相似度;determining the similarity between each of the enterprise metadata items and each of the standard metadata items based on the feature vector of each of the enterprise metadata items and the feature vector of each of the standard metadata items;
如果企业元数据项与标准元数据项间的相似度满足阈值条件,所述匹配结果为匹配成功,如果企业元数据项与标准元数据项间的相似度不满足所述阈值条件,所述匹配结果为匹配失败。If the similarity between the enterprise metadata item and the standard metadata item satisfies the threshold condition, the matching result is successful; if the similarity between the enterprise metadata item and the standard metadata item does not meet the threshold condition, the matching The result is a match failure.
具体的,在本实施例中,针对企业上传的企业元数据模板中的每个企业元数据项的名称、描述信息等进行语义分析后形成特征向量。而针对第一标准元数据模板中的每个标准元数据项,系统中也配置了对应的标识、描述信息,描述信息中包括了该标准元数据项的相关说明和定义。同理,基于语义分析算法也可确定出每个标准元数据项的特征向量。这样,就可以计算得到每个企业元数据项与每个标准元数据项的特征向量间的相似性。Specifically, in this embodiment, a feature vector is formed after semantic analysis is performed on the name and description information of each enterprise metadata item in the enterprise metadata template uploaded by the enterprise. For each standard metadata item in the first standard metadata template, corresponding identification and description information are also configured in the system, and the description information includes relevant descriptions and definitions of the standard metadata item. Similarly, the feature vector of each standard metadata item can also be determined based on the semantic analysis algorithm. In this way, the similarity between the feature vectors of each enterprise metadata item and each standard metadata item can be calculated.
进而,根据每个企业元数据项与每个标准元数据项的特征向量间的相似性,如果相似度满足阈值条件,则确定匹配结果为匹配成功,将其作为匹配成功的第一企业元数据项。剩余的企业元数据项,确定匹配结果为匹配失败,将其作为匹配失败的第二企业元数据项。其中,阈值条件可以是大于预设阈值,预设阈值可根据实际需要设定为90%、95%等数值,在具体实施过程中,预设阈值可根据实际需要进行设定,在此,本实施例不做限制。Further, according to the similarity between the feature vectors of each enterprise metadata item and each standard metadata item, if the similarity satisfies the threshold condition, the matching result is determined to be successful, and it is used as the first enterprise metadata that is successfully matched. item. For the remaining enterprise metadata items, it is determined that the matching result is a matching failure, and it is regarded as the second enterprise metadata item whose matching fails. The threshold condition may be greater than the preset threshold, and the preset threshold may be set to 90%, 95% and other values according to actual needs. In the specific implementation process, the preset threshold may be set according to actual needs. The embodiment is not limited.
举例来说,假设A企业上传的企业元数据模板中包括:制造企业名称、制造企业统一代码、制造企业注册地址、制造企业联系信息和制造企业国别。For example, assume that the enterprise metadata template uploaded by Company A includes: the name of the manufacturing enterprise, the unified code of the manufacturing enterprise, the registered address of the manufacturing enterprise, the contact information of the manufacturing enterprise, and the country of the manufacturing enterprise.
元数据系统维护的与A企业上传的企业元数据模板对应的产品类的第一标准元数据模板包括:生产企业名称、生产企业统一代码、生产企业注册地址和生产企业联系信息。The first standard metadata template of the product category maintained by the metadata system and corresponding to the enterprise metadata template uploaded by Company A includes: the name of the manufacturer, the unified code of the manufacturer, the registered address of the manufacturer, and the contact information of the manufacturer.
假设第一标准元数据模板为图3所示,由图3可确定第一标准元数据模板中各标准元数据项的标识信息包括:生产企业名称为2.1.1.1,生产企业统一代码为2.1.1.2,生产企业注册地址为2.1.1.3,生产企业联系信息为2.1.1.4。Assuming that the first standard metadata template is shown in Figure 3, it can be determined from Figure 3 that the identification information of each standard metadata item in the first standard metadata template includes: the name of the manufacturer is 2.1.1.1, and the unified code of the manufacturer is 2.1. 1.2, the registered address of the manufacturer is 2.1.1.3, and the contact information of the manufacturer is 2.1.1.4.
进而,根据步骤S102确定出企业元数据模板中的制造企业名称与生产企业名称间的相似度大于预设阈值,假设预设阈值为95%,则确定企业元数据模板中的“制造企业名称”与第一标准元数据模板中的“生产企业名称”的相似度大于95%,相似度匹配成功。进而,通过步骤S103为其分配与第一标准元数据模板中的“生产企业名称”对应的标识2.1.1.1。并且,反馈信息中也包括了关于“生产企业名称”的描述信息,描述信息中包括了正确的标准元数据项的名称以及相关的数据定义等。同理,针对“制造企业统一代码”采用同样的方式确定出其在第一标准元数据模板中的标识为2.1.1.2。针对“制造企业注册地址”采用同样的方式确定出其在第一标准元数据模板中的标识为2.1.1.3。针对“制造企业联系信息”采用同样的方式确定出其在第一标准元数据模板中的标识为2.1.1.4。Further, according to step S102, it is determined that the similarity between the manufacturing enterprise name and the manufacturing enterprise name in the enterprise metadata template is greater than the preset threshold, and assuming that the preset threshold is 95%, then determine the "manufacturing enterprise name" in the enterprise metadata template. If the similarity with the "manufacturing company name" in the first standard metadata template is greater than 95%, the similarity matching is successful. Further, through step S103, the identifier 2.1.1.1 corresponding to the "production enterprise name" in the first standard metadata template is assigned to it. In addition, the feedback information also includes description information about the "name of the manufacturer", and the description information includes the correct standard metadata item name and related data definitions. In the same way, the identification of the "uniform code of manufacturing enterprises" in the first standard metadata template is determined as 2.1.1.2 in the same way. In the same way, the "registered address of the manufacturing enterprise" is identified as 2.1.1.3 in the first standard metadata template. For "manufacturing enterprise contact information", its identification in the first standard metadata template is determined as 2.1.1.4 in the same manner.
而针对“制造企业国别”,第一标准元数据模板中没有与之匹配的标准元数据项,可将其添加至第一标准元数据模板,并为其配置新标识为2.1.1.5以及新描述信息。反馈信息中也会包含该元数据项的标识和描述信息,并将其标注为新的元数据项,包括:制造企业名称为2.1.1.1,制造企业统一代码为2.1.1.2,制造企业注册地址为2.1.1.3,制造企业联系信息为2.1.1.4,制造企业国别(新)为2.1.1.5(新)。For "manufacturing enterprise country", if there is no standard metadata item in the first standard metadata template, it can be added to the first standard metadata template and configured with new identifiers 2.1.1.5 and new Description. The feedback information will also include the identification and description information of the metadata item, and mark it as a new metadata item, including: the name of the manufacturing enterprise is 2.1.1.1, the unified code of the manufacturing enterprise is 2.1.1.2, and the registered address of the manufacturing enterprise It is 2.1.1.3, the manufacturing company contact information is 2.1.1.4, and the manufacturing company country (new) is 2.1.1.5 (new).
由于反馈信息中包括了针对企业元数据模板中的各企业元数据项的相关匹配信息,包括与之匹配成功的正确的标准元数据项的标识和描述信息,描述信息中包括了正确的名称和数据定义,所以,企业可以基于反馈信息对虽匹配成功,但相似度没有达到100%,存在不一致的企业元数据项进行修正,与系统对应的标准元数据项保持一致,下次上传时相似度就是100%。并且,根据修正后的企业元数据模板中的各元数据项,确定与之关联的本地数据。这样,当元数据系统向企业请求标准元数据项的主数据时,企业能够准确响应,并将系统请求的数据上传至系统,以提高企业间数据的流通性,提高数据的利用率,实现相关对象的数据全生命周期的管理和监控。Because the feedback information includes the relevant matching information for each enterprise metadata item in the enterprise metadata template, including the identification and description information of the correct standard metadata item that has been successfully matched, the description information includes the correct name and Therefore, based on the feedback information, the enterprise can correct the enterprise metadata items that have been matched successfully, but the similarity has not reached 100%, and there are inconsistencies, and it is consistent with the standard metadata items corresponding to the system. is 100%. And, according to each metadata item in the revised enterprise metadata template, the local data associated therewith is determined. In this way, when the metadata system requests the master data of standard metadata items from the enterprise, the enterprise can respond accurately and upload the data requested by the system to the system, so as to improve the circulation of data between enterprises, improve the utilization rate of data, and realize the relevant Object data lifecycle management and monitoring.
在本实施例中,针对工业领域,应用于产品追溯的数据,主要包括产品的属性数据(从工业软件中获取的数据,比如产品品牌)、产品的成品数据(rfid等现场采集到的数据)。为了能够实现数据可追溯,元数据系统按前述方式统一系统中的元数据模板,实现数据清洗,数据清洗后。明确了数据字段,数据定义,不同字段的映射方式,从而产业链上下游不同企业内数据能够互通。In this embodiment, for the industrial field, the data applied to product traceability mainly includes attribute data of the product (data obtained from industrial software, such as product brand), finished product data of the product (data collected on site such as rfid) . In order to realize data traceability, the metadata system unifies the metadata templates in the system according to the aforementioned method to realize data cleaning, after data cleaning. The data fields, data definitions, and mapping methods of different fields are clarified, so that the data of different enterprises in the upstream and downstream of the industrial chain can be exchanged.
为了实现数据对象的全生命周期的管理,本实施例的方法还包括:In order to realize the management of the whole life cycle of the data object, the method of this embodiment further includes:
接收所述企业基于所述反馈信息上传的与所述第一标准元数据模板对应的第一主数据;receiving first master data corresponding to the first standard metadata template uploaded by the enterprise based on the feedback information;
基于所述第一主数据,确定与所述第一标准元数据模板关联的第二标准元数据模板,建立所述第一标准元数据模板与所述第二标准元数据模板的映射关系。Based on the first master data, a second standard metadata template associated with the first standard metadata template is determined, and a mapping relationship between the first standard metadata template and the second standard metadata template is established.
具体的,在前述实施例中介绍了系统维护了不同类型的标准元数据模板,其中,包括了事件类型和属性类型,也即动态类型和静态类型。在工业互联网中,一个产品对象可能关联多个企业,比如生产企业对应的是属性类型的元数据模板,而运输企业和运营企业针对该产品对象的是事件类型的元数据模板。Specifically, in the foregoing embodiments, it is introduced that the system maintains different types of standard metadata templates, including event types and attribute types, that is, dynamic types and static types. In the Industrial Internet, a product object may be associated with multiple companies. For example, the production company corresponds to the attribute type metadata template, while the transportation company and the operation company use the event type metadata template for the product object.
这样,如果系统获得权限得到企业基于反馈信息上传的与第一标准元数据模板对应的第一主数据,就可以基于第一主数据,确定与第一标准元数据模板关联的第二标准元数据模板,建立第一标准元数据模板与第二标准元数据模板的映射关系。举例来说,钢铁生产企业、钢铁流通企业、钢铁使用企业,按各自对应的标准元数据模板上传主数据,钢铁生产企业基于第一标准元数据模板上传的主数据中包括了钢铁的标识或者生产企业名称、代码等,如果钢铁流通企业基于第二标准元数据模板上传了同样的钢铁的标识或者生产企业名称、代码等,建立第一标准元数据模板与第二标准元数据模板间的映射关系。这样,当权限用户通过钢铁的标识进行查询时,可将第一标准元数据模板和第二标准元数据模板均作为查询结果反馈至该权限用户。同时,如果权限用户具备访问企业本地数据的权限,第一标准元数据模板对应的企业和第二标准元数据对应的企业还会各自反馈对应的数据作为查询结果。这样,就可以将企业间的数据关联起来,实现对象数据全生命周期的管理。In this way, if the system obtains permission to obtain the first master data corresponding to the first standard metadata template uploaded by the enterprise based on the feedback information, it can determine the second standard metadata associated with the first standard metadata template based on the first master data template, to establish a mapping relationship between the first standard metadata template and the second standard metadata template. For example, iron and steel production enterprises, iron and steel distribution enterprises, and iron and steel users upload master data according to their corresponding standard metadata templates. The master data uploaded by iron and steel manufacturers based on the first standard metadata template includes the identification or production of iron and steel. Enterprise name, code, etc. If the steel distribution enterprise uploads the same steel logo or production enterprise name, code, etc. based on the second standard metadata template, establish the mapping relationship between the first standard metadata template and the second standard metadata template . In this way, when an authorized user conducts a query through the identification of iron and steel, both the first standard metadata template and the second standard metadata template can be fed back to the authorized user as query results. At the same time, if the authorized user has the right to access the local data of the enterprise, the enterprise corresponding to the first standard metadata template and the enterprise corresponding to the second standard metadata will also respectively feed back the corresponding data as the query result. In this way, the data between enterprises can be associated, and the management of the whole life cycle of object data can be realized.
进一步,在本实施例中,系统还可以预先建立起不同类型标准元数据模板间的映射关系,比如针对电视类的标准元数据模板和电视屏幕的标准元数据模板间的映射关系。或者,矿物原料的标准元数据模板和带钢的标准元数据模板间的映射关系等,在具体实施过程中,均可根据实际需要进行设置 ,在此,本实施例不做限制。Further, in this embodiment, the system may also pre-establish a mapping relationship between different types of standard metadata templates, such as a mapping relationship between a standard metadata template for a TV class and a standard metadata template for a TV screen. Alternatively, the mapping relationship between the standard metadata template of mineral raw materials and the standard metadata template of strip steel, etc., can be set according to actual needs in the specific implementation process, which is not limited in this embodiment.
本实施例中的元数据系统主要应用于工业互联网标识解析系统中的递归节点、国家顶级节点、二级节点中,每个节点均可提供解析查询的入口,查询的方式可通过如下步骤实现:The metadata system in this embodiment is mainly applied to recursive nodes, national top-level nodes, and secondary nodes in the industrial Internet identification resolution system. Each node can provide an entry for analysis and query, and the query method can be implemented through the following steps:
接收查询请求,所述查询请求中包括待查询标识;receiving a query request, where the query request includes an identifier to be queried;
将所述查询请求发送至所述递归节点,以使得所述递归节点分别向所述国家顶级节点和所述二级节点发出解析请求,以使得所述国家顶级节点和/或所述二级节点解析出与所述待查询标识对应的目标元数据项和所述目标元数据项所在的目标企业节点;sending the query request to the recursive node, so that the recursive node sends a parsing request to the country top-level node and the second-level node, respectively, so that the country top-level node and/or the second-level node Parse out the target metadata item corresponding to the identifier to be queried and the target enterprise node where the target metadata item is located;
接收所述目标企业节点反馈的与所述目标元数据项对应的目标主数据;receiving target master data corresponding to the target metadata item fed back by the target enterprise node;
将所述目标元数据项和所述目标主数据反馈至所述查询请求的请求端。The target metadata item and the target main data are fed back to the requester of the query request.
具体的,在本实施例中,接入系统的各个企业节点维护自己的数据,每个二级节点、国家顶级节点、以及企业节点中均设置有标识数据服务,可以供用户进行数据查询。本实施例中的标识可指物品/设备/机器的工业互联网标识,也可指其他用途的标识,例如用户标识、产品类型标识等,在具体实施过程中,可根据实际需要进行设置,在此,本实施例不做限制。标识包括标识编码和标识值列表,标识编码相当于“身份证”,具有唯一性。也可说,一个标识编码表征一个标识,或表征标识背后的物品、设备、机器、管理员等。具体的,标识编码包括前缀和后缀,前缀与后缀之间用“/”分开,举例来讲,标识编码88.100.1/xxx中,“88.100.1”是前缀,“xxx”是后缀。可将产品类型标识之外的、表征物品/设备/机器/用户的其他标识统称为“元数据标识”,同一品类的多个元数据标识与同一产品类型标识相关联。Specifically, in this embodiment, each enterprise node of the access system maintains its own data, and each secondary node, national top-level node, and enterprise node are provided with identification data services for users to query data. The identifiers in this embodiment may refer to the industrial Internet identifiers of items/equipment/machines, and may also refer to identifiers for other purposes, such as user identifiers, product type identifiers, etc. In the specific implementation process, they can be set according to actual needs. Here , which is not limited in this embodiment. The identification includes an identification code and a list of identification values. The identification code is equivalent to an "identity card" and is unique. It can also be said that an identification code represents an identification, or an object, equipment, machine, administrator, etc. behind the identification. Specifically, the identification code includes a prefix and a suffix, and the prefix and the suffix are separated by "/". For example, in the identification code 88.100.1/xxx, "88.100.1" is the prefix and "xxx" is the suffix. Other identifiers other than product type identifiers that characterize items/equipment/machines/users may be collectively referred to as "metadata identifiers", and multiple metadata identifiers of the same category are associated with the same product type identifier.
进而,请参考图6,通过节点的标识数据服务接收查询请求,该查询请求中包括了待查询标识,该将所述查询请求发送至递归节点,递归节点配置有路由功能,将该解析请求先发送至对应的二级节点,二级节点的标识解析功能主要是为其分配的标识提供公共解析服务。对于由二级节点自身分配的标识编码,二级节点在其标识解析功能中配置该分配标识相应的路由信息。当收到标识解析请求时,如果二级节点保存有标识编码对应路由等信息,则直接回复该标识编码对应的企业节点通信地址以及对应的目标元数据项等信息,如果二级节点没有该标识相应的路由等信息,该二级节点返回信息为空。Further, please refer to FIG. 6 , a query request is received through the identification data service of the node, the query request includes the identification to be queried, the query request is sent to the recursive node, and the recursive node is configured with a routing function, and the parsing request is first It is sent to the corresponding secondary node, and the identification resolution function of the secondary node is mainly to provide public resolution services for the assigned identification. For the identification code allocated by the secondary node itself, the secondary node configures routing information corresponding to the allocated identification in its identification resolution function. When receiving an identification resolution request, if the secondary node saves information such as the route corresponding to the identification code, it will directly reply to the information such as the communication address of the enterprise node corresponding to the identification code and the corresponding target metadata item. If the secondary node does not have the identification code Corresponding routing and other information, the return information of the secondary node is empty.
如果二级节点返回信息为空,则将解析请求发送至国家顶级节点进行解析,进而通过国家顶级节点定位至目标二级节点,再由目标二级节点回复该标识编码对应的企业节点通信地址以及对应的目标元数据项等信息,再根据地址访问企业节点,如果请求方具备访问权限,企业节点将目标元数据项和对应的目标主数据反馈至系统,系统将目标元数据项以及对应的目标主数据反馈给请求方。同时,可将该待查询标识涉及的标准元数据模板也作为查询结果反馈至请求端。在具体实施过程中,查询结果可根据实际需要进行配置,在此,本实施例不做限制。If the return information of the secondary node is empty, the parsing request will be sent to the national top-level node for analysis, and then the target secondary node will be located through the national top-level node, and then the target secondary node will reply to the enterprise node communication address corresponding to the identification code and The corresponding target metadata item and other information, and then access the enterprise node according to the address. If the requester has access rights, the enterprise node feeds back the target metadata item and the corresponding target master data to the system, and the system sends the target metadata item and the corresponding target. The master data is fed back to the requester. At the same time, the standard metadata template involved in the identifier to be queried can also be fed back to the requester as the query result. In the specific implementation process, the query result may be configured according to actual needs, which is not limited in this embodiment.
进一步,在本实施例中,请参考图7,节点部署引入区块链架构,企业节点、二级节点、国家顶级节点都在区块链中,标识数据服务是在区块链节点上。基于区块链系统的特点,每个节点分配标识不容易被篡改,以确保数据的安全性。并且,可将查询记录存储在对应的节点中,以实现信息追溯,并且,基于区块链技术,还可以对上传数据端和查询数据端进行有效的身份认证,确保数据的安全。Further, in this embodiment, please refer to FIG. 7 , the node deployment introduces a blockchain architecture, enterprise nodes, secondary nodes, and national top-level nodes are all in the blockchain, and the identification data service is on the blockchain nodes. Based on the characteristics of the blockchain system, the identification of each node is not easily tampered with to ensure data security. In addition, the query record can be stored in the corresponding node to realize information traceability, and based on the blockchain technology, the upload data terminal and the query data terminal can also be effectively authenticated to ensure data security.
进一步,各节点中维护的标准元数据模板需要更新时,系统向区块链网络中的各个节点发起投票事件,该投票事件中包括了原有版本的第一标准元数据模板和新版本的第一标准元数据模板的元数据项。以及该第一标准元数据模板所属的行业。以使区块链中的节点响应该投票事件,基于共识机制更新进行投票。如果投票通过,则进行更新,并将更新记录保存至相关的区块链节点,使得更新记录可追溯。基于区块链的投票机制进行第一标准元数据模板的更新,投票结果反映了区块链中大部分节点的更新意愿,可以确保更新的公平合理,升级更新更优化更精确。各节点中标准元数据模板的加密、数据的权限管理等均可以通过区块链网络中的共识机制进行投票来确定,以确保数据的安全性。Further, when the standard metadata template maintained in each node needs to be updated, the system initiates a voting event to each node in the blockchain network, and the voting event includes the first standard metadata template of the original version and the first standard metadata template of the new version. A metadata item of a standard metadata template. and the industry to which the first standard metadata template belongs. In order to make the nodes in the blockchain respond to the voting event and vote based on the consensus mechanism update. If the vote is passed, the update is performed and the update record is saved to the relevant blockchain node, so that the update record can be traced back. The blockchain-based voting mechanism is used to update the first standard metadata template. The voting results reflect the update willingness of most nodes in the blockchain, which can ensure that the update is fair and reasonable, and the upgrade and update are more optimized and accurate. The encryption of standard metadata templates in each node and the management of data rights can be determined by voting through the consensus mechanism in the blockchain network to ensure data security.
请参见图8,本发明的第二实施例提供了一种元数据处理装置,应用于元数据系统,包括:Referring to FIG. 8, a second embodiment of the present invention provides a metadata processing apparatus, which is applied to a metadata system, including:
接收模块801,用于接收企业上传的企业元数据模板,其中,所述企业元数据模板中包括所述企业配置的至少一个企业元数据项;A receiving
匹配模块802,用于将所述企业元数据模板中的每个企业元数据项与第一标准元数据模板中的标准元数据项进行相似度匹配,获得匹配结果,其中,所述匹配结果包括匹配成功的第一企业元数据项和匹配失败的第二企业元数据项;A
确定模块803,用于基于所述匹配结果,确定反馈信息,其中,针对所述第一企业元数据项,所述反馈信息中包括与所述第一企业元数据项匹配的标准数据项的标识和描述信息,针对所述第二企业元数据项,将所述第二企业元数据项添加至所述第一标准元数据模板,为所述第二企业元数据项配置新标识和新描述信息,所述反馈信息中包括所述新标识和新描述信息;A
反馈模块804,用于将所述反馈信息发送至所述企业,以使得所述企业基于所述反馈信息建立本地数据与所述第一标准元数据模板中各标准数据项间的映射关系。The
在一种可选实现方式中,所述匹配模块具体用于:In an optional implementation manner, the matching module is specifically used for:
基于语义分析算法确定每个所述企业元数据项的特征向量和每个所述标准元数据项的特征向量;determining a feature vector of each of the enterprise metadata items and a feature vector of each of the standard metadata items based on a semantic analysis algorithm;
基于每个所述企业元数据项的特征向量和每个所述标准元数据项的特征向量,确定每个所述企业元数据项与每个所述标准元数据项间的相似度;determining the similarity between each of the enterprise metadata items and each of the standard metadata items based on the feature vector of each of the enterprise metadata items and the feature vector of each of the standard metadata items;
如果企业元数据项与标准元数据项间的相似度满足阈值条件,所述匹配结果为匹配成功,如果企业元数据项与标准元数据项间的相似度不满足所述阈值条件,所述匹配结果为匹配失败。If the similarity between the enterprise metadata item and the standard metadata item satisfies the threshold condition, the matching result is successful; if the similarity between the enterprise metadata item and the standard metadata item does not meet the threshold condition, the matching The result is a match failure.
在一种可选实现方式中,所述元数据处理装置还包括确定模块,所述确定模块具体用于:In an optional implementation manner, the metadata processing apparatus further includes a determination module, and the determination module is specifically configured to:
在所述将所述企业元数据模板中每个企业元数据项与第一标准元数据模板中的标准元数据项进行相似度匹配之前,确定所述企业元数据模板所属的目标类型,其中,所述元数据系统中包括行业类型和综合类型对应的标准元数据模板;Before performing similarity matching between each enterprise metadata item in the enterprise metadata template and the standard metadata item in the first standard metadata template, determine the target type to which the enterprise metadata template belongs, wherein, The metadata system includes standard metadata templates corresponding to industry types and comprehensive types;
确定与所述目标类型对应的第一标准元数据模板。A first standard metadata template corresponding to the target type is determined.
在一种可选实现方式中,所述确定模块具体用于:In an optional implementation manner, the determining module is specifically used for:
将所述企业的选定类型作为目标类型;或the selected type of said business as the target type; or
基于所述企业的属性信息确定目标类型;或determine a target type based on attribute information of the business; or
确定所述企业元数据模板的文本信息与每个类型的标准元数据模板的文本信息间的相似性,基于所述相似性确定目标类型。Similarity between the textual information of the enterprise metadata template and the textual information of each type of standard metadata template is determined, and the target type is determined based on the similarity.
在一种可选实现方式中,所述元数据处理装置还包括映射模块,所述映射模块具体用于:In an optional implementation manner, the metadata processing apparatus further includes a mapping module, and the mapping module is specifically configured to:
将所述反馈信息发送至所述企业之后,接收所述企业基于所述反馈信息上传的与所述第一标准元数据模板对应的第一主数据;After sending the feedback information to the enterprise, receiving first master data corresponding to the first standard metadata template uploaded by the enterprise based on the feedback information;
基于所述第一主数据,确定与所述第一标准元数据模板关联的第二标准元数据模板,建立所述第一标准元数据模板与所述第二标准元数据模板的映射关系。Based on the first master data, a second standard metadata template associated with the first standard metadata template is determined, and a mapping relationship between the first standard metadata template and the second standard metadata template is established.
在一种可选的实施例中,所述元数据系统应用于工业互联网标识解析系统中的递归节点、国家顶级节点、二级节点中的任意一个或多个组合,所述工业互联网标识解析系统的递归节点、国家顶级节点、二级节点和企业节点均为区块链节点。In an optional embodiment, the metadata system is applied to any one or more combinations of recursive nodes, national top-level nodes, and secondary nodes in the industrial Internet identification resolution system. The recursive nodes, national top-level nodes, secondary nodes and enterprise nodes are all blockchain nodes.
在一种可选实现方式中,所述元数据处理装置还包括查询模块,所述确定模块具体用于:In an optional implementation manner, the metadata processing apparatus further includes a query module, and the determination module is specifically configured to:
接收查询请求,所述查询请求中包括待查询标识;receiving a query request, where the query request includes an identifier to be queried;
将所述查询请求发送至所述递归节点,以使得所述递归节点分别向所述国家顶级节点和所述二级节点发出解析请求,以使得所述国家顶级节点和/或所述二级节点解析出与所述待查询标识对应的目标元数据项和所述目标元数据项所在的目标企业节点;sending the query request to the recursive node, so that the recursive node sends a parsing request to the country top-level node and the second-level node, respectively, so that the country top-level node and/or the second-level node Parse out the target metadata item corresponding to the identifier to be queried and the target enterprise node where the target metadata item is located;
接收所述目标企业节点反馈的与所述目标元数据项对应的目标主数据;receiving target master data corresponding to the target metadata item fed back by the target enterprise node;
将所述目标元数据项和所述目标主数据反馈至所述查询请求的请求端。The target metadata item and the target main data are fed back to the requester of the query request.
本实施例中的元数据处理装置进行数据处理的详细过程已在前述第一实施例中详细阐述,可参见第一实施例中的内容,在此,本实施例不再赘述。The detailed process of data processing performed by the metadata processing apparatus in this embodiment has been described in detail in the foregoing first embodiment, and reference may be made to the content in the first embodiment, which will not be repeated in this embodiment.
请参见图9,本发明的第三实施例提供了一种元数据处理装置,该实施例的装置包括:处理器901、存储器902以及存储在所述存储器中并可在所述处理器上运行的计算机程序,例如第一实施例中元数据处理方法对应的程序。所述处理器执行所述计算机程序时实现上述第一实施例中各元数据处理方法中的步骤。或者,所述处理器执行所述计算机程序时实现上述第二实施例的装置中各模块/单元的功能。Referring to FIG. 9, a third embodiment of the present invention provides an apparatus for processing metadata. The apparatus in this embodiment includes: a
示例性的,所述计算机程序可以被分割成一个或多个模块/单元,所述一个或者多个模块/单元被存储在所述存储器中,并由所述处理器执行,以完成本发明。所述一个或多个模块/单元可以是能够完成特定功能的一系列计算机程序指令段,该指令段用于描述所述计算机程序在所述计算机装置中的执行过程。Exemplarily, the computer program may be divided into one or more modules/units, and the one or more modules/units are stored in the memory and executed by the processor to accomplish the present invention. The one or more modules/units may be a series of computer program instruction segments capable of performing specific functions, and the instruction segments are used to describe the execution process of the computer program in the computer device.
所述元数据处理装置可包括,但不仅限于,处理器、存储器。本领域技术人员可以理解,所述示意图9仅仅是计算机装置的示例,并不构成对装置的限定,可以包括比图示更多或更少的部件,或者组合某些部件,或者不同的部件,例如所述元数据处理装置还可以包括输入输出设备、网络接入设备、总线等。The metadata processing apparatus may include, but is not limited to, a processor and a memory. Those skilled in the art can understand that the schematic diagram 9 is only an example of a computer device, and does not constitute a limitation to the device. It may include more or less components than the one shown, or combine some components, or For example, the metadata processing apparatus may further include an input/output device, a network access device, a bus, and the like.
所称处理器901可以是中央处理单元(Central Processing Unit,CPU),还可以是其他通用处理器、数字信号处理器 (Digital Signal Processor,DSP)、专用集成电路(Application Specific Integrated Circuit,ASIC)、现成可编程门阵列 (Field-Programmable Gate Array,FPGA) 或者其他可编程逻辑器件、分立门或者晶体管逻辑器件、分立硬件组件等。通用处理器可以是微处理器或者该处理器也可以是任何常规的处理器等,所述处理器是所述计算机装置的控制中心,利用各种接口和线路连接整个计算机装置的各个部分。The so-called
所述存储器902可用于存储所述计算机程序和/或模块,所述处理器通过运行或执行存储在所述存储器内的计算机程序和/或模块,以及调用存储在存储器内的数据,实现所述计算机装置的各种功能。所述存储器可主要包括存储程序区和存储数据区,其中,存储程序区可存储操作系统、至少一个功能所需的应用程序(比如声音播放功能、图像播放功能等)等;存储数据区可存储根据手机的使用所创建的数据(比如音频数据、视频数据等)等。此外,存储器可以包括高速随机存取存储器,还可以包括非易失性存储器,例如硬盘、内存、插接式硬盘,智能存储卡(Smart Media Card, SMC),安全数字(Secure Digital, SD)卡,闪存卡(Flash Card)、至少一个磁盘存储器件、闪存器件、或其他易失性固态存储器件。The
进一步,该元数据处理装置所包括的处理器901还具有以下功能:Further, the
接收企业上传的企业元数据模板,其中,所述企业元数据模板中包括所述企业配置的至少一个企业元数据项;receiving an enterprise metadata template uploaded by an enterprise, wherein the enterprise metadata template includes at least one enterprise metadata item configured by the enterprise;
将所述企业元数据模板中的每个企业元数据项与第一标准元数据模板中的标准元数据项进行相似度匹配,获得匹配结果,其中,所述匹配结果包括匹配成功的第一企业元数据项和匹配失败的第二企业元数据项;Perform similarity matching between each enterprise metadata item in the enterprise metadata template and the standard metadata item in the first standard metadata template to obtain a matching result, wherein the matching result includes the first enterprise that has been successfully matched The metadata item and the second enterprise metadata item that failed to match;
基于所述匹配结果,确定反馈信息,其中,针对所述第一企业元数据项,所述反馈信息中包括与所述第一企业元数据项匹配的标准数据项的标识和描述信息,针对所述第二企业元数据项,将所述第二企业元数据项添加至所述第一标准元数据模板,为所述第二企业元数据项配置新标识和新描述信息,所述反馈信息中包括所述新标识和新描述信息;Based on the matching result, feedback information is determined, wherein, for the first enterprise metadata item, the feedback information includes identification and description information of a standard data item matching the first enterprise metadata item, and for the first enterprise metadata item, the identification and description information are included in the feedback information. the second enterprise metadata item, add the second enterprise metadata item to the first standard metadata template, configure a new identifier and new description information for the second enterprise metadata item, and the feedback information Include the new identification and new description information;
将所述反馈信息发送至所述企业,以使得所述企业基于所述反馈信息建立本地数据与所述第一标准元数据模板中各标准数据项间的映射关系。The feedback information is sent to the enterprise, so that the enterprise establishes a mapping relationship between local data and each standard data item in the first standard metadata template based on the feedback information.
进一步,该元数据处理装置所包括的处理器901还具有以下功能:Further, the
基于语义分析算法确定每个所述企业元数据项的特征向量和每个所述标准元数据项的特征向量;determining a feature vector of each of the enterprise metadata items and a feature vector of each of the standard metadata items based on a semantic analysis algorithm;
基于每个所述企业元数据项的特征向量和每个所述标准元数据项的特征向量,确定每个所述企业元数据项与每个所述标准元数据项间的相似度;determining the similarity between each of the enterprise metadata items and each of the standard metadata items based on the feature vector of each of the enterprise metadata items and the feature vector of each of the standard metadata items;
如果企业元数据项与标准元数据项间的相似度满足阈值条件,所述匹配结果为匹配成功,如果企业元数据项与标准元数据项间的相似度不满足所述阈值条件,所述匹配结果为匹配失败。If the similarity between the enterprise metadata item and the standard metadata item satisfies the threshold condition, the matching result is successful; if the similarity between the enterprise metadata item and the standard metadata item does not meet the threshold condition, the matching The result is a match failure.
进一步,该元数据处理装置所包括的处理器901还具有以下功能:Further, the
在所述将所述企业元数据模板中每个企业元数据项与第一标准元数据模板中的标准元数据项进行相似度匹配之前,确定所述企业元数据模板所属的目标类型,其中,所述元数据系统中包括行业类型和综合类型对应的标准元数据模板;Before performing similarity matching between each enterprise metadata item in the enterprise metadata template and the standard metadata item in the first standard metadata template, determine the target type to which the enterprise metadata template belongs, wherein, The metadata system includes standard metadata templates corresponding to industry types and comprehensive types;
确定与所述目标类型对应的第一标准元数据模板。A first standard metadata template corresponding to the target type is determined.
进一步,该元数据处理装置所包括的处理器901还具有以下功能:Further, the
将所述企业的选定类型作为目标类型;或the selected type of said business as the target type; or
基于所述企业的属性信息确定目标类型;或determine a target type based on attribute information of the business; or
确定所述企业元数据模板的文本信息与每个类型的标准元数据模板的文本信息间的相似性,基于所述相似性确定目标类型。Similarity between the textual information of the enterprise metadata template and the textual information of each type of standard metadata template is determined, and the target type is determined based on the similarity.
进一步,该元数据处理装置所包括的处理器901还具有以下功能:Further, the
在所述将所述反馈信息发送至所述企业之后,接收所述企业基于所述反馈信息上传的与所述第一标准元数据模板对应的第一主数据;After the sending the feedback information to the enterprise, receiving first master data corresponding to the first standard metadata template uploaded by the enterprise based on the feedback information;
基于所述第一主数据,确定与所述第一标准元数据模板关联的第二标准元数据模板,建立所述第一标准元数据模板与所述第二标准元数据模板的映射关系。Based on the first master data, a second standard metadata template associated with the first standard metadata template is determined, and a mapping relationship between the first standard metadata template and the second standard metadata template is established.
进一步,所述元数据系统应用于工业互联网标识解析系统中的递归节点、国家顶级节点、二级节点中的任意一个或多个组合,所述工业互联网标识解析系统的递归节点、国家顶级节点、二级节点和企业节点均为区块链节点。Further, the metadata system is applied to any one or more combinations of recursive nodes, national top-level nodes, and secondary nodes in the industrial Internet identification resolution system, and the recursive nodes, national top-level nodes, Both secondary nodes and enterprise nodes are blockchain nodes.
进一步,该元数据处理装置所包括的处理器901还具有以下功能:Further, the
接收查询请求,所述查询请求中包括待查询标识;receiving a query request, where the query request includes an identifier to be queried;
将所述查询请求发送至所述递归节点,以使得所述递归节点分别向所述国家顶级节点和所述二级节点发出解析请求,以使得所述国家顶级节点和/或所述二级节点解析出与所述待查询标识对应的目标元数据项和所述目标元数据项所在的目标企业节点;sending the query request to the recursive node, so that the recursive node sends a parsing request to the country top-level node and the second-level node, respectively, so that the country top-level node and/or the second-level node Parse out the target metadata item corresponding to the identifier to be queried and the target enterprise node where the target metadata item is located;
接收所述目标企业节点反馈的与所述目标元数据项对应的目标主数据;receiving target master data corresponding to the target metadata item fed back by the target enterprise node;
将所述目标元数据项和所述目标主数据反馈至所述查询请求的请求端。The target metadata item and the target main data are fed back to the requester of the query request.
本发明第四实施例提供了一种计算机可读存储介质,其上存储有计算机程序,本发明第二实施例中的所述元数据处理装置集成的功能单元如果以软件功能单元的形式实现并作为独立的产品销售或使用时,可以存储在一个计算机可读取存储介质中。基于这样的理解,本发明实现上述第一实施例的元数据处理方法中的全部或部分流程,也可以通过计算机程序来指令相关的硬件来完成,所述的计算机程序可存储于一计算机可读存储介质中,该计算机程序在被处理器执行时,可实现上述各个方法实施例的步骤。其中,所述计算机程序包括计算机程序代码,所述计算机程序代码可以为源代码形式、对象代码形式、可执行文件或某些中间形式等。所述计算机可读介质可以包括:能够携带所述计算机程序代码的任何实体或装置、记录介质、U盘、移动硬盘、磁碟、光盘、计算机存储器、只读存储器(ROM,Read-Only Memory)、随机存取存储器(RAM,Random Access Memory)、电载波信号、电信信号以及软件分发介质等。需要说明的是,所述计算机可读介质包含的内容可以根据司法管辖区内立法和专利实践的要求进行适当的增减,例如在某些司法管辖区,根据立法和专利实践,计算机可读介质不包括电载波信号和电信信号。The fourth embodiment of the present invention provides a computer-readable storage medium on which a computer program is stored. If the functional units integrated in the metadata processing apparatus in the second embodiment of the present invention are implemented in the form of software functional units and When sold or used as a stand-alone product, it may be stored on a computer-readable storage medium. Based on this understanding, the present invention realizes all or part of the process in the metadata processing method of the first embodiment, and can also be completed by instructing the relevant hardware through a computer program, and the computer program can be stored in a computer-readable In the storage medium, when the computer program is executed by the processor, the steps of the foregoing method embodiments can be implemented. Wherein, the computer program includes computer program code, and the computer program code may be in the form of source code, object code, executable file or some intermediate form, and the like. The computer-readable medium may include: any entity or device capable of carrying the computer program code, recording medium, U disk, removable hard disk, magnetic disk, optical disk, computer memory, read-only memory (ROM, Read-Only Memory) , Random Access Memory (RAM, Random Access Memory), electric carrier signal, telecommunication signal and software distribution medium, etc. It should be noted that the content contained in the computer-readable media may be appropriately increased or decreased according to the requirements of legislation and patent practice in the jurisdiction, for example, in some jurisdictions, according to legislation and patent practice, the computer-readable media Electric carrier signals and telecommunication signals are not included.
尽管已描述了本发明的优选实施例,但本领域内的技术人员一旦得知了基本创造性概念,则可对这些实施例作出另外的变更和修改。所以,所附权利要求意欲解释为包括优选实施例以及落入本发明范围的所有变更和修改。Although preferred embodiments of the present invention have been described, additional changes and modifications to these embodiments may occur to those skilled in the art once the basic inventive concepts are known. Therefore, the appended claims are intended to be construed to include the preferred embodiment and all changes and modifications that fall within the scope of the present invention.
显然,本领域的技术人员可以对本发明进行各种改动和变型而不脱离本发明的精神和范围。这样,倘若本发明的这些修改和变型属于本发明权利要求及其等同技术的范围之内,则本发明也意图包含这些改动和变型在内。It will be apparent to those skilled in the art that various modifications and variations can be made in the present invention without departing from the spirit and scope of the invention. Thus, provided that these modifications and variations of the present invention fall within the scope of the claims of the present invention and their equivalents, the present invention is also intended to include these modifications and variations.
Claims (7)
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202110354795.6A CN112732703B (en) | 2021-03-23 | 2021-03-23 | Metadata processing method, metadata processing apparatus, and readable storage medium |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202110354795.6A CN112732703B (en) | 2021-03-23 | 2021-03-23 | Metadata processing method, metadata processing apparatus, and readable storage medium |
Publications (2)
Publication Number | Publication Date |
---|---|
CN112732703A CN112732703A (en) | 2021-04-30 |
CN112732703B true CN112732703B (en) | 2022-04-12 |
Family
ID=75596286
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202110354795.6A Active CN112732703B (en) | 2021-03-23 | 2021-03-23 | Metadata processing method, metadata processing apparatus, and readable storage medium |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN112732703B (en) |
Families Citing this family (12)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN113111636B (en) * | 2021-05-17 | 2024-04-12 | 京东科技控股股份有限公司 | Data uniqueness standard identification method and device |
CN113377740A (en) * | 2021-05-28 | 2021-09-10 | 中国铁道科学研究院集团有限公司电子计算技术研究所 | Railway metadata management method, application method and device |
CN113377741A (en) * | 2021-05-28 | 2021-09-10 | 中国铁道科学研究院集团有限公司电子计算技术研究所 | Method and device for managing metadata of railway engineering design |
CN113486051B (en) * | 2021-06-16 | 2023-06-13 | 中国联合网络通信集团有限公司 | Industrial internet identification analysis method, system, node and storage medium |
CN113535938B (en) * | 2021-07-22 | 2024-06-28 | 北京明略软件系统有限公司 | Standard data construction method, system, equipment and medium based on content identification |
CN114021540B (en) * | 2021-11-11 | 2025-03-14 | 浙江太美医疗科技股份有限公司 | Data processing method, device, equipment and storage medium |
CN114139501A (en) * | 2021-11-24 | 2022-03-04 | 山东浪潮工业互联网产业股份有限公司 | Conversion docking method of identification analysis system data based on micro-service architecture |
CN114925661A (en) * | 2022-05-18 | 2022-08-19 | 浪潮工业互联网股份有限公司 | Method, device, equipment and medium for generating enterprise identification data |
CN116010672A (en) * | 2022-12-12 | 2023-04-25 | 山东浪潮质量链科技有限公司 | Data processing method and device based on identification analysis system |
CN116959455A (en) * | 2023-07-17 | 2023-10-27 | 中国信息通信研究院 | An audio counterfeiting method based on voiceprint recognition |
CN117453721B (en) * | 2023-10-29 | 2024-07-19 | 江苏信而泰智能装备有限公司 | Production management data acquisition system based on big data |
CN119691028A (en) * | 2025-02-21 | 2025-03-25 | 浪潮通用软件有限公司 | Metadata-based automatic main data acquisition method |
Family Cites Families (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN103530334B (en) * | 2013-09-29 | 2018-01-23 | 北大医疗信息技术有限公司 | Based on the data matching system and method for comparing template |
CN107832371A (en) * | 2017-10-27 | 2018-03-23 | 安徽经邦软件技术有限公司 | A kind of Excel intelligent Matchings import system and method based on financial statement |
CN109684450B (en) * | 2018-12-20 | 2023-07-14 | 中国科学院沈阳自动化研究所 | A Semantic Identification-Based Industrial Network Data Distribution Service System and Method |
CN109871382A (en) * | 2019-02-13 | 2019-06-11 | 北京明略软件系统有限公司 | A kind of implementation method and device of tables of data access java standard library |
EP3722968A1 (en) * | 2019-04-12 | 2020-10-14 | Basf Se | Data extraction system |
CN110377633A (en) * | 2019-06-21 | 2019-10-25 | 深圳壹账通智能科技有限公司 | Method for processing report data, device, computer equipment and storage medium |
-
2021
- 2021-03-23 CN CN202110354795.6A patent/CN112732703B/en active Active
Also Published As
Publication number | Publication date |
---|---|
CN112732703A (en) | 2021-04-30 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN112732703B (en) | Metadata processing method, metadata processing apparatus, and readable storage medium | |
CN112699160B (en) | Metadata template upgrade method, device and readable storage medium | |
CN112699089B (en) | Data sharing system, data sharing method and device | |
CN111459985B (en) | Identification information processing method and device | |
CN107220837A (en) | A kind of destination object source tracing method and device | |
CN110503373A (en) | A kind of material-flow method and device based on block chain network | |
US20120310906A1 (en) | Building Information Tracking System and Method of Use | |
WO2021208762A1 (en) | Data storage and query | |
WO2018201887A1 (en) | Data response method, apparatus, terminal device, and medium | |
US9766881B2 (en) | Social project management system and marketplace | |
CN105431844A (en) | Third party search applications for a search system | |
US20120203782A1 (en) | Method and system for data provenance management in multi-layer systems | |
US10877960B2 (en) | Normalizing user identification across disparate systems | |
CN106570674A (en) | Supply chain information query method and device | |
CN112104726A (en) | Real-time monitoring system for object storage of industrial Internet of things | |
CN112612802B (en) | Real-time data middle station processing method, device and platform | |
CN111008220A (en) | Method and device for dynamic identification of data source, storage medium and electronic device | |
CN116402446B (en) | Cargo management method, device, medium and electronic equipment | |
CN110737662B (en) | Data analysis method, device, server and computer storage medium | |
WO2017173978A1 (en) | Big data exchange method, device, and system | |
CN106649678B (en) | Data processing method and system | |
US20130262662A1 (en) | Methods and systems for smart adapters in a social media content analytics environment | |
CN113780903B (en) | Asset object management method and device, electronic equipment and computer storage medium | |
CN115016821B (en) | Grayscale distribution method, device, computer equipment and storage medium for callback data | |
JP2017004139A (en) | Format conversion management apparatus and format conversion management method |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant |