CN118278962A - Evaluation method and device for data asset value - Google Patents
Evaluation method and device for data asset value Download PDFInfo
- Publication number
- CN118278962A CN118278962A CN202410359786.XA CN202410359786A CN118278962A CN 118278962 A CN118278962 A CN 118278962A CN 202410359786 A CN202410359786 A CN 202410359786A CN 118278962 A CN118278962 A CN 118278962A
- Authority
- CN
- China
- Prior art keywords
- data
- asset
- data asset
- cost
- evaluated
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
Classifications
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06Q—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
- G06Q30/00—Commerce
- G06Q30/02—Marketing; Price estimation or determination; Fundraising
- G06Q30/0278—Product appraisal
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06Q—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
- G06Q30/00—Commerce
- G06Q30/02—Marketing; Price estimation or determination; Fundraising
- G06Q30/0201—Market modelling; Market analysis; Collecting market data
- G06Q30/0206—Price or cost determination based on market factors
Landscapes
- Business, Economics & Management (AREA)
- Strategic Management (AREA)
- Engineering & Computer Science (AREA)
- Accounting & Taxation (AREA)
- Development Economics (AREA)
- Finance (AREA)
- Entrepreneurship & Innovation (AREA)
- Game Theory and Decision Science (AREA)
- Economics (AREA)
- Marketing (AREA)
- Physics & Mathematics (AREA)
- General Business, Economics & Management (AREA)
- General Physics & Mathematics (AREA)
- Theoretical Computer Science (AREA)
- Data Mining & Analysis (AREA)
- Management, Administration, Business Operations System, And Electronic Commerce (AREA)
Abstract
Description
技术领域Technical Field
本发明涉及数据资产价值评估领域,尤其涉及一种数据资产价值的评估方法和装置。The present invention relates to the field of data asset value assessment, and in particular to a method and device for assessing the value of data assets.
背景技术Background technique
数据资产价值难评估,虽然数据资产价值已经得到了普遍认可,但目前仍缺少跨区域、跨行业的数据定价和评估体系或方法,数据资产价值评估处困境。估值过程主观性较大、缺少量化指标、技术应用程度低,使得数据资产估值结果不准确。The value of data assets is difficult to assess. Although the value of data assets has been widely recognized, there is still a lack of cross-regional and cross-industry data pricing and evaluation systems or methods, and data asset value assessment is in a dilemma. The valuation process is highly subjective, lacks quantitative indicators, and has a low level of technology application, making the data asset valuation results inaccurate.
在实现本发明过程中,申请人发现现有技术中至少存在如下问题:In the process of implementing the present invention, the applicant discovered that there are at least the following problems in the prior art:
现有技术缺乏基于数据资产全生命周期的资产价值评估。Existing technologies lack asset value assessment based on the entire life cycle of data assets.
发明内容Summary of the invention
本发明实施例提供一种数据资产价值的评估方法和装置,解决了现有技术缺乏基于数据资产全生命周期的资产价值评估的问题。The embodiments of the present invention provide a method and device for evaluating the value of data assets, which solve the problem that the prior art lacks asset value evaluation based on the entire life cycle of data assets.
为达上述目的,一方面,本发明实施例提供一种数据资产价值的评估方法,包括:To achieve the above objectives, on the one hand, an embodiment of the present invention provides a method for evaluating the value of data assets, including:
根据预设评估目标,收集待评估数据资产,针对所述待评估数据资产,通过多种数据加工步骤对所述待评估数据资产进行加工处理,得到所述待评估数据资产对应的至少一个数据资产表和数据生产成本;According to the preset evaluation target, data assets to be evaluated are collected, and the data assets to be evaluated are processed through multiple data processing steps to obtain at least one data asset table and data production cost corresponding to the data assets to be evaluated;
针对每个数据资产表,根据血缘传播算法分摊所述待评估数据资产对应的数据生产成本,得到所述数据资产表的成本信息;For each data asset table, the data production cost corresponding to the data asset to be evaluated is apportioned according to the lineage propagation algorithm to obtain the cost information of the data asset table;
针对每个数据资产表,通过层次分析法对所述数据资产表对应的数据管理要素进行评价,得到所述数据资产表的数据管理分数;For each data asset table, the data management elements corresponding to the data asset table are evaluated by the hierarchical analysis method to obtain the data management score of the data asset table;
根据所有数据资产表的数据管理分数和成本信息,评估所述待评估数据资产的资产回报率。The asset return rate of the data asset to be evaluated is evaluated based on the data management scores and cost information of all data asset tables.
进一步地,所述根据预设评估目标,收集待评估数据资产,针对所述待评估数据资产,通过多种数据加工步骤对所述待评估数据资产进行加工处理,得到所述待评估数据资产对应的至少一个数据资产表和数据生产成本,包括:Furthermore, the data assets to be evaluated are collected according to the preset evaluation target, and the data assets to be evaluated are processed through multiple data processing steps to obtain at least one data asset table and data production cost corresponding to the data assets to be evaluated, including:
根据预设评估目标,确定对应的待评估数据资产的范围,根据所述待评估数据资产的范围收集所述待评估数据资产;According to the preset evaluation target, determine the scope of the corresponding data assets to be evaluated, and collect the data assets to be evaluated according to the scope of the data assets to be evaluated;
对所述待评估数据资产进行数据清洗,得到清洗后数据,所述数据清洗包括:去除重复数据和修复数据错误;Performing data cleaning on the data assets to be evaluated to obtain cleaned data, wherein the data cleaning includes: removing duplicate data and repairing data errors;
对不同来源、格式和结构的清洗后数据进行数据整合,按预设的统一数据存储要求进行存储,得到存储数据;Integrate the cleaned data from different sources, formats and structures, store them according to the preset unified data storage requirements, and obtain stored data;
根据所述存储数据的性质、来源和用途对所述存储数据进行分类并标记标签;Classify and label the stored data according to the nature, source and purpose of the stored data;
将所述存储数据中的敏感信息替换为指定替代信息,得到所述存储数据对应的脱敏数据,对所述脱敏数据进行加密;Replacing the sensitive information in the stored data with specified replacement information to obtain desensitized data corresponding to the stored data, and encrypting the desensitized data;
将经脱敏和加密的存储数据按分类存储为至少一个数据资产表;storing the desensitized and encrypted stored data into at least one data asset table by category;
根据从所述待评估数据资产到所述至少一个数据资产表的数据加工步骤收集所述待评估数据资产对应的数据生产成本;Collecting data production costs corresponding to the data assets to be evaluated according to the data processing steps from the data assets to be evaluated to the at least one data asset table;
其中,所述数据生产成本包括:硬件建设成本、软件建设成本、运维成本、资源成本、数据采购成本、数据咨询成本、人力支出成本和场地费用成本。Among them, the data production cost includes: hardware construction cost, software construction cost, operation and maintenance cost, resource cost, data procurement cost, data consulting cost, manpower expenditure cost and site fee cost.
进一步地,所述针对每个数据资产表,根据血缘传播算法分摊所述待评估数据资产对应的数据生产成本,得到所述数据资产表的成本信息,包括:Furthermore, for each data asset table, the data production cost corresponding to the data asset to be evaluated is apportioned according to the lineage propagation algorithm to obtain the cost information of the data asset table, including:
针对每个数据资产表,根据血缘传播算法生成所述数据资产表对应的加工工序血缘关系;所述加工工序血缘关系表示数据加工步骤之间的父子关系;For each data asset table, a processing procedure blood relationship corresponding to the data asset table is generated according to a blood relationship propagation algorithm; the processing procedure blood relationship represents a parent-child relationship between data processing steps;
根据所述数据资产表对应的加工工序血缘关系,将所述待评估数据资产对应的数据生产成本,分摊为用于得到所述数据资产表的各数据加工步骤各自对应的数据生产成本;According to the blood relationship of the processing procedures corresponding to the data asset table, the data production cost corresponding to the data asset to be evaluated is apportioned into the data production costs corresponding to each data processing step used to obtain the data asset table;
根据所述加工工序血缘关系,累计用于得到所述数据资产表的各数据加工步骤各自对应的数据生产成本,得到所述数据资产表的成本信息。According to the blood relationship of the processing procedures, the data production costs corresponding to the respective data processing steps used to obtain the data asset table are accumulated to obtain the cost information of the data asset table.
进一步地,所述针对每个数据资产表,通过层次分析法对所述数据资产表对应的数据管理要素进行评价,得到所述数据资产表的数据管理分数,包括:Furthermore, for each data asset table, the data management elements corresponding to the data asset table are evaluated by the analytic hierarchy process to obtain the data management score of the data asset table, including:
根据所述数据资产表对应的数据管理要素构建层次分析法的判断矩阵;Constructing a judgment matrix of the hierarchical analysis method according to the data management elements corresponding to the data asset table;
根据所述判断矩阵比较所述数据资产表对应的不同数据管理要素之间的相对重要性,确定所述数据资产表对应的各数据管理要素的权重;Comparing the relative importance of different data management elements corresponding to the data asset table according to the judgment matrix, and determining the weight of each data management element corresponding to the data asset table;
对所述判断矩阵的每一列进行归一化处理,计算所述数据资产表对应的各数据管理要素的权重向量;Normalizing each column of the judgment matrix and calculating the weight vector of each data management element corresponding to the data asset table;
根据所述数据资产表对应的各数据管理要素的权重向量计算所述数据资产表对应的各数据管理要素的综合权重;Calculating the comprehensive weight of each data management element corresponding to the data asset table according to the weight vector of each data management element corresponding to the data asset table;
根据数据资产表对应的各数据管理要素及其综合权重,确定所述数据资产表的数据管理分数;Determine the data management score of the data asset table according to the data management elements corresponding to the data asset table and their comprehensive weights;
所述数据管理要素包括基本要素、质量要素和使用要素;The data management elements include basic elements, quality elements and usage elements;
所述基本要素包括数据规模;The basic elements include data size;
所述质量要素包括:准确性、一致性、完整性、规范性、时效性;The quality elements include: accuracy, consistency, completeness, standardization, and timeliness;
所述使用要素包括:可访问度和使用热度。The usage factors include: accessibility and usage popularity.
进一步地,所述根据所有数据资产表的数据管理分数和成本信息,评估所述待评估数据资产的资产回报率,包括:Furthermore, evaluating the asset return rate of the data asset to be evaluated based on the data management scores and cost information of all data asset tables includes:
针对每个数据资产表,将所述数据资产表的数据管理分数和成本信息相加,得到所述数据资产表的总体价值;For each data asset table, add the data management score and cost information of the data asset table to obtain the overall value of the data asset table;
根据所述数据资产表的应用场景和贡献程度,确定所述数据资产表的总体价值对应的权重;Determine the weight corresponding to the overall value of the data asset table according to the application scenario and contribution degree of the data asset table;
根据所有数据资产表的各总体价值对应的权重,对所述所有数据资产表的总体价值进行加权平局,得到所述待评估数据资产的资产回报率。According to the weights corresponding to the overall values of all data asset tables, the overall values of all data asset tables are weighted and averaged to obtain the asset return rate of the data assets to be evaluated.
另一方面,本发明实施例提供一种数据资产价值的评估装置,包括:On the other hand, an embodiment of the present invention provides a data asset value evaluation device, including:
数据资产采集单元,用于根据预设评估目标,收集待评估数据资产,针对所述待评估数据资产,通过多种数据加工步骤对所述待评估数据资产进行加工处理,得到所述待评估数据资产对应的至少一个数据资产表和数据生产成本;A data asset collection unit, configured to collect data assets to be evaluated according to preset evaluation targets, and process the data assets to be evaluated through a plurality of data processing steps to obtain at least one data asset table and data production cost corresponding to the data assets to be evaluated;
数据资产成本传播单元,用于针对每个数据资产表,根据血缘传播算法分摊所述待评估数据资产对应的数据生产成本,得到所述数据资产表的成本信息;A data asset cost propagation unit, for allocating the data production cost corresponding to the data asset to be evaluated according to a lineage propagation algorithm for each data asset table, to obtain cost information of the data asset table;
数据资产管理单元,用于针对每个数据资产表,通过层次分析法对所述数据资产表对应的数据管理要素进行评价,得到所述数据资产表的数据管理分数;A data asset management unit, for evaluating the data management elements corresponding to each data asset table by means of a hierarchical analysis method, to obtain a data management score for the data asset table;
数据资产评估单元,用于根据所有数据资产表的数据管理分数和成本信息,评估所述待评估数据资产的资产回报率。The data asset evaluation unit is used to evaluate the asset return rate of the data asset to be evaluated based on the data management scores and cost information of all data asset tables.
进一步地,所述数据资产采集单元,包括:Furthermore, the data asset collection unit includes:
数据资产收集模块,用于根据预设评估目标,确定对应的待评估数据资产的范围,根据所述待评估数据资产的范围收集所述待评估数据资产;A data asset collection module, used to determine the scope of corresponding data assets to be evaluated according to a preset evaluation target, and collect the data assets to be evaluated according to the scope of the data assets to be evaluated;
数据资产清洗模块,用于对所述待评估数据资产进行数据清洗,得到清洗后数据,所述数据清洗包括:去除重复数据和修复数据错误;A data asset cleaning module is used to clean the data assets to be evaluated to obtain cleaned data, wherein the data cleaning includes: removing duplicate data and repairing data errors;
数据资产存储模块,用于对不同来源、格式和结构的清洗后数据进行数据整合,按预设的统一数据存储要求进行存储,得到存储数据;The data asset storage module is used to integrate cleaned data from different sources, formats and structures, store them according to preset unified data storage requirements, and obtain stored data;
数据资产分类模块,用于根据所述存储数据的性质、来源和用途对所述存储数据进行分类并标记标签;A data asset classification module, used to classify and label the stored data according to the nature, source and purpose of the stored data;
数据资产脱敏加密模块,用于将所述存储数据中的敏感信息替换为指定替代信息,得到所述存储数据对应的脱敏数据,对所述脱敏数据进行加密;A data asset desensitizing and encryption module, used to replace sensitive information in the stored data with specified replacement information, obtain desensitized data corresponding to the stored data, and encrypt the desensitized data;
数据资产表生成模块,用于将经脱敏和加密的存储数据按分类存储为至少一个数据资产表;A data asset table generation module, used to store the desensitized and encrypted storage data into at least one data asset table by category;
数据生产成本收集模块,用于根据从所述待评估数据资产到所述至少一个数据资产表的数据加工步骤收集所述待评估数据资产对应的数据生产成本;A data production cost collection module, configured to collect data production costs corresponding to the data asset to be evaluated according to the data processing steps from the data asset to be evaluated to the at least one data asset table;
其中,所述数据生产成本包括:硬件建设成本、软件建设成本、运维成本、资源成本、数据采购成本、数据咨询成本、人力支出成本和场地费用成本。Among them, the data production cost includes: hardware construction cost, software construction cost, operation and maintenance cost, resource cost, data procurement cost, data consulting cost, manpower expenditure cost and site fee cost.
进一步地,所述数据资产成本传播单元,包括:Furthermore, the data asset cost propagation unit includes:
加工工序血缘关系确定模块,用于针对每个数据资产表,根据血缘传播算法生成所述数据资产表对应的加工工序血缘关系;所述加工工序血缘关系表示数据加工步骤之间的父子关系;A processing procedure blood relationship determination module is used to generate, for each data asset table, a processing procedure blood relationship corresponding to the data asset table according to a blood relationship propagation algorithm; the processing procedure blood relationship represents a parent-child relationship between data processing steps;
成本分摊模块,用于根据所述数据资产表对应的加工工序血缘关系,将所述待评估数据资产对应的数据生产成本,分摊为用于得到所述数据资产表的各数据加工步骤各自对应的数据生产成本;A cost allocation module, for allocating the data production cost corresponding to the data asset to be evaluated into the data production costs corresponding to each data processing step for obtaining the data asset table according to the blood relationship of the processing steps corresponding to the data asset table;
成本信息确定模块,用于根据所述加工工序血缘关系,累计用于得到所述数据资产表的各数据加工步骤各自对应的数据生产成本,得到所述数据资产表的成本信息。The cost information determination module is used to accumulate the data production costs corresponding to each data processing step used to obtain the data asset table according to the processing process blood relationship, so as to obtain the cost information of the data asset table.
进一步地,所述数据资产管理单元,包括:Furthermore, the data asset management unit includes:
判断矩阵构建模块,用于根据所述数据资产表对应的数据管理要素构建层次分析法的判断矩阵;A judgment matrix construction module, used to construct a judgment matrix of the hierarchical analysis method according to the data management elements corresponding to the data asset table;
管理要素权重确定模块,用于根据所述判断矩阵比较所述数据资产表对应的不同数据管理要素之间的相对重要性,确定所述数据资产表对应的各数据管理要素的权重;A management element weight determination module, used to compare the relative importance of different data management elements corresponding to the data asset table according to the judgment matrix, and determine the weight of each data management element corresponding to the data asset table;
权重向量确定模块,用于对所述判断矩阵的每一列进行归一化处理,计算所述数据资产表对应的各数据管理要素的权重向量;A weight vector determination module, used to normalize each column of the judgment matrix and calculate the weight vector of each data management element corresponding to the data asset table;
综合权重确定模块,用于根据所述数据资产表对应的各数据管理要素的权重向量计算所述数据资产表对应的各数据管理要素的综合权重;A comprehensive weight determination module, used to calculate the comprehensive weight of each data management element corresponding to the data asset table according to the weight vector of each data management element corresponding to the data asset table;
数据管理分数确定单元,用于根据数据资产表对应的各数据管理要素及其综合权重,确定所述数据资产表的数据管理分数;A data management score determination unit, used to determine the data management score of the data asset table according to the data management elements corresponding to the data asset table and their comprehensive weights;
所述数据管理要素包括基本要素、质量要素和使用要素;The data management elements include basic elements, quality elements and usage elements;
所述基本要素包括数据规模;The basic elements include data size;
所述质量要素包括:准确性、一致性、完整性、规范性、时效性;The quality elements include: accuracy, consistency, completeness, standardization, and timeliness;
所述使用要素包括:可访问度和使用热度。The usage factors include: accessibility and usage popularity.
进一步地,所述数据资产评估单元,包括:Furthermore, the data asset evaluation unit includes:
总体价值确定模块,用于针对每个数据资产表,将所述数据资产表的数据管理分数和成本信息相加,得到所述数据资产表的总体价值;An overall value determination module, for each data asset table, adding the data management score and cost information of the data asset table to obtain the overall value of the data asset table;
总体价值权重确定模块,用于根据所述数据资产表的应用场景和贡献程度,确定所述数据资产表的总体价值对应的权重;An overall value weight determination module, used to determine the weight corresponding to the overall value of the data asset table according to the application scenario and contribution degree of the data asset table;
资产回报率确定模块,用于根据所有数据资产表的各总体价值对应的权重,对所述所有数据资产表的总体价值进行加权平局,得到所述待评估数据资产的资产回报率。The asset return rate determination module is used to perform weighted averaging on the overall values of all data asset tables according to the weights corresponding to the overall values of all data asset tables to obtain the asset return rate of the data assets to be evaluated.
上述技术方案具有如下有益效果:通过从获取数据资产、经多种数据加工步骤进行加工,再对待评估数据资产对应的数据生产成本进行分摊确定数据资产表的成本信息,通过层次分析法对数据管理要素进行分析,得到所述数据资产表的数据管理分数,结合成本信息和数据管理分数,确定数据资产价值,实现了从数据资产的数据源到使用数据资产的全生命周期的资产价值评估,达到简化复杂的评估步骤,更全面准确的对数据资产的价值进行评估的效果。建立数据资产价值评估指标体系,从数据在不同阶段的影响因素出发,完善数据资产价值评估框架。在拥有大量数据,预期进行数据交易的行业、企业级应用中可以有较广泛的用途。The above technical solution has the following beneficial effects: by acquiring data assets, processing them through multiple data processing steps, and then allocating the data production costs corresponding to the data assets to be evaluated to determine the cost information of the data asset table, analyzing the data management elements through the hierarchical analysis method, and obtaining the data management score of the data asset table, combining the cost information and the data management score to determine the value of the data assets, and realizing the asset value evaluation of the entire life cycle from the data source of the data asset to the use of the data asset, so as to simplify the complex evaluation steps and evaluate the value of the data assets more comprehensively and accurately. Establish a data asset value evaluation indicator system, starting from the influencing factors of data at different stages, and improve the data asset value evaluation framework. It can have a wide range of uses in industries and enterprise-level applications that have large amounts of data and are expected to conduct data transactions.
附图说明BRIEF DESCRIPTION OF THE DRAWINGS
为了更清楚地说明本发明实施例或现有技术中的技术方案,下面将对实施例或现有技术描述中所需要使用的附图作简单地介绍,显而易见地,下面描述中的附图仅仅是本发明的一些实施例,对于本领域普通技术人员来讲,在不付出创造性劳动的前提下,还可以根据这些附图获得其他的附图。In order to more clearly illustrate the embodiments of the present invention or the technical solutions in the prior art, the drawings required for use in the embodiments or the description of the prior art will be briefly introduced below. Obviously, the drawings described below are only some embodiments of the present invention. For ordinary technicians in this field, other drawings can be obtained based on these drawings without paying creative work.
图1是本发明实施例之一的一种数据资产价值的评估方法的流程图;FIG1 is a flow chart of a method for evaluating the value of data assets according to one embodiment of the present invention;
图2是本发明实施例之一的一种数据资产价值的评估装置的架构图;FIG2 is a schematic diagram of a data asset value evaluation device according to one embodiment of the present invention;
图3是本发明实施例之一的图检索算法的检索过程示意图;FIG3 is a schematic diagram of a retrieval process of a graph retrieval algorithm according to one embodiment of the present invention;
图4是本发明实施例之一的数据资产评估模型示意图;FIG4 is a schematic diagram of a data asset evaluation model according to an embodiment of the present invention;
图5是本发明实施例之一的一种数据资产价值的评估方法的另一流程图。FIG. 5 is another flow chart of a method for evaluating the value of data assets according to one embodiment of the present invention.
具体实施方式Detailed ways
下面将结合本发明实施例中的附图,对本发明实施例中的技术方案进行清楚、完整地描述,显然,所描述的实施例仅仅是本发明一部分实施例,而不是全部的实施例。基于本发明中的实施例,本领域普通技术人员在没有做出创造性劳动前提下所获得的所有其他实施例,都属于本发明保护的范围。The following will be combined with the drawings in the embodiments of the present invention to clearly and completely describe the technical solutions in the embodiments of the present invention. Obviously, the described embodiments are only part of the embodiments of the present invention, not all of the embodiments. Based on the embodiments of the present invention, all other embodiments obtained by ordinary technicians in this field without creative work are within the scope of protection of the present invention.
一方面,如图1所示,本发明实施例提供一种数据资产价值的评估方法,包括:On the one hand, as shown in FIG1 , an embodiment of the present invention provides a method for evaluating the value of data assets, including:
步骤S10,根据预设评估目标,收集待评估数据资产,针对所述待评估数据资产,通过多种数据加工步骤对所述待评估数据资产进行加工处理,得到所述待评估数据资产对应的至少一个数据资产表和数据生产成本;Step S10, according to the preset evaluation target, collect the data assets to be evaluated, and process the data assets to be evaluated through multiple data processing steps to obtain at least one data asset table and data production cost corresponding to the data assets to be evaluated;
步骤S11,针对每个数据资产表,根据血缘传播算法分摊所述待评估数据资产对应的数据生产成本,得到所述数据资产表的成本信息;Step S11, for each data asset table, allocating the data production cost corresponding to the data asset to be evaluated according to the lineage propagation algorithm to obtain the cost information of the data asset table;
步骤S12,针对每个数据资产表,通过层次分析法对所述数据资产表对应的数据管理要素进行评价,得到所述数据资产表的数据管理分数;Step S12, for each data asset table, evaluating the data management elements corresponding to the data asset table by using the analytic hierarchy process to obtain the data management score of the data asset table;
步骤S13,根据所有数据资产表的数据管理分数和成本信息,评估所述待评估数据资产的资产回报率。Step S13, evaluating the asset return rate of the data asset to be evaluated based on the data management scores and cost information of all data asset tables.
在一些实施例中,数据全生命周期可以依据从生产到消费的链路分为“采”、“建”、“管”、“用”四个阶段,“采”代表汇聚异构数据的全过程,“建”代表数据治理的完备和连贯,“管”指全量数据资产的治理与管理,“用”则强调数据资产的应用与回报。基于数据全生命周期建立资产价值评估方法,从数据在不同阶段的影响要素出发,完善数据资产价值评估框架。充分考虑数据资产在发挥价值的路径里面的所有对价值因素有意义的因素,对数据资产进行剖析,分类,以及针对不同类型的因素采用不同的技术化手段进行计算,得到了数据资产价值评估的完整且符合数据资产真实属性的如图4所示的数据资产评估模型,采的阶段,整体从采集数据资产(待评估数据资产)和采集数据生产成本(例如数据成本识别和录入)开始,建的阶段,数据资产经加工,以血缘传播算法(血缘图算法)为主导的成本分配“计算器”为基础,管的阶段,根据数据管理的基本要素、质量要素、使用要素进行数据管理评分,用的阶段,结合数据管理评分和数据生产成本,同时考虑非经济因素和经济因素对数据资产的价值进行评估,结合数据资产要素评价体系和数据应用市场价值分析,对数据资产价值进行评估,确定数据资产的回报率。针对不同类型的数据资产,可以采用不同的技术化手段来进行计算和评估其价值,以下是一些常用的技术化手段和对应的因素,定量分析:通过数据的数量和属性进行计算和评估,例如,对于销售数据资产,可以使用数据挖掘和统计分析方法来计算其销售额、增长率等指标;财务分析:通过财务数据进行计算和评估,例如,对于财务数据资产,可以使用财务比率分析、财务模型等方法来评估其价值和潜在收益;风险评估:通过评估数据资产的风险和不确定性来计算其价值,例如,对于市场数据资产,可以使用风险模型和概率分析来评估其市场价值和风险;品牌价值评估:通过评估品牌数据资产对企业价值的影响来计算其价值,例如,可以使用品牌评估模型和市场调研数据来计算品牌价值;技术评估:通过评估技术数据资产的创新性、专利价值等来计算其价值,例如,可以使用专利评估模型和技术竞争分析来评估技术数据资产的价值。以下血缘传播算法的步骤,数据成本识别:首先,需要识别和记录与数据资产相关的成本项目,这包括数据采集、存储、处理、维护等方面的成本,可以通过审查财务记录、与相关团队进行讨论以及分析数据生命周期等方式来确定成本项目;数据成本录入:将识别出的成本项目与相应的数据资产关联,并将成本数据录入计算器中,这可以通过手动录入或与财务系统的集成来完成;血缘图算法:基于血缘关系,使用血缘图算法来追踪数据资产的来源、流向和使用情况,血缘关系可以通过数据标签、元数据、日志记录等方式进行标识和捕获;成本分配计算:根据血缘图算法和成本数据,计算器可以进行成本分配计算,计算器根据数据资产的使用情况、流动路径等因素,将相关成本进行分配和计算,以确定各个数据资产的成本;价值评估:基于计算得到的数据资产成本,可以进行价值评估,这可以通过与数据资产的预期收益、市场价值等进行比较,以确定数据资产的价值。对数据生产成本进行全面梳理和归纳,对数据的各项成本归纳,将数据成本总结为硬件建设成本、软件建设成本、运维成本、资源成本、数据采购成本、数据咨询成本、人力支出、场地费用等方面。血缘传播用于数据成本评估。追踪数据成本,将可收集到的成本全部分摊给数据资产表的根节点。通过数据的血缘传递关系,沿着数据加工血缘传播数据成本。基于已有数据管理评分维度的总结,建立了数据管理评分体系,包括基础属性要素(基本要素)、数据质量要素(质量要素)以及使用要素。数据管理因素主要解释数据研发后的运维和运营情况,采用基础评价指标(对应于基本要素)、数据质量评价指标(对应于质量要素)和使用评价指标(对应于使用要素),并通过层次分析法对数据的非经济因素(数据管理要素)进行评价,从而得到单张数据资产表的数据管理分数。通过分析企业的经营情况得出整体的行业回报率,并将数据管理分数和成本信息拟合成每张表的数据资产回报率。企业的整体资产回报率与所处行业以及该行业的数字化建设能力相关。数据资产评估的成本录入是指将与数据资产评估相关的成本信息记录和管理的过程。它的目的是确保对数据资产评估的成本进行准确、全面和可追溯的记录,以便在需要时进行审计和分析。具体而言,成本录入的主要任务包括以下几个方面,识别和收集成本信息:通过与相关部门和团队协调,识别和收集与数据资产评估相关的成本信息,这些成本可能包括人力资源、硬件设备、软件工具、外部服务等;记录和分类成本:将收集到的成本信息进行记录和分类,这可以通过建立成本分类系统或使用现有的财务系统进行实现,常见的成本分类包括人员成本、设备成本、软件成本、培训成本等;分配成本:根据成本归属原则,将成本分配给相关的数据资产评估项目或活动,这可以根据项目的工作量、资源消耗、时间分配等进行分配;计算和汇总成本:根据成本分类和分配,计算和汇总各项成本,这可以通过使用电子表格或财务系统进行自动计算和汇总;审核和验证成本:定期进行成本审核和验证,确保成本的准确性和可靠性,这可以通过与财务部门合作进行财务审计或使用内部审计方法进行实现;分析和报告成本:根据需求进行成本分析和报告,这可以包括生成成本报告、制定成本预算、进行成本效益分析等,以支持管理决策和优化资源配置。数据资产评估的成本维护是指对数据资产评估过程中所涉及的成本信息进行更新和管理的活动,具体来说,它包括以下几个方面的工作,成本数据收集:收集与数据资产评估相关的所有成本数据,包括硬件设备、软件许可、人力资源、培训等方面的成本信息;成本分类和归档:将收集到的成本数据进行分类和归档,确保每个成本项目都有清晰的标识和分类,以便后续的成本分析和管理;成本更新和调整:根据实际情况,定期更新和调整成本数据,例如,当硬件设备的价格有所变动时,需要相应地更新成本数据;成本分析和报告:利用收集到的成本数据进行分析,评估数据资产的成本效益和投资回报率,同时,生成基于成本数据的报告,向决策者提供有关数据资产评估成本的信息;成本控制和优化:根据成本数据的分析结果,采取相应的控制措施,优化数据资产评估的成本效益,例如,通过节约软件许可费用或优化人力资源配置等方式,降低成本。In some embodiments, the data life cycle can be divided into four stages: "collection", "construction", "management" and "use" based on the link from production to consumption. "Collection" represents the whole process of gathering heterogeneous data, "construction" represents the completeness and coherence of data governance, "management" refers to the governance and management of all data assets, and "use" emphasizes the application and return of data assets. Establish an asset value assessment method based on the data life cycle, starting from the factors affecting data at different stages, and improve the data asset value assessment framework. Taking into full consideration all the factors that are meaningful to the value factors in the path of data assets to realize their value, data assets are analyzed and classified, and different technical means are used to calculate different types of factors. The data asset evaluation model shown in Figure 4 is complete and in line with the true attributes of data assets. In the acquisition stage, the whole starts with the acquisition of data assets (data assets to be evaluated) and the acquisition of data production costs (such as data cost identification and entry). In the construction stage, data assets are processed and based on the cost allocation "calculator" dominated by the bloodline propagation algorithm (bloodline graph algorithm). In the management stage, data management scores are scored according to the basic elements, quality elements, and usage elements of data management. In the use stage, the value of data assets is evaluated by combining data management scores and data production costs, and considering non-economic factors and economic factors. Combined with the data asset factor evaluation system and data application market value analysis, the value of data assets is evaluated to determine the rate of return on data assets. Different technical means can be used to calculate and evaluate the value of different types of data assets. The following are some commonly used technical means and corresponding factors: Quantitative analysis: calculation and evaluation through the quantity and attributes of data. For example, for sales data assets, data mining and statistical analysis methods can be used to calculate its sales, growth rate and other indicators; Financial analysis: calculation and evaluation through financial data. For example, for financial data assets, financial ratio analysis, financial models and other methods can be used to evaluate its value and potential benefits; Risk assessment: calculation of data assets by evaluating their risks and uncertainties. For example, for market data assets, risk models and probability analysis can be used to evaluate their market value and risks; Brand value assessment: calculation of brand data assets by evaluating their impact on corporate value. For example, brand evaluation models and market research data can be used to calculate brand value; Technology assessment: calculation of technology data assets by evaluating their innovation, patent value, etc. For example, patent evaluation models and technology competition analysis can be used to evaluate the value of technology data assets. The following are the steps of the lineage propagation algorithm: Data cost identification: First, it is necessary to identify and record the cost items related to data assets, which include the costs of data collection, storage, processing, maintenance, etc. The cost items can be determined by reviewing financial records, discussing with relevant teams, and analyzing the data life cycle; Data cost entry: Associate the identified cost items with the corresponding data assets, and enter the cost data into the calculator. This can be done by manual entry or integration with the financial system; Lineage graph algorithm: Based on lineage relationships, use the lineage graph algorithm to track the source, flow and usage of data assets. Lineage relationships can be identified and captured through data tags, metadata, log records, etc.; Cost allocation calculation: Based on the lineage graph algorithm and cost data, the calculator can perform cost allocation calculations. The calculator allocates and calculates relevant costs based on factors such as the usage of data assets and flow paths to determine the cost of each data asset; Value assessment: Based on the calculated data asset costs, value assessment can be performed, which can be compared with the expected returns and market value of the data assets to determine the value of the data assets. Comprehensively sort out and summarize the data production costs, summarize the various data costs, and summarize the data costs into hardware construction costs, software construction costs, operation and maintenance costs, resource costs, data procurement costs, data consulting costs, human expenditures, site costs, etc. Lineage transmission is used for data cost assessment. Track data costs and allocate all the collectible costs to the root node of the data asset table. Through the lineage transmission relationship of data, the data costs are propagated along the data processing lineage. Based on the summary of the existing data management scoring dimensions, a data management scoring system was established, including basic attribute elements (basic elements), data quality elements (quality elements) and usage elements. The data management factors mainly explain the operation and operation of data after R&D, using basic evaluation indicators (corresponding to basic elements), data quality evaluation indicators (corresponding to quality elements) and usage evaluation indicators (corresponding to usage elements), and the non-economic factors of data (data management elements) are evaluated through the hierarchical analysis method, so as to obtain the data management score of a single data asset table. The overall industry return rate is obtained by analyzing the business conditions of the enterprise, and the data management score and cost information are fitted into the data asset return rate of each table. The overall return on assets of an enterprise is related to the industry it is in and the digital construction capabilities of the industry. Cost entry of data asset evaluation refers to the process of recording and managing cost information related to data asset evaluation. Its purpose is to ensure that the cost of data asset evaluation is accurately, comprehensively and traceably recorded so that it can be audited and analyzed when necessary. Specifically, the main tasks of cost entry include the following aspects: Identify and collect cost information: identify and collect cost information related to data asset evaluation through coordination with relevant departments and teams. These costs may include human resources, hardware equipment, software tools, external services, etc.; record and classify costs: record and classify the collected cost information, which can be achieved by establishing a cost classification system or using an existing financial system. Common cost classifications include personnel costs, equipment costs, software costs, training costs, etc.; allocate costs: allocate costs to relevant data asset evaluation projects or activities according to the principle of cost attribution, which can be allocated according to the project's workload, resource consumption, time allocation, etc.; calculate and summarize costs: calculate and summarize various costs based on cost classification and allocation, which can be automatically calculated and summarized by using spreadsheets or financial systems; review and verify costs: conduct regular cost audits and verifications to ensure the accuracy and reliability of costs, which can be achieved by cooperating with the financial department to conduct financial audits or using internal audit methods; analyze and report costs: conduct cost analysis and reporting based on needs, which can include generating cost reports, formulating cost budgets, conducting cost-benefit analysis, etc., to support management decisions and optimize resource allocation. Cost maintenance of data asset evaluation refers to the activities of updating and managing the cost information involved in the data asset evaluation process. Specifically, it includes the following aspects of work: Cost data collection: Collect all cost data related to data asset evaluation, including cost information on hardware equipment, software licenses, human resources, training, etc.; Cost classification and archiving: Classify and archive the collected cost data to ensure that each cost item has a clear identification and classification for subsequent cost analysis and management; Cost update and adjustment: Update and adjust cost data regularly according to actual conditions. For example, when the price of hardware equipment changes, the cost data needs to be updated accordingly; Cost analysis and reporting: Use the collected cost data to analyze and evaluate the cost-effectiveness and return on investment of data assets. At the same time, generate reports based on cost data to provide decision makers with information on data asset evaluation costs; Cost control and optimization: Take corresponding control measures based on the analysis results of cost data to optimize the cost-effectiveness of data asset evaluation. For example, reduce costs by saving software licensing fees or optimizing human resource allocation.
在另一些实施例中,如图5所示,“采”、“建”、“管”、“用”四个阶段体现为数据资产盘点、数据资产治理、数据资产运营和数据资产评估四个阶段。数据资产盘点:开展数据资产盘点,优先对具有业务使用价值的重要数据进行盘点。对数据进行分类分级工作,并做好数据资产确权。数据资产治理:建立数据资产管理体系和配套制度、流程。规范开展数据标准检核、数据质量管理、数据安全等活动。数据资产运营:梳理现有数据资产,形成企业级数据资产目录。用户申请消费使用资产,并记录日常管理、会计处理、信息披露所需要的要素信息。数据资产评估:根据价值评估方案,选取数据要素影响因素,构建数据资产价值评估框架,进行数据资产定价与估值。数据资产盘点、数据资产治理和数据资产运营对数据资产评估非常有用。数据资产盘点:在评估数据资产之前,首先需要进行数据资产盘点,这包括收集和记录企业所拥有的所有数据资产,包括结构化数据(如数据库中的数据)和非结构化数据(如文件、文档等),通过数据资产盘点,可以建立一个清晰的数据资产清单,为后续的评估工作提供基础;数据资产治理:数据资产治理是指制定和执行数据管理策略,确保数据的质量、安全和合规性,在数据资产评估中,数据资产治理发挥重要作用,通过进行数据质量评估、数据安全风险评估和合规性评估,可以确定数据资产的可靠性和价值,并为评估提供相关的指标和指导;数据资产运营:数据资产运营是指利用数据资产实现业务目标的过程,在数据资产评估中,数据资产运营可以提供数据使用情况和效益的信息,通过分析数据资产的使用情况和业务价值,可以评估数据资产的回报率和潜在价值,并为决策提供依据。综上所述,数据资产盘点、数据资产治理和数据资产运营对数据资产评估非常有用,它们提供了评估数据资产的基础信息、质量和安全性评估,并帮助确定数据资产的回报率和潜在价值。In other embodiments, as shown in FIG5 , the four stages of “collection”, “construction”, “management” and “use” are embodied in the four stages of data asset inventory, data asset governance, data asset operation and data asset evaluation. Data asset inventory: Conduct data asset inventory, giving priority to important data with business use value. Classify and grade data, and do a good job in data asset confirmation. Data asset governance: Establish a data asset management system and supporting systems and processes. Standardize activities such as data standard verification, data quality management, and data security. Data asset operation: Sort out existing data assets and form an enterprise-level data asset catalog. Users apply for consumption and use of assets, and record the element information required for daily management, accounting processing, and information disclosure. Data asset evaluation: According to the value evaluation plan, select the influencing factors of data elements, build a data asset value evaluation framework, and perform data asset pricing and valuation. Data asset inventory, data asset governance, and data asset operation are very useful for data asset evaluation. Data asset inventory: Before evaluating data assets, you first need to conduct a data asset inventory, which includes collecting and recording all data assets owned by the enterprise, including structured data (such as data in databases) and unstructured data (such as files, documents, etc.). Through data asset inventory, you can establish a clear list of data assets to provide a basis for subsequent evaluation work; Data asset governance: Data asset governance refers to the formulation and implementation of data management strategies to ensure the quality, security and compliance of data. In data asset evaluation, data asset governance plays an important role. By conducting data quality assessments, data security risk assessments and compliance assessments, the reliability and value of data assets can be determined, and relevant indicators and guidance can be provided for the evaluation; Data asset operations: Data asset operations refer to the process of using data assets to achieve business goals. In data asset evaluation, data asset operations can provide information on data usage and benefits. By analyzing the usage and business value of data assets, the return on data assets and potential value can be evaluated, and a basis for decision-making can be provided. In summary, data asset inventory, data asset governance, and data asset operations are very useful for data asset evaluation. They provide basic information, quality, and security assessments for evaluating data assets, and help determine the return on investment and potential value of data assets.
在一些实施例中,在数据资产评估中,“采”、“建”、“管”、“用”四个阶段融合了大数据特有的数据仓库分层处理架构,具体包括以下几个层,数据采集层:这一层主要负责从各种数据源采集数据,其中可能包含结构化数据、非结构化数据、半结构化数据等,这些数据源可能包括各种业务系统、日志文件、第三方数据等;数据预处理层:对采集的原始数据进行清洗、转换、合并等预处理操作,以便下一步的处理,一般会使用ETL(Extract,Transform,Load)工具来完成这一步;数据仓库层:对预处理后的数据进行存储和管理,这一层通常包含一个或多个数据仓库,用于存储历史数据和支持复杂的分析查询;数据分析层:对存储在数据仓库中的数据进行分析和挖掘,提取有价值的信息和知识,这一层通常会使用一些统计分析、数据挖掘、机器学习等方法;数据应用层:将分析结果应用到业务中,如通过报表展示、预警提示、智能推荐等方式为决策提供支持。在整个过程中,我们会对数据的质量、完整性、一致性等进行评估,以确保数据的价值。融合大数据特有的数据仓库分层处理架构后,数据资产评估会有以下几个明显效果,提高数据处理效率:分层处理架构能够将大规模数据分解成更小、更易于管理的部分,这样不仅可以提高数据处理的速度,还可以减少数据处理过程中的错误;增强数据的可用性和可访问性:通过分层处理架构,可以将数据根据其重要性和用途进行分类,这使得用户可以更方便地获得他们需要的数据,从而提高数据的可用性和可访问性;提升数据的质量和准确性:在数据处理过程中,分层处理架构可以有效地防止数据重复,此外,它还可以通过在每个层级进行数据清洗和数据校验,从而提高数据的质量和准确性;更好地保护数据安全:通过分层处理架构,可以对不同层级的数据设置不同的访问权限,从而有效地保护数据的安全;辅助决策制定:对于企业而言,通过大数据特有的数据仓库分层处理架构处理后的数据,能够帮助企业更好地理解自身的业务状况,从而制定出更为科学和合理的决策。In some embodiments, in the data asset assessment, the four stages of "collection", "construction", "management" and "use" integrate the hierarchical processing architecture of the data warehouse unique to big data, which specifically includes the following layers: data collection layer: this layer is mainly responsible for collecting data from various data sources, which may include structured data, unstructured data, semi-structured data, etc. These data sources may include various business systems, log files, third-party data, etc.; data preprocessing layer: cleaning, conversion, merging and other preprocessing operations are performed on the collected raw data for the next step of processing. ETL (Extract, Transform, Load) tools are generally used to complete this step; data warehouse layer: storage and management of preprocessed data. This layer usually includes one or more data warehouses for storing historical data and supporting complex analytical queries; data analysis layer: analysis and mining of data stored in the data warehouse to extract valuable information and knowledge. This layer usually uses some statistical analysis, data mining, machine learning and other methods; data application layer: applying the analysis results to the business, such as providing support for decision-making through report display, early warning prompts, intelligent recommendations, etc. Throughout the process, we will evaluate the quality, integrity, consistency, etc. of the data to ensure the value of the data. After integrating the hierarchical processing architecture of data warehouse unique to big data, data asset assessment will have the following obvious effects, improving data processing efficiency: the hierarchical processing architecture can decompose large-scale data into smaller and more manageable parts, which can not only improve the speed of data processing, but also reduce errors in the data processing process; enhance data availability and accessibility: through the hierarchical processing architecture, data can be classified according to its importance and purpose, which makes it easier for users to obtain the data they need, thereby improving data availability and accessibility; improve data quality and accuracy: during the data processing process, the hierarchical processing architecture can effectively prevent data duplication. In addition, it can also improve data quality and accuracy by performing data cleaning and data verification at each level; better protect data security: through the hierarchical processing architecture, different access permissions can be set for data at different levels, thereby effectively protecting data security; assist in decision making: for enterprises, data processed by the hierarchical processing architecture of data warehouse unique to big data can help enterprises better understand their own business conditions and make more scientific and reasonable decisions.
本发明实施例具有如下技术效果:通过从获取数据资产、经多种数据加工步骤进行加工,再对待评估数据资产对应的数据生产成本进行分摊确定数据资产表的成本信息,通过层次分析法对数据管理要素进行分析,得到所述数据资产表的数据管理分数,结合成本信息和数据管理分数,确定数据资产价值,实现了从数据资产的数据源到使用数据资产的全生命周期的资产价值评估,达到简化复杂的评估步骤,更全面准确的对数据资产的价值进行评估的效果。通过对数据资产的深度刨析,以及对不同技术手段的探究,明确了对企业内部中所处“采、建、管、用”阶段数据表采用“以血缘分摊为核心、管理及交易要素为调节”的核心思路。通过血缘分析视角,完整、清晰的为企业揭示每一项数据从源端直至应用过程中成本价值积累至释放的详细过程。进而帮助企业有针对性地识别、分析高价值、高成本流程或节点,进一步优化企业内部的数据资源配置。通过建立基于资产全生命周期的价值评估指标体系,简化了复杂的价值评估流程,降低了企业进行资产价值评估的技术门槛。通过数据资产价值评估模型,完整、清晰的为企业记录每一项数据从源端直至应用全生命过程中成本价值积累至释放的详细过程。融合了大数据特有的数据仓库分层处理架构和数据血缘特性。完成对数据整体建设成本到单一数据表建设成本的分摊计算。The embodiment of the present invention has the following technical effects: by acquiring data assets, processing them through multiple data processing steps, and then allocating the data production costs corresponding to the data assets to be evaluated to determine the cost information of the data asset table, analyzing the data management elements through the hierarchical analysis method, obtaining the data management score of the data asset table, combining the cost information and the data management score, determining the value of the data asset, and realizing the asset value evaluation of the entire life cycle from the data source of the data asset to the use of the data asset, so as to simplify the complex evaluation steps and more comprehensively and accurately evaluate the value of the data asset. Through the in-depth analysis of data assets and the exploration of different technical means, it is clarified that the core idea of "taking blood relationship allocation as the core and management and transaction elements as the adjustment" is adopted for the data table in the "acquisition, construction, management, and use" stage within the enterprise. Through the perspective of blood relationship analysis, the detailed process of the accumulation and release of the cost value of each data from the source to the application process is completely and clearly revealed to the enterprise. Then it helps enterprises to identify and analyze high-value, high-cost processes or nodes in a targeted manner, and further optimize the data resource configuration within the enterprise. By establishing a value assessment indicator system based on the entire life cycle of assets, the complex value assessment process is simplified and the technical threshold for enterprises to conduct asset value assessment is lowered. Through the data asset value assessment model, the detailed process of cost value accumulation and release of each data from the source to the application of the entire life cycle is completely and clearly recorded for the enterprise. It integrates the data warehouse hierarchical processing architecture and data lineage characteristics unique to big data. Complete the calculation of the allocation of the overall data construction cost to the construction cost of a single data table.
进一步地,所述根据预设评估目标,收集待评估数据资产,针对所述待评估数据资产,通过多种数据加工步骤对所述待评估数据资产进行加工处理,得到所述待评估数据资产对应的至少一个数据资产表和数据生产成本,包括:Furthermore, the data assets to be evaluated are collected according to the preset evaluation target, and the data assets to be evaluated are processed through multiple data processing steps to obtain at least one data asset table and data production cost corresponding to the data assets to be evaluated, including:
根据预设评估目标,确定对应的待评估数据资产的范围,根据所述待评估数据资产的范围收集所述待评估数据资产;According to the preset evaluation target, determine the scope of the corresponding data assets to be evaluated, and collect the data assets to be evaluated according to the scope of the data assets to be evaluated;
对所述待评估数据资产进行数据清洗,得到清洗后数据,所述数据清洗包括:去除重复数据和修复数据错误;Performing data cleaning on the data assets to be evaluated to obtain cleaned data, wherein the data cleaning includes: removing duplicate data and repairing data errors;
对不同来源、格式和结构的清洗后数据进行数据整合,按预设的统一数据存储要求进行存储,得到存储数据;Integrate the cleaned data from different sources, formats and structures, store them according to the preset unified data storage requirements, and obtain stored data;
根据所述存储数据的性质、来源和用途对所述存储数据进行分类并标记标签;Classify and label the stored data according to the nature, source and purpose of the stored data;
将所述存储数据中的敏感信息替换为指定替代信息,得到所述存储数据对应的脱敏数据,对所述脱敏数据进行加密;Replacing the sensitive information in the stored data with specified replacement information to obtain desensitized data corresponding to the stored data, and encrypting the desensitized data;
将经脱敏和加密的存储数据按分类存储为至少一个数据资产表;storing the desensitized and encrypted stored data into at least one data asset table by category;
根据从所述待评估数据资产到所述至少一个数据资产表的数据加工步骤收集所述待评估数据资产对应的数据生产成本;Collecting data production costs corresponding to the data assets to be evaluated according to the data processing steps from the data assets to be evaluated to the at least one data asset table;
其中,所述数据生产成本包括:硬件建设成本、软件建设成本、运维成本、资源成本、数据采购成本、数据咨询成本、人力支出成本和场地费用成本。Among them, the data production cost includes: hardware construction cost, software construction cost, operation and maintenance cost, resource cost, data procurement cost, data consulting cost, manpower expenditure cost and site fee cost.
在一些实施例中,其中,根据预设评估目标,确定对应的待评估数据资产的范围,根据所述待评估数据资产的范围收集所述待评估数据资产,即“采”的阶段,该“采”的阶段具体包括以下一个或多个步骤:确定评估目标、收集数据资产信息、确定评估指标、确定评估目标、评估数据价值、识别数据关联性、设计数据汇聚方案、实施数据汇聚、验证数据汇聚结果和/或更新数据资产价值评估。其中,确定评估目标用于明确评估的目的和范围;例如,确定评估是否仅针对特定的数据资产或整个数据资产库;收集数据资产信息用于收集和整理有关数据资产的详细信息,包括数据源、数据类型、数据量、数据质量等;确定评估指标用于根据评估目标确定评估指标,这些指标可以是数据的商业价值、潜在风险、市场需求等;评估数据价值用于根据所选的评估指标对数据资产进行价值评估,这可以包括使用统计分析方法、市场调研、专家判断等;识别数据关联性用于分析数据资产之间的关联性和相互影响,以确定汇聚数据的必要性和价值;设计数据汇聚方案用于根据评估结果和数据关联性,设计数据汇聚方案。这可以包括确定数据汇聚的方式(例如ETL、API集成等),数据清洗和转换策略等;实施数据汇聚用于根据设计的方案,实施数据汇聚。这可能涉及到数据提取、转换、加载等步骤;验证数据汇聚结果用于验证汇聚后的数据的准确性和完整性,确保数据达到预期要求;更新数据资产价值评估用于根据实际的数据汇聚结果,更新数据资产的价值评估,以反映数据汇聚的影响。“建”阶段的包括以下一个或多个步骤:数据清洗、数据整合、数据分类与标签、数据脱敏与加密、数据质量监控、和/或数据治理策略制定。其中,数据清洗用于对收集到的原始数据进行预处理,包括去除重复数据、修复错误数据、统一数据格式等。这一步骤的目的是确保数据的质量和准确性,为后续的数据分析和应用奠定基础。数据整合用于在清洗完数据后,将来自不同来源、格式和结构的数据进行整合,形成一个统一的数据存储。这需要运用数据集成技术,如ETL(提取、转换、加载)等,以满足后续数据分析和应用的需求。数据分类与标签用于对整合后的数据进行分类,根据数据的性质、来源、用途等将其划分为不同的类别。同时,为数据添加相应的标签,以便于后续的数据管理和应用。数据脱敏与加密用于为了保护数据隐私,需要对数据进行脱敏处理,即将敏感信息替换为指定的替代值。此外,对数据进行加密,以防止数据在传输和存储过程中被非法访问。数据质量监控用于建立数据质量监控机制,对数据资产的质量指标进行持续监测,确保数据资产的准确、完整、及时和可靠。数据治理策略制定用于根据企业的业务需求和数据特点,制定相应的数据治理策略,包括数据管理、数据安全、数据合规等方面的规范。在“建”的过程中,数据资产价值评估的体现主要通过以下几个方面:数据质量:评估数据资产的价值时,需要关注数据的质量,包括数据的准确性、完整性、及时性等。高质量的数据资产更能体现其价值。数据价值挖掘:在数据整合和分类过程中,挖掘数据中的潜在价值,发现数据之间的关联性和规律,为企业的业务决策提供支持。数据安全与合规:评估数据资产的价值时,要考虑数据的安全性和合规性。确保数据资产在合规的前提下,为企业带来价值。数据应用场景:分析数据资产在不同业务场景下的应用潜力,评估其在各场景下的价值贡献。数据生命周期:关注数据从产生到消亡的全过程,分析数据资产在不同阶段的价值变化,以全面评估数据资产的价值。In some embodiments, according to the preset evaluation target, the scope of the corresponding data assets to be evaluated is determined, and the data assets to be evaluated are collected according to the scope of the data assets to be evaluated, that is, the "collection" stage, which specifically includes one or more of the following steps: determining the evaluation target, collecting data asset information, determining the evaluation index, determining the evaluation target, evaluating the data value, identifying data relevance, designing a data aggregation plan, implementing data aggregation, verifying the data aggregation results and/or updating the data asset value evaluation. Among them, determining the evaluation target is used to clarify the purpose and scope of the evaluation; for example, determining whether the evaluation is only for a specific data asset or the entire data asset library; collecting data asset information is used to collect and organize detailed information about the data assets, including data sources, data types, data volume, data quality, etc.; determining the evaluation index is used to determine the evaluation index according to the evaluation target, and these indicators can be the commercial value, potential risks, market demand, etc. of the data; evaluating the data value is used to evaluate the value of the data assets according to the selected evaluation index, which can include using statistical analysis methods, market research, expert judgment, etc.; identifying data relevance is used to analyze the relevance and mutual influence between data assets to determine the necessity and value of aggregating data; designing a data aggregation plan is used to design a data aggregation plan based on the evaluation results and data relevance. This can include determining the data aggregation method (such as ETL, API integration, etc.), data cleaning and transformation strategies, etc.; implementing data aggregation is used to implement data aggregation according to the designed plan. This may involve steps such as data extraction, transformation, and loading; verifying data aggregation results is used to verify the accuracy and completeness of the aggregated data to ensure that the data meets the expected requirements; updating the data asset value assessment is used to update the data asset value assessment based on the actual data aggregation results to reflect the impact of data aggregation. The "construction" stage includes one or more of the following steps: data cleaning, data integration, data classification and labeling, data desensitization and encryption, data quality monitoring, and/or data governance strategy formulation. Among them, data cleaning is used to pre-process the collected raw data, including removing duplicate data, repairing erroneous data, and unifying data formats. The purpose of this step is to ensure the quality and accuracy of the data and lay the foundation for subsequent data analysis and application. Data integration is used to integrate data from different sources, formats, and structures after cleaning the data to form a unified data storage. This requires the use of data integration technologies such as ETL (extraction, transformation, loading) to meet the needs of subsequent data analysis and application. Data classification and labeling are used to classify the integrated data and divide it into different categories according to the nature, source, purpose, etc. of the data. At the same time, corresponding labels are added to the data to facilitate subsequent data management and application. Data desensitization and encryption are used to protect data privacy. Data desensitization is required, that is, sensitive information is replaced with specified alternative values. In addition, data is encrypted to prevent illegal access to data during transmission and storage. Data quality monitoring is used to establish a data quality monitoring mechanism to continuously monitor the quality indicators of data assets to ensure the accuracy, completeness, timeliness and reliability of data assets. Data governance strategy formulation is used to formulate corresponding data governance strategies according to the business needs and data characteristics of the enterprise, including specifications in data management, data security, data compliance, etc. In the process of "building", the value assessment of data assets is mainly reflected in the following aspects: Data quality: When evaluating the value of data assets, it is necessary to pay attention to the quality of data, including the accuracy, completeness, timeliness, etc. of data. High-quality data assets can better reflect their value. Data value mining: In the process of data integration and classification, the potential value in the data is mined, the correlation and rules between data are discovered, and support is provided for the business decision-making of the enterprise. Data security and compliance: When evaluating the value of data assets, data security and compliance must be considered. Ensure that data assets bring value to the enterprise under the premise of compliance. Data application scenarios: Analyze the application potential of data assets in different business scenarios and evaluate their value contribution in each scenario. Data life cycle: Pay attention to the entire process from data generation to extinction, analyze the value changes of data assets at different stages, and comprehensively evaluate the value of data assets.
从数据资产全生命周期出发,数据资产总投入成本(数据生产成本)包括硬件建设成本、软件建设成本、运维成本、资源成本、数据采购成本、数据咨询成本、人力支出、场地费用等方面,覆盖了数据资产采集、购买、存储、计算、管理、应用等环节。各类成本具体成本子项和说明如表1所示。From the perspective of the entire life cycle of data assets, the total investment cost of data assets (data production cost) includes hardware construction costs, software construction costs, operation and maintenance costs, resource costs, data procurement costs, data consulting costs, human resources expenditures, site costs, etc., covering data asset collection, purchase, storage, calculation, management, application and other links. The specific cost sub-items and descriptions of each type of cost are shown in Table 1.
表1数据资产总投入成本要素说明Table 1 Description of the total cost elements of data assets
进一步地,所述针对每个数据资产表,根据血缘传播算法分摊所述待评估数据资产对应的数据生产成本,得到所述数据资产表的成本信息,包括:Furthermore, for each data asset table, the data production cost corresponding to the data asset to be evaluated is apportioned according to the lineage propagation algorithm to obtain the cost information of the data asset table, including:
针对每个数据资产表,根据血缘传播算法生成所述数据资产表对应的加工工序血缘关系;所述加工工序血缘关系表示数据加工步骤之间的父子关系;For each data asset table, a processing procedure blood relationship corresponding to the data asset table is generated according to a blood relationship propagation algorithm; the processing procedure blood relationship represents a parent-child relationship between data processing steps;
根据所述数据资产表对应的加工工序血缘关系,将所述待评估数据资产对应的数据生产成本,分摊为用于得到所述数据资产表的各数据加工步骤各自对应的数据生产成本;According to the blood relationship of the processing procedures corresponding to the data asset table, the data production cost corresponding to the data asset to be evaluated is apportioned into the data production costs corresponding to each data processing step used to obtain the data asset table;
根据所述加工工序血缘关系,累计用于得到所述数据资产表的各数据加工步骤各自对应的数据生产成本,得到所述数据资产表的成本信息。According to the blood relationship of the processing procedures, the data production costs corresponding to the respective data processing steps used to obtain the data asset table are accumulated to obtain the cost information of the data asset table.
在一些实施例中,建的阶段还包括针对每个数据资产表,根据血缘传播算法分摊所述待评估数据资产对应的数据生产成本,得到所述数据资产表的成本信息;追踪数据成本,将可收集到的数据生产成本全部分摊给数据资产表的根节点,通过数据的血缘传递关系,沿着数据加工血缘传播数据成本。根节点是指数据资产血缘传播图中的最顶层节点,也就是整个数据资产血缘关系的起点。在数据资产血缘传播中,根节点代表最初的数据源或者最早的数据生成点。它是整个数据流程的起点,没有其他节点可以追溯到它。通常情况下,根节点可以是原始数据的产生点,比如数据库中的表、文件系统中的文件等。在评估数据成本和分摊成本时,根节点的重要性在于它代表了整个数据流程的初始投入成本。通过将成本从根节点开始沿着数据血缘传播路径逐级分摊,可以更准确地评估每个数据资产的成本,并为每个数据资产分配适当的成本。这样可以更好地了解数据资产的价值和对业务的贡献,帮助做出更明智的决策和资源分配。总结来说,根节点是数据资产血缘传播图中的起点,代表最初的数据源或者最早的数据生成点。在评估数据成本和分摊成本时,根节点起到了重要的作用。In some embodiments, the construction stage also includes allocating the data production cost corresponding to the data asset to be evaluated according to the lineage propagation algorithm for each data asset table to obtain the cost information of the data asset table; tracking the data cost, allocating all the data production costs that can be collected to the root node of the data asset table, and propagating the data cost along the data processing lineage through the lineage transmission relationship of the data. The root node refers to the top-level node in the data asset lineage propagation diagram, that is, the starting point of the entire data asset lineage relationship. In the data asset lineage propagation, the root node represents the initial data source or the earliest data generation point. It is the starting point of the entire data process, and no other node can be traced back to it. Usually, the root node can be the point where the original data is generated, such as a table in a database, a file in a file system, etc. When evaluating data costs and allocating costs, the importance of the root node lies in that it represents the initial investment cost of the entire data process. By allocating the cost step by step along the data lineage propagation path starting from the root node, the cost of each data asset can be more accurately evaluated and an appropriate cost can be assigned to each data asset. This can better understand the value of data assets and their contribution to the business, and help make more informed decisions and resource allocation. In summary, the root node is the starting point in the data asset lineage propagation diagram, representing the initial data source or the earliest data generation point. The root node plays an important role in evaluating data costs and allocating costs.
在一些实施例中,通过数据的血缘传递关系,沿着数据加工血缘传播数据生产成本,可以按照以下步骤进行,确定数据资产的根节点:找到数据加工流程中的起始节点,即数据的来源或输入点,这可以是源系统、外部数据源或其他数据集;了解数据加工流程:详细了解数据在加工过程中的各个步骤和转换,包括数据清洗、转换、整合等;根据血缘传播关系标识影响数据成本的因素:在每个数据加工步骤中,确定对数据成本产生影响的因素,这可能包括计算资源、存储空间、人工成本等;跟踪数据血缘传播路径:通过记录数据加工流程中每个步骤的输入和输出关系,建立数据血缘传播路径,这可以通过元数据管理系统或数据血缘分析工具来实现;计算数据成本:根据每个数据加工步骤中确定的成本因素,按照数据血缘传播路径计算数据成本。这可以是一个简单的加总,也可以是根据不同因素进行权重计算;分析数据成本:根据计算得到的数据成本,进行评估和分析,可以识别出高成本的数据加工步骤,找到优化的机会,并提出相应的改进措施。In some embodiments, the data production cost can be propagated along the data processing lineage through the data lineage transmission relationship, and the root node of the data asset can be determined by the following steps: find the starting node in the data processing process, that is, the source or input point of the data, which can be the source system, external data source or other data set; understand the data processing process: understand the various steps and transformations of the data in the processing process in detail, including data cleaning, conversion, integration, etc.; identify the factors affecting the data cost according to the lineage transmission relationship: in each data processing step, determine the factors that affect the data cost, which may include computing resources, storage space, labor costs, etc.; track the data lineage transmission path: establish the data lineage transmission path by recording the input and output relationship of each step in the data processing process, which can be achieved through the metadata management system or data lineage analysis tool; calculate the data cost: calculate the data cost according to the data lineage transmission path based on the cost factors determined in each data processing step. This can be a simple summation or a weighted calculation based on different factors; analyze the data cost: evaluate and analyze the calculated data cost, identify the high-cost data processing steps, find optimization opportunities, and propose corresponding improvement measures.
血缘传播算法是基于图检索算法的一种算法,图检索算法是图算法中的一个领域,它采用树理论对图进行搜索,在找到终点后回溯这一分支,最后获得从起点到终点的路径,图检索算法如图3所示,左侧是图数据结构,右侧是基于左侧的图数据结构的检索路径,本发明采用广度优先搜索,这种算法层层推进,将根节点为起点,先找到距离该起点最近的下一节点(每个节点代表一个数据加工步骤),然后从内到外到达下一加工层,在下一加工层搜索记录路径,直至跳出该层。The bloodline propagation algorithm is an algorithm based on the graph retrieval algorithm, which is a field in graph algorithms. It uses tree theory to search the graph, traces back to the branch after finding the end point, and finally obtains the path from the starting point to the end point. The graph retrieval algorithm is shown in Figure 3. The left side is the graph data structure, and the right side is the retrieval path based on the graph data structure on the left. The present invention adopts breadth-first search. This algorithm advances layer by layer, taking the root node as the starting point, first finding the next node closest to the starting point (each node represents a data processing step), and then reaching the next processing layer from the inside to the outside, searching for the record path in the next processing layer until jumping out of the layer.
将收集到的成本分摊给数据资产表,可以按照以下步骤进行,确定数据资产表:首先,确定需要评估成本的数据资产表。这可以是数据库表、数据文件、数据集等。收集相关成本:收集与数据资产表相关的成本信息。这包括数据采集、存储、处理、维护等方面的成本。确定成本分摊因素:确定用于分摊成本的因素。这可以是数据资产表的行数、数据量、使用频率等。计算成本分摊比例:根据确定的分摊因素,计算每个数据资产表的成本分摊比例。可以使用简单的比例计算方法,将总成本按比例分配给各个数据资产表。将成本分摊给数据资产表:根据计算得到的成本分摊比例,将对应的成本分摊到每个数据资产表上。可以记录分摊后的成本信息,以便后续管理和分析。定期更新成本分摊:随着时间的推移,成本可能会发生变化。因此,需要定期更新成本分摊,以反映最新的成本情况。To allocate the collected costs to the data asset tables, you can follow the following steps to determine the data asset tables: First, determine the data asset tables that need to be evaluated for costs. This can be a database table, data file, data set, etc. Collect relevant costs: Collect cost information related to the data asset table. This includes costs for data collection, storage, processing, maintenance, etc. Determine cost allocation factors: Determine the factors used to allocate costs. This can be the number of rows, data volume, frequency of use, etc. of the data asset table. Calculate cost allocation ratio: Based on the determined allocation factors, calculate the cost allocation ratio for each data asset table. A simple ratio calculation method can be used to allocate the total cost to each data asset table in proportion. Allocate costs to data asset tables: Based on the calculated cost allocation ratio, allocate the corresponding costs to each data asset table. The allocated cost information can be recorded for subsequent management and analysis. Update cost allocation regularly: Costs may change over time. Therefore, cost allocation needs to be updated regularly to reflect the latest cost situation.
追踪数据成本并评估数据资产的血缘传播,具体可以包括:确定数据资产的血缘关系:首先,确定数据资产之间的血缘关系,即数据的来源、传递和消费关系。这可以通过分析数据流、数据转换和数据存储等过程来实现。标记数据资产的成本信息:为每个数据资产和相关的血缘关系,标记相应的成本信息。成本信息可以包括数据采集、数据传输、数据存储、数据处理等环节的成本数据。记录和更新成本信息:建立一个成本管理系统或数据库,记录和更新数据资产的成本信息。确保成本信息与实际情况保持同步,并及时更新。追踪数据成本的传播:通过分析数据资产之间的血缘关系和成本信息,可以追踪数据成本的传播路径。从数据的源头开始,沿着血缘关系链追踪数据的消费和相关成本。评估数据成本:根据数据成本的传播路径和相关成本信息,对数据资产的成本进行评估。可以计算数据资产在每个环节的成本,或者根据所需指标对数据资产进行整体成本评估。Tracking data costs and evaluating the lineage propagation of data assets can specifically include: Determining the lineage of data assets: First, determine the lineage between data assets, that is, the source, transmission and consumption relationship of data. This can be achieved by analyzing processes such as data flow, data conversion and data storage. Marking the cost information of data assets: For each data asset and related lineage, mark the corresponding cost information. Cost information can include cost data for data collection, data transmission, data storage, data processing and other links. Record and update cost information: Establish a cost management system or database to record and update the cost information of data assets. Ensure that the cost information is synchronized with the actual situation and updated in a timely manner. Tracking the propagation of data costs: By analyzing the lineage and cost information between data assets, the propagation path of data costs can be tracked. Starting from the source of the data, the consumption and related costs of data are tracked along the lineage chain. Evaluating data costs: Evaluate the cost of data assets based on the propagation path of data costs and related cost information. The cost of data assets at each link can be calculated, or the overall cost of data assets can be evaluated based on the required indicators.
在另一些实施例中,图检索算法可以应用到数据资产评估的过程中,帮助我们处理具体的资产数据。具体的处理步骤如下:构建数据资产图:将所有的数据资产以节点的形式表示,节点之间的关系以边的形式表示。这个数据资产图可以包括数据表、数据库、文件等。定义评估指标:根据评估的需求,确定评估指标,比如数据价值、数据可靠性、数据用途等。这些指标可以作为节点的属性。应用图检索算法:根据需要的评估指标,选择合适的图检索算法来搜索和遍历数据资产图。常用的图检索算法包括深度优先搜索(DFS)、广度优先搜索(BFS)、Dijkstra算法、最小生成树算法等。处理资产数据:在图检索的过程中,根据算法的结果,可以对具体的资产数据进行处理。比如计算数据价值、评估数据可靠性、确定数据用途等。输出评估结果:根据处理后的资产数据,生成评估结果报告,包括各个资产的评估指标值,以及整体的评估结论。通过应用图检索算法,我们可以从数据资产图中找到与评估指标相关的数据资产,并对其进行处理和评估,进而得到数据资产的评估结果。这样可以帮助我们更好地理解数据资产的价值和可用性,从而做出更好的决策。在图检索算法中,"表"通常指的是数据的存储形式,也可以理解为数据库中的表。在数据成本中的各个方面与表之间存在一定的关系。硬件建设成本:与表的存储相关,包括服务器、存储设备等硬件设备的购买和配置成本。软件建设成本:与表的数据库系统相关,包括数据库软件的购买和开发成本。运维成本:与表的维护和管理相关,包括数据库管理员的工资、服务器维护费用等。资源成本:与表的访问和使用相关,包括服务器资源的使用费用、网络带宽费用等。数据采购成本:与表的数据来源相关,包括从外部获取数据的成本。数据咨询成本:与表的数据分析和决策相关,包括咨询服务的费用。人力支出:与表的维护、开发、分析等相关,包括人员工资、培训等费用。具体的"表"指的是数据库中的数据表,它保存着特定类型的数据。多个表之间可以通过关联键(如主键和外键)建立关系,这样可以进行表之间的数据关联和查询。表的存储组织通常由数据库管理系统自动管理,可以使用不同的存储引擎来实现不同的存储机制(如B树、哈希等)。通过对表的存储组织和数据内容的管理,我们可以使用图检索算法从表中检索和分析数据。In other embodiments, the graph retrieval algorithm can be applied to the process of data asset evaluation to help us process specific asset data. The specific processing steps are as follows: Construct a data asset graph: All data assets are represented in the form of nodes, and the relationship between nodes is represented in the form of edges. This data asset graph may include data tables, databases, files, etc. Define evaluation indicators: According to the evaluation requirements, determine the evaluation indicators, such as data value, data reliability, data usage, etc. These indicators can be used as attributes of nodes. Apply graph retrieval algorithms: According to the required evaluation indicators, select appropriate graph retrieval algorithms to search and traverse the data asset graph. Commonly used graph retrieval algorithms include depth-first search (DFS), breadth-first search (BFS), Dijkstra algorithm, minimum spanning tree algorithm, etc. Process asset data: In the process of graph retrieval, specific asset data can be processed according to the results of the algorithm. For example, calculate data value, evaluate data reliability, determine data usage, etc. Output evaluation results: Generate an evaluation result report based on the processed asset data, including the evaluation indicator values of each asset and the overall evaluation conclusion. By applying graph retrieval algorithms, we can find data assets related to evaluation indicators from the data asset graph, process and evaluate them, and then obtain the evaluation results of data assets. This can help us better understand the value and availability of data assets and make better decisions. In graph retrieval algorithms, "table" usually refers to the storage form of data, which can also be understood as a table in a database. There is a certain relationship between various aspects of data cost and tables. Hardware construction cost: related to the storage of the table, including the purchase and configuration costs of hardware devices such as servers and storage devices. Software construction cost: related to the database system of the table, including the purchase and development costs of database software. Operation and maintenance cost: related to the maintenance and management of the table, including the salary of the database administrator, server maintenance costs, etc. Resource cost: related to the access and use of the table, including the use fee of server resources, network bandwidth fees, etc. Data procurement cost: related to the data source of the table, including the cost of obtaining data from the outside. Data consulting cost: related to the data analysis and decision-making of the table, including the cost of consulting services. Human resource expenditure: related to the maintenance, development, analysis, etc. of the table, including staff wages, training and other expenses. Specifically, "table" refers to a data table in a database, which stores a specific type of data. Multiple tables can be related through association keys (such as primary keys and foreign keys), so that data between tables can be associated and queried. The storage organization of the table is usually automatically managed by the database management system, and different storage engines can be used to implement different storage mechanisms (such as B-tree, hash, etc.). By managing the storage organization and data content of the table, we can use graph retrieval algorithms to retrieve and analyze data from the table.
在一些实施例中,数据的血缘传递关系可以通过以下几种方式获得,元数据管理系统:使用专门的元数据管理系统可以追踪和记录数据资产的血缘关系。这些系统可以跟踪数据的来源、转换和使用情况,并构建出完整的血缘传递关系图。数据采集工具:在数据采集过程中,可以通过使用数据采集工具来自动记录数据的血缘关系。这些工具可以捕获数据的来源、目标和转换过程,并将其关联起来形成血缘传递关系。数据库日志:一些数据库系统可以记录数据的操作日志,包括数据的读取、写入和修改等操作。通过分析这些数据库日志,可以还原数据的血缘传递关系。人工记录:在缺乏自动化工具的情况下,人工记录数据的血缘关系也是一种可行的方式。通过与相关人员交流和调查,可以了解数据的来源和使用情况,并手动绘制血缘传递关系图。In some embodiments, the lineage transmission relationship of data can be obtained in the following ways: Metadata management system: The lineage relationship of data assets can be tracked and recorded using a dedicated metadata management system. These systems can track the source, conversion, and usage of data, and build a complete lineage transmission relationship diagram. Data collection tools: During the data collection process, the lineage relationship of data can be automatically recorded by using data collection tools. These tools can capture the source, target, and conversion process of data, and associate them to form a lineage transmission relationship. Database log: Some database systems can record data operation logs, including operations such as reading, writing, and modifying data. By analyzing these database logs, the lineage transmission relationship of data can be restored. Manual recording: In the absence of automated tools, manually recording the lineage relationship of data is also a feasible way. By communicating and investigating with relevant personnel, you can understand the source and usage of the data and manually draw a lineage transmission relationship diagram.
在一些实施例中,完成数据整体建设成本到单一数据表建设成本的分摊计算,可以按照以下步骤进行,确定数据建设总成本(待评估数据资产对应的数据生产成本):数据建设总成本通常包括硬件设施投资、软件开发和购买、人力成本(包括数据采集、清洗、整理、维护等)等;计算每个数据表的权重:数据表的权重可以根据其大小(如记录数量)、其复杂性(如字段数量和类型)和其使用频率等因素来定义;分摊成本:可以根据每个数据表的权重,将总成本分摊到每个数据表上,即每个数据表的成本=数据建设总成本x该表权重;数据表成本=数据建设总成本x数据表权重,这种方法是一种近似方法,并且其精度取决于如何定义和计算权重。In some embodiments, the allocation calculation of the overall data construction cost to the construction cost of a single data table can be completed by following the steps below to determine the total data construction cost (the data production cost corresponding to the data asset to be evaluated): the total data construction cost usually includes hardware facility investment, software development and purchase, labor costs (including data collection, cleaning, organization, maintenance, etc.), etc.; calculate the weight of each data table: the weight of a data table can be defined based on factors such as its size (such as the number of records), its complexity (such as the number and type of fields), and its frequency of use; allocate costs: the total cost can be allocated to each data table based on the weight of each data table, that is, the cost of each data table = total data construction cost x the weight of the table; data table cost = total data construction cost x data table weight. This method is an approximate method, and its accuracy depends on how the weights are defined and calculated.
进一步地,所述针对每个数据资产表,通过层次分析法对所述数据资产表对应的数据管理要素进行评价,得到所述数据资产表的数据管理分数,包括:Furthermore, for each data asset table, the data management elements corresponding to the data asset table are evaluated by the analytic hierarchy process to obtain the data management score of the data asset table, including:
根据所述数据资产表对应的数据管理要素构建层次分析法的判断矩阵;Constructing a judgment matrix of the hierarchical analysis method according to the data management elements corresponding to the data asset table;
根据所述判断矩阵比较所述数据资产表对应的不同数据管理要素之间的相对重要性,确定所述数据资产表对应的各数据管理要素的权重;Comparing the relative importance of different data management elements corresponding to the data asset table according to the judgment matrix, and determining the weight of each data management element corresponding to the data asset table;
对所述判断矩阵的每一列进行归一化处理,计算所述数据资产表对应的各数据管理要素的权重向量;Normalizing each column of the judgment matrix and calculating the weight vector of each data management element corresponding to the data asset table;
根据所述数据资产表对应的各数据管理要素的权重向量计算所述数据资产表对应的各数据管理要素的综合权重;Calculating the comprehensive weight of each data management element corresponding to the data asset table according to the weight vector of each data management element corresponding to the data asset table;
根据数据资产表对应的各数据管理要素及其综合权重,确定所述数据资产表的数据管理分数;Determine the data management score of the data asset table according to the data management elements corresponding to the data asset table and their comprehensive weights;
所述数据管理要素包括基本要素、质量要素和使用要素;The data management elements include basic elements, quality elements and usage elements;
所述基本要素包括数据规模;The basic elements include data size;
所述质量要素包括:准确性、一致性、完整性、规范性、时效性;The quality elements include: accuracy, consistency, completeness, standardization, and timeliness;
所述使用要素包括:可访问度和使用热度。The usage factors include: accessibility and usage popularity.
层次分析法(Analytic Hierarchy Process,简称AHP)是一种用于多准则决策的方法,它能够帮助我们对不同因素进行权重排序。以下是层次分析法的具体步骤,确定层次结构:首先,我们需要确定决策问题的层次结构,即将问题分解成不同的因素层次,在数据资产评估中,我们可以将数据质量评价指标划分为准确性、一致性、完整性、规范性和时效性;构建判断矩阵:接下来,我们需要构建一个判断矩阵,用于比较不同因素之间的相对重要性,判断矩阵是一个方阵,其中每个元素表示各个因素之间的相对重要性比较,在判断矩阵中,我们需要对每个因素进行两两比较,根据其重要性给出相对权重,通常采用1到9的比较尺度进行评分;计算权重向量:在判断矩阵中,需要对每一列进行归一化处理,然后计算每个因素的权重向量,归一化处理可以通过将每个元素除以列之和来实现,最后,计算每行的平均值,得到每个因素的权重向量;一致性检验:为了确保判断矩阵的一致性,我们需要计算一致性指标(Consistency Index,简称CI)和一致性比率(Consistency Ratio,简称CR)。如果CR小于0.1,则认为判断矩阵是一致的,否则需要对比较矩阵进行调整;综合权重排序:根据每个因素的权重向量,可以计算出综合权重,用于对不同因素进行排序,综合权重可以通过对每个因素的权重向量进行加权求和得到。通过以上步骤,可以使用层次分析法对数据质量评价指标进行排序,并确定其相对重要性。这样可以在数据资产评估中进行准确的决策和优先排序。在数据资产评估中,非经济因素指的是与数据管理相关的因素,但不涉及经济方面的成本或收益。这些因素通常包括以下几个方面,数据质量:数据的准确性、完整性、一致性和可靠性等因素;数据安全性:保护数据免受未经授权的访问、修改、删除或泄露的能力;数据可用性:确保数据在需要时可随时访问和使用的能力;数据治理:数据管理的规则、流程和机制,以确保数据在组织内被正确管理和使用;数据合规性:确保数据管理符合法规、法律和行业标准的要求。The Analytic Hierarchy Process (AHP) is a method for multi-criteria decision-making, which can help us rank the weights of different factors. The following are the specific steps of the hierarchical analysis method. Determine the hierarchical structure: First, we need to determine the hierarchical structure of the decision-making problem, that is, decompose the problem into different factor levels. In data asset evaluation, we can divide the data quality evaluation indicators into accuracy, consistency, completeness, standardization and timeliness; Construct a judgment matrix: Next, we need to construct a judgment matrix to compare the relative importance of different factors. The judgment matrix is a square matrix in which each element represents the relative importance comparison of each factor. In the judgment matrix, we need to compare each factor pairwise and give a relative weight according to its importance. Usually, a comparison scale of 1 to 9 is used for scoring; Calculate the weight vector: In the judgment matrix, each column needs to be normalized, and then the weight vector of each factor is calculated. Normalization can be achieved by dividing each element by the sum of the columns. Finally, the average value of each row is calculated to obtain the weight vector of each factor; Consistency test: In order to ensure the consistency of the judgment matrix, we need to calculate the consistency index (Consistency Index, referred to as CI) and the consistency ratio (Consistency Ratio, referred to as CR). If CR is less than 0.1, the judgment matrix is considered consistent, otherwise the comparison matrix needs to be adjusted; Comprehensive weight sorting: According to the weight vector of each factor, the comprehensive weight can be calculated for sorting different factors. The comprehensive weight can be obtained by weighted summation of the weight vector of each factor. Through the above steps, the hierarchical analysis method can be used to sort the data quality evaluation indicators and determine their relative importance. This allows accurate decision-making and prioritization in data asset evaluation. In data asset evaluation, non-economic factors refer to factors related to data management, but do not involve economic costs or benefits. These factors usually include the following aspects: data quality: factors such as data accuracy, completeness, consistency and reliability; data security: the ability to protect data from unauthorized access, modification, deletion or leakage; data availability: the ability to ensure that data is accessible and available at any time when needed; data governance: the rules, processes and mechanisms of data management to ensure that data is properly managed and used within the organization; data compliance: ensuring that data management complies with regulations, laws and industry standards.
在一些实施例中,关于数据资产投入的软件建设成本、数据咨询成本、资源成本、硬件建设成本和运维成本之间的关系,也可以通过层次分析法进行评价。层次分析法是一种定量分析方法,用于比较和评估多个因素之间的相对重要性。在这种评价体系中,可以将软件建设成本、数据咨询成本、资源成本、硬件建设成本和运维成本视为经济因素,并使用层次分析法来确定它们之间的相对权重或重要性。通过对不同成本因素进行比较和评估,可以确定哪些因素在数据资产评估中具有较高的权重,从而指导决策者在资源配置和投资决策中做出更明智的选择。需要注意的是,经济因素和非经济因素在数据资产评估中是相互关联的。例如,数据质量和数据安全性的提升可能需要投入更多的资源和技术,这将涉及到经济成本。因此,在进行数据资产评估时,经济因素和非经济因素应该综合考虑,以实现数据管理的综合最优化。In some embodiments, the relationship between the software construction cost, data consulting cost, resource cost, hardware construction cost and operation and maintenance cost of data asset investment can also be evaluated by the analytic hierarchy process. The analytic hierarchy process is a quantitative analysis method used to compare and evaluate the relative importance of multiple factors. In this evaluation system, the software construction cost, data consulting cost, resource cost, hardware construction cost and operation and maintenance cost can be regarded as economic factors, and the analytic hierarchy process is used to determine the relative weight or importance between them. By comparing and evaluating different cost factors, it can be determined which factors have a higher weight in the data asset evaluation, thereby guiding decision makers to make more wise choices in resource allocation and investment decisions. It should be noted that economic factors and non-economic factors are interrelated in data asset evaluation. For example, the improvement of data quality and data security may require more resources and technology, which will involve economic costs. Therefore, when conducting data asset evaluation, economic factors and non-economic factors should be considered comprehensively to achieve comprehensive optimization of data management.
在一些实施例中,数据要素价值评估主要分为两个步骤:一是明确需要评估价值的数据要素,二是根据合适的方法进行估值计算。企业在具体选择数据管理要素指标集合时,主要受两大因素影响:一方面应结合企业当前的数据基础设施的成熟水平,确保所选取的要素指标均可被计算;另一方面,应结合企业当前的经营状况和行业属性,选取与数据资产价值高度关联的评价指标。In some embodiments, the data element value assessment is mainly divided into two steps: one is to identify the data elements that need to be assessed, and the other is to perform valuation calculations based on appropriate methods. When enterprises specifically select a set of data management element indicators, they are mainly affected by two factors: on the one hand, they should combine the maturity level of the enterprise's current data infrastructure to ensure that all selected element indicators can be calculated; on the other hand, they should combine the enterprise's current operating conditions and industry attributes to select evaluation indicators that are highly correlated with the value of data assets.
计算数据资产总投入成本(待评估数据资产对应的数据生产成本)选择要素指标的依据是根据具体环境和需求来确定,以下是一些常见的指标选择要素,成本分析目的:确定成本分析的目的是关键,例如确定投资回报率、成本效益、成本控制等;数据资产类型:不同类型的数据资产可能会有不同的成本要素指标,例如,对于硬件资产,可以考虑购买成本、维护费用等。对于软件资产,则可以考虑许可费用、更新和升级成本等;组织需求:根据组织的需求和优先级来选择适合的指标,例如,如果组织非常关注成本效益,可以选择与效益相关的指标,如成本节约比例或投资回报率;行业标准:参考行业标准和最佳实践来选择指标,行业标准可以提供一些常用的指标选择参考,确保选择的指标与其他同行组织相一致,以便进行比较和评估;数据可用性:考虑数据的可用性和可靠性,选择具有可靠数据源支持的指标,以确保成本分析的准确性和可信度。选择与数据资产价值高度关联的评估指标,需要根据企业的经营状况和行业特性来定制,以下是一种可能的方法,业务价值:根据数据在业务流程中的应用程度,以及数据对业务决策的影响程度评估;稀有性:根据数据的获取难度和唯一性评估;完整性:根据数据内容的完整程度评估;准确性:根据数据的质量评估,包括数据的准确性、一致性、及时性等;合规性:根据数据的合规性评估,包括数据的安全性、隐私性、合规性等。结合企业的经营状况可以考虑如下因素,对于销售为主的企业,销售数据、客户数据可能具有较高的价值;对于科技型企业,研发数据、专利数据可能具有较高的价值;行业属性也会影响数据的价值,例如:在金融行业,交易数据、信用数据可能具有较高的价值,在医疗行业,患者数据、药品数据可能具有较高的价值。以上所有因素都可以作为评估指标的参考,具体的指标需要根据企业的具体情况来定制。The basis for selecting factor indicators for calculating the total input cost of data assets (the data production cost corresponding to the data assets to be evaluated) is determined according to the specific environment and needs. The following are some common indicator selection factors: Purpose of cost analysis: It is key to determine the purpose of cost analysis, such as determining the return on investment, cost-effectiveness, cost control, etc.; Type of data assets: Different types of data assets may have different cost factor indicators. For example, for hardware assets, purchase costs, maintenance costs, etc. can be considered. For software assets, license fees, update and upgrade costs, etc. can be considered; Organizational needs: Select appropriate indicators based on the needs and priorities of the organization. For example, if the organization is very concerned about cost-effectiveness, it can select benefit-related indicators such as cost savings ratio or return on investment; Industry standards: Refer to industry standards and best practices to select indicators. Industry standards can provide some commonly used indicator selection references to ensure that the selected indicators are consistent with other peer organizations for comparison and evaluation; Data availability: Consider the availability and reliability of data, and select indicators supported by reliable data sources to ensure the accuracy and credibility of cost analysis. The selection of evaluation indicators that are highly correlated with the value of data assets needs to be customized according to the business conditions and industry characteristics of the enterprise. The following is a possible method: Business value: based on the degree of application of data in business processes and the degree of impact of data on business decisions; Rarity: based on the difficulty and uniqueness of data acquisition; Completeness: based on the completeness of data content; Accuracy: based on the quality of data, including data accuracy, consistency, timeliness, etc.; Compliance: based on the compliance of data, including data security, privacy, compliance, etc. The following factors can be considered in combination with the business conditions of the enterprise. For sales-oriented enterprises, sales data and customer data may have higher value; for technology-based enterprises, R&D data and patent data may have higher value; Industry attributes will also affect the value of data. For example, in the financial industry, transaction data and credit data may have higher value, and in the medical industry, patient data and drug data may have higher value. All of the above factors can be used as references for evaluation indicators, and specific indicators need to be customized according to the specific situation of the enterprise.
表2为数据管理要素评价体系表Table 2 is the data management element evaluation system table
表2数据管理要素评价体系表Table 2 Data management element evaluation system
表2的数据要素评价体系的表格记录内容用于评估和比较不同数据管理要素的性能和价值,包含了各个数据管理要素的指标和相关数据,用于帮助决策者做出合理的决策。使用这些表格记录时,可以按照不同的指标对数据管理要素进行打分和排序。根据具体的评价体系,每个指标可以有不同的权重和评分标准。评估者可以根据自己的需求和偏好,将不同指标的权重进行调整,以反映其对不同指标的重视程度。然后,根据每个数据管理要素在各个指标上的得分,计算出综合得分,用于比较和决策。这些表格记录的内容还可以用于数据管理要素的跟踪和监控。通过周期性地更新数据和重新评估指标,可以跟踪数据管理要素的变化和改进,并及时调整评估结果和决策。The table records of the data element evaluation system in Table 2 are used to evaluate and compare the performance and value of different data management elements. They contain the indicators and related data of each data management element to help decision makers make reasonable decisions. When using these table records, data management elements can be scored and ranked according to different indicators. Depending on the specific evaluation system, each indicator can have different weights and scoring criteria. Evaluators can adjust the weights of different indicators according to their needs and preferences to reflect the importance they attach to different indicators. Then, based on the scores of each data management element on each indicator, a comprehensive score is calculated for comparison and decision-making. The contents of these table records can also be used to track and monitor data management elements. By periodically updating data and re-evaluating indicators, changes and improvements in data management elements can be tracked, and evaluation results and decisions can be adjusted in a timely manner.
在一些实施例中,所述针对每个数据资产表,通过层次分析法对所述数据资产表对应的数据管理要素进行评价,得到所述数据资产表的数据管理分数,即管的阶段,在“管”的阶段中,数据资产价值评估的主要目标是确保全量数据资产得到有效的治理和管理,以实现数据资产的最大化价值。“管”的阶段可以包括以下一个或多个步骤,数据梳理用于对数据源进行整合和梳理,确保数据质量、准确性、及时性和完整性,这有助于提高数据资产的价值;数据分类与标签用于根据数据的特性和业务需求,对数据进行分类和打标签,这有助于提高数据的可发现性和易用性,从而增加数据资产的价值;数据治理用于实施数据治理策略,包括数据质量管理、数据安全管理、数据隐私保护和数据合规性检查,这有助于确保数据资产的可靠性和合规性,进一步提升数据资产价值;数据价值挖掘用于运用数据分析、数据挖掘等技术,发掘数据资产中的潜在价值和关联性,这有助于发现新的业务机会,提高数据资产的实用价值;数据资产目录用于建立和完善数据资产目录,记录数据资产的基本信息、使用方式和权限控制,这有助于提高数据资产的可查找性和可复用性;数据价值评估用于根据数据资产在各个阶段的影响因素,定期对数据资产价值进行评估,这有助于实时了解数据资产的价值变化,为数据资产的优化和调整提供依据;数据资产应用与回报用于推动数据资产在业务场景中的应用,实现数据资产价值的落地。同时,跟踪数据资产的应用效果,不断优化数据资产,以提高数据资产的价值。在“管”的过程中,数据资产价值评估的体现主要通过以下几个方面,提高数据质量:通过数据梳理、治理等手段,确保数据的准确性和完整性,从而提高数据资产的价值;提升数据可用性:通过数据分类、标签等方式,提高数据的可发现性和易用性,使数据资产更容易被业务部门使用;保障数据安全与合规:通过数据治理措施,确保数据资产的安全、合规性,降低潜在的风险;发现潜在价值:通过数据挖掘和分析,发掘数据资产中的潜在价值和关联性,为业务创新提供支持;提高数据资产的复用性:通过建立数据资产目录,记录数据资产的基本信息和使用方式,方便业务部门重复使用;优化数据资产:根据数据资产价值评估结果,对数据资产进行优化和调整,使其更好地满足业务需求;实现数据资产价值落地:推动数据资产在实际业务场景中的应用,转化为业务成果,实现数据资产价值的最大化。In some embodiments, for each data asset table, the data management elements corresponding to the data asset table are evaluated by the hierarchical analysis method to obtain the data management score of the data asset table, which is the management stage. In the "management" stage, the main goal of data asset value assessment is to ensure that all data assets are effectively governed and managed to maximize the value of data assets. The "management" stage may include one or more of the following steps: data combing is used to integrate and comb data sources to ensure data quality, accuracy, timeliness and completeness, which helps to improve the value of data assets; data classification and labeling are used to classify and label data according to data characteristics and business needs, which helps to improve data discoverability and ease of use, thereby increasing the value of data assets; data governance is used to implement data governance strategies, including data quality management, data security management, data privacy protection and data compliance checks, which helps to ensure the reliability and compliance of data assets and further enhance the value of data assets; data value mining is used to use data analysis , data mining and other technologies to explore the potential value and relevance of data assets, which helps to discover new business opportunities and improve the practical value of data assets; the data asset catalog is used to establish and improve the data asset catalog, record the basic information, usage and permission control of data assets, which helps to improve the findability and reusability of data assets; data value assessment is used to regularly assess the value of data assets based on the influencing factors of data assets at various stages, which helps to understand the value changes of data assets in real time and provide a basis for the optimization and adjustment of data assets; data asset application and return is used to promote the application of data assets in business scenarios and realize the implementation of data asset value. At the same time, track the application effect of data assets and continuously optimize data assets to increase the value of data assets. In the process of "management", the embodiment of data asset value assessment is mainly through the following aspects: improving data quality: ensuring the accuracy and completeness of data through data combing, governance and other means, thereby increasing the value of data assets; improving data availability: improving the discoverability and usability of data through data classification, labeling and other means, making data assets easier for business departments to use; ensuring data security and compliance: ensuring the security and compliance of data assets and reducing potential risks through data governance measures; discovering potential value: exploring the potential value and relevance in data assets through data mining and analysis, and providing support for business innovation; improving the reusability of data assets: recording the basic information and usage of data assets by establishing a data asset catalog to facilitate reuse by business departments; optimizing data assets: optimizing and adjusting data assets based on the results of data asset value assessment to better meet business needs; realizing the value of data assets: promoting the application of data assets in actual business scenarios, transforming them into business results, and maximizing the value of data assets.
在一些实施例中,基于数据基础属性要素和数据质量要素的数据管理评分体系包括以下维度,数据可用性:评估数据是否易于获取、共享和利用,是否具备足够的可靠性和稳定性;数据完整性:评估数据是否完整,是否包含所需的所有字段和记录,是否存在数据丢失或错误;数据准确性:评估数据的准确性和正确性,是否符合预期的标准和规范,是否存在数据冗余或不一致;数据一致性:评估数据在不同系统或数据源之间的一致性,是否存在数据差异或不匹配;数据安全性:评估数据的安全性和保密性,是否受到适当的保护措施,是否存在潜在的安全风险;数据可追溯性:评估数据的来源和变更历史是否可追溯,是否存在足够的审计和日志记录;数据文档化:评估数据的文档化程度和质量,是否存在清晰的数据定义、字段说明和数据字典。这些评分维度的目的是为了评估和衡量数据管理的质量和有效性。数据管理评分的维度可以通过定量和定性的方式来获取。定量的评分可以基于数据指标的度量值,例如数据缺失率、数据错误率等。定性的评分可以通过专家评估、用户反馈或调查问卷等方式获得。这些评分可以根据具体的评估方法和标准来计算和衡量,最终得出每个评分维度的分数。In some embodiments, the data management scoring system based on data basic attribute elements and data quality elements includes the following dimensions: data availability: assessing whether the data is easy to obtain, share and use, and whether it has sufficient reliability and stability; data integrity: assessing whether the data is complete, whether it contains all the required fields and records, and whether there is data loss or error; data accuracy: assessing the accuracy and correctness of the data, whether it meets the expected standards and specifications, and whether there is data redundancy or inconsistency; data consistency: assessing the consistency of data between different systems or data sources, whether there are data differences or mismatches; data security: assessing the security and confidentiality of the data, whether it is subject to appropriate protection measures, and whether there are potential security risks; data traceability: assessing whether the source and change history of the data are traceable, whether there are sufficient audit and log records; data documentation: assessing the degree and quality of data documentation, whether there are clear data definitions, field descriptions and data dictionaries. The purpose of these scoring dimensions is to evaluate and measure the quality and effectiveness of data management. The dimensions of data management scoring can be obtained in quantitative and qualitative ways. Quantitative scoring can be based on the measurement values of data indicators, such as data missing rate, data error rate, etc. Qualitative scoring can be obtained through expert evaluation, user feedback or questionnaires. These scores can be calculated and measured according to specific evaluation methods and standards, and finally a score for each scoring dimension is obtained.
在一些实施例中,基于数据资产评估下的数据管理评分体系中的基础评价指标和数据质量评价指标,可以通过以下方式使用,数据规模和数据量:这些指标用于衡量数据集的大小和数量。通过评估数据规模和数据量,可以确定数据管理所需的存储和处理资源,并为数据管理计划提供基准;增长率:增长率指标用于评估数据的增长速度,通过了解数据的增长率,可以预测未来的数据需求,并相应地进行规划和调整数据管理策略;准确性:准确性评价指标用于衡量数据的精确程度和正确性,这可以通过比较数据与真实情况的匹配程度来评估,通过提高数据的准确性,可以增强决策的可靠性和可信度;一致性:一致性评价指标用于评估数据在不同源头和不同时间点的一致性。这可以通过比较数据的值、格式和定义是否一致来进行评估,确保数据一致性可以减少错误和冲突,并提高数据的可信度和可靠性;完整性:完整性评价指标用于评估数据是否完整,是否包含所需的所有属性和记录,通过确保数据的完整性,可以避免信息缺失和数据不完整性引起的问题;规范性:规范性评价指标用于评估数据是否符合预定的标准、规范和约定。通过遵循规范,可以提高数据的一致性、可比性和可操作性;时效性:时效性评价指标用于评估数据的更新速度和及时性,通过确保数据及时更新,可以确保决策和分析基于最新的数据。In some embodiments, the basic evaluation indicators and data quality evaluation indicators in the data management scoring system under data asset evaluation can be used in the following ways: data scale and data volume: These indicators are used to measure the size and quantity of data sets. By evaluating the data scale and data volume, the storage and processing resources required for data management can be determined, and a benchmark can be provided for the data management plan; growth rate: The growth rate indicator is used to evaluate the growth rate of data. By understanding the growth rate of data, future data needs can be predicted, and data management strategies can be planned and adjusted accordingly; accuracy: The accuracy evaluation indicator is used to measure the accuracy and correctness of the data, which can be evaluated by comparing the degree of match between the data and the actual situation. By improving the accuracy of the data, the reliability and credibility of the decision can be enhanced; consistency: The consistency evaluation indicator is used to evaluate the consistency of data from different sources and at different time points. This can be evaluated by comparing whether the values, formats, and definitions of the data are consistent. Ensuring data consistency can reduce errors and conflicts and improve the credibility and reliability of the data; integrity: The integrity evaluation indicator is used to evaluate whether the data is complete and contains all the required attributes and records. By ensuring the integrity of the data, problems caused by missing information and incomplete data can be avoided; normativeness: The normative evaluation indicator is used to evaluate whether the data meets the predetermined standards, specifications, and conventions. By following the specifications, the consistency, comparability and operability of data can be improved; Timeliness: Timeliness evaluation indicators are used to evaluate the update speed and timeliness of data. By ensuring that data is updated in a timely manner, decisions and analysis can be based on the latest data.
进一步地,所述根据所有数据资产表的数据管理分数和成本信息,评估所述待评估数据资产的资产回报率,包括:Furthermore, evaluating the asset return rate of the data asset to be evaluated based on the data management scores and cost information of all data asset tables includes:
针对每个数据资产表,将所述数据资产表的数据管理分数和成本信息相加,得到所述数据资产表的总体价值;For each data asset table, add the data management score and cost information of the data asset table to obtain the overall value of the data asset table;
根据所述数据资产表的应用场景和贡献程度,确定所述数据资产表的总体价值对应的权重;Determine the weight corresponding to the overall value of the data asset table according to the application scenario and contribution degree of the data asset table;
根据所有数据资产表的各总体价值对应的权重,对所述所有数据资产表的总体价值进行加权平局,得到所述待评估数据资产的资产回报率。According to the weights corresponding to the overall values of all data asset tables, the overall values of all data asset tables are weighted and averaged to obtain the asset return rate of the data assets to be evaluated.
在一些实施例中,资产回报率是通过将数据管理分数与成本信息相加,然后根据数据应用价值进行加权平均得出的。以下是资产回报率的一种计算过程:收集待评估数据资产,并对待评估数据资产进行预处理得到待评估数据资产对应的至少一个数据资产表和数据生产成本;通过血缘传播算法计算出每个数据资产表的成本信息;利用层次分析法得到单张数据资产表的数据管理分数。将数据管理分数与每个数据资产表的成本信息相加,得到每个数据资产表的总体价值。针对每个数据资产表,根据其应用场景和贡献程度,赋予相应的权重。根据权重对每个数据资产表的总体价值进行加权平均,得到资产回报率。In some embodiments, the return on assets is obtained by adding the data management score to the cost information and then performing a weighted average based on the data application value. The following is a calculation process for the return on assets: collect the data assets to be evaluated, and pre-process the data assets to be evaluated to obtain at least one data asset table and data production cost corresponding to the data assets to be evaluated; calculate the cost information of each data asset table through the blood propagation algorithm; use the hierarchical analysis method to obtain the data management score of a single data asset table. Add the data management score to the cost information of each data asset table to obtain the overall value of each data asset table. For each data asset table, assign a corresponding weight according to its application scenario and contribution level. According to the weight, the overall value of each data asset table is weighted averaged to obtain the return on assets.
在一些实施例中,“用”阶段的具体步骤如下,数据应用规划:根据业务需求和数据特点,规划数据资产的应用场景和方式,这一步骤的目标是确保数据资产能够在合适的场景中产生价值;数据处理和分析:根据应用规划,对数据进行必要的处理和分析,以满足业务需求,这包括数据清洗、数据转换、数据挖掘等操作;数据资产化:将处理和分析后的数据转化为可重复使用的资产,为业务提供持续的价值,这包括建立数据模型、数据仓库、数据服务等;数据价值实现:通过数据应用,实现业务目标,提升业务效率或创造新的业务价值。在“用”的过程中,数据资产价值评估的体现主要包括以下几点,数据应用效果评估:评估数据资产在实际应用中的效果,如业务指标的改善、工作效率的提升等;数据价值挖掘程度评估:评估数据资产在业务场景中的挖掘程度,包括数据处理和分析的深度、广度等;数据资产重复利用率评估:评估数据资产在多次应用中的利用率,体现数据资产的可持续价值;数据资产对业务创新的贡献评估:评估数据资产在业务创新中的作用,包括新业务的开发、产品改进等;数据安全与合规性评估:评估数据资产在应用过程中的安全性和合规性,确保数据资产的稳定价值。综合以上因素,对数据资产在“用”的过程中的价值进行综合评估,从而为数据资产的管理和优化提供依据。In some embodiments, the specific steps of the "use" stage are as follows: data application planning: plan the application scenarios and methods of data assets based on business needs and data characteristics. The goal of this step is to ensure that data assets can generate value in appropriate scenarios; data processing and analysis: according to the application plan, perform necessary processing and analysis on the data to meet business needs, which includes data cleaning, data conversion, data mining and other operations; data assetization: converting processed and analyzed data into reusable assets to provide continuous value for the business, which includes establishing data models, data warehouses, data services, etc.; data value realization: achieving business goals, improving business efficiency or creating new business value through data applications. In the process of "using", the manifestation of data asset value assessment mainly includes the following points: data application effect assessment: assess the effect of data assets in actual applications, such as improvement of business indicators, improvement of work efficiency, etc.; data value mining degree assessment: assess the mining degree of data assets in business scenarios, including the depth and breadth of data processing and analysis; data asset reuse rate assessment: assess the utilization rate of data assets in multiple applications, reflecting the sustainable value of data assets; data asset contribution to business innovation assessment: assess the role of data assets in business innovation, including the development of new businesses, product improvements, etc.; data security and compliance assessment: assess the security and compliance of data assets in the application process to ensure the stable value of data assets. Based on the above factors, the value of data assets in the process of "using" is comprehensively assessed, so as to provide a basis for the management and optimization of data assets.
在一些实施例中,要具体分析企业的经营情况并得出整体的数据资产的行业回报率,可以按照以下步骤进行,收集经营数据:首先,收集企业的财务报表和经营数据,包括利润表、资产负债表和现金流量表等,这些数据可以提供有关企业经营状况和财务健康的关键信息;确定数据资产:识别和定量化企业的数据资产,数据资产可以包括客户数据库、市场调研数据、销售数据、供应链数据等,确保对数据资产的价值和重要性有清晰的认识;分析经营情况:利用收集到的经营数据,对企业的财务状况、盈利能力和运营效率进行分析,这可以包括计算财务比率和指标,如净利润率、资产回报率和流动比率等,以评估企业的经营情况;行业比较:进行行业比较分析,包括对竞争对手和同行业企业的财务指标进行比较,这可以帮助确定企业的相对竞争优势和行业平均水平;计算行业回报率:通过将企业的数据资产价值与行业平均水平进行对比,可以计算出数据资产的行业回报率。这可以通过将企业的数据资产收入与行业的总收入进行比较来计算。具体的分析方法和指标选择可能会根据不同的行业和企业而有所变化。在进行数据资产评估时,建议咨询专业的财务分析师或数据分析师,以确保准确性和可靠性。In some embodiments, to specifically analyze the business conditions of an enterprise and derive the industry return rate of the overall data assets, the following steps can be followed to collect business data: First, collect the financial statements and business data of the enterprise, including the income statement, balance sheet, and cash flow statement, etc., which can provide key information about the business conditions and financial health of the enterprise; determine data assets: identify and quantify the data assets of the enterprise, which can include customer databases, market research data, sales data, supply chain data, etc., to ensure a clear understanding of the value and importance of data assets; analyze business conditions: use the collected business data to analyze the financial status, profitability, and operating efficiency of the enterprise, which can include calculating financial ratios and indicators, such as net profit margin, return on assets, and current ratio, etc., to evaluate the business conditions of the enterprise; industry comparison: conduct industry comparative analysis, including comparing the financial indicators of competitors and companies in the same industry, which can help determine the relative competitive advantage of the enterprise and the industry average; calculate industry return rate: by comparing the value of the data assets of the enterprise with the industry average, the industry return rate of the data assets can be calculated. This can be calculated by comparing the data asset income of the enterprise with the total income of the industry. The specific analysis method and indicator selection may vary according to different industries and enterprises. When conducting data asset valuation, it is recommended to consult a professional financial analyst or data analyst to ensure accuracy and reliability.
在一些实施例中,要将数据管理分数拟合成每张表的数据资产回报率,可以按照以下步骤进行:首先,确定数据管理分数的计算方法,数据管理分数可以包括诸如数据准确性、数据完整性、数据可靠性等指标,可以根据实际情况确定这些指标的权重和得分规则;然后,确定每张表的数据资产回报率的计算方法,数据资产回报率可以包括数据使用频率、数据价值、数据质量等指标,同样,可以根据实际情况确定这些指标的权重和得分规则;接下来,对每张表进行数据管理分数和数据资产回报率的评估,根据定义的指标和得分规则,对每张表进行评分,可以使用一些工具或方法来自动化评估过程,例如数据质量管理工具或数据资产管理系统;将数据管理分数和数据资产回报率进行匹配,将每张表的数据管理分数和数据资产回报率进行对应,可以使用线性回归或其他相关的统计分析方法来拟合这两个指标之间的关系;最后,根据拟合的结果,可以预测其他表的数据资产回报率,通过应用拟合的模型,可以估计其他表的数据资产回报率,从而提供决策支持和优化数据管理策略。这个过程可能涉及到一些数据收集和数据分析的工作。同时,拟合的结果也只是预测值,实际的数据资产回报率可能受到其他因素的影响。因此,在实际应用中,需要不断监测和调整模型,以确保其准确性和有效性。In some embodiments, to fit the data management score to the data asset return rate of each table, the following steps can be followed: first, determine the calculation method of the data management score, which can include indicators such as data accuracy, data integrity, data reliability, etc., and the weights and scoring rules of these indicators can be determined according to actual conditions; then, determine the calculation method of the data asset return rate of each table, which can include indicators such as data usage frequency, data value, data quality, etc., and similarly, the weights and scoring rules of these indicators can be determined according to actual conditions; next, evaluate the data management score and data asset return rate for each table, and score each table according to the defined indicators and scoring rules. Some tools or methods can be used to automate the evaluation process, such as data quality management tools or data asset management systems; match the data management score and data asset return rate, and correspond the data management score and data asset return rate of each table. Linear regression or other related statistical analysis methods can be used to fit the relationship between the two indicators; finally, based on the fitting results, the data asset return rate of other tables can be predicted, and by applying the fitted model, the data asset return rate of other tables can be estimated, thereby providing decision support and optimizing data management strategies. This process may involve some data collection and data analysis work. At the same time, the fitting result is only a prediction value, and the actual return on data assets may be affected by other factors. Therefore, in practical applications, it is necessary to continuously monitor and adjust the model to ensure its accuracy and effectiveness.
另一方面,如图2所示,本发明实施例提供一种数据资产价值的评估装置,包括:On the other hand, as shown in FIG2 , an embodiment of the present invention provides a data asset value evaluation device, including:
数据资产采集单元200,用于根据预设评估目标,收集待评估数据资产,针对所述待评估数据资产,通过多种数据加工步骤对所述待评估数据资产进行加工处理,得到所述待评估数据资产对应的至少一个数据资产表和数据生产成本;The data asset collection unit 200 is used to collect data assets to be evaluated according to a preset evaluation target, and process the data assets to be evaluated through multiple data processing steps to obtain at least one data asset table and data production cost corresponding to the data assets to be evaluated;
数据资产成本传播单元201,用于针对每个数据资产表,根据血缘传播算法分摊所述待评估数据资产对应的数据生产成本,得到所述数据资产表的成本信息;The data asset cost propagation unit 201 is used to allocate the data production cost corresponding to the data asset to be evaluated according to the lineage propagation algorithm for each data asset table to obtain the cost information of the data asset table;
数据资产管理单元202,用于针对每个数据资产表,通过层次分析法对所述数据资产表对应的数据管理要素进行评价,得到所述数据资产表的数据管理分数;The data asset management unit 202 is used to evaluate the data management elements corresponding to each data asset table by using the analytic hierarchy process to obtain the data management score of the data asset table;
数据资产评估单元203,用于根据所有数据资产表的数据管理分数和成本信息,评估所述待评估数据资产的资产回报率。The data asset evaluation unit 203 is used to evaluate the asset return rate of the data asset to be evaluated based on the data management scores and cost information of all data asset tables.
进一步地,所述数据资产采集单元200,包括:Furthermore, the data asset collection unit 200 includes:
数据资产收集模块,用于根据预设评估目标,确定对应的待评估数据资产的范围,根据所述待评估数据资产的范围收集所述待评估数据资产;A data asset collection module, used to determine the scope of corresponding data assets to be evaluated according to a preset evaluation target, and collect the data assets to be evaluated according to the scope of the data assets to be evaluated;
数据资产清洗模块,用于对所述待评估数据资产进行数据清洗,得到清洗后数据,所述数据清洗包括:去除重复数据和修复数据错误;A data asset cleaning module is used to clean the data assets to be evaluated to obtain cleaned data, wherein the data cleaning includes: removing duplicate data and repairing data errors;
数据资产存储模块,用于对不同来源、格式和结构的清洗后数据进行数据整合,按预设的统一数据存储要求进行存储,得到存储数据;The data asset storage module is used to integrate cleaned data from different sources, formats and structures, store them according to preset unified data storage requirements, and obtain stored data;
数据资产分类模块,用于根据所述存储数据的性质、来源和用途对所述存储数据进行分类并标记标签;A data asset classification module, used to classify and label the stored data according to the nature, source and purpose of the stored data;
数据资产脱敏加密模块,用于将所述存储数据中的敏感信息替换为指定替代信息,得到所述存储数据对应的脱敏数据,对所述脱敏数据进行加密;A data asset desensitizing and encryption module, used to replace sensitive information in the stored data with specified replacement information, obtain desensitized data corresponding to the stored data, and encrypt the desensitized data;
数据资产表生成模块,用于将经脱敏和加密的存储数据按分类存储为至少一个数据资产表;A data asset table generation module, used to store the desensitized and encrypted storage data into at least one data asset table by category;
数据生产成本收集模块,用于根据从所述待评估数据资产到所述至少一个数据资产表的数据加工步骤收集所述待评估数据资产对应的数据生产成本;A data production cost collection module, configured to collect data production costs corresponding to the data asset to be evaluated according to the data processing steps from the data asset to be evaluated to the at least one data asset table;
其中,所述数据生产成本包括:硬件建设成本、软件建设成本、运维成本、资源成本、数据采购成本、数据咨询成本、人力支出成本和场地费用成本。Among them, the data production cost includes: hardware construction cost, software construction cost, operation and maintenance cost, resource cost, data procurement cost, data consulting cost, manpower expenditure cost and site fee cost.
进一步地,所述数据资产成本传播单元201,包括:Furthermore, the data asset cost propagation unit 201 includes:
加工工序血缘关系确定模块,用于针对每个数据资产表,根据血缘传播算法生成所述数据资产表对应的加工工序血缘关系;所述加工工序血缘关系表示数据加工步骤之间的父子关系;A processing procedure blood relationship determination module is used to generate, for each data asset table, a processing procedure blood relationship corresponding to the data asset table according to a blood relationship propagation algorithm; the processing procedure blood relationship represents a parent-child relationship between data processing steps;
成本分摊模块,用于根据所述数据资产表对应的加工工序血缘关系,将所述待评估数据资产对应的数据生产成本,分摊为用于得到所述数据资产表的各数据加工步骤各自对应的数据生产成本;A cost allocation module, for allocating the data production cost corresponding to the data asset to be evaluated into the data production costs corresponding to each data processing step for obtaining the data asset table according to the blood relationship of the processing steps corresponding to the data asset table;
成本信息确定模块,用于根据所述加工工序血缘关系,累计用于得到所述数据资产表的各数据加工步骤各自对应的数据生产成本,得到所述数据资产表的成本信息。The cost information determination module is used to accumulate the data production costs corresponding to each data processing step used to obtain the data asset table according to the processing process blood relationship, so as to obtain the cost information of the data asset table.
进一步地,所述数据资产管理单元202,包括:Furthermore, the data asset management unit 202 includes:
判断矩阵构建模块,用于根据所述数据资产表对应的数据管理要素构建层次分析法的判断矩阵;A judgment matrix construction module, used to construct a judgment matrix of the hierarchical analysis method according to the data management elements corresponding to the data asset table;
管理要素权重确定模块,用于根据所述判断矩阵比较所述数据资产表对应的不同数据管理要素之间的相对重要性,确定所述数据资产表对应的各数据管理要素的权重;A management element weight determination module, used to compare the relative importance of different data management elements corresponding to the data asset table according to the judgment matrix, and determine the weight of each data management element corresponding to the data asset table;
权重向量确定模块,用于对所述判断矩阵的每一列进行归一化处理,计算所述数据资产表对应的各数据管理要素的权重向量;A weight vector determination module, used to normalize each column of the judgment matrix and calculate the weight vector of each data management element corresponding to the data asset table;
综合权重确定模块,用于根据所述数据资产表对应的各数据管理要素的权重向量计算所述数据资产表对应的各数据管理要素的综合权重;A comprehensive weight determination module, used to calculate the comprehensive weight of each data management element corresponding to the data asset table according to the weight vector of each data management element corresponding to the data asset table;
数据管理分数确定单元,用于根据数据资产表对应的各数据管理要素及其综合权重,确定所述数据资产表的数据管理分数;A data management score determination unit, used to determine the data management score of the data asset table according to the data management elements corresponding to the data asset table and their comprehensive weights;
所述数据管理要素包括基本要素、质量要素和使用要素;The data management elements include basic elements, quality elements and usage elements;
所述基本要素包括数据规模;The basic elements include data size;
所述质量要素包括:准确性、一致性、完整性、规范性、时效性;The quality elements include: accuracy, consistency, completeness, standardization, and timeliness;
所述使用要素包括:可访问度和使用热度。The usage factors include: accessibility and usage popularity.
进一步地,所述数据资产评估单元203,包括:Furthermore, the data asset evaluation unit 203 includes:
总体价值确定模块,用于针对每个数据资产表,将所述数据资产表的数据管理分数和成本信息相加,得到所述数据资产表的总体价值;An overall value determination module, for each data asset table, adding the data management score and cost information of the data asset table to obtain the overall value of the data asset table;
总体价值权重确定模块,用于根据所述数据资产表的应用场景和贡献程度,确定所述数据资产表的总体价值对应的权重;An overall value weight determination module, used to determine the weight corresponding to the overall value of the data asset table according to the application scenario and contribution degree of the data asset table;
资产回报率确定模块,用于根据所有数据资产表的各总体价值对应的权重,对所述所有数据资产表的总体价值进行加权平局,得到所述待评估数据资产的资产回报率。The asset return rate determination module is used to perform weighted averaging on the overall values of all data asset tables according to the weights corresponding to the overall values of all data asset tables to obtain the asset return rate of the data assets to be evaluated.
本发明实施例是与前述的方法实施例一一对应的装置实施例,可根据前述方法实施例理解本发明实施例,在此不再赘述。The embodiments of the present invention are device embodiments that correspond one-to-one to the aforementioned method embodiments. The embodiments of the present invention can be understood based on the aforementioned method embodiments, and will not be described in detail here.
应该明白,公开的过程中的步骤的特定顺序或层次是示例性方法的实例。基于设计偏好,应该理解,过程中的步骤的特定顺序或层次可以在不脱离本公开的保护范围的情况下得到重新安排。所附的方法权利要求以示例性的顺序给出了各种步骤的要素,并且不是要限于所述的特定顺序或层次。It should be understood that the specific order or hierarchy of steps in the disclosed process is an example of an exemplary method. Based on design preferences, it should be understood that the specific order or hierarchy of steps in the process can be rearranged without departing from the scope of protection of the present disclosure. The attached method claims present the elements of the various steps in an exemplary order and are not intended to be limited to the specific order or hierarchy described.
在上述的详细描述中,各种特征一起组合在单个的实施方案中,以简化本公开。不应该将这种公开方法解释为反映了这样的意图,即,所要求保护的主题的实施方案需要比清楚地在每个权利要求中所陈述的特征更多的特征。相反,如所附的权利要求书所反映的那样,本发明处于比所公开的单个实施方案的全部特征少的状态。因此,所附的权利要求书特此清楚地被并入详细描述中,其中每项权利要求独自作为本发明单独的优选实施方案。In the above detailed description, various features are grouped together in a single embodiment to simplify the disclosure. This method of disclosure should not be interpreted as reflecting an intention that the embodiments of the claimed subject matter require more features than are clearly stated in each claim. On the contrary, as reflected in the appended claims, the invention is in a state of having less than all the features of the disclosed individual embodiments. Therefore, the appended claims are hereby expressly incorporated into the detailed description, with each claim standing on its own as a separate preferred embodiment of the invention.
为使本领域内的任何技术人员能够实现或者使用本发明,上面对所公开实施例进行了描述。对于本领域技术人员来说;这些实施例的各种修改方式都是显而易见的,并且本文定义的一般原理也可以在不脱离本公开的精神和保护范围的基础上适用于其它实施例。因此,本公开并不限于本文给出的实施例,而是与本申请公开的原理和新颖性特征的最广范围相一致。The disclosed embodiments are described above to enable any person skilled in the art to implement or use the present invention. Various modifications of these embodiments are obvious to those skilled in the art, and the general principles defined herein may also be applied to other embodiments without departing from the spirit and scope of the present disclosure. Therefore, the present disclosure is not limited to the embodiments given herein, but is consistent with the broadest scope of the principles and novel features disclosed in this application.
上文的描述包括一个或多个实施例的举例。当然,为了描述上述实施例而描述部件或方法的所有可能的结合是不可能的,但是本领域普通技术人员应该认识到,各个实施例可以做进一步的组合和排列。因此,本文中描述的实施例旨在涵盖落入所附权利要求书的保护范围内的所有这样的改变、修改和变型。此外,就说明书或权利要求书中使用的术语“包含”,该词的涵盖方式类似于术语“包括”,就如同“包括:”在权利要求中用作衔接词所解释的那样。此外,使用在权利要求书的说明书中的任何一个术语“或者”是要表示“非排它性的或者”。The above description includes examples of one or more embodiments. Of course, it is impossible to describe all possible combinations of components or methods for the purpose of describing the above embodiments, but it should be recognized by those skilled in the art that the various embodiments may be further combined and arranged. Therefore, the embodiments described herein are intended to cover all such changes, modifications and variations that fall within the scope of protection of the appended claims. In addition, with respect to the term "comprising" used in the specification or claims, the word is covered in a manner similar to the term "including", as explained by "including:" used as a transitional word in the claims. In addition, any term "or" used in the specification of the claims is intended to mean "non-exclusive or".
本领域技术人员还可以了解到本发明实施例列出的各种说明性逻辑块(illustrative logical block),单元,和步骤可以通过电子硬件、电脑软件,或两者的结合进行实现。为清楚展示硬件和软件的可替换性(interchangeability),上述的各种说明性部件(illustrative components),单元和步骤已经通用地描述了它们的功能。这样的功能是通过硬件还是软件来实现取决于特定的应用和整个系统的设计要求。本领域技术人员可以对于每种特定的应用,可以使用各种方法实现所述的功能,但这种实现不应被理解为超出本发明实施例保护的范围。Those skilled in the art may also understand that the various illustrative logical blocks, units, and steps listed in the embodiments of the present invention may be implemented by electronic hardware, computer software, or a combination of the two. In order to clearly demonstrate the interchangeability of hardware and software, the various illustrative components, units, and steps described above have generally described their functions. Whether such functions are implemented by hardware or software depends on the specific application and the design requirements of the entire system. Those skilled in the art may use various methods to implement the described functions for each specific application, but such implementation should not be understood as exceeding the scope of protection of the embodiments of the present invention.
本发明实施例中所描述的各种说明性的逻辑块,或单元都可以通过通用处理器,数字信号处理器,专用集成电路(ASIC),现场可编程门阵列或其它可编程逻辑装置,离散门或晶体管逻辑,离散硬件部件,或上述任何组合的设计来实现或操作所描述的功能。通用处理器可以为微处理器,可选地,该通用处理器也可以为任何传统的处理器、控制器、微控制器或状态机。处理器也可以通过计算装置的组合来实现,例如数字信号处理器和微处理器,多个微处理器,一个或多个微处理器联合一个数字信号处理器核,或任何其它类似的配置来实现。The various illustrative logic blocks or units described in the embodiments of the present invention can be implemented or operated by a general-purpose processor, a digital signal processor, an application-specific integrated circuit (ASIC), a field programmable gate array or other programmable logic device, a discrete gate or transistor logic, a discrete hardware component, or any combination of the above. The general-purpose processor can be a microprocessor, and optionally, the general-purpose processor can also be any conventional processor, controller, microcontroller or state machine. The processor can also be implemented by a combination of computing devices, such as a digital signal processor and a microprocessor, a plurality of microprocessors, one or more microprocessors combined with a digital signal processor core, or any other similar configuration.
本发明实施例中所描述的方法或算法的步骤可以直接嵌入硬件、处理器执行的软件模块、或者这两者的结合。软件模块可以存储于RAM存储器、闪存、ROM存储器、EPROM存储器、EEPROM存储器、寄存器、硬盘、可移动磁盘、CD-ROM或本领域中其它任意形式的存储媒介中。示例性地,存储媒介可以与处理器连接,以使得处理器可以从存储媒介中读取信息,并可以向存储媒介存写信息。可选地,存储媒介还可以集成到处理器中。处理器和存储媒介可以设置于ASIC中,ASIC可以设置于用户终端中。可选地,处理器和存储媒介也可以设置于用户终端中的不同的部件中。The steps of the method or algorithm described in the embodiments of the present invention can be directly embedded in hardware, a software module executed by a processor, or a combination of the two. The software module can be stored in a RAM memory, a flash memory, a ROM memory, an EPROM memory, an EEPROM memory, a register, a hard disk, a removable disk, a CD-ROM, or other storage media of any form in the art. Exemplarily, the storage medium can be connected to the processor so that the processor can read information from the storage medium and can write information to the storage medium. Optionally, the storage medium can also be integrated into the processor. The processor and the storage medium can be arranged in an ASIC, and the ASIC can be arranged in a user terminal. Optionally, the processor and the storage medium can also be arranged in different components in the user terminal.
在一个或多个示例性的设计中,本发明实施例所描述的上述功能可以在硬件、软件、固件或这三者的任意组合来实现。如果在软件中实现,这些功能可以存储与电脑可读的媒介上,或以一个或多个指令或代码形式传输于电脑可读的媒介上。电脑可读媒介包括电脑存储媒介和便于使得让电脑程序从一个地方转移到其它地方的通信媒介。存储媒介可以是任何通用或特殊电脑可以接入访问的可用媒体。例如,这样的电脑可读媒体可以包括但不限于RAM、ROM、EEPROM、CD-ROM或其它光盘存储、磁盘存储或其它磁性存储装置,或其它任何可以用于承载或存储以指令或数据结构和其它可被通用或特殊电脑、或通用或特殊处理器读取形式的程序代码的媒介。此外,任何连接都可以被适当地定义为电脑可读媒介,例如,如果软件是从一个网站站点、服务器或其它远程资源通过一个同轴电缆、光纤电缆、双绞线、数字用户线(DSL)或以例如红外、无线和微波等无线方式传输的也被包含在所定义的电脑可读媒介中。所述的碟片(disk)和磁盘(disc)包括压缩磁盘、镭射盘、光盘、DVD、软盘和蓝光光盘,磁盘通常以磁性复制数据,而碟片通常以激光进行光学复制数据。上述的组合也可以包含在电脑可读媒介中。In one or more exemplary designs, the above functions described in the embodiments of the present invention can be implemented in hardware, software, firmware or any combination of the three. If implemented in software, these functions can be stored on a computer-readable medium, or transmitted in the form of one or more instructions or codes on a computer-readable medium. Computer-readable media include computer storage media and communication media that facilitate the transfer of computer programs from one place to another. The storage medium can be any available medium that can be accessed by any general or special computer. For example, such computer-readable media can include but are not limited to RAM, ROM, EEPROM, CD-ROM or other optical disk storage, disk storage or other magnetic storage devices, or any other medium that can be used to carry or store program codes in the form of instructions or data structures and other forms that can be read by general or special computers, or general or special processors. In addition, any connection can be appropriately defined as a computer-readable medium, for example, if the software is transmitted from a website site, server or other remote resource through a coaxial cable, fiber optic cable, twisted pair, digital subscriber line (DSL) or wirelessly, such as infrared, wireless and microwave, it is also included in the defined computer-readable medium. The disk and disc include compact disk, laser disk, optical disk, DVD, floppy disk and blue-ray disk. Disks usually copy data magnetically, while discs usually copy data optically with lasers. The above combination can also be included in computer readable media.
以上所述的具体实施方式,对本发明的目的、技术方案和有益效果进行了进一步详细说明,所应理解的是,以上所述仅为本发明的具体实施方式而已,并不用于限定本发明的保护范围,凡在本发明的精神和原则之内,所做的任何修改、等同替换、改进等,均应包含在本发明的保护范围之内。The specific implementation methods described above further illustrate the objectives, technical solutions and beneficial effects of the present invention in detail. It should be understood that the above description is only a specific implementation method of the present invention and is not intended to limit the scope of protection of the present invention. Any modifications, equivalent substitutions, improvements, etc. made within the spirit and principles of the present invention should be included in the scope of protection of the present invention.
Claims (10)
Priority Applications (1)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| CN202410359786.XA CN118278962A (en) | 2024-03-27 | 2024-03-27 | Evaluation method and device for data asset value |
Applications Claiming Priority (1)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| CN202410359786.XA CN118278962A (en) | 2024-03-27 | 2024-03-27 | Evaluation method and device for data asset value |
Publications (1)
| Publication Number | Publication Date |
|---|---|
| CN118278962A true CN118278962A (en) | 2024-07-02 |
Family
ID=91633032
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| CN202410359786.XA Pending CN118278962A (en) | 2024-03-27 | 2024-03-27 | Evaluation method and device for data asset value |
Country Status (1)
| Country | Link |
|---|---|
| CN (1) | CN118278962A (en) |
Cited By (2)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN118941397A (en) * | 2024-10-15 | 2024-11-12 | 全拓科技(杭州)股份有限公司 | A method and system for evaluating the value of data assets |
| CN119598134A (en) * | 2025-02-10 | 2025-03-11 | 云镝智慧科技有限公司 | Full-period data management method for digital transformation of small and medium enterprises |
Citations (4)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN116341956A (en) * | 2023-02-24 | 2023-06-27 | 阿里云计算有限公司 | Data asset estimation method, system, device and data center |
| CN116934160A (en) * | 2023-07-25 | 2023-10-24 | 福州大学 | A data element asset value assessment method and system |
| CN117592873A (en) * | 2024-01-19 | 2024-02-23 | 数据空间研究院 | A data asset value assessment method based on data characteristics |
| CN117592822A (en) * | 2024-01-19 | 2024-02-23 | 数据空间研究院 | Comprehensive assessment method of data quality based on dynamic weights |
-
2024
- 2024-03-27 CN CN202410359786.XA patent/CN118278962A/en active Pending
Patent Citations (4)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN116341956A (en) * | 2023-02-24 | 2023-06-27 | 阿里云计算有限公司 | Data asset estimation method, system, device and data center |
| CN116934160A (en) * | 2023-07-25 | 2023-10-24 | 福州大学 | A data element asset value assessment method and system |
| CN117592873A (en) * | 2024-01-19 | 2024-02-23 | 数据空间研究院 | A data asset value assessment method based on data characteristics |
| CN117592822A (en) * | 2024-01-19 | 2024-02-23 | 数据空间研究院 | Comprehensive assessment method of data quality based on dynamic weights |
Cited By (2)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN118941397A (en) * | 2024-10-15 | 2024-11-12 | 全拓科技(杭州)股份有限公司 | A method and system for evaluating the value of data assets |
| CN119598134A (en) * | 2025-02-10 | 2025-03-11 | 云镝智慧科技有限公司 | Full-period data management method for digital transformation of small and medium enterprises |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| Adewumi et al. | A systematic literature review of open source software quality assessment models | |
| KR102286415B1 (en) | Online and offline information analysis service system by lifecycle according to product life cycle | |
| CN111008197A (en) | Data center design method for power marketing service system | |
| CN117787792B (en) | A method and system based on medical device quality and safety risk supervision | |
| Aljumaili et al. | Metadata-based data quality assessment | |
| EP2427861A2 (en) | Business information and innovation management | |
| CN118278962A (en) | Evaluation method and device for data asset value | |
| CN116701358B (en) | Data processing method and system | |
| CN119168749A (en) | A commodity information management method and system based on big data analysis | |
| CN120725468B (en) | Multi-dimensional data integration and dynamic risk assessment methods for enterprise procurement anomaly detection | |
| KR102491396B1 (en) | Method And Apparatus for Providing Multi-Dimensional Patent Analysis Service Based on Big Data | |
| Sargiotis | Data quality management: Ensuring accuracy and reliability | |
| CN119494463A (en) | A method for monitoring urban operation indicators based on multi-source heterogeneity | |
| KR100903726B1 (en) | Data Quality Management Maturity Assessment System | |
| CN118037469B (en) | Financial management system based on big data | |
| CN119151474A (en) | Lightweight real-time data system | |
| Li | Data quality and data cleaning in database applications | |
| CN119990536A (en) | Enterprise data asset evaluation method, system, electronic device and storage medium | |
| Sargiotis | Measuring the Impact of Data Governance: Metrics and Key Performance Indicators | |
| CN118394851A (en) | Data management and control method based on block chain | |
| McSweeney | Review of data management maturity models | |
| Krathu et al. | A framework for inter-organizational performance analysis from EDI messages | |
| CN120198168B (en) | A comprehensive data asset value assessment device | |
| US20250378412A1 (en) | Computer-automated systems and methods for generating software development metrics for use in diligence | |
| del Paso et al. | Making Cost Data Work for Public Financial Management |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| PB01 | Publication | ||
| SE01 | Entry into force of request for substantive examination | ||
| SE01 | Entry into force of request for substantive examination | ||
| RJ01 | Rejection of invention patent application after publication | ||
| RJ01 | Rejection of invention patent application after publication |
Application publication date: 20240702 |