[go: up one dir, main page]

CN116955377A - Data processing method and device, storage medium and computer equipment - Google Patents

Data processing method and device, storage medium and computer equipment Download PDF

Info

Publication number
CN116955377A
CN116955377A CN202310875897.1A CN202310875897A CN116955377A CN 116955377 A CN116955377 A CN 116955377A CN 202310875897 A CN202310875897 A CN 202310875897A CN 116955377 A CN116955377 A CN 116955377A
Authority
CN
China
Prior art keywords
data
database
target
database data
standardized conversion
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202310875897.1A
Other languages
Chinese (zh)
Inventor
夏斌
王博
石耀军
张超
李海波
郭嘉伟
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
SHAANXI SHANHUA COAL CHEMICAL INDUSTRY GROUP CO LTD
Original Assignee
SHAANXI SHANHUA COAL CHEMICAL INDUSTRY GROUP CO LTD
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by SHAANXI SHANHUA COAL CHEMICAL INDUSTRY GROUP CO LTD filed Critical SHAANXI SHANHUA COAL CHEMICAL INDUSTRY GROUP CO LTD
Priority to CN202310875897.1A priority Critical patent/CN116955377A/en
Publication of CN116955377A publication Critical patent/CN116955377A/en
Pending legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/23Updating
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/21Design, administration or maintenance of databases
    • G06F16/215Improving data quality; Data cleansing, e.g. de-duplication, removing invalid entries or correcting typographical errors
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/22Indexing; Data structures therefor; Storage structures
    • G06F16/2228Indexing structures
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/25Integrating or interfacing systems involving database management systems
    • G06F16/258Data format conversion from or to a database

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Databases & Information Systems (AREA)
  • Data Mining & Analysis (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Quality & Reliability (AREA)
  • Software Systems (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

本发明公开了一种数据处理方法及装置、存储介质、计算机设备,属于数据处理技术领域,主要在于解决独立的数据管理系统之间难以进行数据共享的问题,包括获取第一数据库数据和第二数据库数据,并基于所述第一数据库数据对所述第二数据库数据进行数据去重处理,得到去重后第二数据库数据;获取数据模板,并基于所述数据模板对所述去重后第二数据库数据进行标准化转换,得到标准化转换数据,以使得所述标准化转换数据与所述第一数据库数据具有相同的数据相关信息;基于所述标准化转换数据对所述第一数据库数据进行数据更新。

The invention discloses a data processing method and device, storage medium and computer equipment, which belongs to the field of data processing technology. It mainly aims at solving the problem of difficulty in data sharing between independent data management systems, including obtaining first database data and second database data. Database data, and perform data deduplication processing on the second database data based on the first database data to obtain the deduplicated second database data; obtain a data template, and perform data deduplication on the deduplicated third database data based on the data template. The second database data is subjected to standardized conversion to obtain standardized conversion data, so that the standardized conversion data and the first database data have the same data-related information; the first database data is updated based on the standardized conversion data.

Description

一种数据处理方法及装置、存储介质、计算机设备A data processing method and device, storage medium, and computer equipment

技术领域Technical field

本发明涉及数据处理技术领域,特别是涉及一种数据处理方法及装置、存储介质、计算机设备。The present invention relates to the field of data processing technology, and in particular, to a data processing method and device, storage medium, and computer equipment.

背景技术Background technique

随着大型企业业务的不断改革深化,孵化了不少子公司。由于子公司与母公司在经营上是独立的,所以在生产经营上采用独立的数据管理系统进行数据管理操作。独立的数据管理系统之间存在数据库差异性、操作系统异构性等特点,导致母公司与子公司之间的数据难以进行交互和共享,形成数据孤岛。随着企业的不断进步和发展,需要依赖数据进行趋势分析、业务质量分析等分析操作,由于各个独立的数据管理系统之间无法进行数据共享,加大了数据分析的难度,不能满足管理人员对现有数据资源决策分析的需要。With the continuous reform and deepening of the business of large enterprises, many subsidiaries have been hatched. Since the subsidiary and the parent company are operationally independent, independent data management systems are used for data management operations in production and operations. Independent data management systems have characteristics such as database differences and operating system heterogeneity, making it difficult to interact and share data between the parent company and its subsidiaries, forming data islands. With the continuous progress and development of enterprises, they need to rely on data for analysis operations such as trend analysis and business quality analysis. Since data cannot be shared between independent data management systems, data analysis is more difficult and cannot satisfy the needs of managers. The need for decision-making analysis of existing data resources.

发明内容Contents of the invention

有鉴于此,本发明提供一种数据处理方法及装置、存储介质、计算机设备,主要目的在于解决独立的数据管理系统之间难以进行数据共享的问题。In view of this, the present invention provides a data processing method and device, storage medium, and computer equipment. The main purpose is to solve the problem of difficulty in data sharing between independent data management systems.

依据本发明一个方面,提供了一种数据处理方法,包括:According to one aspect of the present invention, a data processing method is provided, including:

获取第一数据库数据和第二数据库数据,并基于所述第一数据库数据对所述第二数据库数据进行数据去重处理,得到去重后第二数据库数据;Obtain the first database data and the second database data, and perform data deduplication processing on the second database data based on the first database data to obtain the deduplicated second database data;

获取数据模板,并基于所述数据模板对所述去重后第二数据库数据进行标准化转换,得到标准化转换数据,以使得所述标准化转换数据与所述第一数据库数据具有相同的数据相关信息;Obtain a data template, and perform standardized conversion on the deduplicated second database data based on the data template to obtain standardized conversion data, so that the standardized conversion data and the first database data have the same data-related information;

基于所述标准化转换数据对所述第一数据库数据进行数据更新。Data updating is performed on the first database data based on the standardized conversion data.

进一步的,所述获取第一数据库数据和第二数据库数据之前,所述方法还包括:Further, before obtaining the first database data and the second database data, the method further includes:

将所述第一数据库数据和所述第二数据库数据划分为多个数据类别;classifying the first database data and the second database data into a plurality of data categories;

为具有相同数据类别的所述第一数据库数据和所述第二数据库数据添加相同的数据索引。The same data index is added to the first database data and the second database data having the same data category.

进一步的,所述基于所述第一数据库数据对所述第二数据库数据进行数据去重处理,得到去重后第二数据库数据包括:Further, performing data deduplication processing on the second database data based on the first database data, and obtaining the deduplicated second database data includes:

获取所述第一数据库数据的第一数据索引,并获取所述第二数据库数据的第二数据索引;Obtain the first data index of the first database data, and obtain the second data index of the second database data;

对所述第一数据索引和所述第二数据索引进行匹配处理,并获取具有相同数据索引的目标第一数据库数据和目标第二数据库数据;Perform matching processing on the first data index and the second data index, and obtain target first database data and target second database data having the same data index;

基于所述目标第一数据库数据对所述目标第二数据库数据进行数据去重处理,得到去重后第二数据库数据。Perform data deduplication processing on the target second database data based on the target first database data to obtain the deduplicated second database data.

进一步的,所述获取数据模板之前,所述方法还包括:Further, before obtaining the data template, the method further includes:

获取所述第一数据库数据的数据相关信息,所述数据相关信息包括数据类别、数据类型、数据格式和关联数据;Obtain data-related information of the first database data, where the data-related information includes data categories, data types, data formats and associated data;

基于所述数据类别、所述数据类型、所述数据格式和所述关联数据构建与所述数据类别数量相同的数据模板;Construct data templates with the same number as the data categories based on the data category, the data type, the data format and the associated data;

在所述数据模板与所述数据类别之间建立关联关系。An association relationship is established between the data template and the data category.

进一步的,所述获取数据模板,并基于所述数据模板对所述去重后第二数据库数据进行标准化转换,得到标准化转换数据包括:Further, obtaining a data template and performing standardized conversion on the deduplicated second database data based on the data template to obtain standardized conversion data includes:

基于所述去重后第二数据库数据的数据类别,查询所述关联关系,确定目标数据模板;Based on the data category of the second database data after deduplication, query the association relationship and determine the target data template;

获取所述目标数据模板的目标数据类型、目标数据格式和目标关联数据;Obtain the target data type, target data format and target associated data of the target data template;

基于所述目标数据类型、所述目标数据格式和所述目标关联数据对所述去重后第二数据库数据进行标准化转换,得到标准化转换数据。Standardized conversion is performed on the deduplicated second database data based on the target data type, the target data format and the target associated data to obtain standardized conversion data.

进一步的,所述基于所述标准化转换数据对所述第一数据库数据进行数据更新包括:Further, updating the first database data based on the standardized conversion data includes:

向第一数据库发送携带所述标准化转换数据的更新请求;以使得所述第一数据库基于所述更新请求进行审批操作,以及待所述更新请求审批通过后,将所述标准化转换数据更新至所述第一数据库中。Send an update request carrying the standardized conversion data to the first database; so that the first database performs an approval operation based on the update request, and after the update request is approved, update the standardized conversion data to the in the first database.

进一步的,所述方法还包括:Further, the method also includes:

接收多个客户端对第二数据库的数据访问,以使得所述客户端获取到所述标准化转换数据;所述客户端通过企业服务总线与所述第二数据库相连接;Receive data access to the second database from multiple clients, so that the clients obtain the standardized conversion data; the clients are connected to the second database through an enterprise service bus;

相对应的,接收多个所述客户端对所述第二数据库的数据更新操作,包括停用、新增和修改。Correspondingly, data update operations on the second database from multiple clients are received, including deactivation, addition and modification.

依据本发明另一个方面,提供了一种数据处理装置,包括:According to another aspect of the present invention, a data processing device is provided, including:

去重模块,用于获取第一数据库数据和第二数据库数据,并基于所述第一数据库数据对所述第二数据库数据进行数据去重处理,得到去重后第二数据库数据;A deduplication module, used to obtain the first database data and the second database data, and perform data deduplication processing on the second database data based on the first database data to obtain the deduplicated second database data;

转换模块,用于获取数据模板,并基于所述数据模板对所述去重后第二数据库数据进行标准化转换,得到标准化转换数据,以使得所述标准化转换数据与所述第一数据库数据具有相同的数据相关信息;A conversion module, configured to obtain a data template, and perform standardized conversion on the deduplicated second database data based on the data template to obtain standardized conversion data, so that the standardized conversion data has the same characteristics as the first database data. data related information;

更新模块,用于基于所述标准化转换数据对所述第一数据库数据进行数据更新。An update module, configured to update the first database data based on the standardized conversion data.

进一步的,所述装置还包括索引添加模块,用于:Further, the device also includes an index adding module for:

将所述第一数据库数据和所述第二数据库数据划分为多个数据类别;classifying the first database data and the second database data into a plurality of data categories;

为具有相同数据类别的所述第一数据库数据和所述第二数据库数据添加相同的数据索引。The same data index is added to the first database data and the second database data having the same data category.

进一步的,所述去重模块还用于:Furthermore, the deduplication module is also used to:

获取所述第一数据库数据的第一数据索引,并获取所述第二数据库数据的第二数据索引;Obtain the first data index of the first database data, and obtain the second data index of the second database data;

对所述第一数据索引和所述第二数据索引进行匹配处理,并获取具有相同数据索引的目标第一数据库数据和目标第二数据库数据;Perform matching processing on the first data index and the second data index, and obtain target first database data and target second database data having the same data index;

基于所述目标第一数据库数据对所述目标第二数据库数据进行数据去重处理,得到去重后第二数据库数据。Perform data deduplication processing on the target second database data based on the target first database data to obtain the deduplicated second database data.

进一步的,所述装置还包括模板创建模块,用于:Further, the device also includes a template creation module for:

获取所述第一数据库数据的数据相关信息,所述数据相关信息包括数据类别、数据类型、数据格式和关联数据;Obtain data-related information of the first database data, where the data-related information includes data categories, data types, data formats and associated data;

基于所述数据类别、所述数据类型、所述数据格式和所述关联数据构建与所述数据类别数量相同的数据模板;Construct data templates with the same number as the data categories based on the data category, the data type, the data format and the associated data;

在所述数据模板与所述数据类别之间建立关联关系。An association relationship is established between the data template and the data category.

进一步的,所述转换模块还用于:Further, the conversion module is also used to:

基于所述去重后第二数据库数据的数据类别,查询所述关联关系,确定目标数据模板;Based on the data category of the second database data after deduplication, query the association relationship and determine the target data template;

获取所述目标数据模板的目标数据类型、目标数据格式和目标关联数据;Obtain the target data type, target data format and target associated data of the target data template;

基于所述目标数据类型、所述目标数据格式和所述目标关联数据对所述去重后第二数据库数据进行标准化转换,得到标准化转换数据。Standardized conversion is performed on the deduplicated second database data based on the target data type, the target data format and the target associated data to obtain standardized conversion data.

进一步的,所述更新模块还用于:Further, the update module is also used to:

向第一数据库发送携带所述标准化转换数据的更新请求;以使得所述第一数据库基于所述更新请求进行审批操作,以及待所述更新请求审批通过后,将所述标准化转换数据更新至所述第一数据库中。Send an update request carrying the standardized conversion data to the first database; so that the first database performs an approval operation based on the update request, and after the update request is approved, update the standardized conversion data to the in the first database.

进一步的,所述装置还包括接收及更新模块,用于:Further, the device also includes a receiving and updating module for:

接收多个客户端对第二数据库的数据访问,以使得所述客户端获取到所述标准化转换数据;所述客户端通过企业服务总线与所述第二数据库相连接;Receive data access to the second database from multiple clients, so that the clients obtain the standardized conversion data; the clients are connected to the second database through an enterprise service bus;

相对应的,接收多个所述客户端对所述第二数据库的数据更新操作,包括停用、新增和修改。Correspondingly, data update operations on the second database from multiple clients are received, including deactivation, addition and modification.

依据本发明的又一方面,提供了一种存储介质,所述存储介质中存储有至少一可执行指令,所述可执行指令使处理器执行如上述数据处理方法对应的操作。According to another aspect of the present invention, a storage medium is provided, and at least one executable instruction is stored in the storage medium. The executable instruction causes the processor to perform operations corresponding to the above-mentioned data processing method.

依据本发明另一个方面,提供了一种计算机设备,包括处理器、存储器、通信接口和通信总线,所述处理器、所述存储器和所述通信接口通过所述通信总线完成相互间的通信;According to another aspect of the present invention, a computer device is provided, including a processor, a memory, a communication interface, and a communication bus. The processor, the memory, and the communication interface complete communication with each other through the communication bus;

所述存储器用于存放至少一可执行指令,所述可执行指令使所述处理器执行如上述数据处理方法对应的操作。The memory is used to store at least one executable instruction, and the executable instruction causes the processor to perform operations corresponding to the above data processing method.

借由上述技术方案,本发明实施例提供的技术方案至少具有下列优点:Through the above technical solutions, the technical solutions provided by the embodiments of the present invention have at least the following advantages:

本发明提供了一种数据处理方法及装置、存储介质、计算机设备,与现有技术相比,本发明通过获取第一数据库数据和第二数据库数据,并基于所述第一数据库数据对所述第二数据库数据进行数据去重处理,得到去重后第二数据库数据;获取数据模板,并基于所述数据模板对所述去重后第二数据库数据进行标准化转换,得到标准化转换数据,以使得所述标准化转换数据与所述第一数据库数据具有相同的数据相关信息;基于所述标准化转换数据对所述第一数据库数据进行数据更新,可以实现母公司与子公司之间的数据交互和共享,有效避免了数据孤岛的形成,进一步为数据资源决策分析提供重要的数据基础。The present invention provides a data processing method and device, storage medium, and computer equipment. Compared with the existing technology, the present invention obtains first database data and second database data, and processes the data based on the first database data. The second database data is subjected to data deduplication processing to obtain the deduplicated second database data; a data template is obtained, and the deduplicated second database data is standardized and converted based on the data template to obtain standardized conversion data, so that The standardized conversion data and the first database data have the same data-related information; data updating of the first database data based on the standardized conversion data can realize data interaction and sharing between the parent company and the subsidiary company. , effectively avoiding the formation of data islands, and further providing an important data basis for decision-making analysis of data resources.

上述说明仅是本发明技术方案的概述,为了能够更清楚了解本发明的技术手段,而可依照说明书的内容予以实施,并且为了让本发明的上述和其它目的、特征和优点能够更明显易懂,以下特举本发明的具体实施方式。The above description is only an overview of the technical solution of the present invention. In order to have a clearer understanding of the technical means of the present invention, it can be implemented according to the content of the description, and in order to make the above and other objects, features and advantages of the present invention more obvious and understandable. , the specific embodiments of the present invention are listed below.

附图说明Description of the drawings

通过阅读下文优选实施方式的详细描述,各种其他的优点和益处对于本领域普通技术人员将变得清楚明了。附图仅用于示出优选实施方式的目的,而并不认为是对本发明的限制。而且在整个附图中,用相同的参考符号表示相同的部件。在附图中:Various other advantages and benefits will become apparent to those of ordinary skill in the art upon reading the following detailed description of the preferred embodiments. The drawings are for the purpose of illustrating preferred embodiments only and are not to be construed as limiting the invention. Also throughout the drawings, the same reference characters are used to designate the same components. In the attached picture:

图1示出了本发明实施例提供的一种数据处理方法的流程示意图;Figure 1 shows a schematic flow chart of a data processing method provided by an embodiment of the present invention;

图2示出了本发明实施例提供的另一种数据处理方法的流程示意图;Figure 2 shows a schematic flow chart of another data processing method provided by an embodiment of the present invention;

图3示出了本发明实施例提供的又一种数据处理方法的流程示意图;Figure 3 shows a schematic flow chart of yet another data processing method provided by an embodiment of the present invention;

图4示出了本发明实施例提供的再一种数据处理方法的流程示意图;Figure 4 shows a schematic flow chart of yet another data processing method provided by an embodiment of the present invention;

图5示出了本发明实施例提供的一种数据处理装置的结构示意图;Figure 5 shows a schematic structural diagram of a data processing device provided by an embodiment of the present invention;

图6示出了本发明实施例提供的一种计算机设备的结构示意图。FIG. 6 shows a schematic structural diagram of a computer device provided by an embodiment of the present invention.

具体实施方式Detailed ways

下面将参照附图更详细地描述本公开的示例性实施例。虽然附图中显示了本公开的示例性实施例,然而应当理解,可以以各种形式实现本公开而不应被这里阐述的实施例所限制。相反,提供这些实施例是为了能够更透彻地理解本公开,并且能够将本公开的范围完整的传达给本领域的技术人员。Exemplary embodiments of the present disclosure will be described in more detail below with reference to the accompanying drawings. Although exemplary embodiments of the present disclosure are shown in the drawings, it should be understood that the present disclosure may be implemented in various forms and should not be limited to the embodiments set forth herein. Rather, these embodiments are provided to provide a thorough understanding of the disclosure, and to fully convey the scope of the disclosure to those skilled in the art.

本发明实施例提供了一种数据处理方法,如图1所示,该方法包括:An embodiment of the present invention provides a data processing method, as shown in Figure 1. The method includes:

101、获取第一数据库数据和第二数据库数据,并基于所述第一数据库数据对所述第二数据库数据进行数据去重处理,得到去重后第二数据库数据;101. Obtain the first database data and the second database data, and perform data deduplication processing on the second database data based on the first database data to obtain the deduplicated second database data;

本发明实施例中,当前执行端获取第一数据库数据和第二数据库数据,其中,第一数据库数据为母公司数据,母公司数据的数据标准可以作为集团内任何子公司数据的数据标准。其中,第二数据库数据为子公司数据,目前子公司数据尚未以母公司数据为数据标准进行统一得数据存储和管理,故各个子公司数据和母公司数据之间形成了数据孤岛。当前执行端在获取到第一数据库数据和第二数据库数据之后,基于第一数据库数据对第二数据库数据进行数据去重处理,得到去重后第二数据库数据。其中,数据去重处理用于表征从获取的第二数据库数据中删除与第一数据库数据相重复的数据,可以有效得避免在当前执行端中出现数据冗余的现象。In the embodiment of the present invention, the current execution end obtains the first database data and the second database data, wherein the first database data is the parent company data, and the data standard of the parent company data can be used as the data standard of any subsidiary data in the group. Among them, the second database data is subsidiary data. At present, subsidiary data has not yet been unified data storage and management based on parent company data as the data standard. Therefore, data islands are formed between each subsidiary data and parent company data. After acquiring the first database data and the second database data, the current execution end performs data deduplication processing on the second database data based on the first database data to obtain the deduplicated second database data. Among them, the data deduplication process is used to represent the deletion of data that is duplicated with the first database data from the acquired second database data, which can effectively avoid data redundancy in the current execution end.

102、获取数据模板,并基于所述数据模板对所述去重后第二数据库数据进行标准化转换,得到标准化转换数据,以使得所述标准化转换数据与所述第一数据库数据具有相同的数据相关信息;102. Obtain a data template, and perform standardized conversion on the deduplicated second database data based on the data template to obtain standardized conversion data, so that the standardized conversion data and the first database data have the same data correlation. information;

本发明实施例中,当前执行端获取数据模板,所述数据模板为基于数据类别预先创建的,用于表征数据存储的标准化处理规则。当前执行端基于数据模板对去重后第二数据库数据进行标准化转换。其中,标准化转换包括数据类型的标准化转换、数据格式的标准化转换、关联数据的数据类型标准化转换、关联数据的数据格式标准化转换等,本发明实施例不做具体限定。经标准化转换之后得到标准化转换数据,得到的标准化转换数据应该与第一数据库数据具有相同的数据相关信息。In the embodiment of the present invention, the current execution end obtains a data template, which is pre-created based on data categories and used to represent standardized processing rules for data storage. The current execution end performs standardized conversion on the deduplicated second database data based on the data template. The standardized conversion includes standardized conversion of data types, standardized conversion of data formats, standardized conversion of data types of associated data, standardized conversion of data formats of associated data, etc., which are not specifically limited in the embodiment of the present invention. After the standardized conversion, the standardized converted data is obtained, and the obtained standardized converted data should have the same data-related information as the first database data.

103、基于所述标准化转换数据对所述第一数据库数据进行数据更新。103. Update the first database data based on the standardized conversion data.

本发明实施例中,当前执行端基于标准化转换数据对第一数据库数据进行数据更新,即当前执行端基于第一数据库的数据管理接口发送携带所述标准化转换数据的更新请求;当第一数据库的数据管理相关方接收到上述更新请求时,对所述更新请求进行审批操作,当检测到更新请求审批通过后,将标准化转换数据更新至第一数据库中。此外,若在一定的时间间隔内,未检测到审批通过信息,则由当前执行端向第二数据库的数据管理相关方发送数据更新失败信息,以使得所述数据管理相关方对标准化转换数据进行进一步核对,以保证标准化转换数据与第一数据库数据具有相同的数据相关信息。In the embodiment of the present invention, the current execution end performs data update on the first database data based on the standardized conversion data, that is, the current execution end sends an update request carrying the standardized conversion data based on the data management interface of the first database; when the first database When the data management related party receives the above update request, it performs an approval operation on the update request. When it is detected that the update request is approved, the standardized conversion data is updated into the first database. In addition, if no approval information is detected within a certain time interval, the current execution end sends data update failure information to the data management related party of the second database, so that the data management related party performs the standardized conversion data Further verification is performed to ensure that the standardized conversion data and the first database data have the same data-related information.

进一步的,作为上述实施例具体实施方式的细化和扩展,为了便于对数据进行分类管理和提高数据查询的效率,提供了另一种数据处理方法,步骤获取第一数据库数据和第二数据库数据之前,所述方法还包括:Further, as a refinement and expansion of the specific implementation of the above embodiment, in order to facilitate classification management of data and improve the efficiency of data query, another data processing method is provided, with the steps of obtaining the first database data and the second database data. Previously, the method also included:

将所述第一数据库数据和所述第二数据库数据划分为多个数据类别;classifying the first database data and the second database data into a plurality of data categories;

为具有相同数据类别的所述第一数据库数据和所述第二数据库数据添加相同的数据索引。The same data index is added to the first database data and the second database data having the same data category.

本发明实施例中,当前执行端将第一数据库数据和第二数据库数据划分为多个数据类别,如物料类数据、客商类数据、企业员工类数据、企业组织机构类数据等,本发明实施例不做具体限定。当前执行端为具有相同类别的第一数据库数据和第二数据库数据添加相同的数据索引,便于通过相同的数据索引对相同类别的第一数据库数据和第二数据库数据进行查询。In the embodiment of the present invention, the current execution end divides the first database data and the second database data into multiple data categories, such as material data, customer data, enterprise employee data, enterprise organization data, etc., the present invention implements The examples are not specifically limited. The current execution end adds the same data index to the first database data and the second database data of the same category, so as to facilitate querying of the first database data and the second database data of the same category through the same data index.

进一步的,作为上述实施例具体实施方式的细化和扩展,为了减少冗余数据,并且减少后期数据标准化转换的工作量,提供了另一种数据处理方法,如图2所示,步骤基于所述第一数据库数据对所述第二数据库数据进行数据去重处理,得到去重后第二数据库数据包括:Further, as a refinement and expansion of the specific implementation of the above embodiment, in order to reduce redundant data and reduce the workload of later data standardization conversion, another data processing method is provided, as shown in Figure 2, the steps are based on The first database data performs data deduplication processing on the second database data, and the deduplicated second database data includes:

201、获取所述第一数据库数据的第一数据索引,并获取所述第二数据库数据的第二数据索引;201. Obtain the first data index of the first database data, and obtain the second data index of the second database data;

202、对所述第一数据索引和所述第二数据索引进行匹配处理,并获取具有相同数据索引的目标第一数据库数据和目标第二数据库数据;202. Perform matching processing on the first data index and the second data index, and obtain the target first database data and the target second database data having the same data index;

203、基于所述目标第一数据库数据对所述目标第二数据库数据进行数据去重处理,得到去重后第二数据库数据。203. Perform data deduplication processing on the target second database data based on the target first database data to obtain the deduplicated second database data.

本发明实施例中,当前执行端获取第一数据库数据的第一数据索引,并获取第二数据库数据的第二数据索引。当前执行端获取到第一数据索引和第二数据索引后,对第一数据索引和第二数据索引进行匹配处理,匹配到具有相同数据索引的数据,即从第一数据库数据和第二数据库数据中获取相同类别的数据,作为目标第一数据库数据和目标第二数据库数据。当前执行端基于目标第一数据库数据对目标第二数据库数据进行数据去重处理,得到去重后第二数据库数据。如,当前执行端获取具有数据索引A的目标第一数据库数据为客商类数据H,获取具有数据索引A的目标第二数据库数据为客商类数据M,当前执行端需要基于客商类数据H对客商类数据M进行去重处理,即将M中的客商名称或者客商唯一识别码与H中的客商名称或客商唯一识别码进行匹配,如匹配成功,则认为M中具有与H中相同的客商信息,需要将冗余的客商信息从M中删除,本发明实施例不做具体限定。In the embodiment of the present invention, the current execution end obtains the first data index of the first database data and obtains the second data index of the second database data. After the current execution end obtains the first data index and the second data index, it performs matching processing on the first data index and the second data index, and matches the data with the same data index, that is, from the first database data and the second database data Obtain data of the same category as target first database data and target second database data. The current execution end performs data deduplication processing on the target second database data based on the target first database data, and obtains the deduplicated second database data. For example, the current execution end obtains the target first database data with data index A as customer class data H, and obtains the target second database data with data index A as customer class data M. The current execution end needs to calculate customer data based on customer class data H. Class data M is deduplicated, that is, the merchant name or merchant unique identification code in M is matched with the merchant name or merchant unique identification code in H. If the match is successful, it is considered that M has the same merchant information as H. Redundant customer and merchant information needs to be deleted from M, which is not specifically limited in the embodiment of the present invention.

进一步的,作为上述实施例具体实施方式的细化和扩展,为了便于对第二数据库数据进行数据标准化处理,提供了另一种数据处理方法,如图3所示,步骤获取数据模板之前,所述方法还包括:Further, as a refinement and expansion of the specific implementation of the above embodiment, in order to facilitate data standardization processing of the second database data, another data processing method is provided. As shown in Figure 3, before the step of obtaining the data template, The above methods also include:

301、获取所述第一数据库数据的数据相关信息,所述数据相关信息包括数据类别、数据类型、数据格式和关联数据;301. Obtain data-related information of the first database data. The data-related information includes data category, data type, data format and associated data;

302、基于所述数据类别、所述数据类型、所述数据格式和所述关联数据构建与所述数据类别数量相同的数据模板;302. Construct data templates with the same number as the data categories based on the data category, the data type, the data format and the associated data;

303、在所述数据模板与所述数据类别之间建立关联关系。303. Establish an association between the data template and the data category.

本发明实施例中,当前执行端获取第一数据库数据的数据相关信息,所述数据相关信息包括数据类别、数据类型、数据格式和关联数据。如获取客商类数据的关联数据包括客商名称、客商识别码、客商联系地址、客商联系电话等,数据类型包括字符型、整数型、浮点型等,数据格式包括数值、字符、二进制等,本发明实施例不做具体限定。当前执行端基于上述数据相关信息中的数据类别、数据类型、数据格式和关联数据构建与数据类别数量相同的数据模板,即针对每个数据类别创建对应的数据模板。如客商类数据对应的数据模板必须包括客商名称、客商识别码、客商联系地址、客商联系电话等关联数据,各个关联数据的数据类型和数据格式都在模板中进行标准化限定。当前执行端在创建好数据模板之后,还在数据类别与对应的数据模板之间建立关联关系,便于后期基于数据类别对数据模板进行查询。In the embodiment of the present invention, the current execution end obtains data-related information of the first database data. The data-related information includes data category, data type, data format and associated data. For example, the associated data obtained for customer data includes customer name, customer identification code, customer contact address, customer contact number, etc. The data type includes character, integer, floating point, etc., and the data format includes numerical value, character, binary, etc., this document The embodiments of the invention are not specifically limited. The current execution end builds data templates with the same number as the data categories based on the data categories, data types, data formats and associated data in the above data-related information, that is, a corresponding data template is created for each data category. For example, the data template corresponding to merchant data must include associated data such as merchant name, merchant identification code, merchant contact address, merchant contact number, etc. The data type and data format of each associated data are standardized and limited in the template. After the current execution end creates the data template, it also establishes an association between the data category and the corresponding data template to facilitate subsequent querying of the data template based on the data category.

进一步的,作为上述实施例具体实施方式的细化和扩展,为了快速得对第二数据库数据进行数据标准化转换,提供了另一种数据处理方法,如图4所示,步骤获取数据模板,并基于所述数据模板对所述去重后第二数据库数据进行标准化转换,得到标准化转换数据包括:Further, as a refinement and expansion of the specific implementation of the above embodiment, in order to quickly perform data standardization conversion on the second database data, another data processing method is provided. As shown in Figure 4, the steps are to obtain a data template, and Perform standardized conversion on the deduplicated second database data based on the data template, and obtain standardized conversion data including:

401、基于所述去重后第二数据库数据的数据类别,查询所述关联关系,确定目标数据模板;401. Based on the data category of the second database data after deduplication, query the association relationship and determine the target data template;

402、获取所述目标数据模板的目标数据类型、目标数据格式和目标关联数据;402. Obtain the target data type, target data format and target associated data of the target data template;

403、基于所述目标数据类型、所述目标数据格式和所述目标关联数据对所述去重后第二数据库数据进行标准化转换,得到标准化转换数据。403. Perform standardized conversion on the deduplicated second database data based on the target data type, the target data format, and the target associated data to obtain standardized conversion data.

本发明实施例中,当前执行端分别基于去重后第二数据库数据的数据类别进行数据模板查询,如当前要转换的数据为客商类别数据,则查询关联关系,将与客商类别对应的数据模板确定为目标数据模板。当前执行端获取目标数据模板的目标关联数据,如客商名称、客商识别码、客商联系地址、客商联系电话等,将上述客商名称、客商识别码、客商联系地址、客商联系电话等数据均转换成目标数据模板所规定的目标数据类型和目标数据格式,即转换后得到的标准化转换数据与第一数据库数据具有相同的标准,便于后期进行数据共享和分析。In the embodiment of the present invention, the current execution end performs data template query based on the data category of the second database data after deduplication. If the data currently to be converted is customer category data, the association relationship is queried and the data template corresponding to the customer category is Determine the target data template. The current execution end obtains the target associated data of the target data template, such as customer name, customer identification code, customer contact address, customer contact number, etc., and converts the above customer name, customer identification code, customer contact address, customer contact number and other data into The target data type and target data format specified by the target data template, that is, the standardized conversion data obtained after conversion have the same standards as the first database data, which facilitates later data sharing and analysis.

进一步的,作为上述实施例具体实施方式的细化和扩展,为了在子公司内实现对第二数据库数据的访问及更新操纵,提供了另一种数据处理方法,所述方法还包括:Further, as a refinement and expansion of the specific implementation of the above embodiment, in order to achieve access and update manipulation of the second database data within the subsidiary, another data processing method is provided, and the method also includes:

接收多个客户端对第二数据库的数据访问,以使得所述客户端获取到所述标准化转换数据;所述客户端通过企业服务总线与所述第二数据库相连接;Receive data access to the second database from multiple clients, so that the clients obtain the standardized conversion data; the clients are connected to the second database through an enterprise service bus;

相对应的,接收多个所述客户端对所述第二数据库的数据更新操作,包括停用、新增和修改。Correspondingly, data update operations on the second database from multiple clients are received, including deactivation, addition and modification.

本发明实施例中,当前执行端接收多个客户端对第二数据库的数据访问请求,基于数据访问请求获取被访问数据,如客商数据、设备数据等,并将被访问数据发送至对应的客户端,以使得客户端获取到标准化转换后的标准化转换数据。其中,各个客户端为子公司为开展公司内部业务用于执行子公司各项服务的客户端,包括生产部门使用的与生产材料相关的客户端,财务部门使用的与公司财务相关的客户端,管理部门使用的与生产管理相关的客户端,物流部门使用的与物流相关的客户端等,本发明实施例不做具体限定。各个客户端通过企业服务总线ESB与第二数据库相连接。相对应的,当前执行端还接收多个客户端对第二数据库的数据更新操作,包括停用、新增、修改等,本发明实施例不做具体限定。In the embodiment of the present invention, the current execution end receives data access requests from multiple clients to the second database, obtains the accessed data based on the data access requests, such as customer data, equipment data, etc., and sends the accessed data to the corresponding client. end, so that the client can obtain the standardized conversion data after standardized conversion. Among them, each client is a client used by the subsidiary to perform various services of the subsidiary to carry out the company's internal business, including clients related to production materials used by the production department, and clients related to company finance used by the financial department. Clients related to production management used by the management department, clients related to logistics used by the logistics department, etc. are not specifically limited in the embodiment of the present invention. Each client is connected to the second database through the enterprise service bus ESB. Correspondingly, the current execution end also receives data update operations on the second database from multiple clients, including deactivation, addition, modification, etc., which are not specifically limited in the embodiment of the present invention.

需要说明的是,当前执行端在接收到客户端的数据更新指令为新增数据之后,先确定待新增数据的数据类别,然后基于数据类别查询数据类别与数据模板之间的关联关系,确定目标数据模板,再基于目标数据模板添加模板规定的标准化数据,包括标准化关联数据、标准化数据类型、标准化数据格式等,本发明实施例不做具体限定。It should be noted that after receiving the client's data update instruction for new data, the current execution end first determines the data category of the data to be added, and then queries the association between the data category and the data template based on the data category to determine the target. data template, and then add standardized data specified by the template based on the target data template, including standardized associated data, standardized data types, standardized data formats, etc., which are not specifically limited in the embodiment of the present invention.

本发明实施例提供了一种数据处理方法,与现有技术相比,本发明通过获取第一数据库数据和第二数据库数据,并基于所述第一数据库数据对所述第二数据库数据进行数据去重处理,得到去重后第二数据库数据;获取数据模板,并基于所述数据模板对所述去重后第二数据库数据进行标准化转换,得到标准化转换数据,以使得所述标准化转换数据与所述第一数据库数据具有相同的数据相关信息;基于所述标准化转换数据对所述第一数据库数据进行数据更新,可以实现母公司与子公司之间的数据交互和共享,有效避免了数据孤岛的形成,进一步为数据资源决策分析提供重要的数据基础。Embodiments of the present invention provide a data processing method. Compared with the existing technology, the present invention obtains first database data and second database data, and performs data processing on the second database data based on the first database data. Deduplication processing to obtain the second database data after deduplication; obtaining a data template, and performing standardized conversion on the deduplicated second database data based on the data template to obtain standardized conversion data, so that the standardized conversion data is consistent with The first database data has the same data-related information; data updating of the first database data based on the standardized conversion data can realize data interaction and sharing between the parent company and the subsidiary company, effectively avoiding data islands. The formation further provides an important data basis for decision-making analysis of data resources.

作为对上述图1所示方法的实现,本发明实施例提供了一种数据处理装置,如图5所示,该装置包括:As an implementation of the method shown in Figure 1, an embodiment of the present invention provides a data processing device, as shown in Figure 5. The device includes:

去重模块51,用于获取第一数据库数据和第二数据库数据,并基于所述第一数据库数据对所述第二数据库数据进行数据去重处理,得到去重后第二数据库数据;The deduplication module 51 is used to obtain the first database data and the second database data, and perform data deduplication processing on the second database data based on the first database data to obtain the deduplicated second database data;

转换模块52,用于获取数据模板,并基于所述数据模板对所述去重后第二数据库数据进行标准化转换,得到标准化转换数据,以使得所述标准化转换数据与所述第一数据库数据具有相同的数据相关信息;The conversion module 52 is used to obtain a data template, and perform standardized conversion on the deduplicated second database data based on the data template to obtain standardized conversion data, so that the standardized conversion data has the same characteristics as the first database data. The same data-related information;

更新模块53,用于基于所述标准化转换数据对所述第一数据库数据进行数据更新。The update module 53 is configured to update the first database data based on the standardized conversion data.

进一步的,所述装置还包括索引添加模块,用于:Further, the device also includes an index adding module for:

将所述第一数据库数据和所述第二数据库数据划分为多个数据类别;classifying the first database data and the second database data into a plurality of data categories;

为具有相同数据类别的所述第一数据库数据和所述第二数据库数据添加相同的数据索引。The same data index is added to the first database data and the second database data having the same data category.

进一步的,所述去重模块51还用于:Further, the deduplication module 51 is also used to:

获取所述第一数据库数据的第一数据索引,并获取所述第二数据库数据的第二数据索引;Obtain the first data index of the first database data, and obtain the second data index of the second database data;

对所述第一数据索引和所述第二数据索引进行匹配处理,并获取具有相同数据索引的目标第一数据库数据和目标第二数据库数据;Perform matching processing on the first data index and the second data index, and obtain target first database data and target second database data having the same data index;

基于所述目标第一数据库数据对所述目标第二数据库数据进行数据去重处理,得到去重后第二数据库数据。Perform data deduplication processing on the target second database data based on the target first database data to obtain the deduplicated second database data.

进一步的,所述装置还包括模板创建模块,用于:Further, the device also includes a template creation module for:

获取所述第一数据库数据的数据相关信息,所述数据相关信息包括数据类别、数据类型、数据格式和关联数据;Obtain data-related information of the first database data, where the data-related information includes data categories, data types, data formats and associated data;

基于所述数据类别、所述数据类型、所述数据格式和所述关联数据构建与所述数据类别数量相同的数据模板;Construct data templates with the same number as the data categories based on the data category, the data type, the data format and the associated data;

在所述数据模板与所述数据类别之间建立关联关系。An association relationship is established between the data template and the data category.

进一步的,所述转换模块52还用于:Further, the conversion module 52 is also used to:

基于所述去重后第二数据库数据的数据类别,查询所述关联关系,确定目标数据模板;Based on the data category of the second database data after deduplication, query the association relationship and determine the target data template;

获取所述目标数据模板的目标数据类型、目标数据格式和目标关联数据;Obtain the target data type, target data format and target associated data of the target data template;

基于所述目标数据类型、所述目标数据格式和所述目标关联数据对所述去重后第二数据库数据进行标准化转换,得到标准化转换数据。Standardized conversion is performed on the deduplicated second database data based on the target data type, the target data format and the target associated data to obtain standardized conversion data.

进一步的,所述更新模块53还用于:Further, the update module 53 is also used to:

向第一数据库发送携带所述标准化转换数据的更新请求;以使得所述第一数据库基于所述更新请求进行审批操作,以及待所述更新请求审批通过后,将所述标准化转换数据更新至所述第一数据库中。Send an update request carrying the standardized conversion data to the first database; so that the first database performs an approval operation based on the update request, and after the update request is approved, update the standardized conversion data to the in the first database.

进一步的,所述装置还包括接收及更新模块,用于:Further, the device also includes a receiving and updating module for:

接收多个客户端对第二数据库的数据访问,以使得所述客户端获取到所述标准化转换数据;所述客户端通过企业服务总线与所述第二数据库相连接;Receive data access to the second database from multiple clients, so that the clients obtain the standardized conversion data; the clients are connected to the second database through an enterprise service bus;

相对应的,接收多个所述客户端对所述第二数据库的数据更新操作,包括停用、新增和修改。Correspondingly, data update operations on the second database from multiple clients are received, including deactivation, addition and modification.

本发明实施例提供了一种数据处理装置,与现有技术相比,本发明通过获取第一数据库数据和第二数据库数据,并基于所述第一数据库数据对所述第二数据库数据进行数据去重处理,得到去重后第二数据库数据;获取数据模板,并基于所述数据模板对所述去重后第二数据库数据进行标准化转换,得到标准化转换数据,以使得所述标准化转换数据与所述第一数据库数据具有相同的数据相关信息;基于所述标准化转换数据对所述第一数据库数据进行数据更新,可以实现母公司与子公司之间的数据交互和共享,有效避免了数据孤岛的形成,进一步为数据资源决策分析提供重要的数据基础。Embodiments of the present invention provide a data processing device. Compared with the existing technology, the present invention obtains first database data and second database data, and performs data processing on the second database data based on the first database data. Deduplication processing to obtain the second database data after deduplication; obtaining a data template, and performing standardized conversion on the deduplicated second database data based on the data template to obtain standardized conversion data, so that the standardized conversion data is consistent with The first database data has the same data-related information; data updating of the first database data based on the standardized conversion data can realize data interaction and sharing between the parent company and the subsidiary company, effectively avoiding data islands. The formation further provides an important data basis for decision-making analysis of data resources.

根据本发明一个实施例提供了一种存储介质,所述存储介质存储有至少一可执行指令,该计算机可执行指令可执行上述任意方法实施例中的数据处理方法。According to an embodiment of the present invention, a storage medium is provided. The storage medium stores at least one executable instruction. The computer-executable instruction can execute the data processing method in any of the above method embodiments.

图6示出了根据本发明一个实施例提供的一种计算机设备的结构示意图,本发明具体实施例并不对计算机设备的具体实现做限定。FIG. 6 shows a schematic structural diagram of a computer device according to an embodiment of the present invention. The specific embodiment of the present invention does not limit the specific implementation of the computer device.

如图6所示,该计算机设备可以包括:处理器(processor)602、通信接口(Communications Interface)604、存储器(memory)606、以及通信总线608。As shown in Figure 6, the computer device may include: a processor (processor) 602, a communications interface (Communications Interface) 604, a memory (memory) 606, and a communication bus 608.

其中:处理器602、通信接口604、以及存储器606通过通信总线608完成相互间的通信。Among them: the processor 602, the communication interface 604, and the memory 606 complete communication with each other through the communication bus 608.

通信接口604,用于与其它设备比如客户端或其它服务器等的网元通信。The communication interface 604 is used to communicate with network elements of other devices such as clients or other servers.

处理器602,用于执行程序610,具体可以执行上述数据处理方法的相关步骤。The processor 602 is used to execute the program 610, and specifically can execute the relevant steps of the above-mentioned data processing method.

具体地,程序610可以包括程序代码,该程序代码包括计算机操作指令。Specifically, program 610 may include program code including computer operating instructions.

处理器602可能是中央处理器CPU,或者是特定集成电路ASIC(ApplicationSpecific Integrated Circuit),或者是被配置成实施本发明实施例的一个或多个集成电路。计算机设备包括的一个或多个处理器,可以是同一类型的处理器,如一个或多个CPU;也可以是不同类型的处理器,如一个或多个CPU以及一个或多个ASIC。The processor 602 may be a central processing unit (CPU), an application specific integrated circuit (ASIC), or one or more integrated circuits configured to implement embodiments of the present invention. The one or more processors included in the computer device may be the same type of processor, such as one or more CPUs; or they may be different types of processors, such as one or more CPUs and one or more ASICs.

存储器606,用于存放程序610。存储器606可能包含高速RAM存储器,也可能还包括非易失性存储器(non-volatile memory),例如至少一个磁盘存储器。Memory 606 is used to store program 610. The memory 606 may include high-speed RAM memory, and may also include non-volatile memory (non-volatile memory), such as at least one disk memory.

程序610具体可以用于使得处理器602执行以下操作:The program 610 may be specifically used to cause the processor 602 to perform the following operations:

获取第一数据库数据和第二数据库数据,并基于所述第一数据库数据对所述第二数据库数据进行数据去重处理,得到去重后第二数据库数据;Obtain the first database data and the second database data, and perform data deduplication processing on the second database data based on the first database data to obtain the deduplicated second database data;

获取数据模板,并基于所述数据模板对所述去重后第二数据库数据进行标准化转换,得到标准化转换数据,以使得所述标准化转换数据与所述第一数据库数据具有相同的数据相关信息;Obtain a data template, and perform standardized conversion on the deduplicated second database data based on the data template to obtain standardized conversion data, so that the standardized conversion data and the first database data have the same data-related information;

基于所述标准化转换数据对所述第一数据库数据进行数据更新。Data updating is performed on the first database data based on the standardized conversion data.

显然,本领域的技术人员应该明白,上述的本发明的各模块或各步骤可以用通用的计算装置来实现,它们可以集中在单个的计算装置上,或者分布在多个计算装置所组成的网络上,可选地,它们可以用计算装置可执行的程序代码来实现,从而,可以将它们存储在存储装置中由计算装置来执行,并且在某些情况下,可以以不同于此处的顺序执行所示出或描述的步骤,或者将它们分别制作成各个集成电路模块,或者将它们中的多个模块或步骤制作成单个集成电路模块来实现。这样,本发明不限制于任何特定的硬件和软件结合。Obviously, those skilled in the art should understand that the above-mentioned modules or steps of the present invention can be implemented using general-purpose computing devices. They can be concentrated on a single computing device, or distributed across a network composed of multiple computing devices. , optionally, they may be implemented in program code executable by a computing device, such that they may be stored in a storage device for execution by the computing device, and in some cases, may be in a sequence different from that herein. The steps shown or described are performed either individually as individual integrated circuit modules, or as multiple modules or steps among them as a single integrated circuit module. As such, the invention is not limited to any specific combination of hardware and software.

以上所述仅为本发明的优选实施例而已,并不用于限制本发明,对于本领域的技术人员来说,本发明可以有各种更改和变化。凡在本发明的精神和原则之内,所作的任何修改、等同替换、改进等,均应包括在本发明的保护范围之内。The above descriptions are only preferred embodiments of the present invention and are not intended to limit the present invention. For those skilled in the art, the present invention may have various modifications and changes. Any modifications, equivalent substitutions, improvements, etc. made within the spirit and principles of the present invention shall be included in the protection scope of the present invention.

Claims (10)

1.一种数据处理方法,其特征在于,包括:1. A data processing method, characterized by including: 获取第一数据库数据和第二数据库数据,并基于所述第一数据库数据对所述第二数据库数据进行数据去重处理,得到去重后第二数据库数据;Obtain the first database data and the second database data, and perform data deduplication processing on the second database data based on the first database data to obtain the deduplicated second database data; 获取数据模板,并基于所述数据模板对所述去重后第二数据库数据进行标准化转换,得到标准化转换数据,以使得所述标准化转换数据与所述第一数据库数据具有相同的数据相关信息;Obtain a data template, and perform standardized conversion on the deduplicated second database data based on the data template to obtain standardized conversion data, so that the standardized conversion data and the first database data have the same data-related information; 基于所述标准化转换数据对所述第一数据库数据进行数据更新。Data updating is performed on the first database data based on the standardized conversion data. 2.根据权利要求1所述的方法,其特征在于,所述获取第一数据库数据和第二数据库数据之前,所述方法还包括:2. The method according to claim 1, characterized in that before obtaining the first database data and the second database data, the method further includes: 将所述第一数据库数据和所述第二数据库数据划分为多个数据类别;classifying the first database data and the second database data into a plurality of data categories; 为具有相同数据类别的所述第一数据库数据和所述第二数据库数据添加相同的数据索引。The same data index is added to the first database data and the second database data having the same data category. 3.根据权利要求1所述的方法,其特征在于,所述基于所述第一数据库数据对所述第二数据库数据进行数据去重处理,得到去重后第二数据库数据包括:3. The method according to claim 1, characterized in that, performing data deduplication processing on the second database data based on the first database data, and obtaining the deduplicated second database data includes: 获取所述第一数据库数据的第一数据索引,并获取所述第二数据库数据的第二数据索引;Obtain the first data index of the first database data, and obtain the second data index of the second database data; 对所述第一数据索引和所述第二数据索引进行匹配处理,并获取具有相同数据索引的目标第一数据库数据和目标第二数据库数据;Perform matching processing on the first data index and the second data index, and obtain target first database data and target second database data having the same data index; 基于所述目标第一数据库数据对所述目标第二数据库数据进行数据去重处理,得到去重后第二数据库数据。Perform data deduplication processing on the target second database data based on the target first database data to obtain the deduplicated second database data. 4.根据权利要求1所述的方法,其特征在于,所述获取数据模板之前,所述方法还包括:4. The method according to claim 1, characterized in that before obtaining the data template, the method further includes: 获取所述第一数据库数据的数据相关信息,所述数据相关信息包括数据类别、数据类型、数据格式和关联数据;Obtain data-related information of the first database data, where the data-related information includes data categories, data types, data formats and associated data; 基于所述数据类别、所述数据类型、所述数据格式和所述关联数据构建与所述数据类别数量相同的数据模板;Construct data templates with the same number as the data categories based on the data category, the data type, the data format and the associated data; 在所述数据模板与所述数据类别之间建立关联关系。An association relationship is established between the data template and the data category. 5.根据权利要求4所述的方法,其特征在于,所述获取数据模板,并基于所述数据模板对所述去重后第二数据库数据进行标准化转换,得到标准化转换数据包括:5. The method according to claim 4, characterized in that said obtaining a data template and performing standardized conversion on the deduplicated second database data based on the data template to obtain standardized conversion data includes: 基于所述去重后第二数据库数据的数据类别,查询所述关联关系,确定目标数据模板;Based on the data category of the second database data after deduplication, query the association relationship and determine the target data template; 获取所述目标数据模板的目标数据类型、目标数据格式和目标关联数据;Obtain the target data type, target data format and target associated data of the target data template; 基于所述目标数据类型、所述目标数据格式和所述目标关联数据对所述去重后第二数据库数据进行标准化转换,得到标准化转换数据。Standardized conversion is performed on the deduplicated second database data based on the target data type, the target data format and the target associated data to obtain standardized conversion data. 6.根据权利要求5所述的方法,其特征在于,所述基于所述标准化转换数据对所述第一数据库数据进行数据更新包括:6. The method of claim 5, wherein updating the first database data based on the standardized conversion data includes: 向第一数据库发送携带所述标准化转换数据的更新请求;以使得所述第一数据库基于所述更新请求进行审批操作,以及待所述更新请求审批通过后,将所述标准化转换数据更新至所述第一数据库中。Send an update request carrying the standardized conversion data to the first database; so that the first database performs an approval operation based on the update request, and after the update request is approved, update the standardized conversion data to the in the first database. 7.根据权利要求1~6任一项所述的方法,其特征在于,所述方法还包括:7. The method according to any one of claims 1 to 6, characterized in that the method further includes: 接收多个客户端对第二数据库的数据访问,以使得所述客户端获取到所述标准化转换数据;所述客户端通过企业服务总线与所述第二数据库相连接;Receive data access to the second database from multiple clients, so that the clients obtain the standardized conversion data; the clients are connected to the second database through an enterprise service bus; 相对应的,接收多个所述客户端对所述第二数据库的数据更新操作,包括停用、新增和修改。Correspondingly, data update operations on the second database from multiple clients are received, including deactivation, addition and modification. 8.一种数据处理装置,其特征在于,包括:8. A data processing device, characterized in that it includes: 去重模块,用于获取第一数据库数据和第二数据库数据,并基于所述第一数据库数据对所述第二数据库数据进行数据去重处理,得到去重后第二数据库数据;A deduplication module, used to obtain the first database data and the second database data, and perform data deduplication processing on the second database data based on the first database data to obtain the deduplicated second database data; 转换模块,用于获取数据模板,并基于所述数据模板对所述去重后第二数据库数据进行标准化转换,得到标准化转换数据,以使得所述标准化转换数据与所述第一数据库数据具有相同的数据相关信息;A conversion module, configured to obtain a data template, and perform standardized conversion on the deduplicated second database data based on the data template to obtain standardized conversion data, so that the standardized conversion data has the same characteristics as the first database data. data related information; 更新模块,用于基于所述标准化转换数据对所述第一数据库数据进行数据更新。An update module, configured to update the first database data based on the standardized conversion data. 9.一种存储介质,所述存储介质中存储有至少一可执行指令,所述可执行指令执行如权利要求1~7中任一项所述的数据处理方法对应的操作。9. A storage medium, wherein at least one executable instruction is stored in the storage medium, and the executable instruction executes an operation corresponding to the data processing method according to any one of claims 1 to 7. 10.一种计算机设备,包括处理器、存储器、通信接口和通信总线,所述处理器、所述存储器和所述通信接口通过所述通信总线完成相互间的通信;10. A computer device, including a processor, a memory, a communication interface and a communication bus. The processor, the memory and the communication interface complete communication with each other through the communication bus; 所述存储器用于存放至少一可执行指令,所述可执行指令使所述处理器执行如权利要求1~7中任一项所述的数据处理方法对应的操作。The memory is used to store at least one executable instruction, and the executable instruction causes the processor to perform operations corresponding to the data processing method according to any one of claims 1 to 7.
CN202310875897.1A 2023-07-17 2023-07-17 Data processing method and device, storage medium and computer equipment Pending CN116955377A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202310875897.1A CN116955377A (en) 2023-07-17 2023-07-17 Data processing method and device, storage medium and computer equipment

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202310875897.1A CN116955377A (en) 2023-07-17 2023-07-17 Data processing method and device, storage medium and computer equipment

Publications (1)

Publication Number Publication Date
CN116955377A true CN116955377A (en) 2023-10-27

Family

ID=88443900

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202310875897.1A Pending CN116955377A (en) 2023-07-17 2023-07-17 Data processing method and device, storage medium and computer equipment

Country Status (1)

Country Link
CN (1) CN116955377A (en)

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2017193739A1 (en) * 2016-05-10 2017-11-16 中兴通讯股份有限公司 Data sharing method and apparatus
CN112817990A (en) * 2021-01-28 2021-05-18 北京百度网讯科技有限公司 Data processing method and device, electronic equipment and readable storage medium
CN113111101A (en) * 2021-03-18 2021-07-13 武汉中地众思科技有限公司 Database conversion method
CN115577045A (en) * 2022-10-14 2023-01-06 内蒙古科技大学包头医学院 A database synchronization method and system across networks

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2017193739A1 (en) * 2016-05-10 2017-11-16 中兴通讯股份有限公司 Data sharing method and apparatus
CN112817990A (en) * 2021-01-28 2021-05-18 北京百度网讯科技有限公司 Data processing method and device, electronic equipment and readable storage medium
CN113111101A (en) * 2021-03-18 2021-07-13 武汉中地众思科技有限公司 Database conversion method
CN115577045A (en) * 2022-10-14 2023-01-06 内蒙古科技大学包头医学院 A database synchronization method and system across networks

Similar Documents

Publication Publication Date Title
CN111488393B (en) virtual blockchain
CN110674152B (en) Data synchronization method and device, storage medium and electronic equipment
CN110070359B (en) Blockchain-based data verification system, method, computing device and storage medium
CN106776780B (en) Data exchange and sharing method and system in cloud environment
US12530339B2 (en) Distributed platform for computation and trusted validation
US20140344327A1 (en) Content interchange bus
WO2018201887A1 (en) Data response method, apparatus, terminal device, and medium
CN104205099A (en) A method and system for centralized issue tracking
CN114546982B (en) Data processing method, device, computer readable medium and electronic device
CN113094560A (en) Data label library construction method, device, equipment and medium based on data middlebox
CN107194794A (en) A kind of corporate tax declaration management system and method
US20220050825A1 (en) Block chain based management of auto regressive database relationships
CN113688418A (en) Engineering order settlement method and system based on block chain data storage
CN116527697B (en) Block chain and IPFS public opinion sharing method and system applied to network system management
CN116028484A (en) Digital asset management method and related products
CN110046155A (en) The update of property data base, data characteristics determination method, device and equipment
CN113269538B (en) Method, device, equipment and storage medium for forwarding parallel approval nodes
CN113973003B (en) Verification method for cloud cluster scheduling storage information based on block chain
CN112699118B (en) Data synchronization method and corresponding device, system and storage medium
CN114896569A (en) Blockchain-based code copyright registration system, method and platform
CN116955377A (en) Data processing method and device, storage medium and computer equipment
CN111984631A (en) Production data migration method, device, computer equipment and storage medium
CN115657901B (en) Service changing method and device based on unified parameters
CN111966666A (en) Data integration management and development method and platform
CN108062277B (en) Electronic credential data access method, device and system

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination