CN109947751B - Medical data processing method and device, readable medium and electronic equipment - Google Patents
Medical data processing method and device, readable medium and electronic equipment Download PDFInfo
- Publication number
- CN109947751B CN109947751B CN201811636104.6A CN201811636104A CN109947751B CN 109947751 B CN109947751 B CN 109947751B CN 201811636104 A CN201811636104 A CN 201811636104A CN 109947751 B CN109947751 B CN 109947751B
- Authority
- CN
- China
- Prior art keywords
- medical data
- data
- keywords
- medical
- management system
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
- 238000003672 processing method Methods 0.000 title claims abstract description 18
- 238000013523 data management Methods 0.000 claims abstract description 85
- 238000012545 processing Methods 0.000 claims abstract description 55
- 238000000034 method Methods 0.000 claims abstract description 37
- 238000004140 cleaning Methods 0.000 claims abstract description 35
- 238000013499 data model Methods 0.000 claims abstract description 20
- 230000014509 gene expression Effects 0.000 claims description 11
- 230000008520 organization Effects 0.000 claims description 9
- 238000006243 chemical reaction Methods 0.000 claims description 5
- 238000012544 monitoring process Methods 0.000 claims description 5
- 238000007726 management method Methods 0.000 claims description 4
- 238000000605 extraction Methods 0.000 claims description 2
- 238000007405 data analysis Methods 0.000 abstract description 9
- 238000004458 analytical method Methods 0.000 abstract description 8
- 230000008569 process Effects 0.000 description 7
- 239000002184 metal Substances 0.000 description 4
- 238000010586 diagram Methods 0.000 description 3
- 230000002452 interceptive effect Effects 0.000 description 3
- 238000004590 computer program Methods 0.000 description 2
- 230000006870 function Effects 0.000 description 2
- 230000004048 modification Effects 0.000 description 2
- 238000012986 modification Methods 0.000 description 2
- 230000001960 triggered effect Effects 0.000 description 2
- 238000005406 washing Methods 0.000 description 2
- 230000004075 alteration Effects 0.000 description 1
- 238000003491 array Methods 0.000 description 1
- 230000009286 beneficial effect Effects 0.000 description 1
- 230000005540 biological transmission Effects 0.000 description 1
- 238000004891 communication Methods 0.000 description 1
- 238000004883 computer application Methods 0.000 description 1
- 238000011161 development Methods 0.000 description 1
- 230000000694 effects Effects 0.000 description 1
- 238000005516 engineering process Methods 0.000 description 1
- 230000002349 favourable effect Effects 0.000 description 1
- 230000004927 fusion Effects 0.000 description 1
- 230000006872 improvement Effects 0.000 description 1
- 230000002093 peripheral effect Effects 0.000 description 1
- 230000000750 progressive effect Effects 0.000 description 1
- 230000035939 shock Effects 0.000 description 1
- 238000012546 transfer Methods 0.000 description 1
Images
Landscapes
- Medical Treatment And Welfare Office Work (AREA)
Abstract
The invention discloses a medical data processing method, a device, a readable medium and electronic equipment, wherein the method comprises the following steps: acquiring medical data under a medical data management system; performing data cleaning on the medical data according to at least one preset label name; and carrying out structural processing on the medical data subjected to data cleaning according to a preset data model. According to the technical scheme, when big data analysis is performed on medical data under different medical data management systems, the analysis difficulty is reduced, and the analysis efficiency is improved.
Description
Technical Field
The invention relates to the technical field of computers, in particular to a medical data processing method, a medical data processing device, a readable medium and electronic equipment.
Background
With the development of computer application technology, a medical institution usually deploys a plurality of different medical data management systems to implement corresponding services respectively. In order to realize big data analysis of medical data, medical data under each medical data management system needs to be collected.
Currently, when medical data in a medical data management system is collected, medical data displayed on each application program interface of the medical data management system is mainly collected, and the actually collected medical data is usually a file with a certain label structure.
Medical data respectively acquired from different medical data management systems usually have different data structures, and the acquired medical data may include a large amount of data without referential significance, so that the difficulty in performing big data analysis on the medical data under each medical data management system is high, and the big data analysis efficiency is influenced.
Disclosure of Invention
The invention provides a medical data processing method, a medical data processing device, a readable medium and electronic equipment, which are beneficial to reducing the analysis difficulty and improving the analysis efficiency when large data analysis is carried out on medical data under different medical data management systems.
In a first aspect, the present invention provides a medical data processing method, including:
acquiring medical data under a medical data management system;
performing data cleaning on the medical data according to at least one preset label name;
and carrying out structural processing on the medical data subjected to data cleaning according to a preset data model.
Preferably, ,
further comprising: and coding the content value under at least one target keyword in the medical data subjected to the structured processing.
Preferably, the first and second electrodes are formed of a metal,
further comprising: the medical data which is coded is sent to an external Field Programmable Gate Array (FPGA) accelerator card, so that the FPGA accelerator card compresses the medical data which is coded to form compressed data, and receives the compressed data sent by the FPGA accelerator card.
Preferably, the first and second electrodes are formed of a metal,
further comprising: storing the compressed data; and/or sending the compressed data to an external big data management platform.
Preferably, the first and second electrodes are formed of a metal,
the medical data under the medical data acquisition management system comprises:
determining a request rule of a data request which can be responded to by the medical data management system;
simulating a client program according to the request rule;
and forming a target data request through the simulated client program, sending the target data request to the medical data management system, and receiving the medical data provided by the medical data management system according to the target data request.
Preferably, ,
the request rule for determining the data request which can be responded by the medical data management system comprises the following steps:
monitoring a message sent by a designated terminal which can communicate with a medical data management system;
analyzing the message to obtain a current data request which is sent to the medical data management system by the appointed terminal and carries appointed keywords;
determining the request rule of the current data request as the request rule of the data request which can be received by the medical data management system.
Preferably, the first and second electrodes are formed of a metal,
the specified keywords include any one or more of a patient number and/or a patient name.
In a second aspect, the present invention provides a medical data processing apparatus comprising:
the data acquisition module is used for acquiring medical data under the medical data management system;
the data cleaning module is used for cleaning the medical data according to at least one preset label name;
and the structural processing module is used for carrying out structural processing on the medical data subjected to data cleaning according to a preset data model.
In a third aspect, the invention provides a readable medium comprising executable instructions which, when executed by a processor of an electronic device, cause the electronic device to perform the method according to any one of the first aspect.
In a fourth aspect, the present invention provides an electronic device, comprising a processor and a memory storing execution instructions, wherein when the processor executes the execution instructions stored in the memory, the processor performs the method according to any one of the first aspect.
The invention provides a medical data processing method, a device, a readable medium and electronic equipment, wherein the acquired medical data is usually a file with a certain label structure, and one or more label names and data models can be set by combining with the actual data acquisition requirement; after medical data under the medical data management system is acquired, on one hand, the acquired medical data is firstly subjected to data cleaning according to one or more preset label names, so that all data under each set label name in the acquired medical data are removed, namely, the data without reference meaning in the acquired medical data are removed, and the data volume of the acquired medical data is reduced; on the other hand, the medical data which is cleaned according to the preset data model is subjected to structured processing, so that the medical data acquired from different medical data management systems can be ensured to have the same or similar data structures; the medical data which is subjected to data cleaning and structured processing is used for big data analysis in the subsequent process, so that the analysis difficulty is reduced, and the analysis efficiency is improved.
Further effects of the above-mentioned unconventional preferred modes will be described below in conjunction with specific embodiments.
Drawings
In order to more clearly illustrate the embodiments or the prior art solutions of the present invention, the drawings used in the embodiments or the prior art descriptions will be briefly described below, and it is obvious that the drawings in the following description are only some embodiments described in the present invention, and for those skilled in the art, other drawings can be obtained according to the drawings without creative efforts.
Fig. 1 is a schematic flowchart of a medical data processing method according to an embodiment of the present invention;
FIG. 2 is a flow chart of another method for processing medical data according to an embodiment of the invention;
fig. 3 is a schematic structural diagram of a medical data processing apparatus according to an embodiment of the present invention;
fig. 4 is a schematic structural diagram of an electronic device according to an embodiment of the present invention.
Detailed Description
In order to make the objects, technical solutions and advantages of the present invention more apparent, the technical solutions of the present invention will be clearly and completely described below with reference to specific embodiments and corresponding drawings. It is to be understood that the described embodiments are merely exemplary of the invention, and not restrictive of the full scope of the invention. All other embodiments, which can be derived by a person skilled in the art from the embodiments given herein without making any creative effort, shall fall within the protection scope of the present invention.
As shown in fig. 1, an embodiment of the present invention provides a medical data processing method, including the following steps:
102, performing data cleaning on the medical data according to at least one preset label name;
and 103, carrying out structural processing on the medical data subjected to data cleaning according to a preset data model.
In the embodiment shown in fig. 1, the acquired medical data is usually a file with a certain tag structure, and one or more tag names and data models can be set according to actual data acquisition requirements; after medical data under the medical data management system is acquired, on one hand, data cleaning is firstly carried out on the acquired medical data according to one or more preset label names, all data under the set label names in the acquired medical data are removed, namely, data without reference meaning in the acquired medical data are removed, and therefore the data volume of the acquired medical data is reduced; on the other hand, the medical data which is cleaned according to the preset data model is subjected to structured processing, so that the medical data acquired from different medical data management systems can be ensured to have the same or similar data structures; the medical data which is subjected to data cleaning and structured processing is used for big data analysis in the subsequent process, so that the analysis difficulty is reduced, and the analysis efficiency is improved.
Here, the medical data management system may be a hospital information system, a laboratory information management system, a radiology information management system, an electronic medical record system, and a medical image archiving and communication system.
The data model can be set by a user according to actual business requirements, and the set data model is mainly used for defining conversion rules for keywords with the same meaning and content values with the same meaning in different medical data.
The process of performing structured processing on the medical data subjected to data cleaning according to the set data model specifically comprises the following steps: firstly, extracting keywords and content values under the keywords from medical data which are subjected to data cleaning, then converting the keywords and the content values according to conversion rules defined by a data model, and finally establishing an incidence relation between the keywords and the content values to form a plurality of key value pairs. For example, a content value corresponding to a keyword 'visit period' is extracted from medical data which is subjected to data cleaning to be 'X week', a data model with preset values can limit the unit of the keyword 'visit period' to be 'day', namely, a conversion rule of the content value corresponding to the keyword 'visit period' is limited, so that the 'X week' can be correspondingly converted into 'Y day' during structuring processing, a key value pair can be formed by the keyword 'visit period' and the content value 'Y day' corresponding to the keyword, and obviously, the medical data which is subjected to structuring processing is formed by a plurality of key value pairs; in this way, different medical data are structured according to the same data model, so that different medical data can be ensured to have the same or similar data structure.
Specifically, the extraction of the keywords and the content values under the keywords from the medical data after completing the data cleansing may be realized by any one of the following two implementation manners.
In the implementation mode 1, when the acquired medical data is an XML (Extensible Markup Language) file or an HTML (HyperText Markup Language) file, the XML file or the HTML file subjected to data cleaning is dynamically sniffed to extract keywords and content values under the keywords. Specifically, an HTML file for completing data cleansing can be described by an XPath having a tree structure, a list node in the XPath is calculated by automatically retrieving the XPath according to a tag structure of the HTML file, and then a node representing a field name (also referred to as a keyword) and a node representing a content value (also referred to as data content) in the XPath are further determined and extracted according to the determined list node.
In the implementation mode 2, one or more regular expressions with corresponding organization structures are set in advance according to various organization structures possibly existing in the medical data under the medical data management system; after the medical data (XML files or HTML files) are cleaned, fuzzy matching is carried out on the set regular expressions and the XML files or HTML files cleaned, text data matched with any regular expression is extracted from the XML files or HTML files cleaned, and the text data which are the same as the set regular expressions in the organization structure are extracted from the files to be analyzed (namely, the original medical data with a certain organization structure in the extracted files are extracted); and analyzing each extracted text data again according to the organization structure of the regular expression matched with the text data so as to extract key words and content values from the text data.
Specifically, the data cleaning of the acquired medical data according to the preset at least one label name can be realized in any one of the following two ways.
In the implementation mode a, all data under each tag name in the medical data (XML file or HTML file) is removed according to each tag name set in advance.
In the implementation B, all data other than the data under each tag name in the medical data (XML file or HTML file) is removed according to each tag name set in advance.
The medical data may include sensitive patient information such as a patient identification number, a mobile phone number and the like, and the medical data which is subjected to structured processing needs to be transmitted to a big data management platform for big data analysis. In order to avoid that the patient information in the medical data is directly leaked to the intruder in the transmission process, in an embodiment of the present invention, the method further includes: and encoding the content value under at least one target keyword in the medical data subjected to the structured processing.
Obviously, a user can set one or more target keywords corresponding to more sensitive patient information to be encoded in a customized manner in combination with an actual service scenario, for example, the target keywords may include "contact phone call" and "identification number", and the encoding method used in encoding processing can be set in a customized manner, and it is only necessary to ensure that the big data management platform can decode the content value under the corresponding target keyword in the encoded medical data according to the decoding method corresponding to the encoding method, so as to obtain the original content value.
In order to save computing resources, storage resources, and network resources, an embodiment of the present invention further includes: and sending the medical data subjected to coding processing to an external FPGA accelerator card, so that the FPGA accelerator card compresses the medical data subjected to coding processing to form compressed data, and receiving the compressed data sent by the FPGA accelerator card.
In the embodiment, the external FPGA accelerator card is used for compressing the medical data subjected to coding processing, so that the computing resources of the electronic equipment for executing the data acquisition task, the data cleaning task, the structured processing task and the coding processing task can be saved, the data volume of the formed compressed data is far smaller than that of the medical data subjected to coding processing, and the storage resources and the network resources of the corresponding electronic equipment can be saved when the compressed data is stored in the subsequent process or sent to the big data management platform.
Obviously, the structured medical data or the encoded medical data may also be derived in other data formats, such as XML, HTML, EXCEL, PDF (Portable Document Format), SWF (Shock Wave Flash), or JSON (JavaScript object notation).
In an embodiment of the present invention, the acquisition of medical data under the medical data management system can be specifically realized through the following steps A1 to A3:
a1, determining a request rule of a data request which can be responded by a medical data management system;
a2, simulating a client program according to the request rule;
and A3, forming a target data request through the simulated client program, sending the target data request to the medical data management system, and receiving medical data provided by the medical data management system according to the target data request.
In this embodiment, the data request that can be received by the medical data management system may specifically be an HTTP (HyperText Transfer Protocol) request; after a request rule of a data request which can be responded by the medical data management system is determined, a client program is simulated according to the request rule, a target data request which can be responded by the medical data management system can be formed and sent by the simulated client program, namely medical data which is provided by the medical data management system according to the target data request and is usually a JSON file, an XML file or an HTML file can be received by the client program, and then corresponding data cleaning and structuring processing can be carried out on the medical data according to file characteristics. Therefore, the medical data under each medical data management system can be directly acquired under the conditions that the original system code is not changed and coordination with a supplier of the medical data management system is not needed, and the method is favorable for quickly acquiring the medical data under a large number of medical data management systems.
In one embodiment of the present invention, step A1 may be implemented by steps a11 to a13 as follows:
a11, monitoring a message sent by a designated terminal which can communicate with a medical data management system;
a12, analyzing the message to obtain a current data request which is sent to the medical data management system by the appointed terminal and carries appointed keywords;
and A13, determining the request rule of the current data request as the request rule of the data request which can be received by the medical data management system.
In this embodiment, a message sent by an appointed terminal can be intercepted by monitoring a network card of the appointed terminal capable of communicating with the medical data management system, a current data request carrying an appointed keyword, sent from the appointed terminal to the medical data management system, can be acquired from the intercepted message by analyzing the intercepted message, a request rule of the current data request can be obtained by analyzing the current data request, and the request rule of the current data request can be determined as a request rule of a data request which can be received by the medical data management system.
It should be noted that the functional module executing a11 may be deployed on a designated terminal, and the functional modules executing other tasks may be independently deployed on another electronic device, and the electronic device may be simultaneously connected to the designated terminals corresponding to the multiple data management systems, so that each functional module deployed on the electronic device executes the same processing flow for the message monitored by each functional module deployed on each designated terminal, thereby implementing parallel acquisition of medical data under multiple medical data management systems.
It should be noted that the specified keywords include, but are not limited to, any one or more of the patient number and/or the patient name.
In order to more clearly illustrate the technical solution and advantages of the present invention, an embodiment of the present invention further provides another medical data processing method, please refer to fig. 2, which may specifically include the following steps.
Here, the specified keyword may be input into the application program interface displayed by the browser and further triggered, or the specified keyword in the application program interface displayed by the browser is directly triggered, so that the specified terminal sends the current data request carrying the specified keyword to the medical data management system in a form of a message through the network card of the specified terminal.
And 204, analyzing the intercepted message to acquire a current data request carrying the specified keywords, which is sent to the medical data management system by the specified terminal.
Here, when the network card of the designated terminal is monitored, all messages sent by the designated terminal through the network card can be monitored, but each message sent by the network card is not completely a current data request sent to the medical data management system, and here, the current data request carrying the designated keyword sent by the designated terminal to the medical data management system can be obtained by analyzing each intercepted message.
In step 205, the obtained current data request is analyzed to determine a request rule of the current data request, and the obtained request rule is used as a request rule of a data request which can be received by the medical data management system.
It will be understood by those skilled in the art that the current data request may specifically be an HTTP request, and the request rules specifically include a request method, a protocol version, a language used by the browser, a body length, and the like specified in the HTTP request.
Here, the target data request has the same request rule as the current data request; meanwhile, the target data request may carry a specified keyword carried by the current data request.
Here, the medical data may be an HTML file, an XML file, or a JSON file.
And step 208, performing data cleaning on the medical data according to at least one preset label name.
And step 209, performing structured processing on the medical data subjected to data cleaning according to a preset data model.
And step 210, coding the content value of each target keyword in the structured medical data.
For example, the target keyword may include "contact phone number" and "identification card number", and a specific phone number and identification card number in the medical data may be encoded in a user-defined encoding manner, so that the phone number and the identification card are prevented from being directly revealed to an intruder.
And step 211, sending the medical data subjected to the coding processing to an external FPGA accelerator card, so that the FPGA accelerator card compresses the medical data subjected to the coding processing to form compressed data, and receiving the compressed data sent by the FPGA accelerator card.
Then, the big data management platform can decompress each received compressed data and decode corresponding content values, so that medical data which are relatively small in data volume and have the same data structure and acquired from each medical data management system can be subjected to de-duplication processing, association among data is established, fusion of medical data under different medical data management systems is achieved, and then big data analysis is carried out.
It should be noted that, in order to more completely acquire medical data under different medical data management systems, the foregoing step 201 to step 210 may be repeatedly performed for each medical data management system, then the electronic device performing the step 201 to step 210 directly performs deduplication processing on a large amount of medical data subjected to structured processing and establishes associations among data, and compresses the medical data subjected to deduplication processing and established associations among data through the FPGA accelerator card.
Referring to fig. 3, based on the same concept as the method embodiment of the present invention, an embodiment of the present invention further provides a medical data management apparatus, including:
the data acquisition module 301 is used for acquiring medical data under the medical data management system;
a data washing module 302, configured to perform data washing on the medical data according to at least one preset tag name;
and the structural processing module 303 is configured to perform structural processing on the medical data subjected to data cleaning according to a preset data model.
In one embodiment of the present invention, the method further comprises: a coding processing module; wherein,
and the coding processing module is used for coding the content value under at least one target keyword in the medical data subjected to the structured processing.
In one embodiment of the present invention, the method further comprises: an interactive processing module; wherein,
the interactive processing module is used for sending the medical data which are coded to an external Field Programmable Gate Array (FPGA) accelerator card, so that the FPGA accelerator card compresses the medical data which are coded to form compressed data, and receives the compressed data sent by the FPGA accelerator card.
In one embodiment of the present invention, the method further comprises: a storage processing module; the storage processing module is used for storing the compressed data.
In an embodiment of the present invention, the interactive processing module is further configured to send the compressed data to an external big data management platform.
In an embodiment of the present invention, the data acquisition module 301 is specifically configured to execute the following steps A1 to A3:
a1, determining a request rule of a data request which can be responded by a medical data management system;
a2, simulating a client program according to the request rule;
and A3, forming a target data request through the simulated client program, sending the target data request to the medical data management system, and receiving medical data provided by the medical data management system according to the target data request.
In an embodiment of the present invention, the data acquisition module 301 is specifically configured to execute the following steps a11 to a13:
a11, monitoring a message sent by a designated terminal which can communicate with a medical data management system;
a12, analyzing the message to obtain a current data request which is sent to the medical data management system by the appointed terminal and carries appointed keywords;
and A13, determining the request rule of the current data request as the request rule of the data request which can be received by the medical data management system.
For convenience of description, the above device embodiments are described with functions divided into various units or modules, and the functions of the units or modules may be implemented in one or more software and/or hardware when implementing the present invention.
Fig. 4 is a schematic structural diagram of an electronic device according to an embodiment of the present invention. On the hardware level, the electronic device comprises a processor, and optionally an internal bus, a network interface and a memory. The Memory may include a Memory, such as a Random-Access Memory (RAM), and may further include a non-volatile Memory, such as at least 1 disk Memory. Of course, the electronic device may also include hardware required for other services.
The processor, the network interface, and the memory may be connected to each other via an internal bus, which may be an ISA (Industry standard architecture) bus, a PCI (Peripheral Component Interconnect) bus, an EISA (Extended Industry standard architecture) bus, or the like. The bus may be divided into an address bus, a data bus, a control bus, etc. For ease of illustration, only one double-headed arrow is shown in FIG. 4, but that does not indicate only one bus or one type of bus.
And the memory is used for storing the execution instruction. In particular, a computer program that can be executed by executing instructions. The memory may include both memory and non-volatile storage and provides execution instructions and data to the processor.
In a possible implementation manner, the processor reads the corresponding execution instruction from the nonvolatile memory into the memory and then executes the execution instruction, and the corresponding execution instruction can also be obtained from other equipment so as to form the medical data processing device on a logic level. The processor executes the execution instructions stored in the memory, so that the medical data processing method provided by any embodiment of the invention is realized through the executed execution instructions.
The method performed by the medical data processing device according to the embodiment of the invention shown in fig. 3 may be applied to or implemented by a processor. The processor may be an integrated circuit chip having signal processing capabilities. In implementation, the steps of the above method may be performed by integrated logic circuits of hardware in a processor or instructions in the form of software. The Processor may be a general-purpose Processor, including a Central Processing Unit (CPU), a Network Processor (NP), and the like; but also Digital Signal Processors (DSPs), application Specific Integrated Circuits (ASICs), field Programmable Gate Arrays (FPGAs) or other Programmable logic devices, discrete gate or transistor logic devices, discrete hardware components. The various methods, steps, and logic blocks disclosed in the embodiments of the present invention may be implemented or performed. A general purpose processor may be a microprocessor or the processor may be any conventional processor or the like.
The steps of the method disclosed in connection with the embodiments of the present invention may be directly implemented by a hardware decoding processor, or implemented by a combination of hardware and software modules in the decoding processor. The software modules may be located in ram, flash, rom, prom, or eprom, registers, etc. as is well known in the art. The storage medium is located in a memory, and a processor reads information in the memory and completes the steps of the method in combination with hardware of the processor.
An embodiment of the present invention further provides a readable storage medium, which stores execution instructions, and when the stored execution instructions are executed by a processor of an electronic device, the electronic device can be caused to execute the medical data processing method provided in any embodiment of the present invention, and is specifically configured to execute the method shown in fig. 1.
The electronic device described in the foregoing embodiments may be a computer.
It should be appreciated by those skilled in the art that embodiments of the present invention may be provided as a method or computer program product. Accordingly, the present invention may take the form of an entirely hardware embodiment, an entirely software embodiment or an embodiment combining software and hardware aspects.
All the embodiments in the invention are described in a progressive manner, and the same and similar parts among the embodiments can be referred to each other, and each embodiment focuses on the differences from other embodiments. In particular, for the apparatus embodiment, since it is substantially similar to the method embodiment, the description is relatively simple, and reference may be made to the partial description of the method embodiment for relevant points.
It should also be noted that the terms "comprises," "comprising," or any other variation thereof, are intended to cover a non-exclusive inclusion, such that a process, method, article, or apparatus that comprises a list of elements does not include only those elements but may include other elements not expressly listed or inherent to such process, method, article, or apparatus. Without further limitation, an element defined by the phrases "comprising a," "8230," "8230," or "comprising" does not exclude the presence of other like elements in a process, method, article, or apparatus comprising the element.
The above description is only an example of the present invention and is not intended to limit the present invention. Various modifications and alterations to this invention will become apparent to those skilled in the art. Any modification, equivalent replacement, improvement, etc. made within the spirit and principle of the present invention should be included in the scope of the claims of the present invention.
Claims (10)
1. A medical data processing method, comprising:
acquiring medical data under a medical data management system, wherein the medical data is a file with a label structure;
performing data cleaning on the medical data according to at least one preset label name;
carrying out structural processing on the medical data subjected to data cleaning according to a preset data model, and specifically comprising the following steps of:
extracting keywords and content values under the keywords from the medical data subjected to data cleaning;
converting the keywords and the content values according to a conversion rule defined by the data model;
establishing an incidence relation between the key words and the content values to form a plurality of key value pairs;
the medical data is an extensible markup language XML file or a hypertext markup language HTML file, and the extraction of keywords and content values under the keywords from the medical data after data cleaning comprises the following steps:
dynamically sniffing the XML file or HTML file subjected to data cleaning to extract keywords and content values under the keywords; or,
extracting keywords and content values under the keywords from XML files or HTML files subjected to data cleaning through a preset regular expression; the preset regular expressions comprise one or more regular expressions with corresponding organization structures, which are arranged according to various organization structures of medical data existing under the medical data management system.
2. The medical data processing method according to claim 1,
and encoding the content value under at least one target keyword in the medical data subjected to the structured processing.
3. The medical data processing method according to claim 2,
the medical data which are coded are sent to an external Field Programmable Gate Array (FPGA) accelerator card, so that the FPGA accelerator card compresses the medical data which are coded to form compressed data, and the compressed data sent by the FPGA accelerator card are received.
4. The medical data processing method according to claim 3,
further comprising: storing the compressed data; and/or sending the compressed data to an external big data management platform.
5. The medical data processing method according to claim 1,
the medical data under the medical data acquisition management system comprises:
determining a request rule of a data request which can be responded to by the medical data management system;
simulating a client program according to the request rule;
and forming a target data request through the simulated client program, sending the target data request to the medical data management system, and receiving the medical data provided by the medical data management system according to the target data request.
6. The medical data processing method according to claim 5,
the request rule for determining the data request which can be responded by the medical data management system comprises the following steps:
monitoring a message sent by a designated terminal which can communicate with a medical data management system;
analyzing the message to obtain a current data request which is sent to the medical data management system by the appointed terminal and carries appointed keywords;
determining the request rule of the current data request as the request rule of the data request which can be received by the medical data management system.
7. The method of claim 6,
the specified keywords include any one or more of a patient number and/or a patient name.
8. A medical data processing apparatus, characterized by comprising:
the data acquisition module is used for acquiring medical data under the medical data management system, and the medical data is a file with a label structure;
the data cleaning module is used for cleaning the medical data according to at least one preset label name;
the structural processing module is used for carrying out structural processing on the medical data which is cleaned according to a preset data model, and specifically comprises the following steps: extracting keywords and content values under the keywords from the medical data subjected to data cleaning; converting the key words and the content values according to a conversion rule defined by the data model; establishing an incidence relation between the key words and the content values to form a plurality of key value pairs; the method for extracting keywords and content values under the keywords from medical data after data cleaning is completed comprises the following steps: dynamically sniffing the XML file or HTML file subjected to data cleaning to extract keywords and content values under the keywords; or extracting keywords and content values under the keywords from the XML file or the HTML file subjected to data cleaning through a preset regular expression; the preset regular expressions comprise one or more regular expressions with corresponding organization structures, which are arranged according to various organization structures of medical data existing under the medical data management system.
9. A readable medium comprising executable instructions which, when executed by a processor of an electronic device, cause the electronic device to perform the method of any one of claims 1 to 7.
10. An electronic device comprising a processor and a memory storing execution instructions, the processor performing the method of any of claims 1-7 when the processor executes the execution instructions stored by the memory.
Priority Applications (1)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| CN201811636104.6A CN109947751B (en) | 2018-12-29 | 2018-12-29 | Medical data processing method and device, readable medium and electronic equipment |
Applications Claiming Priority (1)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| CN201811636104.6A CN109947751B (en) | 2018-12-29 | 2018-12-29 | Medical data processing method and device, readable medium and electronic equipment |
Publications (2)
| Publication Number | Publication Date |
|---|---|
| CN109947751A CN109947751A (en) | 2019-06-28 |
| CN109947751B true CN109947751B (en) | 2023-04-07 |
Family
ID=67006475
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| CN201811636104.6A Active CN109947751B (en) | 2018-12-29 | 2018-12-29 | Medical data processing method and device, readable medium and electronic equipment |
Country Status (1)
| Country | Link |
|---|---|
| CN (1) | CN109947751B (en) |
Families Citing this family (3)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN111276231B (en) * | 2020-02-27 | 2024-12-06 | 深圳平安医疗健康科技服务有限公司 | Medical data monitoring method, device, computer equipment and storage medium |
| CN113764110B (en) * | 2021-01-29 | 2024-09-24 | 北京京东拓先科技有限公司 | Data processing method, device, electronic equipment and storage medium |
| CN115579118A (en) * | 2022-11-04 | 2023-01-06 | 零氪科技(北京)有限公司 | A medical data management method, system and storage medium based on data fusion |
Citations (9)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| WO2008111314A1 (en) * | 2007-03-14 | 2008-09-18 | Justsystems Corporation | Data processing apparatus and data processing method |
| CN103034700A (en) * | 2012-12-05 | 2013-04-10 | 北京奇虎科技有限公司 | Rich text content processing method and system |
| CN105808712A (en) * | 2016-03-07 | 2016-07-27 | 陈宽 | Intelligent system and method for converting text type medical reports into structured data |
| CN106845061A (en) * | 2016-11-02 | 2017-06-13 | 百度在线网络技术(北京)有限公司 | Intelligent interrogation system and method |
| CN107704601A (en) * | 2017-10-13 | 2018-02-16 | 中国人民解放军第三军医大学第附属医院 | Big data search method and system, computer-readable storage medium and electronic equipment |
| CN108182963A (en) * | 2017-12-14 | 2018-06-19 | 山东浪潮云服务信息科技有限公司 | A kind of medical data processing method and processing device |
| CN108711443A (en) * | 2018-05-07 | 2018-10-26 | 成都智信电子技术有限公司 | The text data analysis method and device of electronic health record |
| CN108984542A (en) * | 2017-05-31 | 2018-12-11 | 北京京东尚科信息技术有限公司 | Distribution type data collection method and system |
| CN109036576A (en) * | 2018-07-23 | 2018-12-18 | 无锡慧方科技有限公司 | Electronic health record data analysis method, device, computer and readable storage medium storing program for executing |
Family Cites Families (13)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN103473375A (en) * | 2013-09-29 | 2013-12-25 | 方正国际软件有限公司 | Data cleaning method and data cleaning system |
| US20150100602A1 (en) * | 2013-10-03 | 2015-04-09 | Amekc Llc | System and method for third party remote access to personal medical records |
| CN104750813A (en) * | 2015-03-30 | 2015-07-01 | 浪潮集团有限公司 | Data cleaning method based on data reduction model |
| CN106227885A (en) * | 2016-08-08 | 2016-12-14 | 星河互联集团有限公司 | Processing method, device and the terminal of a kind of big data |
| CN106354770A (en) * | 2016-08-22 | 2017-01-25 | 成都众易通科技有限公司 | Data analysis system |
| CN106484828B (en) * | 2016-09-29 | 2020-01-21 | 西南科技大学 | Distributed internet data rapid acquisition system and acquisition method |
| CN106815336A (en) * | 2016-12-31 | 2017-06-09 | 深圳市易特科信息技术有限公司 | Medical data inquiry system and method based on big data associated storage |
| CN106897357B (en) * | 2017-01-04 | 2023-07-18 | 北京京拍档科技股份有限公司 | Method for intelligent crawling network information with verification function |
| CN108320779A (en) * | 2017-01-16 | 2018-07-24 | 医渡云(北京)技术有限公司 | Medical data processing method and processing device |
| US10514827B2 (en) * | 2017-02-10 | 2019-12-24 | Microsoft Technology Licensing, Llc | Resequencing actionable task structures for transforming data |
| CN107731269B (en) * | 2017-10-25 | 2020-06-26 | 山东众阳软件有限公司 | Disease coding method and system based on original diagnosis data and medical record file data |
| CN107766574A (en) * | 2017-11-13 | 2018-03-06 | 天津开心生活科技有限公司 | Data query method and device, date storage method and device |
| CN108334791A (en) * | 2018-02-07 | 2018-07-27 | 冼钇冰 | Medical data storage method, apparatus based on Internet of Things and computer storage media |
-
2018
- 2018-12-29 CN CN201811636104.6A patent/CN109947751B/en active Active
Patent Citations (9)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| WO2008111314A1 (en) * | 2007-03-14 | 2008-09-18 | Justsystems Corporation | Data processing apparatus and data processing method |
| CN103034700A (en) * | 2012-12-05 | 2013-04-10 | 北京奇虎科技有限公司 | Rich text content processing method and system |
| CN105808712A (en) * | 2016-03-07 | 2016-07-27 | 陈宽 | Intelligent system and method for converting text type medical reports into structured data |
| CN106845061A (en) * | 2016-11-02 | 2017-06-13 | 百度在线网络技术(北京)有限公司 | Intelligent interrogation system and method |
| CN108984542A (en) * | 2017-05-31 | 2018-12-11 | 北京京东尚科信息技术有限公司 | Distribution type data collection method and system |
| CN107704601A (en) * | 2017-10-13 | 2018-02-16 | 中国人民解放军第三军医大学第附属医院 | Big data search method and system, computer-readable storage medium and electronic equipment |
| CN108182963A (en) * | 2017-12-14 | 2018-06-19 | 山东浪潮云服务信息科技有限公司 | A kind of medical data processing method and processing device |
| CN108711443A (en) * | 2018-05-07 | 2018-10-26 | 成都智信电子技术有限公司 | The text data analysis method and device of electronic health record |
| CN109036576A (en) * | 2018-07-23 | 2018-12-18 | 无锡慧方科技有限公司 | Electronic health record data analysis method, device, computer and readable storage medium storing program for executing |
Non-Patent Citations (4)
| Title |
|---|
| Li Z 等.Research on, and development of, data extraction and data cleaning technology based on the internet of things.《2017 IEEE International Conference on Computational Science and Engineering (CSE) and IEEE International Conference on Embedded and Ubiquitous Computing (EUC)》.2017,332-341. * |
| 李昊旻.电子病历的标准化结构化方法研究及实践.《中国博士学位论文全文数据库医药卫生科技辑》.2008,(第09期),E053-8. * |
| 谭亚竹.基于XML数据清洗的应用研究.《中国优秀博硕士学位论文全文数据库 (硕士)》.2007,(第05期),I138-345. * |
| 马连浩.Web文本聚类技术及聚类结果可视化研究.《中国优秀硕士学位论文全文数据库信息科技辑》.2009,(第06期),I138-1114. * |
Also Published As
| Publication number | Publication date |
|---|---|
| CN109947751A (en) | 2019-06-28 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| CN109947751B (en) | Medical data processing method and device, readable medium and electronic equipment | |
| CN109857992A (en) | Medical data structuring analytic method, device, readable medium and electronic equipment | |
| CN112395251A (en) | Intelligent analysis method and device for data file, electronic equipment and storage medium | |
| EP3480704A1 (en) | Method and apparatus for information interaction | |
| CN111125605B (en) | Page element acquisition method and device | |
| CN112631924A (en) | Automatic testing method and device, computer equipment and storage medium | |
| CN111130987A (en) | Automatic acquisition method and device for AWR report, electronic equipment and storage medium | |
| CN110598135A (en) | Network request processing method and device, computer readable medium and electronic equipment | |
| CN112256557B (en) | Program regression testing method, device, system, computer equipment and storage medium | |
| CN113961632A (en) | Message data processing method, device and electronic device | |
| CN111291288B (en) | Webpage link extraction method and system | |
| CN113239290A (en) | Data analysis method and device for public opinion monitoring and electronic device | |
| CN116821456A (en) | Webpage data extraction method and system based on browser | |
| CN112788090A (en) | Network resource transmission method, device and system | |
| CN112307386A (en) | Information monitoring method, system, electronic device and computer readable storage medium | |
| CN112965767A (en) | Job processing method and apparatus, electronic device, and storage medium | |
| CN111459792A (en) | WEB front-end performance monitoring method, device, electronic device and storage medium | |
| CN114692050A (en) | Page parsing method and device, computer readable medium and electronic device | |
| CN113918434A (en) | Statistical method, device and electronic device for abnormal occurrence rate of application program | |
| CN116521170A (en) | Conversion method, device, equipment and medium of client native application | |
| CN117376852B (en) | Intelligent short message generation and processing method and device, electronic equipment and medium | |
| CN117742900B (en) | Method, device, equipment and storage medium for constructing service call graph | |
| CN111340672A (en) | Method and device for acquiring user real name information and electronic equipment | |
| CN113190305B (en) | Short message processing method and device and terminal equipment | |
| CN115454800B (en) | Log data verification methods, devices, electronic equipment, storage media and products |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| PB01 | Publication | ||
| PB01 | Publication | ||
| SE01 | Entry into force of request for substantive examination | ||
| SE01 | Entry into force of request for substantive examination | ||
| GR01 | Patent grant | ||
| GR01 | Patent grant |