[go: up one dir, main page]

CN108519965A - Data analysis method and device - Google Patents

Data analysis method and device Download PDF

Info

Publication number
CN108519965A
CN108519965A CN201810214338.5A CN201810214338A CN108519965A CN 108519965 A CN108519965 A CN 108519965A CN 201810214338 A CN201810214338 A CN 201810214338A CN 108519965 A CN108519965 A CN 108519965A
Authority
CN
China
Prior art keywords
field
parsing
analysis mode
target data
analytic parameter
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Withdrawn
Application number
CN201810214338.5A
Other languages
Chinese (zh)
Inventor
张如松
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Lianjia Beijing Technology Co Ltd
Original Assignee
Lianjia Beijing Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Lianjia Beijing Technology Co Ltd filed Critical Lianjia Beijing Technology Co Ltd
Priority to CN201810214338.5A priority Critical patent/CN108519965A/en
Publication of CN108519965A publication Critical patent/CN108519965A/en
Withdrawn legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/10Text processing
    • G06F40/12Use of codes for handling textual entities
    • G06F40/131Fragmentation of text files, e.g. creating reusable text-blocks; Linking to fragments, e.g. using XInclude; Namespaces

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Health & Medical Sciences (AREA)
  • Artificial Intelligence (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Computational Linguistics (AREA)
  • General Health & Medical Sciences (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

A kind of data analysis method of present invention offer and device, this method include:The analysis mode parsed to target data is analytically chosen in mode database;Parsing window is called according to the analysis mode;Obtain the analytic parameter corresponding to the analysis mode inputted on the parsing window and the target data;The target data is parsed according to the analytic parameter, generate field result and is shown.A kind of data analysis method and device provided in an embodiment of the present invention, parsing window is called out by choosing the analysis mode parsed to target data in analytically mode database, and obtain the analytic parameter corresponding to the analysis mode inputted on the parsing window and the target data, the target data is parsed further according to the analytic parameter, it generates field result and shows, reach and target data is parsed from multiple analysis modes, the parsing demand for meeting different user, facilitates configuration and maintenance.

Description

Data analysis method and device
Technical field
The present invention relates to technical field of data processing more particularly to a kind of data analysis methods and device.
Background technology
The process that data cleansing to data examine and verify again, it is therefore intended that significant figure is filtered from mass data According to.User extracts required data from data source, by data cleansing, finally according to the data warehouse mould pre-defined Type loads data into data warehouse.And data parsing be cleaning basis, only by source data according to it is certain rule into After row parsing, it could be cleaned.
The data analysis mode of the prior art is realized by regular expression.Regular expression has general, efficient The characteristics of, but the requirement to user is relatively high, needs to have regular expression and grasps and understand well.On the other hand, Other staff in team are when safeguarding resolution rules, it would be desirable to be able to understand the expression formula for understanding original author, and can accurately repair Change, these all acquire a certain degree of difficulty.
Invention content
A kind of data analysis method of offer of the embodiment of the present invention and device, for solving in the prior art, data parsing can not The problem of meeting user demand.
In a first aspect, the embodiment of the present invention provides a kind of data analysis method, including:
The analysis mode parsed to target data is analytically chosen in mode database;
Parsing window is called according to the analysis mode;
Obtain the analytic parameter corresponding to the analysis mode inputted on the parsing window and the target data;
The target data is parsed according to the analytic parameter, generate field result and is shown.
Preferably, if the analysis mode is separator analysis mode, the analytic parameter includes field name, field Serial number in the data of type and field after parsing.
Preferably, if the analysis mode be regular expression analysis mode, the analytic parameter include field name, The regular expression of field type and field.
Preferably, further include:
Either field is chosen from the field result as secondary parsing field, and is chosen to the secondary parsing field The analysis mode parsed;
Obtain the analytic parameter corresponding to the analysis mode;
The target data is parsed according to the analytic parameter, generate secondary field result and is shown.
Second aspect, the embodiment of the present invention provide a kind of data analysis device, including:
Module is chosen, for choosing the analysis mode parsed to target data in analytically mode database;
Display module is called, for calling parsing window according to the analysis mode;
Acquisition module, for obtaining the analytic parameter corresponding to the analysis mode inputted on the parsing window and institute State target data;
Display module is parsed, for being parsed to the target data according to the analytic parameter, generates field result And it shows.
Preferably, if the analysis mode is separator analysis mode, the analytic parameter includes field name, field Serial number in the data of type and field after parsing.
Preferably, if the analysis mode be regular expression analysis mode, the analytic parameter include field name, The regular expression of field type and field.
Preferably, further include secondary parsing module, be used for:
Either field is chosen from the field result as secondary parsing field, and is chosen to the secondary parsing field The analysis mode parsed;
Obtain the analytic parameter corresponding to the analysis mode;
The target data is parsed according to the analytic parameter, generate secondary field result and is shown.
As shown from the above technical solution, a kind of data analysis method and device provided in an embodiment of the present invention, by from solution The analysis mode parsed to target data is chosen in analysis mode database and calls out parsing window, and obtains the parsing window The analytic parameter corresponding to the analysis mode inputted on mouth and the target data, further according to the analytic parameter to described Target data is parsed, and is generated field result and is shown, reach and parsed from multiple analysis modes to target data, meets The parsing demand of different user, facilitates configuration and maintenance.
Description of the drawings
Fig. 1 is the flow diagram for the data analysis method that one embodiment of the invention provides;
Fig. 2 is that the analysis mode that one embodiment of the invention provides chooses interface;
Fig. 3 is the parsing window interface for the separator analysis mode that one embodiment of the invention provides;
Fig. 4 is the parsing window interface for the regular expression analysis mode that one embodiment of the invention provides;
Fig. 5 is the flow diagram for the data analysis method that one embodiment of the invention provides;
Fig. 6 is the resolution interface one for the secondary parsing that one embodiment of the invention provides;
Fig. 7 is the resolution interface two for the secondary parsing that one embodiment of the invention provides;
Fig. 8 is the analysis result interface that one embodiment of the invention provides;
Fig. 9 is the structural schematic diagram for the data analysis device that one embodiment of the invention provides;
Figure 10 is the structural schematic diagram for the data analysis device that one embodiment of the invention provides.
Specific implementation mode
With reference to the accompanying drawings and examples, the specific implementation mode of the present invention is described in further detail.Implement below Example is not limited to the scope of the present invention for illustrating the present invention.
Fig. 1 shows that one embodiment of the invention provides a kind of data analysis method, including:
S11, the analysis mode parsed to target data is analytically chosen in mode database;
S12, parsing window is called according to the analysis mode;
S13, the analytic parameter corresponding to the analysis mode inputted on the parsing window and the number of targets are obtained According to;
S14, the target data is parsed according to the analytic parameter, generate field result and shown.
For above-mentioned steps S11- steps S14, it should be noted that in embodiments of the present invention, in the embodiment of the present invention In, the analysis mode of data is stored in analysis mode database.System is mounted in terminal, can operation display interface.Such as Shown in Fig. 2, user operates in operation interface, to which system can be allowed to get the analysis mode selected by user.At this In inventive embodiments, analysis mode may include separator analysis mode and regular expression analysis mode.
After system gets the analysis mode parsed to target data, system can call corresponding parsing window simultaneously Display.User can input analytic parameter and target data in parsing window, and system is according to the analytic parameter to the target Data are parsed, and are generated field result and are shown.
If as shown in figure 3, the analysis mode be separator analysis mode, the analytic parameter include field name, Serial number in the data of field type and field after parsing.
If the target data of input is 2017-12-31 12:21\thttp://lianjia.com/a/cFrom=uc t400\t10.0.0.12
Separator be t.
Field is respectively time fields and url fields, and field type is the representation of field.
It sorts in the data of serial number field after parsing.
If as shown in figure 4, the analysis mode is regular expression analysis mode, the analytic parameter includes field name Claim, the regular expression of field type and field.
If the target data of input is 2017-12-31 12:21\thttp://lianjia.com/a/cFrom=uc t400\t10.0.0.12
Field is respectively time fields and url fields, and field type is the representation of field.
The corresponding expression formula of each field.
System parses target data, and generate field knot according to the analytic parameter obtained in parsing window input Fruit is simultaneously shown on window.
A kind of data analysis method provided in an embodiment of the present invention, by being chosen to number of targets in analytically mode database Call out parsing window according to the analysis mode parsed, and obtain it is described parsing window on input correspond to the parsing side The analytic parameter of formula and the target data parse the target data further according to the analytic parameter, generate field As a result it and shows, reaches and target data is parsed from multiple analysis modes, meet the parsing demand of different user, conveniently match It sets and safeguards.
Fig. 5 shows a kind of data analysis method provided in an embodiment of the present invention, including:
S21, the analysis mode parsed to target data is analytically chosen in mode database;
S22, parsing window is called according to the analysis mode;
S23, the analytic parameter corresponding to the analysis mode inputted on the parsing window and the number of targets are obtained According to;
S24, the target data is parsed according to the analytic parameter, generate field result and shown;
S25, either field is chosen from the field result as secondary parsing field, and choose to the secondary parsing The analysis mode that field is parsed;Obtain the analytic parameter corresponding to the analysis mode;According to the analytic parameter to institute It states target data to be parsed, generate secondary field result and shows.
It is identical in principle as the step S11- steps S14 of above-described embodiment for above-mentioned steps S21- step S24, This is repeated no more.
For step S25, it should be noted that as shown in Figure 6 and Figure 7, for being obtained through the S24 parsings of step S21- steps The field obtained is as a result, if desired further parsing, then system can call corresponding parsing window.It is input in parsing window Analytic parameter, field, source field, analysis mode, separator, expression formula etc. as shown in Fig. 6 and Fig. 7, system can be according to solution Analysis parameter parses secondary parsing field, obtains final analysis result (as shown in Figure 8).
A kind of data analysis method provided in an embodiment of the present invention, by being chosen to number of targets in analytically mode database Call out parsing window according to the analysis mode parsed, and obtain it is described parsing window on input correspond to the parsing side The analytic parameter of formula and the target data parse the target data further according to the analytic parameter, generate field As a result it and shows, while field result is subjected to secondary parsing, reach and no matter Preliminary Analysis or be contacted again, from multiple solutions Analysis mode parses target data, meets the parsing demand of different user, facilitates configuration and maintenance.
Fig. 9 shows a kind of data analysis device that one embodiment of the invention provides, including chooses module 31, calls display Module 32, acquisition module 33 and parsing display module 34, wherein:
Module 31 is chosen, for choosing the analysis mode parsed to target data in analytically mode database;
Display module 32 is called, for calling parsing window according to the analysis mode;
Acquisition module 33, for obtain the analytic parameter corresponding to the analysis mode inputted on the parsing window with The target data;
Display module 34 is parsed, for being parsed to the target data according to the analytic parameter, generates field knot Fruit simultaneously shows.
Since described device of the embodiment of the present invention is identical as the principle of above-described embodiment the method, for more detailed Explain that details are not described herein for content.
It should be noted that can be by hardware processor (hardware processor) come real in the embodiment of the present invention Existing related function module.
A kind of data analysis device provided in an embodiment of the present invention, by being chosen to number of targets in analytically mode database Call out parsing window according to the analysis mode parsed, and obtain it is described parsing window on input correspond to the parsing side The analytic parameter of formula and the target data parse the target data further according to the analytic parameter, generate field As a result it and shows, reaches and target data is parsed from multiple analysis modes, meet the parsing demand of different user, conveniently match It sets and safeguards.
Figure 10 shows that a kind of data analysis device that one embodiment of the invention provides, including selection module 41, calling are shown Show module 42, acquisition module 43 and parsing display module 44, wherein:
Module 41 is chosen, for choosing the analysis mode parsed to target data in analytically mode database;
Display module 42 is called, for calling parsing window according to the analysis mode;
Acquisition module 43, for obtain the analytic parameter corresponding to the analysis mode inputted on the parsing window with The target data;
Display module 44 is parsed, for being parsed to the target data according to the analytic parameter, generates field knot Fruit simultaneously shows;
Secondary parsing module 45, is used for:
Either field is chosen from the field result as secondary parsing field, and is chosen to the secondary parsing field The analysis mode parsed;
Obtain the analytic parameter corresponding to the analysis mode;
The target data is parsed according to the analytic parameter, generate secondary field result and is shown.
Since described device of the embodiment of the present invention is identical as the principle of above-described embodiment the method, for more detailed Explain that details are not described herein for content.
It should be noted that can be by hardware processor (hardware processor) come real in the embodiment of the present invention Existing related function module.
A kind of data analysis device provided in an embodiment of the present invention, by being chosen to number of targets in analytically mode database Call out parsing window according to the analysis mode parsed, and obtain it is described parsing window on input correspond to the parsing side The analytic parameter of formula and the target data parse the target data further according to the analytic parameter, generate field As a result it and shows, while field result is subjected to secondary parsing, reach and no matter Preliminary Analysis or be contacted again, from multiple solutions Analysis mode parses target data, meets the parsing demand of different user, facilitates configuration and maintenance.
In addition, it will be appreciated by those of skill in the art that although some embodiments described herein include other embodiments In included certain features rather than other feature, but the combination of the feature of different embodiments means in of the invention Within the scope of and form different embodiments.For example, in the following claims, embodiment claimed is appointed One of meaning mode can use in any combination.
It should be noted that the present invention will be described rather than limits the invention for above-described embodiment, and ability Field technique personnel can design alternative embodiment without departing from the scope of the appended claims.In the claims, Any reference mark between bracket should not be configured to limitations on claims.Word "comprising" does not exclude the presence of not Element or step listed in the claims.Word "a" or "an" before element does not exclude the presence of multiple such Element.The present invention can be by means of including the hardware of several different elements and being come by means of properly programmed computer real It is existing.In the unit claims listing several devices, several in these devices can be by the same hardware branch To embody.The use of word first, second, and third does not indicate that any sequence.These words can be explained and be run after fame Claim.
One of ordinary skill in the art will appreciate that:The above embodiments are only used to illustrate the technical solution of the present invention., and It is non-that it is limited;Although present invention has been described in detail with reference to the aforementioned embodiments, those skilled in the art It should be understood that:Its still can with technical scheme described in the above embodiments is modified, either to which part or All technical features carries out equivalent replacement;And these modifications or replacements, it does not separate the essence of the corresponding technical solution this hair Bright claim limited range.

Claims (8)

1. a kind of data analysis method, which is characterized in that including:
The analysis mode parsed to target data is analytically chosen in mode database;
Parsing window is called according to the analysis mode;
Obtain the analytic parameter corresponding to the analysis mode inputted on the parsing window and the target data;
The target data is parsed according to the analytic parameter, generate field result and is shown.
If 2. according to the method described in claim 1, it is characterized in that, the analysis mode be separator analysis mode, institute It includes the serial number in the data of field name, field type and field after parsing to state analytic parameter.
If 3. according to the method described in claim 1, it is characterized in that, the analysis mode be regular expression analysis mode, Then the analytic parameter includes the regular expression of field name, field type and field.
4. according to the method described in claim 1, it is characterized in that, further including:
Either field is chosen from the field result as secondary parsing field, and chooses and the secondary parsing field is carried out The analysis mode of parsing;
Obtain the analytic parameter corresponding to the analysis mode;
The target data is parsed according to the analytic parameter, generate secondary field result and is shown.
5. a kind of data analysis device, which is characterized in that including:
Module is chosen, for choosing the analysis mode parsed to target data in analytically mode database;
Display module is called, for calling parsing window according to the analysis mode;
Acquisition module, for obtaining the analytic parameter corresponding to the analysis mode inputted on the parsing window and the mesh Mark data;
Display module is parsed, for parsing, generating field result and showing to the target data according to the analytic parameter Show.
6. device according to claim 6, which is characterized in that if the analysis mode is separator analysis mode, institute It includes the serial number in the data of field name, field type and field after parsing to state analytic parameter.
7. device according to claim 6, which is characterized in that if the analysis mode is regular expression analysis mode, Then the analytic parameter includes the regular expression of field name, field type and field.
8. device according to claim 6, which is characterized in that further include secondary parsing module, be used for:
Either field is chosen from the field result as secondary parsing field, and chooses and the secondary parsing field is carried out The analysis mode of parsing;
Obtain the analytic parameter corresponding to the analysis mode;
The target data is parsed according to the analytic parameter, generate secondary field result and is shown.
CN201810214338.5A 2018-03-15 2018-03-15 Data analysis method and device Withdrawn CN108519965A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201810214338.5A CN108519965A (en) 2018-03-15 2018-03-15 Data analysis method and device

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201810214338.5A CN108519965A (en) 2018-03-15 2018-03-15 Data analysis method and device

Publications (1)

Publication Number Publication Date
CN108519965A true CN108519965A (en) 2018-09-11

Family

ID=63433735

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201810214338.5A Withdrawn CN108519965A (en) 2018-03-15 2018-03-15 Data analysis method and device

Country Status (1)

Country Link
CN (1) CN108519965A (en)

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN114679504A (en) * 2022-05-27 2022-06-28 成都数联云算科技有限公司 UDP message parsing method and device and computer equipment
US20250156411A1 (en) * 2023-11-13 2025-05-15 Beijing Baidu Netcom Science Technology Co., Ltd. Large language model-based question processing method and apparatus, electronic device, and storage medium

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102098235A (en) * 2011-01-18 2011-06-15 南京邮电大学 Fishing mail inspection method based on text characteristic analysis
CN106021554A (en) * 2016-05-30 2016-10-12 北京奇艺世纪科技有限公司 Log analysis method and device
CN106709368A (en) * 2016-12-15 2017-05-24 天津交控科技有限公司 Data analysis method and device

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102098235A (en) * 2011-01-18 2011-06-15 南京邮电大学 Fishing mail inspection method based on text characteristic analysis
CN106021554A (en) * 2016-05-30 2016-10-12 北京奇艺世纪科技有限公司 Log analysis method and device
CN106709368A (en) * 2016-12-15 2017-05-24 天津交控科技有限公司 Data analysis method and device

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
陈小玲 等: "《信息检索与利用》", 31 May 2016 *

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN114679504A (en) * 2022-05-27 2022-06-28 成都数联云算科技有限公司 UDP message parsing method and device and computer equipment
US20250156411A1 (en) * 2023-11-13 2025-05-15 Beijing Baidu Netcom Science Technology Co., Ltd. Large language model-based question processing method and apparatus, electronic device, and storage medium

Similar Documents

Publication Publication Date Title
US20090271351A1 (en) Rules engine test harness
CN106095673A (en) Automated testing method based on WEB interface and system
CN109684209A (en) A kind of method for generating test case, device and electronic equipment
EP1796005A3 (en) A method of configuring a product
CA2909634A1 (en) Testing system
US6938026B2 (en) System and method for implementing quality control rules formulated in accordance with a quality control rule grammar
CN109389518A (en) Association analysis method and device
CN104036187A (en) Method and system for determining computer virus types
CN108829884A (en) data mapping method and device
Harrer et al. Open source versus proprietary software in service-orientation: the case of BPEL engines
CN108519965A (en) Data analysis method and device
CN109102245A (en) A kind of processing method of approval process, system and device
CN107908684A (en) enterprise information searching method and device
WO2018222182A1 (en) Risk assessment system and method
Serna-Montoya et al. Optimal microgrid protection coordination for directional overcurrent relays through mixed-integer linear optimization
CN106130929B (en) The service message automatic processing method and system of internet insurance field based on graph-theoretical algorithm
CN109800255A (en) A kind of generation method of Visual Report Forms, system, equipment and medium
CA2423619A1 (en) Improved system and method for case-based reasoning
CN110019315A (en) A kind of method and apparatus for the parsing of data blood relationship
CN108229907B (en) Rule changing method and system based on expression engine
CN109857654A (en) A kind of method, apparatus and system of the timing flow chart automatically generating test case
CN105553767B (en) Website backdoor file detection method and device
CN113313615A (en) Method and device for quantitatively grading and grading enterprise judicial risks
US7389340B2 (en) Execution sets for generated logs
CN110287118B (en) Test data generation method and software test method of test case

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
WW01 Invention patent application withdrawn after publication
WW01 Invention patent application withdrawn after publication

Application publication date: 20180911