[go: up one dir, main page]

CN104486154B - The introduction method and device of data - Google Patents

The introduction method and device of data Download PDF

Info

Publication number
CN104486154B
CN104486154B CN201410773630.2A CN201410773630A CN104486154B CN 104486154 B CN104486154 B CN 104486154B CN 201410773630 A CN201410773630 A CN 201410773630A CN 104486154 B CN104486154 B CN 104486154B
Authority
CN
China
Prior art keywords
parameter
url
imported
alias
data
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN201410773630.2A
Other languages
Chinese (zh)
Other versions
CN104486154A (en
Inventor
赖华贵
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Beijing Gridsum Technology Co Ltd
Original Assignee
Beijing Gridsum Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Beijing Gridsum Technology Co Ltd filed Critical Beijing Gridsum Technology Co Ltd
Priority to CN201410773630.2A priority Critical patent/CN104486154B/en
Publication of CN104486154A publication Critical patent/CN104486154A/en
Application granted granted Critical
Publication of CN104486154B publication Critical patent/CN104486154B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Landscapes

  • Computer And Data Communications (AREA)

Abstract

The invention discloses a kind of introduction method of data and device.Wherein, the introduction method of the data includes:Obtain data to be imported;Judge that, with the presence or absence of agreement parameter in the URL parameter that packet to be imported contains, agreement parameter is the parameter being added in uniform resource position mark URL that data providing is arranged with data importing side;If it is judged that agreement parameter in the URL parameter that packet to be imported contains be present, then the URL parameter value contained packet to be imported is as final argument value;If it is judged that agreement parameter is not present in the URL parameter that packet to be imported contains, then the final argument value for the URL parameter value that packet to be imported contains is determined according to the ginseng mechanism really that presets;And data to be imported are imported according to final argument value.By the present invention, solve importing side in correlation technique and, when importing data according to make an appointment plus ginseng rule, may result in the problem of having imported wrong data.

Description

The introduction method and device of data
Technical field
The present invention relates to internet arena, in particular to the introduction method and device of a kind of data.
Background technology
At present, the parameter that existing data traffic monitoring system is relied primarily in monitoring site URL carrys out the number of tracking website According to flow.This can represent different flow attributions because different URL parameter value, such as, if it is paid for flow, payment Source of flow etc., therefore, data traffic can be accurately tracked by by URL parameter value.
For the data traffic that analyze data flow monitoring system traces into, in the related art, additionally provide corresponding Data traffic import system.Wherein, data traffic monitoring system can be used as data traffic provider, be data traffic introgressive line System provides corresponding data traffic, and data traffic import system can be used as data traffic importing side, and importing comes from data flow Measure the data traffic of monitoring system.
When implementing, for the ease of data analysis, in the related art, importing side can typically make an appointment with provider URL adds the rule of parameter, and specifically, importings side can typically arrange the corresponding URL of data of hope importing parameter with other side Title.For example, importing side can arrange with other side:Show the source of data traffic using utm_source, so, there is provided side is pressed The parameters such as utm_source=baidu can be added in the URL of website as agreed to show that the source of data traffic is " hundred Degree ".And when importing side imports data, then the source of each visit capacity (that is, data traffic) can be determined according to agreement.
However, in actual applications, because the page of website is too many, therefore it can not ensure correct to the addition of each page The parameter appointed.For example, due to the error of staff, utm_source=may not added according to agreement This parameter of baidu, but this parameter of source=baidu is with the addition of, now, if it is desired to be revised as appointing Specification utm_source parameters, then cost is larger.Also, arrange due to provider while from multiple importing sides different Add ginseng rule, same parameters may be caused to have been assigned different meanings, and then data may be caused to import mistake.For example, such as The Tab phenolphthaleinum side of entering A provides the source of utm_source parameter value representative of data flow amounts, and the side of importing B provides utm_source parameters Value represents other meanings, then between two sets of agreements, there is provided side can only take one to use.Based on the above situation, in correlation In technology, it may result in importing side and import wrong data.
For importing side in correlation technique when importing data according to make an appointment plus ginseng rule, importing may result in The problem of wrong data, not yet propose effective solution at present.
The content of the invention
It is a primary object of the present invention to provide the introduction method and device of a kind of data, to solve to import in correlation technique Side may result in the problem of having imported wrong data when importing data according to make an appointment plus ginseng rule.
To achieve these goals, according to an aspect of the invention, there is provided a kind of introduction method of data.This method Including:Obtain data to be imported, above-mentioned packet parameter containing uniform resource position mark URL to be imported and URL URL parameter value, above-mentioned URL parameter and above-mentioned URL parameter value are corresponding;Judge the above-mentioned URL ginsengs that above-mentioned packet to be imported contains With the presence or absence of agreement parameter in number, above-mentioned agreement parameter is that data providing is added to unified resource with what data importing side arranged Parameter in finger URL URL;If it is judged that above-mentioned agreement ginseng in the above-mentioned URL parameter that above-mentioned packet to be imported contains be present Number, then the above-mentioned URL parameter value contained above-mentioned packet to be imported is as final argument value;If it is judged that above-mentioned number to be imported According to comprising above-mentioned URL parameter above-mentioned agreement parameter is not present, then according to preset really ginseng mechanism determine it is above-mentioned to be imported The final argument value for the above-mentioned URL parameter value that packet contains;And above-mentioned data to be imported are imported according to above-mentioned final argument value.
Further, above-mentioned really ginseng mechanism is the alias configuration mechanism of parameter, joins mechanism determination really according to presetting The final argument value for the above-mentioned URL parameter value that above-mentioned packet to be imported contains includes:According to the alias configuration mechanism of above-mentioned parameter Obtain the alias parameter for the above-mentioned URL parameter that above-mentioned packet to be imported contains;Judge the above-mentioned URL that above-mentioned packet to be imported contains It whether there is above-mentioned alias parameter in parameter;And if it is judged that deposited in the above-mentioned URL parameter that above-mentioned packet to be imported contains In above-mentioned alias parameter, then above-mentioned alias parameter is defined as new parameter, and parameter value corresponding to above-mentioned alias parameter is determined For final argument value.
Further, above-mentioned alias parameter includes the first alias parameter for covering above-mentioned agreement parameter, is judging above-mentioned treat Import with the presence or absence of before arranging parameter in the above-mentioned URL parameter that packet contains, above-mentioned introduction method also includes:Judge above-mentioned treat Import in the above-mentioned URL parameter that packet contains and whether there is above-mentioned first alias parameter;If it is judged that above-mentioned data to be imported Comprising above-mentioned URL parameter in above-mentioned first alias parameter be present, then above-mentioned first alias parameter is defined as new parameter, and will Above-mentioned first alias parameter is defined as final argument value as parameter value corresponding to above-mentioned first alias parameter;And if it is determined that Go out and above-mentioned first alias parameter is not present in the above-mentioned URL parameter that above-mentioned packet to be imported contains, then perform and judge above-mentioned to wait to lead The step of entering in the above-mentioned URL parameter that packet contains with the presence or absence of agreement parameter.
Further, the alias parameter of above-mentioned URL parameter is determined in the following manner:Obtain all URL agreement ginseng Number;Each agreement parameter definition one or more alias parameter in respectively above-mentioned all URL agreement parameter;It is determined that The corresponding relation of above-mentioned each agreement parameter and said one or more alias parameters;It is and by said one or more indivedual Name parameter is stored in alias parameter list according to above-mentioned corresponding relation.
Further, before above-mentioned data to be imported are imported according to above-mentioned final argument value, above-mentioned introduction method also wraps Include:Obtain the final argument value of above-mentioned URL parameter value;And judge to obtain above-mentioned URL parameter value final argument value whether into Work(, wherein, if it is judged that obtaining the final argument value failure of above-mentioned URL parameter value, then imported according to null value above-mentioned to be imported Data, if it is judged that obtaining the final argument value success of above-mentioned URL parameter value, then perform and imported according to above-mentioned final argument value The step of above-mentioned data to be imported.
To achieve these goals, according to another aspect of the present invention, there is provided a kind of gatherer of data.The device Including:First acquisition unit, for obtaining data to be imported, above-mentioned packet parameter containing uniform resource position mark URL to be imported With uniform resource position mark URL parameter value, above-mentioned URL parameter and above-mentioned URL parameter value are corresponding;First judging unit, is used for Judge to provide for data with the presence or absence of agreement parameter, above-mentioned agreement parameter in the above-mentioned URL parameter that above-mentioned packet to be imported contains The parameter being added in uniform resource position mark URL that side arranges with data importing side;First determining unit, for judging When above-mentioned agreement parameter be present in the above-mentioned URL parameter that above-mentioned packet to be imported contains, above-mentioned packet to be imported is contained upper URL parameter value is stated as final argument value;Second determining unit, for if it is judged that above-mentioned packet to be imported contain it is above-mentioned When above-mentioned agreement parameter is not present in URL parameter, it is upper to determine that above-mentioned packet to be imported contains according to the ginseng mechanism really that presets State the final argument value of URL parameter value;And data import unit, it is above-mentioned to be imported for being imported according to above-mentioned final argument value Data.
Further, above-mentioned really ginseng mechanism is the alias configuration mechanism of parameter, and above-mentioned second determining unit includes:First obtains Modulus block, for obtaining the other of above-mentioned URL parameter that above-mentioned packet to be imported contains according to the alias configuration mechanism of above-mentioned parameter Name parameter;Judge module, join with the presence or absence of above-mentioned alias in the above-mentioned URL parameter contained for judging above-mentioned packet to be imported Number;And first determining module, for above-mentioned alias be present in above-mentioned URL parameter that above-mentioned packet to be imported contains is judged During parameter, above-mentioned alias parameter is defined as new parameter, and parameter value corresponding to above-mentioned alias parameter is defined as final argument Value.
Further, above-mentioned alias parameter includes the first alias parameter for covering above-mentioned agreement parameter, above-mentioned gatherer Also include:Second judging unit, in above-mentioned URL parameter that above-mentioned packet to be imported contains is judged with the presence or absence of agreement ginseng Before number, judge to whether there is above-mentioned first alias parameter in the above-mentioned URL parameter that above-mentioned packet to be imported contains;3rd determines Unit, will be upper for when above-mentioned first alias parameter be present in judging above-mentioned URL parameter that above-mentioned packet to be imported contains State the first alias parameter and be defined as new parameter, and using above-mentioned first alias parameter as parameter corresponding to above-mentioned first alias parameter Value is defined as final argument value;And if it is judged that it is not present in the above-mentioned URL parameter that above-mentioned packet to be imported contains above-mentioned First alias parameter, then perform the function of above-mentioned first judging unit.
Further, above-mentioned gatherer also includes:4th determining unit, for determining above-mentioned URL in the following manner The alias parameter of parameter, above-mentioned 4th determining unit have:Second acquisition module, for obtaining all URL agreement parameter; Definition module, for be respectively above-mentioned all URL agreement parameter in each agreement parameter definition one or more is other Name parameter;Second determining module, for it is determined that pair of above-mentioned each agreement parameter and said one or more alias parameters It should be related to;And memory module, for said one or more alias parameters to be stored in into alias according to above-mentioned corresponding relation In parameter list.
Further, above-mentioned gatherer also includes:Second acquisition unit, for being imported according to above-mentioned final argument value Before above-mentioned data to be imported, the final argument value of above-mentioned URL parameter value is obtained;And the 3rd judging unit, for judging to obtain Take whether the final argument value of above-mentioned URL parameter value succeeds, wherein, above-mentioned data import unit is additionally operable on acquisition is judged When stating the final argument value failure of URL parameter value, above-mentioned data to be imported are imported according to null value, are judging to obtain above-mentioned URL During the final argument value success of parameter value, the function of above-mentioned data import unit is performed.
Pass through the present invention, using acquisition data to be imported, packet parameter containing uniform resource position mark URL to be imported and system One URLs URL parameter value, URL parameter and URL parameter value are corresponding;Judge in the URL parameter that packet to be imported contains With the presence or absence of agreement parameter, agreement parameter is that data providing is added to URL with what data importing side arranged Parameter in URL;If it is judged that agreement parameter in the URL parameter that packet to be imported contains be present, then by packet to be imported The URL parameter value contained is as final argument value;If it is judged that agreement parameter is not present in the URL parameter that packet to be imported contains, The final argument value for the URL parameter value that packet to be imported contains then is determined according to the ginseng mechanism really that presets;And according to most End condition value imports data to be imported, solves importing side in correlation technique and is importing data according to make an appointment plus ginseng rule When, it may result in the problem of having imported wrong data, and then improve the effect of accuracy that data import.
Brief description of the drawings
The accompanying drawing for forming the part of the application is used for providing a further understanding of the present invention, schematic reality of the invention Apply example and its illustrate to be used to explain the present invention, do not form inappropriate limitation of the present invention.In the accompanying drawings:
Fig. 1 is the flow chart of the introduction method of data according to embodiments of the present invention;And
Fig. 2 is the schematic diagram of the gatherer of data according to embodiments of the present invention.
Embodiment
It should be noted that in the case where not conflicting, the feature in embodiment and embodiment in the application can phase Mutually combination.Describe the present invention in detail below with reference to the accompanying drawings and in conjunction with the embodiments.
In order that those skilled in the art more fully understand application scheme, below in conjunction with the embodiment of the present application Accompanying drawing, the technical scheme in the embodiment of the present application is clearly and completely described, it is clear that described embodiment is only The embodiment of the application part, rather than whole embodiments.Based on the embodiment in the application, ordinary skill people The every other embodiment that member is obtained under the premise of creative work is not made, it should all belong to the model of the application protection Enclose.
It should be noted that term " first " in the description and claims of this application and above-mentioned accompanying drawing, " Two " etc. be for distinguishing similar object, without for describing specific order or precedence.It should be appreciated that so use Data can exchange in the appropriate case, so as to embodiments herein described herein.In addition, term " comprising " and " tool Have " and their any deformation, it is intended that cover it is non-exclusive include, for example, containing series of steps or unit Process, method, system, product or equipment are not necessarily limited to those steps clearly listed or unit, but may include without clear It is listing to Chu or for the intrinsic other steps of these processes, method, product or equipment or unit.
According to an embodiment of the invention, there is provided a kind of introduction method of data.The introduction method of the data can be run On computer-processing equipment.
Fig. 1 is the flow chart of the introduction method of data according to embodiments of the present invention.As shown in figure 1, this method is included such as Under step S102 to step S110:
Step S102, obtain data to be imported.
Packet parameter containing uniform resource position mark URL to be imported and uniform resource position mark URL parameter value, URL parameter It is corresponding with URL parameter value.Wherein, URL parameter and URL parameter value can be in advance added to the parameter on the URL of website, example Such as, utm_source=baidu, or source=baidu etc. can be added on certain URL, wherein, utm_source and Source be added to the parameter on URL, and baidu be then added to the Parameters Parameter Value on URL, utm_source with Baidu is corresponding, or source corresponding with baidu.
Step S104, judge in the URL parameter that packet to be imported contains with the presence or absence of agreement parameter.
Agreement parameter is the ginseng being added in uniform resource position mark URL that data providing is arranged with data importing side Number, that is, after agreement parameter is added in URL, data providing is with data importing side to the meaning representated by agreement parameter Reason and good sense solution is consistent.
Step S106, if it is judged that agreement parameter in the URL parameter that packet to be imported contains be present, then by number to be imported According to comprising URL parameter value as final argument value.
Step S108, if it is judged that agreement parameter is not present in the URL parameter that packet to be imported contains, then basis is set in advance The fixed mechanism of ginseng really determines the final argument value for the URL parameter value that packet to be imported contains.
Step S110, data to be imported are imported according to final argument value.
For example, it is assumed that the agreement parameter for representing data source is utm_source, then when data providing is by utm_ When source=baidu is added on corresponding URL, then it represents that the access corresponding to this URL comes from " Baidu ";And when data are led When the side of entering imports data, utm_source=baidu is found, it is determined that the access corresponding to this URL comes from " Baidu ".Work as data Represent that the access corresponding to this URL comes from " Baidu " source=baidu is added on corresponding URL by provider by mistake when When, when data importing side imports data, source=baidu is found, then will be considered that the access corresponding to this URL is not from " Baidu ", so, when importing data to be imported, the access that with the addition of corresponding to source=baidu URL would not be returned For the access from Baidu.
By the embodiment of the present invention, because before data to be imported are imported, the URL parameter contained to packet to be imported is done Corresponding judgement, only exist arrange parameter when just directly according to original parameter value importing data, and in the absence of about When determining parameter, then the parameter value determined according to true ginseng mechanism imports data, therefore has reached and improved the accuracy that data import Effect.
Preferably, in embodiments of the present invention, really ginseng mechanism can be parameter alias configuration mechanism, step S108 according to Preset ginseng mechanism really and determine that the final argument value for the URL parameter value that packet to be imported contains can include:
S2, the alias parameter for the URL parameter that packet to be imported contains is obtained according to the alias configuration mechanism of parameter;
S4, judge to whether there is alias parameter in the URL parameter that packet to be imported contains;And
S6, if it is judged that alias parameter in the URL parameter that packet to be imported contains be present, then alias parameter is defined as New parameter, and parameter value corresponding to alias parameter is defined as final argument value.
During implementation, can all URL be included about in each website traffic data that data importing side obtains from data providing Determine parameter and its corresponding parameter value, or the alias parameter of agreement parameter and its corresponding parameter value, therefore import data When, for each website traffic data, data importing side can all search alias parameter in alias parameter list, and according to alias Parameter value corresponding to parameter imports data.
It is further preferred that in embodiments of the present invention, alias parameter can include the first alias of covering agreement parameter Parameter, before during S4 judges the URL parameter that packet to be imported contains with the presence or absence of agreement parameter, the introduction method can also wrap Include:
S6, judge to whether there is the first alias parameter in the URL parameter that packet to be imported contains;
S8, if it is judged that the first alias parameter in the URL parameter that packet to be imported contains be present, then the first alias is joined Number is defined as new parameter, and is defined as final argument value using the first alias parameter as parameter value corresponding to the first alias parameter; And
S10, if it is judged that the first alias parameter is not present in the URL parameter that packet to be imported contains, then perform judgement The step of in the URL parameter that packet to be imported contains with the presence or absence of agreement parameter.
During implementation, data importing side determines each website stream from data providing pulling data, and according to following mechanism Measure each parameter value of data:The URL parameter included for each website traffic data, search the alias ginseng of URL parameter Number table, detect whether to cover the alias parameter of original agreement parameter, if so, then each using this alias parameter testing Data, and final argument value is used as using parameter value corresponding to this alias parameter;If it is not, using it is original agreement parameter and Other all alias parameter testings preferentially arrange parameter testing per a data per a data using original, and with this If parameter value corresponding to original agreement parameter, without original agreement parameter, is examined in alias ginseng as final argument value Number, when getting parameter value corresponding to an alias parameter, final argument value is used as using this parameter value.At above-mentioned mechanism Reason data, the data meeting of importing and agreement import consistent.
Preferably, in embodiments of the present invention, the alias parameter of URL parameter can be determined in the following manner:
S12, obtain all URL agreement parameter;
Each one or more alias of agreement parameter definition ginseng in S14, respectively all URL agreement parameter Number;
S16, it is determined that each corresponding relation of agreement parameter and one or more alias parameter;And
S18, one or more alias parameter is stored in alias parameter list according to corresponding relation.
When implementing, the list of all URL parameters can be obtained from existing plus ginseng agreement, and be each URL Parameter specifies one or more alias, can not also specify any alias, wherein, in all alias specified, at most may be used The attribute of original agreement parameter is covered to there is an alias to have.For example, it is assumed that the agreement parameter for representing data source is Utm_source, then can be that it specifies following alias:Source and utm_source1, wherein, utm_source1 be with Cover original agreement parameter utm_source attribute.
It should be noted that alias parameter can be stored in alias parameter list after specifying, and so, Ke Yitong Cross the alias parameter that the mode tabled look-up searches the URL parameter that packet to be imported contains.
Preferably, in embodiments of the present invention, should before step S110 imports data to be imported according to final argument value Introduction method can also include:
S22, obtain the final argument value of URL parameter value;And
Whether S24, the final argument value for judging to obtain URL parameter value succeed,
Wherein, if it is judged that obtaining the final argument value failure of URL parameter value, then number to be imported is imported according to null value According to if it is judged that obtaining the final argument value success of URL parameter value, then execution imports data to be imported according to final argument value The step of.
The present invention is elaborated with a specific embodiment below:
Assuming that data importing side and data providing agreement are as shown in table 1:
Table 1
Arrange the title of parameter Arrange the meaning that the title of parameter represents
utm_source The source of data on flows
utm_term The keyword of data on flows
utm_content The ID of advertisement corresponding to data on flows
And in actual applications, arrange parameter for above-mentioned the first two, do not added according to above-mentioned agreement, for example, It is possible to be omitted prefix utm_;And for utm_term, it is understood that there may be another addition form utm_keyword;Separately Outside, for utm_content parameters, it is likely to be used for representing other information, and actually represents advertisement corresponding to data on flows ID parameter is probably utm_content2.In these situations, can formulate such as the mapping ruler of table 2, i.e. alias configuration mechanism:
Table 2
During implementation, it is desirable to obtain the ginseng corresponding to utm_source, utm_term, utm_content of data to be imported Numerical value, and import data import system, it is assumed that website traffic monitoring data (that is, the data to be imported) such as table 3 got:
Table 3
Flow ID utm_source source utm_term term utm_keyword utm_content2
1 baidu Buy fresh flower 1000
2 baidu google Rose 1001
3 baidu Send florist service 1002
4 google Buy fresh flower 1003
According to above-mentioned data, when implementing of the invention, the final ginseng of each parameter of each flow can be determined respectively Numerical value is as follows:
For utm_source, by reading the mapping relations in alias configuration rule, utm_ is found if searched There is alias source in source parameters, and this alias does not cover original agreement parameter.Therefore, if a data on flows Utm_source parameters are contained, then final argument value are directly used as using parameter value corresponding to this parameter;Otherwise, source is joined Parameter value corresponding to number is as final argument value.It is regular according to these, then it can obtain the utm_ of each data on flows The final argument value of source parameters such as table 4:
Table 4
Flow ID The final argument value of utm_source parameters
1 baidu
2 baidu
3 baidu
4 google
For utm_term, by reading the mapping relations in alias configuration rule, utm_source is found if searched There is alias utm_keyword and term in parameter, and these alias do not cover original agreement parameter.Therefore, principle is same (1) the final argument value such as table 5 of the utm_term parameters of each data on flows can, then be obtained:
Table 5
Flow ID The final argument value of utm_term parameters
1 Buy fresh flower
2 Rose
3 Send florist service
4 Buy fresh flower
For utm_content, by reading the mapping relations in alias configuration rule, if it is all to search discovery Utm_content parameters are all replaced with alias utm_content2, and this alias covers original agreement parameter.Therefore, it is former Reason is same as above, then can obtain the final argument value such as table 6 of the utm_content parameters of each data on flows:
Table 6
Flow ID The final argument value of utm_content parameters
1 1000
2 1001
3 1002
4 1003
For all parameters, final argument value such as table 7:
Table 7
Flow ID utm_source utm_term utm_content
1 baidu Buy fresh flower 1000
2 baidu Rose 1001
3 baidu Send florist service 1002
4 google Buy fresh flower 1003
So, the parameter value consistent with arranging just has been got, and these data have been imported into data importing side.
By the embodiment of the present invention, URL can be added the nonstandard data of ginseng to be mapped on the treaty rule of specification, and then Correct data are directed into appointing system, in such manner, it is possible to effectively handle URL parameter addition mistake or simultaneously using more Conflict caused by set URL parameter addition scheme.
It should be noted that can be in such as one group of computer executable instructions the flow of accompanying drawing illustrates the step of Performed in computer system, although also, show logical order in flow charts, in some cases, can be with not The order being same as herein performs shown or described step.
According to an embodiment of the invention, there is provided a kind of gatherer of data.It should be noted that the embodiment of the present invention The introduction method of the data provided can be performed by the gatherer of the data of the embodiment of the present invention, the embodiment of the present invention Data gatherer can be used for perform the embodiment of the present invention data introduction method.
Fig. 2 is the schematic diagram of the gatherer of data according to embodiments of the present invention.As shown in Fig. 2 the device includes:The One acquiring unit 10, the first judging unit 20, the first determining unit 30, the second determining unit 40 and data import unit 50.
First acquisition unit 10 is used to obtain data to be imported.
Packet parameter containing uniform resource position mark URL to be imported and uniform resource position mark URL parameter value, URL parameter It is corresponding with URL parameter value.Wherein, URL parameter and URL parameter value can be in advance added to the parameter on the URL of website, example Such as, utm_source=baidu, or source=baidu etc. can be added on certain URL, wherein, utm_source and Source be added to the parameter on URL, and baidu be then added to the Parameters Parameter Value on URL, utm_source with Baidu is corresponding, or source corresponding with baidu.
First judging unit 20 is used to judge in the URL parameter that packet to be imported contains with the presence or absence of agreement parameter.
Agreement parameter is the ginseng being added in uniform resource position mark URL that data providing is arranged with data importing side Number, that is, after agreement parameter is added in URL, data providing is with data importing side to the meaning representated by agreement parameter Reason and good sense solution is consistent.
First determining unit 30 is used for when agreement parameter in judging URL parameter that packet to be imported contains be present, will The URL parameter value that packet to be imported contains is as final argument value.
Second determining unit 40 is used for if it is judged that when agreement parameter is not present in the URL parameter that contains of packet to be imported, The final argument value for the URL parameter value that packet to be imported contains is determined according to the ginseng mechanism really that presets.
Data import unit 50 is used to import data to be imported according to final argument value.
For example, it is assumed that the agreement parameter for representing data source is utm_source, then when data providing is by utm_ When source=baidu is added on corresponding URL, then it represents that the access corresponding to this URL comes from " Baidu ";And when data are led When the side of entering imports data, utm_source=baidu is found, it is determined that the access corresponding to this URL comes from " Baidu ".Work as data Represent that the access corresponding to this URL comes from " Baidu " source=baidu is added on corresponding URL by provider by mistake when When, when data importing side imports data, source=baidu is found, then will be considered that the access corresponding to this URL is not from " Baidu ", so, when importing data to be imported, the access that with the addition of corresponding to source=baidu URL would not be returned For the access from Baidu.
By the embodiment of the present invention, because before data to be imported are imported, the URL parameter contained to packet to be imported is done Corresponding judgement, only exist arrange parameter when just directly according to original parameter value importing data, and in the absence of about When determining parameter, then the parameter value determined according to true ginseng mechanism imports data, therefore has reached and improved the accuracy that data import Effect.
Preferably, in embodiments of the present invention, really ginseng mechanism is the alias configuration mechanism of parameter, and the second determining unit can be with Including:First acquisition module, judge module and the first determining module.
First acquisition module is used for the other of the URL parameter contained according to the alias configuration mechanism of parameter acquisition packet to be imported Name parameter;Judge module is used to judge to whether there is alias parameter in the URL parameter that packet to be imported contains;And first determine Module is used to, when alias parameter be present in judging URL parameter that packet to be imported contains, alias parameter be defined as newly joining Number, and parameter value corresponding to alias parameter is defined as final argument value.
During implementation, can all URL be included about in each website traffic data that data importing side obtains from data providing Determine parameter and its corresponding parameter value, or the alias parameter of agreement parameter and its corresponding parameter value, therefore import data When, for each website traffic data, data importing side can all search alias parameter in alias parameter list, and according to alias Parameter value corresponding to parameter imports data.
It is further preferred that in embodiments of the present invention, alias parameter includes the first alias parameter of covering agreement parameter, The gatherer can also include:Second judging unit and the 3rd determining unit.
Second judging unit be used in judging URL parameter that packet to be imported contains with the presence or absence of before arranging parameter, Judge to whether there is the first alias parameter in the URL parameter that packet to be imported contains;3rd determining unit is used to judge to treat Import when the first alias parameter be present in the URL parameter that contains of packet, the first alias parameter is defined as new parameter, and by first Alias parameter is defined as final argument value as parameter value corresponding to the first alias parameter;And if it is judged that data to be imported Comprising URL parameter in the first alias parameter is not present, then perform the function of the first judging unit.
During implementation, data importing side determines each website stream from data providing pulling data, and according to following mechanism Measure each parameter value of data:The URL parameter included for each website traffic data, search the alias ginseng of URL parameter Number table, detect whether to cover the alias parameter of original agreement parameter, if so, then each using this alias parameter testing Data, and final argument value is used as using parameter value corresponding to this alias parameter;If it is not, using it is original agreement parameter and Other all alias parameter testings preferentially arrange parameter testing per a data per a data using original, and with this If parameter value corresponding to original agreement parameter, without original agreement parameter, is examined in alias ginseng as final argument value Number, when getting parameter value corresponding to an alias parameter, final argument value is used as using this parameter value.At above-mentioned mechanism Reason data, the data meeting of importing and agreement import consistent.
Preferably, in embodiments of the present invention, the gatherer can also include:4th determining unit.4th determines list Member is used for the alias parameter for determining URL parameter in the following manner, wherein, the 4th determining unit has:Second acquisition module, determine Adopted module, the second determining module and memory module.
Second acquisition module is used for the agreement parameter for obtaining all URL;Definition module is for respectively all URL's Arrange each agreement parameter definition one or more alias parameter in parameter;Second determining module is used for it is determined that each about Determine the corresponding relation of parameter and one or more alias parameter;And memory module is used for one or more alias parameter It is stored according to corresponding relation in alias parameter list.
When implementing, the list of all URL parameters can be obtained from existing plus ginseng agreement, and be each URL Parameter specifies one or more alias, can not also specify any alias, wherein, in all alias specified, at most may be used The attribute of original agreement parameter is covered to there is an alias to have.For example, it is assumed that the agreement parameter for representing data source is Utm_source, then can be that it specifies following alias:Source and utm_source1, wherein, utm_source1 be with Cover original agreement parameter utm_source attribute.
It should be noted that alias parameter can be stored in alias parameter list after specifying, and so, Ke Yitong Cross the alias parameter that the mode tabled look-up searches the URL parameter that packet to be imported contains.
Preferably, in embodiments of the present invention, the gatherer can also include:Second acquisition unit and the 3rd judges list Member.Second acquisition unit is used for before data to be imported are imported according to final argument value, obtains the final argument of URL parameter value Value;And the 3rd judging unit be used for judge obtain URL parameter value final argument value whether succeed, wherein, data import singly Member is additionally operable to, when judging to obtain the final argument value failure of URL parameter value, import data to be imported according to null value, judging When going out to obtain the final argument value success of URL parameter value, the function of data import unit is performed.
By the embodiment of the present invention, URL can be added the nonstandard data of ginseng to be mapped on the treaty rule of specification, and then Correct data are directed into appointing system, in such manner, it is possible to effectively handle URL parameter addition mistake or simultaneously using more Conflict caused by set URL parameter addition scheme.
Obviously, those skilled in the art should be understood that above-mentioned each module of the invention or each step can be with general Computing device realize that they can be concentrated on single computing device, or be distributed in multiple computing devices and formed Network on, alternatively, they can be realized with the program code that computing device can perform, it is thus possible to they are stored Performed in the storage device by computing device, either they are fabricated to respectively each integrated circuit modules or by they In multiple modules or step be fabricated to single integrated circuit module to realize.So, the present invention is not restricted to any specific Hardware and software combines.
The preferred embodiments of the present invention are the foregoing is only, are not intended to limit the invention, for the skill of this area For art personnel, the present invention can have various modifications and variations.Within the spirit and principles of the invention, that is made any repaiies Change, equivalent substitution, improvement etc., should be included in the scope of the protection.

Claims (10)

  1. A kind of 1. introduction method of data, it is characterised in that including:
    Obtain data to be imported, the packet parameter containing uniform resource position mark URL to be imported and uniform resource position mark URL Parameter value, the URL parameter and the URL parameter value are corresponding;
    Judge that the agreement parameter is data with the presence or absence of agreement parameter in the URL parameter that the packet to be imported contains The parameter being added in uniform resource position mark URL that provider arranges with data importing side;
    If it is judged that the agreement parameter in the URL parameter that the packet to be imported contains be present, then wait to lead by described Enter the URL parameter value that packet contains as final argument value;
    If it is judged that the agreement parameter is not present in the URL parameter that the packet to be imported contains, then basis is set in advance The fixed mechanism of ginseng really determines the final argument value for the URL parameter value that the packet to be imported contains, wherein, the really ginseng machine It is made as the alias configuration mechanism of parameter;And
    The data to be imported are imported according to the final argument value.
  2. 2. the introduction method of data according to claim 1, it is characterised in that the really ginseng mechanism is matched somebody with somebody for the alias of parameter Mechanism is put, the final argument for the URL parameter value that the packet to be imported contains is determined according to the ginseng mechanism really that presets Value includes:
    The alias parameter for the URL parameter that the packet to be imported contains is obtained according to the alias configuration mechanism of the parameter;
    Judge to whether there is the alias parameter in the URL parameter that the packet to be imported contains;And
    If it is judged that the alias parameter in the URL parameter that the packet to be imported contains be present, then by the alias Parameter is defined as new parameter, and parameter value corresponding to the alias parameter is defined as into final argument value.
  3. 3. the introduction method of data according to claim 2, it is characterised in that it is described about that the alias parameter includes covering The first alias parameter of parameter is determined, with the presence or absence of agreement parameter in the URL parameter that the packet to be imported contains is judged Before, the introduction method also includes:
    Judge to whether there is the first alias parameter in the URL parameter that the packet to be imported contains;
    If it is judged that the first alias parameter in the URL parameter that the packet to be imported contains be present, then by described in First alias parameter is defined as new parameter, and using the first alias parameter as parameter value corresponding to the first alias parameter It is defined as final argument value;And
    If it is judged that the first alias parameter is not present in the URL parameter that the packet to be imported contains, then perform The step of judging in the URL parameter that the packet to be imported contains with the presence or absence of agreement parameter.
  4. 4. the introduction method of data according to claim 2, it is characterised in that determine the URL ginsengs in the following manner Several alias parameters:
    Obtain all URL agreement parameter;
    Each agreement parameter definition one or more alias parameter in respectively described all URL agreement parameter;
    Determine the corresponding relation of each agreement parameter and one or more of alias parameters;And
    One or more of alias parameters are stored in alias parameter list according to the corresponding relation.
  5. 5. the introduction method of data according to claim 1, it is characterised in that institute is being imported according to the final argument value Before stating data to be imported, the introduction method also includes:
    Obtain the final argument value of the URL parameter value;And
    Whether the final argument value for judging to obtain the URL parameter value succeeds,
    Wherein, if it is judged that obtaining the final argument value failure of the URL parameter value, then imported according to null value described to be imported Data, if it is judged that obtaining the final argument value success of the URL parameter value, then perform and imported according to the final argument value The step of data to be imported.
  6. A kind of 6. gatherer of data, it is characterised in that including:
    First acquisition unit, for obtaining data to be imported, the packet parameter containing uniform resource position mark URL to be imported and Uniform resource position mark URL parameter value, the URL parameter and the URL parameter value are corresponding;
    First judging unit, with the presence or absence of agreement parameter, institute in the URL parameter contained for judging the packet to be imported It is the parameter being added in uniform resource position mark URL that data providing is arranged with data importing side to state agreement parameter;
    First determining unit, for the agreement ginseng in the URL parameter that the packet to be imported contains is judged be present During number, the URL parameter value that the packet to be imported is contained is as final argument value;
    Second determining unit, for if it is judged that the agreement is not present in the URL parameter that the packet to be imported contains During parameter, the final argument for the URL parameter value that the packet to be imported contains is determined according to the ginseng mechanism really that presets Value, wherein, the really ginseng mechanism is the alias configuration mechanism of parameter;And
    Data import unit, for importing the data to be imported according to the final argument value.
  7. 7. the gatherer of data according to claim 6, it is characterised in that the really ginseng mechanism is matched somebody with somebody for the alias of parameter Mechanism is put, second determining unit includes:
    First acquisition module, described in being contained according to the alias configuration mechanism of the parameter acquisition packet to be imported The alias parameter of URL parameter;
    Judge module, it whether there is the alias parameter in the URL parameter contained for judging the packet to be imported;With And
    First determining module, for the alias ginseng in the URL parameter that the packet to be imported contains is judged be present During number, the alias parameter is defined as new parameter, and parameter value corresponding to the alias parameter is defined as final argument value.
  8. 8. the gatherer of data according to claim 7, it is characterised in that it is described about that the alias parameter includes covering Determine the first alias parameter of parameter, the gatherer also includes:
    Second judging unit, in the URL parameter that the packet to be imported contains is judged with the presence or absence of agreement parameter Before, judge to whether there is the first alias parameter in the URL parameter that the packet to be imported contains;
    3rd determining unit, it is other for having described first in the URL parameter that the packet to be imported contains is judged During name parameter, the first alias parameter is defined as new parameter, and using the first alias parameter as first alias Parameter value corresponding to parameter is defined as final argument value;And
    If it is judged that the first alias parameter is not present in the URL parameter that the packet to be imported contains, then perform The function of first judging unit.
  9. 9. the gatherer of data according to claim 7, it is characterised in that the gatherer also includes:4th is true Order member, for determining the alias parameter of the URL parameter in the following manner, the 4th determining unit has:
    Second acquisition module, for obtaining all URL agreement parameter;
    Definition module, for be respectively all URL agreement parameter in each agreement parameter definition one or more Alias parameter;
    Second determining module, for it is determined that each agreement parameter pass corresponding with one or more of alias parameters System;And
    Memory module, for one or more of alias parameters to be stored in into alias parameter list according to the corresponding relation In.
  10. 10. the gatherer of data according to claim 6, it is characterised in that the gatherer also includes:
    Second acquisition unit, for before the data to be imported are imported according to the final argument value, obtaining the URL ginsengs The final argument value of numerical value;And
    Whether the 3rd judging unit, the final argument value for judging to obtain the URL parameter value succeed,
    Wherein, the data import unit is additionally operable to, when judging to obtain the final argument value failure of the URL parameter value, press The data to be imported are imported according to null value, when judging to obtain the final argument value success of the URL parameter value, described in execution The function of data import unit.
CN201410773630.2A 2014-12-12 2014-12-12 The introduction method and device of data Active CN104486154B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201410773630.2A CN104486154B (en) 2014-12-12 2014-12-12 The introduction method and device of data

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201410773630.2A CN104486154B (en) 2014-12-12 2014-12-12 The introduction method and device of data

Publications (2)

Publication Number Publication Date
CN104486154A CN104486154A (en) 2015-04-01
CN104486154B true CN104486154B (en) 2017-12-19

Family

ID=52760661

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201410773630.2A Active CN104486154B (en) 2014-12-12 2014-12-12 The introduction method and device of data

Country Status (1)

Country Link
CN (1) CN104486154B (en)

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1574750A (en) * 2003-06-20 2005-02-02 西门子医疗健康服务公司 System supporting communication between a web enabled application and another application
CN101576885A (en) * 2008-05-08 2009-11-11 韩露 Technical scheme for extracting dynamic generation web page contents
CN103294781A (en) * 2013-05-14 2013-09-11 百度在线网络技术(北京)有限公司 Method and equipment used for processing page data
CN103886024A (en) * 2014-02-24 2014-06-25 上海上讯信息技术股份有限公司 Database auditing method and system based on multilayer business association

Family Cites Families (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
KR101426464B1 (en) * 2007-12-17 2014-08-06 삼성전자주식회사 Method and apparatus for extracting service quality information parameters from a mobile communication device

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1574750A (en) * 2003-06-20 2005-02-02 西门子医疗健康服务公司 System supporting communication between a web enabled application and another application
CN101576885A (en) * 2008-05-08 2009-11-11 韩露 Technical scheme for extracting dynamic generation web page contents
CN103294781A (en) * 2013-05-14 2013-09-11 百度在线网络技术(北京)有限公司 Method and equipment used for processing page data
CN103886024A (en) * 2014-02-24 2014-06-25 上海上讯信息技术股份有限公司 Database auditing method and system based on multilayer business association

Also Published As

Publication number Publication date
CN104486154A (en) 2015-04-01

Similar Documents

Publication Publication Date Title
JP6511455B2 (en) Method and system for extracting features of user behavior and personalizing recommendations
US7827166B2 (en) Handling dynamic URLs in crawl for better coverage of unique content
RU2648966C2 (en) Apparatus and method for processing multiple open api
CN109241415A (en) Item recommendation method, device, computer equipment and storage medium
WO2016107523A1 (en) Access path analysis method and apparatus for website
CN104408102B (en) For network hot word and the data processing method and device of the degree of association of object
CN103631957B (en) Visitor's behavioral data statistical method and device
CN103839172B (en) Method of Commodity Recommendation and system
JP2013519134A (en) Search query processing
US20160306887A1 (en) Methods, apparatuses and systems for linked and personalized extended search
WO2016045567A1 (en) Webpage data analysis method and device
CN106547749B (en) Webpage data acquisition method and device
CN102323955A (en) Private cloud searching system and implement method thereof
CN110020161B (en) Data processing method, log processing method and terminal
CN106446113A (en) Mobile big data analysis method and device
CN112532489A (en) Internet of things equipment identification method and system and storage medium
CN103593444B (en) Internet Keyword identifying processing method and apparatus
CN104408180A (en) Stored data inquiring method and device
CN106354622B (en) Test the methods of exhibiting and device of webpage
CN104391953B (en) Detect the method and device of webpage renewal
US10643178B1 (en) Asynchronous real-time procurement system
CN103778156A (en) Method and device for searching for data and server for data search
US11314795B2 (en) User navigation in a target portal
CN104486154B (en) The introduction method and device of data
CN103902715B (en) IP range lookup method and apparatus

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant
PE01 Entry into force of the registration of the contract for pledge of patent right

Denomination of invention: Data lead-in method and device

Effective date of registration: 20190531

Granted publication date: 20171219

Pledgee: Shenzhen Black Horse World Investment Consulting Co.,Ltd.

Pledgor: BEIJING GRIDSUM TECHNOLOGY Co.,Ltd.

Registration number: 2019990000503

PE01 Entry into force of the registration of the contract for pledge of patent right
CP02 Change in the address of a patent holder

Address after: 100083 No. 401, 4th Floor, Haitai Building, 229 North Fourth Ring Road, Haidian District, Beijing

Patentee after: BEIJING GRIDSUM TECHNOLOGY Co.,Ltd.

Address before: 100086 Beijing city Haidian District Shuangyushu Area No. 76 Zhichun Road cuigongfandian 8 layer A

Patentee before: BEIJING GRIDSUM TECHNOLOGY Co.,Ltd.

CP02 Change in the address of a patent holder
PP01 Preservation of patent right

Effective date of registration: 20240604

Granted publication date: 20171219

PP01 Preservation of patent right