Disclosure of Invention
The invention aims to provide a method for predicting future technical knowledge flow, which can efficiently and accurately predict the interactive relation and development trend among the future technical fields.
To achieve the above object, the present invention provides a method of predicting a future technical knowledge flow, the method comprising:
Step 1, acquiring the issued patents and technical classifications to which the issued patents belong in each time period and the cited patent literature of each patent;
step 2, aggregating the references among the patents into flows among technologies, and constructing a technical knowledge flow TKF network of each year;
Step 3, calculating the patent data to obtain initial feature vectors of the technical field in the diffusion capacity and the absorption capacity in each time period;
and 4, inputting the initial feature vectors of the technologies in each time period into a technology flow prediction framework to obtain feature vectors of the technology diffusion capacity and the absorption capacity in the next time period, and matching the diffusion capacity and the absorption capacity of different technologies to obtain the flow trend among the final technologies.
Preferably, the technique diffusivity initial feature vector d in step 3 includes:
The technology growth rate, namely the ratio of the authorized quantity of a technology to the total authorized quantity of the technology in the last years;
the amount of technology spread, i.e., the amount of other technical fields to which the knowledge contained in a technology is spread;
The technology diffusivity, i.e., the proportion of the diffusion of a technology to the total diffusion of all technologies.
Preferably, the initial feature vector a of the technical absorption capacity in step3 includes:
The technology growth rate, namely the ratio of the authorized quantity of a technology to the total authorized quantity of the technology in the last years;
the technical absorption, i.e. the amount of knowledge that a technology absorbs comes from other technical fields;
Technical absorptivity, i.e. the proportion of the absorption of a technology to the total absorption of all technologies.
Preferably, step 4 comprises:
Inputting an initial feature vector d of the technical diffusion capability and an initial feature vector a of the technical absorption capability of the technology in each time period into a high-order interaction module HOI of a technical flow prediction framework, and calculating to obtain a feature vector d o of the diffusion capability and a feature vector a o of the absorption capability after surrounding high-order neighbors are polymerized in the technical field;
Inputting the feature vectors d o and a o into a hierarchical transfer module, and obtaining technical diffusion capacity and absorption capacity feature vectors containing hierarchical structure information specific to a technical system through calculation;
Inputting the characteristic vectors of the technical diffusion capacity and the absorption capacity of each time period into a technical flow tracking module to obtain characteristic vectors of the diffusion capacity and the absorption capacity of the technical field in the next time period;
Matching the diffusion capacity and the absorption capacity of the technology in the future time period to obtain the flow probability between two different technical fields;
and inputting the obtained probability and the true value into a loss function, and carrying out gradient update.
Preferably, the calculation formula of the high-order interaction module in step 4 is:
Wherein, AndThe representation technique U i embeds the diffuse and absorptive representations of the k-th layer neighbors,AndRepresenting the outgoing and incoming neighbor sets of technique U i, respectively, after aggregating the higher-order neighbor features of the K-layer, obtaining an updated technique diffusivity representationAnd absorbency representation
Wherein, alpha k is equal to or greater than 0, which represents the importance of the k-th layer neighbor, namely the influence of the k-th layer neighbor on the technology U i.
Preferably, the hierarchical delivery module in step 4 includes:
The upward convergence module is used for converging the information of the lower nodes to the upper nodes, and the module adopts an attention mechanism to automatically learn the weights of the lower nodes to represent the importance of different lower nodes, and the formula is as follows:
Where w ij denotes the weight of the lower node j relative to the upper node i, Is a transformation matrix which is a function of the transformation matrix,Is the offset vector, |g i | is the number of all lower-level technology nodes contained in the upper-level technology node g i, | ij is the normalized weight,Is the final representation of the upper node i;
a downward updating module for updating the upper node Is updated to the lower nodeThe calculation formula is as follows:
Wherein, Is an updated representation of the lower level node,Is a transformation matrix which is a function of the transformation matrix,Is the offset vector and σ is the activation function.
Preferably, in the technical flow tracking module of step 4, a bidirectional long and short time memory model BiLSTM is used for modeling the dynamic characteristics, and the specific formula is as follows:
The diffusion characteristic d= [ d 1,d2,...,dT ] over the continuous period T is input BiLSTM,
dt★=ht
Obtaining a diffusion profile over a period of T+1
According to the technical scheme, initial diffusion characteristics and absorption characteristics of each technical field in a continuous time period are firstly extracted, wherein the initial diffusion characteristics and absorption characteristics comprise the growth rate of the technology, the diffusion capacity and absorption capacity of the technology and the diffusion rate and absorption rate of the technology, then the initial diffusion characteristics and the absorption characteristics are input into a preset technology flow frame, the characteristics sequentially pass through a high-order interaction module and a hierarchical transmission module of the technology to obtain characteristic vectors of the technology in diffusion and absorption aspects in each time period, then the diffusion capacity vectors and absorption capacity vectors of the technology in each time period are input into a technology flow tracking module to obtain diffusion capacity and absorption capacity vectors of the technology in future time periods, and finally the flow probability between two different technical fields is obtained by matching the diffusion capacity and the absorption capacity of the technology. The hierarchical structure property of the existing patent classification system and the high-order flow among technologies are fully utilized, a technology flow network based on patent citation is constructed by using a method of a graph neural network, and the future technology flow direction is predicted according to the dynamic characteristics of the technology. The method overcomes the defects that the prior method mainly depends on expert experience and needs to consume a large amount of manpower, and the characteristics of technical flow and the characteristics of patent hierarchy structure cannot be fully utilized, and can efficiently and accurately predict the future technical flow direction.
Additional features and advantages of the invention will be set forth in the detailed description which follows.
Detailed Description
The following describes specific embodiments of the present invention in detail with reference to the drawings. It should be understood that the detailed description and specific examples, while indicating and illustrating the invention, are not intended to limit the invention.
Referring to fig. 2, the present invention provides a method of predicting future technical knowledge flow, comprising:
Step 1, acquiring the issued patents and technical classifications to which the issued patents belong in each time period and the cited patent literature of each patent;
step 2, aggregating the references among the patents into flows among technologies, and constructing a technical knowledge flow TKF network of each year;
Step 3, calculating the patent data to obtain initial feature vectors of the technical field in the diffusion capacity and the absorption capacity in each time period;
and 4, inputting the initial feature vectors of the technologies in each time period into a technology flow prediction framework to obtain feature vectors of the technology diffusion capacity and the absorption capacity in the next time period, and matching the diffusion capacity and the absorption capacity of different technologies to obtain the flow trend among the final technologies.
Wherein, the technique diffusivity initial feature vector d in step 3 includes:
The technology growth rate, namely the ratio of the authorized quantity of a technology to the total authorized quantity of the technology in the last years;
the amount of technology spread, i.e., the amount of other technical fields to which the knowledge contained in a technology is spread;
The technology diffusivity, i.e., the proportion of the diffusion of a technology to the total diffusion of all technologies.
The initial feature vector a of the technical absorption capacity in step 3 includes:
The technology growth rate, namely the ratio of the authorized quantity of a technology to the total authorized quantity of the technology in the last years;
the technical absorption, i.e. the amount of knowledge that a technology absorbs comes from other technical fields;
Technical absorptivity, i.e. the proportion of the absorption of a technology to the total absorption of all technologies.
Specifically, step 4 includes:
Inputting an initial feature vector d of the technical diffusion capability and an initial feature vector a of the technical absorption capability of the technology in each time period into a high-order interaction module HOI of a technical flow prediction framework, and calculating to obtain a feature vector d o of the diffusion capability and a feature vector a o of the absorption capability after surrounding high-order neighbors are polymerized in the technical field;
Inputting the feature vectors d o and a o into a hierarchical transfer module, and obtaining technical diffusion capacity and absorption capacity feature vectors containing hierarchical structure information specific to a technical system through calculation;
Inputting the characteristic vectors of the technical diffusion capacity and the absorption capacity of each time period into a technical flow tracking module to obtain characteristic vectors of the diffusion capacity and the absorption capacity of the technical field in the next time period;
Matching the diffusion capacity and the absorption capacity of the technology in the future time period to obtain the flow probability between two different technical fields;
and inputting the obtained probability and the true value into a loss function, and carrying out gradient update.
The calculation formula of the high-order interaction module in the step 4 is as follows:
Wherein, AndThe representation technique U i embeds the diffuse and absorptive representations of the k-th layer neighbors,AndRepresenting the outgoing and incoming neighbor sets of technique U i, respectively, after aggregating the higher-order neighbor features of the K-layer, obtaining an updated technique diffusivity representationAnd absorbency representation
Wherein, alpha k is equal to or greater than 0, which represents the importance of the k-th layer neighbor, namely the influence of the k-th layer neighbor on the technology U i.
The hierarchical delivery module in step 4 includes:
The upward convergence module is used for converging the information of the lower nodes to the upper nodes, and the module adopts an attention mechanism to automatically learn the weights of the lower nodes to represent the importance of different lower nodes, and the formula is as follows:
Where w ij denotes the weight of the lower node j relative to the upper node i, Is a transformation matrix which is a function of the transformation matrix,Is the offset vector, |g i | is the number of all lower-level technology nodes contained in the upper-level technology node g i, | ij is the normalized weight,Is the final representation of the upper node i;
a downward updating module for updating the upper node Is updated to the lower nodeThe calculation formula is as follows:
Wherein, Is an updated representation of the lower level node,Is a transformation matrix which is a function of the transformation matrix,Is the offset vector and σ is the activation function.
In addition, in the technical flow tracking module in step 4, a bidirectional long-short-time memory model BiLSTM is used for modeling of dynamic characteristics, and a specific formula is as follows:
The diffusion characteristic d= [ d 1,d2,...,dT ] over the continuous period T is input BiLSTM,
dt★=ht
Obtaining a diffusion profile over a period of T+1
Specifically, as shown in fig. 1, in one embodiment of the present invention, the following steps are sequentially performed:
S101, acquiring the issued patents and technical classifications to which the issued patents belong in each time period, and the cited patent literature of each patent.
In the method provided by the embodiment of the invention, according to the patent data provided by USPTO of the U.S. patent and trademark office and the technical classification provided by CPC of the united patent classification system, the new patents and the technical classification of the new patents and the technical classification and the cited other patents are extracted every year according to years.
S102, aggregating the references among the patents into inter-technology flow, and constructing a technology knowledge flow TKF network of each year.
In the method provided by the embodiment of the invention, if the patent contained in the technology A refers to the patent contained in the technology B for n times, it is considered that there may be a knowledge flow from the technology B to the technology A, wherein n represents the intensity of the flow. Clearly, the larger n, the more knowledge flows between technology a and technology B. In order to remove noise effects and limit the size of the technical flow network, N is thresholded such that the corresponding knowledge flow can only act as a side of the technical knowledge flow network if N > N. The value of N is set according to the specific requirement in the implementation process, and the magnitude of the specific value is determined according to the knowledge flow strength requirement between the technology a and the technology B.
And S103, calculating the patent data to obtain initial characteristic vectors of the technical field in the diffusion capacity and the absorption capacity in each time period.
In the method provided by the embodiment of the invention, the initial eigenvectors of the diffusion capacity and the absorption capacity respectively comprise three values, namely a technical growth rate, a technical diffusion quantity, a technical diffusion rate, a technical growth rate, a technical absorption quantity and a technical absorption rate. Order theAndRepresenting the diffusion and absorption characteristics, respectively, of the technique U i over time t, willAndAs input to the subsequent module.
S104, inputting the initial feature vector d and a of the technology in each time period into a high-order interaction module HOI of a technology flow prediction framework, and calculating to obtain a diffusion capability feature vector d o and an absorption capability feature vector a o after surrounding high-order neighbors are aggregated in the technical field.
In the method provided by the embodiment of the invention, the edges in the TKF network graph can be expressed as the interaction of the source node diffusion capability and the target node absorption capability, so that the two characteristics of the technology are updated by adopting the idea of the interactive graph neural network. In particular, the diffusion characteristics of a technical field are updated by aggregating the absorption characteristics of adjacent technical fields, and vice versa. A simple weighted sum operation is chosen here as operator, taking into account efficiency and effect. The polymerization process is as follows:
Wherein, AndThe representation technique U i embeds the diffuse and absorptive representations of the k-th layer neighbors,AndRepresenting the outgoing and incoming neighbor sets of technology U i, respectively. After aggregating the higher-order neighbor features of the K-layer, an updated technology diffusivity representation is obtainedAnd absorbency representation
Wherein alpha k is equal to or greater than 0, which represents the importance of the k-th layer neighbor, namely the influence of the second rolling neighbor on the technology U i.
S105, inputting the feature vectors into a hierarchical transfer module, and obtaining the feature vectors of the technical diffusion capability and the absorption capability containing the hierarchical structure information specific to the technical system through calculation.
In the method provided by the embodiment of the invention, the technology can be connected through the hierarchical structure of the CPC system. Thus, a more comprehensive technical representation can be generated by using the hierarchical relationship between technologies, and more accurate TKF prediction is realized. Since the above category has an inclusive relationship with the below category, the above category can be represented by aggregating the information of the below category. On the other hand, the information of the higher category can also supplement incomplete information in the lower category, and a hierarchical delivery module is constructed for the purpose of describing the interaction between the upper category and the lower category. The hierarchical transfer module consists of two parts, namely an upward transfer module and a downward update module. In this embodiment, each Section of the CPC may form a dataset, as shown in fig. 2, focusing mainly on the hierarchical structure between the three levels, subgroup, groups and subsection, respectively.
The upward passing module aims to aggregate information from a lower level to a higher level, i.e., subgroup to group, group to subsection. Here, taking the upward pass from subgroup to group as an example, the diffusion characteristics for a group level node are represented by aggregating its diffusion characteristics for its children at the subgroup level. To distinguish the importance of different child nodes, an attention mechanism is applied to automatically learn the weights of the child nodes. The weights representing node j of the subgroup hierarchy to node i of the group hierarchy, denoted by w ij, can be calculated as:
Wherein, Is a transformation matrix which is a function of the transformation matrix,Is the offset vector, |g i | is the number of all lower-level technology nodes contained in the upper-level technology node g i, | ij is the normalized weight,Is an updated diffusion characteristic representation of group level node i. Likewise, the absorption features of technique i at the group level can be obtainedThen, through the upward transfer process from the group level to the subsection level, the diffusion characterization of the technology i at the subsection level is obtainedAnd absorption characterization
In the downward update phase, the lower nodes, i.e., subsection to group and group to subgroup, need to be updated with the information of the upper nodes. Here, taking the subsection level to group level update process as an example, to update the diffusion representation of the nodes in the group level, the diffusion feature of the current node and the diffusion feature of its parent node in the subsection level are spliced, and then nonlinear transformation is adopted to obtain the enhanced diffusion representation:
Wherein, Is a node representation of the updated group hierarchy,Is a transformation matrix which is a function of the transformation matrix,Is the offset vector and σ is the activation function. Likewise, through the information updating from the group level to the subgroup level, the diffusion characteristics of the technology j updated after passing through the level transmission module on the subgroup level can be obtainedAnd absorption features
S106, inputting the characteristic vectors of the technical diffusion capacity and the absorption capacity of each time period into a technical flow tracking module to obtain the characteristic vectors of the diffusion capacity and the absorption capacity of the technical field in the next time period.
In the embodiment of the application, a bidirectional long-short-time memory model BiLSTM is used for modeling dynamic characteristics, and a diffusion characteristic d= [ d 1,d2,...,dT ] in a continuous time period T is input BiLSTM, wherein the specific formula is as follows:
dt★=ht
Obtaining a diffusion profile over a period of T+1
And S107, matching the diffusion capacity and the absorption capacity of the technology in the future time period to obtain the flow probability between two different technical fields.
In this embodiment, the diffusion characteristic at time T+1 is setAnd absorption featuresAnd performing inner product operation, wherein the formula is as follows: The probability of technology i flowing to technology j can be finally obtained for the t+1 time period.
S108, inputting the obtained probability and the true value into a loss function, and carrying out gradient update.
In this embodiment, because the completely new technical flows are difficult to predict, they have never been presented before, and therefore a well-designed objective function is still needed to train the model. In order to more accurately predict new edges without degrading accuracy, a loss function is designed for edges that occur for each time window, with the following formula:
Wherein, Representing a loss function that predicts entirely new edges,Representing a loss function that predicts all edges that occur,Is the total loss function, alpha representsAndAt the position ofThe specific gravity of the mixture is calculated.
The method comprises the steps of firstly extracting initial diffusion characteristics and absorption characteristics of each technical field in a continuous time period, wherein the initial diffusion characteristics and absorption characteristics comprise the growth rate of the technology, the diffusion capacity and absorption capacity of the technology and the diffusion capacity and absorption capacity of the technology, then inputting the initial diffusion characteristics and absorption characteristics into a preset technology flow frame, sequentially passing through a high-order interaction module and a hierarchical transfer module of the technology to obtain characteristic vectors of the technology in diffusion and absorption aspects in each time period, then inputting the diffusion capacity vectors and absorption capacity vectors of the technology in each time period into a technology flow tracking module to obtain diffusion capacity and absorption capacity vectors of the technology in future time periods, and finally obtaining flow probability between two different technical fields by matching the diffusion capacity and the absorption capacity of the technology. The hierarchical structure property of the existing patent classification system and the high-order flow among technologies are fully utilized, a technology flow network based on patent citation is constructed by using a method of a graph neural network, and the future technology flow direction is predicted according to the dynamic characteristics of the technology. The method overcomes the defects that the prior method mainly depends on expert experience and needs to consume a large amount of manpower, and the characteristics of technical flow and the characteristics of patent hierarchy structure cannot be fully utilized, and can efficiently and accurately predict the future technical flow direction.
The preferred embodiments of the present invention have been described in detail above with reference to the accompanying drawings, but the present invention is not limited to the specific details of the above embodiments, and various simple modifications can be made to the technical solution of the present invention within the scope of the technical concept of the present invention, and all the simple modifications belong to the protection scope of the present invention.
In addition, the specific features described in the above embodiments may be combined in any suitable manner, and in order to avoid unnecessary repetition, various possible combinations are not described further.
Moreover, any combination of the various embodiments of the invention can be made without departing from the spirit of the invention, which should also be considered as disclosed herein.