CN113989075B

CN113989075B - Methods for predicting future technological knowledge flows

Info

Publication number: CN113989075B
Application number: CN202111186872.8A
Authority: CN
Inventors: 刘慧杰; 陈恩红; 刘淇; 武晗; 张乐; 于润龙; 刘烨
Original assignee: University of Science and Technology of China USTC
Current assignee: University of Science and Technology of China USTC
Priority date: 2021-10-12
Filing date: 2021-10-12
Publication date: 2025-01-28
Anticipated expiration: 2041-10-12
Also published as: CN113989075A

Abstract

The present invention discloses a method for predicting the future flow of technological knowledge, including: step 1, obtaining the authorized patents and their technical classifications in each time period, as well as the cited patent documents of each patent; step 2, aggregating the citations between the patents into the flow between technologies, and constructing the technical knowledge flow TKF network of each year; step 3, calculating the patent data to obtain the initial feature vectors of the diffusion ability and absorptive ability of the technical field in each time period; step 4, inputting the initial feature vectors of the technology in each time period into the technology flow prediction framework, obtaining the feature vectors of the technology diffusion ability and the feature vectors of the absorptive ability in the next time period, and matching the diffusion ability and absorptive ability of different technologies to obtain the final flow trend between technologies. The method can efficiently and accurately predict the interactive relationship and development trend between future technical fields.

Description

Method for predicting future technical knowledge flow

Technical Field

The invention relates to the technical field of big data analysis, in particular to a method for predicting future technical knowledge flow.

Background

Technical knowledge flow (Technological Knowledge Flow, TKF for short) refers to the directional flow of knowledge from one technical field to another, reflecting one interaction between two different technical fields. In the trend of economic globalization, many high and new technical enterprises face severe innovation pressure, and in order to maintain competitive advantage, they must grasp the technical development trend, and are in progress. In this respect, an effective method is to predict the flow between different technical fields in the future, thereby grasping the development trend of the technology and establishing effective research and development strategy.

TKF analysis is often performed in large-scale scientific literature, where patent mining plays an important role. In the patent arts, each patent often references an already issued patent and at the time of each patent application is assigned one or more technical classification codes according to the patent classification system. Taking the most commonly used joint patent classification system (CPC) at present as an example, each classification code can be regarded as a technical field, and different technologies can be connected in two ways, one is a hierarchical structure inherent to CPC classification, and the other is connected by patent citation. Thus, with the CPC taxonomy and patent citation, we can construct a annual technical knowledge flow graph and predict future technical knowledge flow trends.

Disclosure of Invention

The invention aims to provide a method for predicting future technical knowledge flow, which can efficiently and accurately predict the interactive relation and development trend among the future technical fields.

To achieve the above object, the present invention provides a method of predicting a future technical knowledge flow, the method comprising:

Step 1, acquiring the issued patents and technical classifications to which the issued patents belong in each time period and the cited patent literature of each patent;

step 2, aggregating the references among the patents into flows among technologies, and constructing a technical knowledge flow TKF network of each year;

Step 3, calculating the patent data to obtain initial feature vectors of the technical field in the diffusion capacity and the absorption capacity in each time period;

and 4, inputting the initial feature vectors of the technologies in each time period into a technology flow prediction framework to obtain feature vectors of the technology diffusion capacity and the absorption capacity in the next time period, and matching the diffusion capacity and the absorption capacity of different technologies to obtain the flow trend among the final technologies.

Preferably, the technique diffusivity initial feature vector d in step 3 includes:

The technology growth rate, namely the ratio of the authorized quantity of a technology to the total authorized quantity of the technology in the last years;

the amount of technology spread, i.e., the amount of other technical fields to which the knowledge contained in a technology is spread;

The technology diffusivity, i.e., the proportion of the diffusion of a technology to the total diffusion of all technologies.

Preferably, the initial feature vector a of the technical absorption capacity in step3 includes:

the technical absorption, i.e. the amount of knowledge that a technology absorbs comes from other technical fields;

Technical absorptivity, i.e. the proportion of the absorption of a technology to the total absorption of all technologies.

Preferably, step 4 comprises:

Inputting an initial feature vector d of the technical diffusion capability and an initial feature vector a of the technical absorption capability of the technology in each time period into a high-order interaction module HOI of a technical flow prediction framework, and calculating to obtain a feature vector d ^o of the diffusion capability and a feature vector a ^o of the absorption capability after surrounding high-order neighbors are polymerized in the technical field;

Inputting the feature vectors d ^o and a ^o into a hierarchical transfer module, and obtaining technical diffusion capacity and absorption capacity feature vectors containing hierarchical structure information specific to a technical system through calculation;

Inputting the characteristic vectors of the technical diffusion capacity and the absorption capacity of each time period into a technical flow tracking module to obtain characteristic vectors of the diffusion capacity and the absorption capacity of the technical field in the next time period;

Matching the diffusion capacity and the absorption capacity of the technology in the future time period to obtain the flow probability between two different technical fields;

and inputting the obtained probability and the true value into a loss function, and carrying out gradient update.

Preferably, the calculation formula of the high-order interaction module in step 4 is:

Wherein, AndThe representation technique U _i embeds the diffuse and absorptive representations of the k-th layer neighbors,AndRepresenting the outgoing and incoming neighbor sets of technique U _i, respectively, after aggregating the higher-order neighbor features of the K-layer, obtaining an updated technique diffusivity representationAnd absorbency representation

Wherein, alpha _k is equal to or greater than 0, which represents the importance of the k-th layer neighbor, namely the influence of the k-th layer neighbor on the technology U _i.

Preferably, the hierarchical delivery module in step 4 includes:

The upward convergence module is used for converging the information of the lower nodes to the upper nodes, and the module adopts an attention mechanism to automatically learn the weights of the lower nodes to represent the importance of different lower nodes, and the formula is as follows:

Where w _ij denotes the weight of the lower node j relative to the upper node i, Is a transformation matrix which is a function of the transformation matrix,Is the offset vector, |g _i | is the number of all lower-level technology nodes contained in the upper-level technology node g _i, | _ij is the normalized weight,Is the final representation of the upper node i;

a downward updating module for updating the upper node Is updated to the lower nodeThe calculation formula is as follows:

Wherein, Is an updated representation of the lower level node,Is a transformation matrix which is a function of the transformation matrix,Is the offset vector and σ is the activation function.

Preferably, in the technical flow tracking module of step 4, a bidirectional long and short time memory model BiLSTM is used for modeling the dynamic characteristics, and the specific formula is as follows:

The diffusion characteristic d= [ d ¹,d²,...,d^T ] over the continuous period T is input BiLSTM,

d^t★＝h^t

Obtaining a diffusion profile over a period of T+1

According to the technical scheme, initial diffusion characteristics and absorption characteristics of each technical field in a continuous time period are firstly extracted, wherein the initial diffusion characteristics and absorption characteristics comprise the growth rate of the technology, the diffusion capacity and absorption capacity of the technology and the diffusion rate and absorption rate of the technology, then the initial diffusion characteristics and the absorption characteristics are input into a preset technology flow frame, the characteristics sequentially pass through a high-order interaction module and a hierarchical transmission module of the technology to obtain characteristic vectors of the technology in diffusion and absorption aspects in each time period, then the diffusion capacity vectors and absorption capacity vectors of the technology in each time period are input into a technology flow tracking module to obtain diffusion capacity and absorption capacity vectors of the technology in future time periods, and finally the flow probability between two different technical fields is obtained by matching the diffusion capacity and the absorption capacity of the technology. The hierarchical structure property of the existing patent classification system and the high-order flow among technologies are fully utilized, a technology flow network based on patent citation is constructed by using a method of a graph neural network, and the future technology flow direction is predicted according to the dynamic characteristics of the technology. The method overcomes the defects that the prior method mainly depends on expert experience and needs to consume a large amount of manpower, and the characteristics of technical flow and the characteristics of patent hierarchy structure cannot be fully utilized, and can efficiently and accurately predict the future technical flow direction.

Additional features and advantages of the invention will be set forth in the detailed description which follows.

Drawings

The accompanying drawings are included to provide a further understanding of the invention, and are incorporated in and constitute a part of this specification, illustrate the invention and together with the description serve to explain, without limitation, the invention. In the drawings:

FIG. 1 is a flow chart of a method of predicting future technical knowledge flow in accordance with the present invention;

fig. 2 is a schematic diagram of a method for predicting future technical knowledge flow according to the present invention.

Detailed Description

The following describes specific embodiments of the present invention in detail with reference to the drawings. It should be understood that the detailed description and specific examples, while indicating and illustrating the invention, are not intended to limit the invention.

Referring to fig. 2, the present invention provides a method of predicting future technical knowledge flow, comprising:

Wherein, the technique diffusivity initial feature vector d in step 3 includes:

The initial feature vector a of the technical absorption capacity in step 3 includes:

Specifically, step 4 includes:

The calculation formula of the high-order interaction module in the step 4 is as follows:

The hierarchical delivery module in step 4 includes:

In addition, in the technical flow tracking module in step 4, a bidirectional long-short-time memory model BiLSTM is used for modeling of dynamic characteristics, and a specific formula is as follows:

d^t★＝h^t

Obtaining a diffusion profile over a period of T+1

Specifically, as shown in fig. 1, in one embodiment of the present invention, the following steps are sequentially performed:

S101, acquiring the issued patents and technical classifications to which the issued patents belong in each time period, and the cited patent literature of each patent.

In the method provided by the embodiment of the invention, according to the patent data provided by USPTO of the U.S. patent and trademark office and the technical classification provided by CPC of the united patent classification system, the new patents and the technical classification of the new patents and the technical classification and the cited other patents are extracted every year according to years.

S102, aggregating the references among the patents into inter-technology flow, and constructing a technology knowledge flow TKF network of each year.

In the method provided by the embodiment of the invention, if the patent contained in the technology A refers to the patent contained in the technology B for n times, it is considered that there may be a knowledge flow from the technology B to the technology A, wherein n represents the intensity of the flow. Clearly, the larger n, the more knowledge flows between technology a and technology B. In order to remove noise effects and limit the size of the technical flow network, N is thresholded such that the corresponding knowledge flow can only act as a side of the technical knowledge flow network if N > N. The value of N is set according to the specific requirement in the implementation process, and the magnitude of the specific value is determined according to the knowledge flow strength requirement between the technology a and the technology B.

And S103, calculating the patent data to obtain initial characteristic vectors of the technical field in the diffusion capacity and the absorption capacity in each time period.

In the method provided by the embodiment of the invention, the initial eigenvectors of the diffusion capacity and the absorption capacity respectively comprise three values, namely a technical growth rate, a technical diffusion quantity, a technical diffusion rate, a technical growth rate, a technical absorption quantity and a technical absorption rate. Order theAndRepresenting the diffusion and absorption characteristics, respectively, of the technique U _i over time t, willAndAs input to the subsequent module.

S104, inputting the initial feature vector d and a of the technology in each time period into a high-order interaction module HOI of a technology flow prediction framework, and calculating to obtain a diffusion capability feature vector d ^o and an absorption capability feature vector a ^o after surrounding high-order neighbors are aggregated in the technical field.

In the method provided by the embodiment of the invention, the edges in the TKF network graph can be expressed as the interaction of the source node diffusion capability and the target node absorption capability, so that the two characteristics of the technology are updated by adopting the idea of the interactive graph neural network. In particular, the diffusion characteristics of a technical field are updated by aggregating the absorption characteristics of adjacent technical fields, and vice versa. A simple weighted sum operation is chosen here as operator, taking into account efficiency and effect. The polymerization process is as follows:

Wherein, AndThe representation technique U _i embeds the diffuse and absorptive representations of the k-th layer neighbors,AndRepresenting the outgoing and incoming neighbor sets of technology U _i, respectively. After aggregating the higher-order neighbor features of the K-layer, an updated technology diffusivity representation is obtainedAnd absorbency representation

Wherein alpha _k is equal to or greater than 0, which represents the importance of the k-th layer neighbor, namely the influence of the second rolling neighbor on the technology U _i.

S105, inputting the feature vectors into a hierarchical transfer module, and obtaining the feature vectors of the technical diffusion capability and the absorption capability containing the hierarchical structure information specific to the technical system through calculation.

In the method provided by the embodiment of the invention, the technology can be connected through the hierarchical structure of the CPC system. Thus, a more comprehensive technical representation can be generated by using the hierarchical relationship between technologies, and more accurate TKF prediction is realized. Since the above category has an inclusive relationship with the below category, the above category can be represented by aggregating the information of the below category. On the other hand, the information of the higher category can also supplement incomplete information in the lower category, and a hierarchical delivery module is constructed for the purpose of describing the interaction between the upper category and the lower category. The hierarchical transfer module consists of two parts, namely an upward transfer module and a downward update module. In this embodiment, each Section of the CPC may form a dataset, as shown in fig. 2, focusing mainly on the hierarchical structure between the three levels, subgroup, groups and subsection, respectively.

The upward passing module aims to aggregate information from a lower level to a higher level, i.e., subgroup to group, group to subsection. Here, taking the upward pass from subgroup to group as an example, the diffusion characteristics for a group level node are represented by aggregating its diffusion characteristics for its children at the subgroup level. To distinguish the importance of different child nodes, an attention mechanism is applied to automatically learn the weights of the child nodes. The weights representing node j of the subgroup hierarchy to node i of the group hierarchy, denoted by w _ij, can be calculated as:

Wherein, Is a transformation matrix which is a function of the transformation matrix,Is the offset vector, |g _i | is the number of all lower-level technology nodes contained in the upper-level technology node g _i, | _ij is the normalized weight,Is an updated diffusion characteristic representation of group level node i. Likewise, the absorption features of technique i at the group level can be obtainedThen, through the upward transfer process from the group level to the subsection level, the diffusion characterization of the technology i at the subsection level is obtainedAnd absorption characterization

In the downward update phase, the lower nodes, i.e., subsection to group and group to subgroup, need to be updated with the information of the upper nodes. Here, taking the subsection level to group level update process as an example, to update the diffusion representation of the nodes in the group level, the diffusion feature of the current node and the diffusion feature of its parent node in the subsection level are spliced, and then nonlinear transformation is adopted to obtain the enhanced diffusion representation:

Wherein, Is a node representation of the updated group hierarchy,Is a transformation matrix which is a function of the transformation matrix,Is the offset vector and σ is the activation function. Likewise, through the information updating from the group level to the subgroup level, the diffusion characteristics of the technology j updated after passing through the level transmission module on the subgroup level can be obtainedAnd absorption features

S106, inputting the characteristic vectors of the technical diffusion capacity and the absorption capacity of each time period into a technical flow tracking module to obtain the characteristic vectors of the diffusion capacity and the absorption capacity of the technical field in the next time period.

In the embodiment of the application, a bidirectional long-short-time memory model BiLSTM is used for modeling dynamic characteristics, and a diffusion characteristic d= [ d ¹,d²,...,d^T ] in a continuous time period T is input BiLSTM, wherein the specific formula is as follows:

d^t★＝h^t

Obtaining a diffusion profile over a period of T+1

And S107, matching the diffusion capacity and the absorption capacity of the technology in the future time period to obtain the flow probability between two different technical fields.

In this embodiment, the diffusion characteristic at time T+1 is setAnd absorption featuresAnd performing inner product operation, wherein the formula is as follows: The probability of technology i flowing to technology j can be finally obtained for the t+1 time period.

S108, inputting the obtained probability and the true value into a loss function, and carrying out gradient update.

In this embodiment, because the completely new technical flows are difficult to predict, they have never been presented before, and therefore a well-designed objective function is still needed to train the model. In order to more accurately predict new edges without degrading accuracy, a loss function is designed for edges that occur for each time window, with the following formula:

Wherein, Representing a loss function that predicts entirely new edges,Representing a loss function that predicts all edges that occur,Is the total loss function, alpha representsAndAt the position ofThe specific gravity of the mixture is calculated.

The method comprises the steps of firstly extracting initial diffusion characteristics and absorption characteristics of each technical field in a continuous time period, wherein the initial diffusion characteristics and absorption characteristics comprise the growth rate of the technology, the diffusion capacity and absorption capacity of the technology and the diffusion capacity and absorption capacity of the technology, then inputting the initial diffusion characteristics and absorption characteristics into a preset technology flow frame, sequentially passing through a high-order interaction module and a hierarchical transfer module of the technology to obtain characteristic vectors of the technology in diffusion and absorption aspects in each time period, then inputting the diffusion capacity vectors and absorption capacity vectors of the technology in each time period into a technology flow tracking module to obtain diffusion capacity and absorption capacity vectors of the technology in future time periods, and finally obtaining flow probability between two different technical fields by matching the diffusion capacity and the absorption capacity of the technology. The hierarchical structure property of the existing patent classification system and the high-order flow among technologies are fully utilized, a technology flow network based on patent citation is constructed by using a method of a graph neural network, and the future technology flow direction is predicted according to the dynamic characteristics of the technology. The method overcomes the defects that the prior method mainly depends on expert experience and needs to consume a large amount of manpower, and the characteristics of technical flow and the characteristics of patent hierarchy structure cannot be fully utilized, and can efficiently and accurately predict the future technical flow direction.

The preferred embodiments of the present invention have been described in detail above with reference to the accompanying drawings, but the present invention is not limited to the specific details of the above embodiments, and various simple modifications can be made to the technical solution of the present invention within the scope of the technical concept of the present invention, and all the simple modifications belong to the protection scope of the present invention.

In addition, the specific features described in the above embodiments may be combined in any suitable manner, and in order to avoid unnecessary repetition, various possible combinations are not described further.

Moreover, any combination of the various embodiments of the invention can be made without departing from the spirit of the invention, which should also be considered as disclosed herein.

Claims

1. A method for predicting future technical knowledge flows, characterized in that the method comprises:

Step 1: Obtain the authorized patents and their technical classifications in each time period, as well as the cited patent documents of each patent;

Step 2: Aggregate the citations between the patents into technology flows, and construct a technical knowledge flow TKF network for each year;

Step 3: Calculate the patent data to obtain the initial feature vectors of diffusion capacity and absorption capacity of the technical field in each time period;

Step 4: Input the initial feature vector d of the technology diffusion capability and the initial feature vector a of the technology absorption capability of each time period into the high-order interaction module HOI of the technology flow prediction framework, and calculate the diffusion capability feature vector d ^o and the absorption capability feature vector a ^o after aggregating the surrounding high-order neighbors of the technology field; the calculation formula of the high-order interaction module is:

in, and represents the diffusion and absorption embedding representation of the k-th layer neighbors of technology U _i , and Represent the outflow and inflow neighbor sets of technology U _i respectively; after aggregating the high-order neighbor features of the K layer, the updated technology diffusion capability is obtained and absorptive capacity

Among them, α _k ≥ 0 represents the importance of the k-th layer neighbors, that is, the influence of the k-th layer neighbors on the technology U _i ;

Input the characteristic vectors d ^o and a ^o into the hierarchical transfer module, and obtain the characteristic vectors of technology diffusion capability and absorptive capability containing the hierarchical structure information unique to the technology system through calculation;

Input the characteristic vectors of technology diffusion capacity and absorptive capacity of each time period into the technology flow tracking module to obtain the characteristic vectors of diffusion capacity and absorptive capacity of the technology field in the next time period;

Match the diffusion capacity and absorption capacity of technology in the future time period to obtain the flow probability between two different technological fields;

The obtained probability and true value are input into the loss function for gradient update.

2. The method for predicting future technological knowledge flows according to claim 1, characterized in that the initial characteristic vector d of technological diffusion capability in step 3 comprises:

Technology growth rate, which is the ratio of the number of licenses for a technology to the total number of licenses for that technology in the past few years;

The amount of technological diffusion, that is, the amount of knowledge contained in a technology that spreads to other technological fields;

The technology diffusion rate is the ratio of the diffusion of a technology to the total diffusion of all technologies.

3. The method for predicting future technological knowledge flows according to claim 1, characterized in that the initial characteristic vector a of the technological absorption capacity in step 3 comprises:

Technology absorption, that is, the amount of knowledge absorbed by a technology from other technological fields;

The technology absorption rate is the ratio of the absorption of one technology to the total absorption of all technologies.

4. The method for predicting future technical knowledge flow according to claim 1, wherein the hierarchical transfer module in step 4 comprises:

The upward convergence module is used to aggregate the information of the lower-level nodes to the upper-level nodes. This module uses the attention mechanism to automatically learn the weights of the lower-level nodes to indicate the importance of different lower-level nodes. The formula is as follows:

Among them, w _ij represents the weight of the lower layer node j relative to the upper layer node i, is the transformation matrix, is the offset vector, | _gi | is the number of all lower-level technology nodes contained in the upper-level technology node _gi , _βij is the standardized weight, is the final representation of the upper-level node i;

The downward update module is used to update the upper node The information is updated to the lower nodes The calculation formula is as follows:

in, is the updated representation of the lower-level nodes, is the transformation matrix, is the bias vector and σ is the activation function.

5. The method for predicting future technological knowledge flow according to claim 1 is characterized in that, in the technology flow tracking module of step 4, a bidirectional long short-term memory model BiLSTM is used for modeling dynamic features, and the specific formula is as follows:

Input the diffusion features d = [d ¹ , d ² , ..., d ^T ] in the continuous time period T into BiLSTM,

d ^t★ =h ^t to obtain the diffusion characteristics in the T+1 time period