CN114936296B

CN114936296B - Indexing method, system and computer equipment for super-large-scale knowledge map storage

Info

Publication number: CN114936296B
Application number: CN202210874965.8A
Authority: CN
Inventors: 王文广; 陈运文; 纪达麒
Original assignee: Daguan Data Chengdu Co ltd
Current assignee: Daguan Data Chengdu Co ltd
Priority date: 2022-07-25
Filing date: 2022-07-25
Publication date: 2022-11-08
Anticipated expiration: 2042-07-25
Also published as: CN114936296A

Abstract

The invention relates to an indexing method for super-large scale knowledge graph storage, which specifically comprises the following steps: dividing the input of the index into three types of entity, relation triple and attribute triple; coding the three types of input by using a BERT compatible model, and respectively outputting vector representations of the three types of input; the multilayer perceptron returns the initial position of data storage and the length of physical storage according to the received vector representation; accessing and maintaining the knowledge graph data on the physical storage equipment according to the initial position and the physical storage length to realize intelligent indexing of the super-large scale knowledge graph storage; also relates to an indexing system and computer equipment for the large-scale knowledge map storage intelligence. The indexing method, the indexing system and the computer equipment are suitable for intelligent indexing of the large-scale semantization knowledge graph so as to improve the retrieval efficiency and provide more convenient service for intelligent reasoning based on the knowledge graph.

Description

Indexing method, system and computer equipment for super-large scale knowledge graph storage

Technical Field

The invention relates to the field of artificial intelligence, in particular to an indexing method, a system and computer equipment for super-large-scale knowledge graph storage.

Background

With the increasing application of knowledge maps and the increasing depth of knowledge maps, large-scale enterprises are dedicated to constructing huge knowledge maps from ubiquitous knowledge and providing knowledge-based application in different scenes. The entities of these knowledgemaps can be as high as billions of entries, while the number of relational and attribute triples can scale to the hundreds of billions, or even trillions. In such an ultra-large-scale knowledge graph storage, how to perform efficient retrieval is a huge challenge. The method realizes real-time retrieval of entities, online multi-hop query and relationship analysis, second-level complex analysis and the like, and is an urgent need for super-large-scale knowledge map engineering practice and industrial application.

Conventional knowledge-graph storage typically employs graph databases or relational databases, the physical models of which typically employ B + trees or hash algorithms, the mapping relationships of which are simple arithmetic. For a small-scale knowledge graph, the existing common indexing mode is practical enough, and a practical intelligent indexing method is not needed. For the ultra-large knowledge graph, the existing indexing method has low efficiency and is even infeasible, so that a more practical and intelligent indexing mode is needed.

Disclosure of Invention

In order to achieve the aim, the invention provides a super-large scale knowledge graph intelligent indexing method and system based on deep learning intelligent hash. The method and the system are suitable for intelligent indexing of the knowledge graph with large-scale semantization, so that the retrieval efficiency is improved, and more convenient service is provided for intelligent reasoning based on the knowledge graph.

In order to achieve the purpose of the invention, the technical scheme provided by the invention patent is as follows:

the invention firstly provides an indexing method for storing a super-large-scale knowledge graph, wherein the super-large-scale knowledge graph refers to the fact that the number of triples in the knowledge graph reaches billions, billions or even trillions, and the super-large-scale knowledge graph is stored in the indexing process and is used for realizing Hash calculation based on a deep learning model to obtain the initial position and the storage length of physical storage, and the method specifically comprises the following steps:

firstly, dividing the input of an index into three types of entity, relation triple and attribute triple, and designing an intelligent Hash algorithm based on the three input types, wherein the intelligent Hash algorithm structurally comprises a BERT compatible model, a convergence network and a multilayer perceptron;

secondly, coding and learning the three types of input by using the BERT compatible model respectively, and sending the learned vector to the convergence network;

thirdly, in the convergence network, for the entity, converging the adjacent vertexes and the associated edges of all the entities, and outputting the vector representation of the corresponding entity; for the relation triples and the attribute triples, learning the triples and respectively outputting vector representations of the corresponding relation triples and the corresponding attribute triples;

fourthly, vector representations obtained by the aggregation network are respectively input into the multilayer perceptron, and the initial position of data storage and the length of physical storage are regressed;

and fifthly, accessing and maintaining the knowledge graph data on the physical storage equipment according to the output initial position and the physical storage length, and realizing intelligent indexing of the super-large-scale knowledge graph storage.

In the method for storing the intelligent index by the super-large-scale knowledge graph, in the first step, for each index input, three input types are respectively input in the following mode

，

And

specifically, the following are respectively:

if it is an entity, the input is

，

And

is empty;

in the case of a relationship triple, the relationship,

is a head entity, and is characterized in that,

in order to be in a relationship of,

is a tail entity;

in the case of an attribute triple, the attribute triple,

is a solid substance and is provided with a plurality of groups of different structures,

in order to be the name of the attribute,

is an attribute value.

In the indexing method for the storage of the super-large-scale knowledge graph, in the second step, the encoding process of the BERT compatible model is as follows:

s21, segmenting a text corresponding to the entity or the relation into a word element sequence, if the input is Chinese segmentation by characters, and if the input contains English words, segmenting by directly using a blank space;

s22, adding position information in the sequence of the lemmas, namely the serial number of each lemma in the sequence of the lemmas, and setting the input of upper and lower sentences to be 0 if the input also has upper and lower sentence codes;

s23, for each input, obtaining respective vector representation in an embedding mode, and summing the vectors to obtain an input vector of the model;

s24, the model carries out representation learning on the input vector, and finally the input vector passes through the model

The location acquisition learned vector is recorded as

。

In the indexing method for storing the super-large-scale knowledge graph, if the input of the BERT compatible model is an entity, the output is carried out

I.e. a vector representation of the corresponding entity; if the inputs are relations, then the outputs

For the vector representation of the corresponding relation, the output vector

As input to the aggregation network in the next step.

In the third step, the aggregation network aggregates the information of all adjacent vertexes and associated edges for the entity to realize deep semantic learning, and for the entity to realize the index method for storing the super-large-scale knowledge graph

，

Refers to the vector representation of the entity (vertex) obtained by the model:

wherein,

represent

Is determined by the set of all the contiguous vertices of (a),

indicates the number of contiguous vertices,

is shown and

the top points of the adjacent parts are in a same plane,

represent

And

the relationship between;

finally output

，

Is a vector representation of the correspondent entity in the output of the converged network.

In the index method for storing the super-large-scale knowledge graph, in the third step, for the triples, the mean value of each vector of the triples is directly solved through the following formula,

wherein:

for a relational triple:

is a head entity

Is represented by a vector of (a) or (b),

is a relationship of

Is represented by a vector of (a) or (b),

is a vector representation of the tail entity;

for attribute triplets:

as an entity

Is represented by a vector of (a) or (b),

as attribute names

Is used to represent the vector of (a),

is a vector representation of attribute values.

In the indexing method for storing the super-large-scale knowledge graph, the vector representations obtained by the aggregation network are respectively input into a position multilayer perceptron and a length multilayer perceptron, and the initial position and the length of data storage are respectively regressed.

The invention also relates to an intelligent indexing system for storing the super-large-scale knowledge graph, wherein the super-large-scale knowledge graph is stored in a physical storage device, the system calculates the input data through a deep learning model to obtain the starting position pos of the physical storage and the length len of the physical storage of the data, so that the required knowledge graph is read according to the starting position pos _ start = pos and the ending position pos _ end = pos + len, the system comprises a BERT compatible model, a convergence network module and a multilayer perceptron, wherein,

the BERT compatible model encodes the input of the index, respectively obtains vector representations of the input, and sends the vector representations to a convergence network, wherein each index input is one of three types of entity, relationship triple and attribute triple;

the aggregation network module aggregates information of all adjacent vertexes and associated edges of the entity according to the characteristics of the knowledge graph, so that deep semantic learning is realized, and vector representation of the entity is output; for the triple, calculating the average value of each vector of the triple, and respectively obtaining the vector representation of the relation triple and the vector representation of the attribute triple;

the multilayer perceptron inputs vector representation obtained by the aggregation network into the multilayer perceptron, and returns the initial position and the length of data storage, and the initial position and the length of physical storage are used as the basis for accessing, reading and storing knowledge graph data on the physical storage equipment.

Based on the technical scheme, the ultra-large-scale knowledge graph intelligent indexing method and system based on deep learning intelligent hash obtain the following technical advantages through practical application:

1. the method and the system are suitable for intelligent indexing of the knowledge graph with large-scale semantization, so that the retrieval efficiency is improved, and more convenient service is provided for intelligent reasoning based on the knowledge graph.

2. The method and the system of the invention adopt the intelligent Hash algorithm, and can realize the extremely high-efficiency retrieval in the storage of the super-large-scale knowledge map due to fully utilizing the understanding of deep learning to the semantics, wherein the retrieval comprises simple retrieval, complex multi-hop retrieval, complex analysis with tasks such as knowledge reasoning and the like.

3. The method and the system provided by the invention design an intelligent Hash algorithm architecture, and realize short-time efficient response of input indexes under the condition of super-large-scale knowledge map storage through the application of a BERT compatible model and a convergence network module, thereby greatly improving the index efficiency.

Drawings

FIG. 1 is a schematic diagram of an indexing method for very large-scale knowledge-graph storage according to the present invention.

FIG. 2 is a schematic diagram of an implementation of an intelligent hash algorithm in the indexing method for supersized knowledge graph storage according to the present invention.

FIG. 3 is a schematic diagram of BERT compatible encoding process in the indexing method of the super-large-scale knowledge map storage of the present invention.

Detailed Description

The present invention will be further described in detail with reference to the drawings and examples, so as to more clearly understand the structural composition of the system for storing intelligent indexes by using very large-scale knowledge graphs and the working process of the method for storing intelligent indexes by using very large-scale knowledge graphs, but the scope of the present invention should not be limited thereby.

The scheme provided by the invention is directed to a super-large-scale knowledge graph, wherein the knowledge graph is a multi-relation graph formed by entities (nodes) and relations (edges of different types), each edge connects two entities at the head and the tail, and is usually represented by an SPO triple (object), which is called a fact. The ultra-large-scale knowledge graph means that the knowledge graph contains triple quantities of billions, billions or even trillions. For a small-scale knowledge graph, the existing common indexing mode is practical enough, and a practical intelligent indexing method is not needed. For the ultra-large-scale knowledge graph, the existing indexing method is low in efficiency and even infeasible, so that an intelligent indexing method is required. In such an ultra-large-scale knowledge graph storage, how to perform efficient retrieval is a huge challenge. In addition, real-time retrieval of entities is realized, online multi-hop query and relationship analysis are realized, second-level complex analysis is realized, and the method is an urgent need for super-large-scale knowledge map engineering practice and industrial application.

The invention is used as a brand-new intelligent indexing method for storing the super-large-scale knowledge graph, the super-large-scale knowledge graph is stored in indexing, hash calculation is realized on the basis of a deep learning model, and the initial position and the storage length of physical storage are obtained through calculation, so that the required knowledge graph is quickly retrieved from physical storage equipment. When the deep learning algorithm is selected, the data characteristics of the knowledge graph and the application characteristics of the knowledge graph are fully considered, so that the applications such as high-efficiency storage retrieval and complex analysis are provided.

The intelligent hash system architecture provided by the invention is shown in fig. 1, and the invention adopts a deep learning model to realize hash calculation, namely input data is calculated through the deep learning model to obtain a starting position pos of physical storage and a length len of data physical storage, so as to obtain the starting position pos _ start = pos and the ending position pos _ end = pos + len, and then based on the starting position and the physical storage length, the data of the knowledge graph on a physical storage device is accessed and maintained, so that the intelligent index of the super-large scale knowledge graph storage is realized. The intelligent hash algorithm of the invention fully utilizes the understanding of deep learning on semantics, and can realize highly efficient retrieval in large-scale knowledge map storage, including simple retrieval, complex multi-hop retrieval, complex analysis with tasks such as knowledge reasoning and the like.

The method specifically comprises the following steps:

the method comprises the steps of firstly, dividing index input into three types of entities, relationship triples and attribute triples, designing an intelligent Hash algorithm based on the three input types, wherein the intelligent Hash algorithm structurally comprises a BERT compatible model, a convergence network and a multilayer perceptron. For each index input, the three input types are respectively

，

And

specifically, the following are respectively:

if it is an entity, the input is

，

And

is empty;

in the case of a relationship triple, the relationship,

is a head entity, and is characterized in that,

in order to be in a relationship of,

is a tail entity;

in the case of an attribute triple, the attribute triple,

in order to be the name of the attribute,

is an attribute value.

Secondly, coding and learning the three types of input respectively by using the BERT compatible model, and sending the learned vector to the convergence network;

fourthly, vector representations obtained by the aggregation network are respectively input into the multilayer perceptron, and the initial position of data storage and the length of physical storage are regressed; specifically, vector representations obtained by the aggregation network are respectively input into the position multilayer perceptron and the length multilayer perceptron, the position multilayer perceptron respectively regresses the initial position of data storage, and the length multilayer perceptron regresses the length of physical storage.

The core of the intelligent index is the design of an intelligent Hash algorithm architecture, and the intelligent Hash algorithm architecture comprises a BERT compatible model, a convergence network and a multilayer perceptron. An intelligent Hash algorithm framework is provided by using the latest deep learning latest results and combining the characteristics of the knowledge map. For three different types of index input, they are expressed as

，

And

form (a): if it is an entity, the input is

，

And

is empty; if the relation triple is input

Is a head entity, and is characterized in that,

in order to be in a relationship with each other,

is a tail entity; if the attribute triples are input

Is a solid substance which is a mixture of the components,

in order to be the name of the attribute,

is an attribute value. The intelligent Hash algorithm makes full use of the understanding of deep learning on semantics, so that extremely efficient retrieval including simple retrieval, complex multi-hop retrieval, complex analysis with tasks such as knowledge reasoning and the like can be realized in large-scale knowledge map storage.

In the method for storing intelligent indexes by using the super-large-scale knowledge graph, the three inputs are coded by using a BERT-like model. In selecting a BERT compatible model, the selection may be made according to the richness of the computational power. Generally, when the calculation is very rich, a BERT or a similar large model can be selected, so that a better effect can be obtained; and for the situation that the computational power is more tense, BERT-tiny or similar small models can be selected, and the use of computational power resources is saved on the premise of obtaining acceptable effects. The BERT-like model is not particularly limited in this patent, and newly developed models may be used in the future instead of the models that are currently widely used.

Specifically, as shown in fig. 3, the encoding process of the BERT-compatible model is as follows:

and S21, segmenting a text (an intelligent index in the figure 3) corresponding to the entity or the relation into a sequence of word elements (namely an intelligent sequence, an index sequence and an index sequence in the figure 3), and if the input is Chinese character segmentation, or if the input contains English words, directly segmenting by using a blank space.

S22, adding position information in the sequence of the lemmas, namely the serial number of each lemma in the sequence of the lemmas, and setting the input of the upper sentence and the lower sentence as 0 if the BERT compatible input also has the coding of the upper sentence and the lower sentence;

s23, for each input, obtaining respective vector representation in an embedding mode, and summing the vectors to obtain an input vector of the model, namely the input in the figure 3

、

And so on.

The location acquisition learned vector is recorded as

. As the input of the BERT compatible model, data information containing the lemma, the upper and lower sentences and the position can be obtained by inputting in each of the three types.

In the method for storing the intelligent index by the super-large-scale knowledge graph, in the second step, if the input of the BERT compatible model is an entity, a vector is output

I.e. a vector representation of the corresponding entity; if the inputs are relational, the outputs are

For the vector representation of the corresponding relation, namely outputting the vector representation of the corresponding relation triple and the vector representation of the corresponding attribute triple respectively

As input to the next step aggregation network.

In the method for storing the intelligent index by the super-large-scale knowledge graph, the convergence network module is designed, and the characteristics of the knowledge graph are fully utilized to learn more appropriate vector representation. Specifically, for an entity, the convergence network converges information of all adjacent vertexes and associated edges to realize deep semantic learning; and for the relation triples and the attribute triples, only the triples are learned so as to reduce the calculation amount.

For an entity, according to the characteristics of the knowledge graph, the convergence network converges the information of all adjacent vertexes and associated edges, thereby realizing deep semantic learning and aiming at the entity

，

wherein,

to represent

Is determined by the set of all the contiguous vertices of (a),

indicates the number of adjacent vertices and indicates the number of adjacent vertices,

is shown and

the vertex of the adjacent point is provided with a plurality of adjacent points,

to represent

And

the relationship between them;

finally output

，

For the triple, the vector values of the triple are directly averaged through the following formula,

wherein:

for a relational triple:

is a head entity

Is used to represent the vector of (a),

is a relationship of

Is used to represent the vector of (a),

is a vector representation of the tail entity;

for attribute triplets:

as an entity

Is represented by a vector of (a) or (b),

as attribute names

Is used to represent the vector of (a),

is a vector representation of the attribute values.

In the method for storing the intelligent index by the super-large-scale knowledge graph, after a network is converged, a simple multi-layer perceptron (MLP) is connected, and then the initial position pos of data storage and the length len of physical storage can be regressed. The Multilayer Perceptron (MLP), belongs to the simplest neural network. In addition to the input layer and the output layer, the multi-layer perceptron can have a plurality of hidden layers in the middle, and the simplest MLP only comprises one hidden layer, namely a three-layer structure, wherein the lowest layer of the three-layer structure is the input layer, the middle layer of the three-layer structure is the hidden layer, and the last layer of the three-layer structure is the output layer.

The layers of the multilayer perceptron are all connected, namely any neuron on the upper layer is connected with all neurons on the lower layer. Inputting an N-dimensional vector at the input layer, there are N neurons, and assuming that the input layer is represented by a vector X, the output of the hidden layer is f (W1X + b 1), W1 is a weight (also called a connection coefficient), b1 is an offset, and the function f may be a commonly used sigmoid function or tanh function. The hidden layer to the output layer can be regarded as a multi-class logistic regression, namely, softmax regression, so that the output of the output layer is softmax (W2X 1+ b 2), and X1 represents the output f (W1X + b 1) of the hidden layer. The MLP of the three layers is summarized by the formula, i.e., the function G is softmax,

，

therefore, all the parameters of MLP are the connection weights and biases between the layers, including W1, b1, W2, b2. Solving each parameter is an optimization problem. The simplest is the gradient descent method (SGD): all parameters are first initialized randomly and then trained iteratively, with gradients computed and parameters updated continuously until a certain condition is met. These are conventional techniques in artificial intelligence algorithm, which are not innovative points of the present invention patent and are not described herein. The whole network learns pos and len simultaneously in a multi-task learning mode in a training stage. When the method is applied, the shared backbone network is fully utilized, the efficiency is very high, and pos and len can be calculated simultaneously. The core of the invention is to output pos and len, after the starting position pos of physical storage and the length len of data physical storage are obtained, so as to obtain the starting position pos _ start = pos and the ending position pos _ end = pos + len, and the mature file system api can be used to access the knowledge graph data stored on the physical device (such as a disk, ssd, a memory, even a tape, etc.).

The invention also designs a system for storing the intelligent index by the super-large-scale knowledge graph, wherein the super-large-scale knowledge graph is stored in a physical storage device, the system calculates the input data through a deep learning model to obtain the initial position pos of the physical storage and the length len of the physical storage of the data, the initial position pos _ start = pos, the end position pos _ end = pos + len, and the system comprises a convergence network module and a multilayer perceptron. The BERT compatible model respectively encodes three types of input, respectively obtains vector representations of corresponding types, and sends the vector representations to a convergence network, wherein the three input types are respectively three types of entities, relationship triples and attribute triples, and the input of each index is necessarily one of the three types; the aggregation network module aggregates information of all adjacent vertexes and associated edges of the entity according to the characteristics of the knowledge graph, so that deep semantic learning is realized, and vector representation of the entity is output; for the triple, calculating the average value of each vector of the triple, and respectively obtaining the vector representation of the relation triple and the vector representation of the attribute triple; the multilayer perceptron inputs vector representations obtained by the aggregation network into the multilayer perceptrons respectively and regresses the initial position of data storage and the length of physical storage; in the convergence network module, the characteristics of the knowledge graph are fully utilized to learn more appropriate vector representation. Specifically, for an entity, aggregating the adjacent vertexes and associated edges of all entities; and for the relation triples and the attribute triples, only the triples are learned so as to reduce the calculation amount. The core of the invention is the starting position pos of the output and the length len of the physical storage. After the start position pos of storage and the length len of physical storage are obtained, the mature file system api can be used to access data stored on physical devices (such as a disk, ssd, memory, even a tape, etc.), thereby realizing intelligent indexing of the very-large-scale knowledge-graph storage.

On the basis of the method and the system, the computer equipment is provided with an intelligent indexing system, the intelligent indexing system executes the algorithm of the hash architecture, the intelligent indexing of the super-large-scale knowledge graph stored in the physical storage equipment is realized, and the indexing of the knowledge graph is realized in data on the physical equipment (such as a disk, ssd, a memory, even a tape and the like) efficiently and in a short time.

The method and the system provided by the invention are suitable for intelligent indexing of large-scale semantization knowledge graphs and can be suitable for all fields. The core of the method is to provide intelligent index for the ultra-large-scale knowledge graph so as to improve the retrieval efficiency and provide more convenient service for intelligent reasoning based on the knowledge graph.

Claims

1. The method for indexing the storage of the super-large-scale knowledge graph is characterized in that the super-large-scale knowledge graph is stored in the indexing process, hash calculation is realized on the basis of a deep learning model, and the initial position and the storage length of physical storage are obtained, and the method specifically comprises the following steps:

secondly, coding and learning the three types of input respectively by using the BERT compatible model, and sending the learned vector to a convergence network;

2. The method of claim 1, wherein in the first step, for each index input, the three input types are respectively selected from the group consisting of

，

And

specifically, the following are respectively:

if it is an entity, the input is

，

And

is empty;

in the case of a relationship triple, the relationship,

is a head entity, and is characterized in that,

in order to be in a relationship with each other,

is a tail entity;

in the case of an attribute triple, the attribute triple,

in order to be the name of the attribute,

is an attribute value.

3. The method of claim 1, wherein in the second step, if the input of the BERT-compatible model is an entity, the output is output

I.e. a vector representation of the corresponding entity; if the input is a relationship, then the output

For the vector representation of the corresponding relation, the output vector

As input to the aggregation network in the next step.

4. The method of claim 3, wherein in the second step, the BERT compatible model is encoded as follows:

s22, adding position information in the sequence of the lemmas, namely the serial number of each lemma in the sequence of the lemmas, and if the input also comprises upper and lower sentence codes, setting the input of the upper and lower sentences to be 0;

s24, performing expression learning on input vectors by using models, and finally passing the models

The location acquisition learned vector is recorded as

。

5. The method of claim 1, wherein in the third step, the aggregation network aggregates information of all adjacent vertices and associated edges for entities to achieve deep semantic learning, and in the third step, the aggregation network aggregates information of all adjacent vertices and associated edges for entities to achieve deep semantic learning

，

Refers to the vector representation of the entity obtained by the model:

wherein,

to represent

Is determined by the set of all the contiguous vertices of (a),

is shown and

to represent

And

the relationship between;

final output

，

6. The method for indexing supersized knowledge-graph storage according to claim 1, wherein in said third step, for the triples, the vector values of the triples are directly averaged by the following formula,

wherein:

for a relational triple:

is a head entity

Is used to represent the vector of (a),

is a relationship of

Is represented by a vector of (a) or (b),

is a vector representation of the tail entity;

for attribute triplets:

as an entity

Is represented by a vector of (a) or (b),

as attribute names

Is represented by a vector of (a) or (b),

is a vector representation of the attribute values.

7. The method for indexing supersized knowledge-graph storage according to claim 1, wherein in said fourth step, vector representations obtained from said aggregation network are inputted into a position multi-level perceptron and a length multi-level perceptron, respectively, and the starting position and the length of data in physical storage are regressed respectively.

8. An index system for storing a super-large-scale knowledge graph is characterized in that the super-large-scale knowledge graph is stored in a physical storage device, the system calculates a starting position pos of physical storage and a length len of data physical storage through a deep learning model for input data, and accordingly a required knowledge graph is read according to the starting position pos _ start = pos and the ending position pos _ end = pos + len, the system comprises a BERT compatible model, a convergence network module and a multilayer sensor, wherein,

the BERT compatible model encodes the input of the index, respectively obtains vector representations of the input, and sends the vector representations to a convergence network, wherein each index input is one of three types of entity, relation triple and attribute triple;

the convergence network module converges information of all adjacent vertexes and associated edges for the entity by a convergence network, thereby realizing deep semantic learning and outputting vector representation of the entity; for the triple, calculating the average value of each vector of the triple, and respectively obtaining the vector representation of the relation triple and the vector representation of the attribute triple;

and the multilayer perceptron inputs the vector representation obtained by the aggregation network into the multilayer perceptron, and regresses the initial position of data storage and the length of physical storage, wherein the initial position and the length of physical storage are used as the basis for accessing and maintaining the knowledge map data on the physical storage equipment.

9. A computer device, characterized in that the computer device is provided with an intelligent indexing system, the intelligent indexing system executes the method of claim 1, and the intelligent indexing system realizes intelligent indexing to the super-large-scale knowledge-graph stored in the physical storage device.