Disclosure of Invention
The present invention aims to provide a multimodal DIKW content multi-semantic analysis method oriented to essential computing, so as to overcome or at least partially solve the above problems in the prior art.
An essential computing-oriented multi-modal DIKW content multi-semantic analysis method comprises the following steps:
s1, obtaining type resources to carry out semantic recognition, and judging whether multiple semantics exist according to the recognition result, wherein the type resources comprise data resources DDIKWInformation resource IDIKWAnd knowledge resources KDIKW;
S2, when multiple meanings exist, analyzing the reason for forming the multiple meanings;
and S3, based on the reason of multi-semantic formation, converting the original type resource into a new type resource by adopting a corresponding strategy, and acquiring a final semantic recognition result.
Further, the multiple semantic formation reasons include type resource missing and type resource redundancy.
Further, in step S2, when the reason for forming multiple meanings is analyzed as the type resource missing, it is determined that the type resource missing attribute is data resource missing or information resource missing.
Further, in step S3, when the reason for forming multiple meanings is a type resource loss, the adopted strategy specifically includes:
known data resource D0And information resources I0With respect to KDIKWCombined to obtain new information resource I with different purposesnew1And Inew2;
Searching data resource D related to data chart in data chartrelated;
Will DrelatedTo related IDIKWAnd KDIKWCombining to further obtain new information resource Inew3;
Determination of Inew3And Inew1And Inew2By keeping the relationship of (I)new3Supported IDIKWDelete other IDIKW;
Will remain IDIKWAnd setting as a final semantic recognition result.
Further, in step S3, when the reason for forming multiple meanings is a type resource loss, the adopted strategy specifically includes:
known data resource D0And information resources I0With respect to KDIKWCombined to obtain new information resource I with different purposesnew1And Inew2;
Searching related information resources I related to information in information chartrelated;
Will IrelatedTo related IDIKWAnd KDIKWCombining to further obtain new information resource Inew3;
Determination of Inew3And Inew1And Inew2By keeping the relationship of (I)new3Supported IDIKWDelete other IDIKW;
Will remain IDIKWAnd setting as a final semantic recognition result.
Further, in step S2, when the reason for forming multiple meanings is analyzed as the type resource redundancy, it is determined that the type resource redundancy attribute is data resource redundancy or information resource redundancy.
Further, in step S3, when the reason for forming multiple meanings is redundancy of information resources, the adopted policy specifically includes:
finding conflicting information resources I01And I02;
Searching data resource D related to data chart in data chartrelated;
Will DrelatedTo related IDIKWAnd KDIKWCombining to further obtain new information resource Inew;
Determination of InewAnd I01And I02The association relationship between them is preserved InewThe supported semantic recognition result deletes other semantic recognition results;
will InewAnd setting the supported semantic recognition result as a final semantic recognition result.
Further, in step S3, when the reason for forming multiple meanings is redundancy of information resources, the adopted policy specifically includes:
finding conflicting information resources I01And I02;
Searching information source I related to information chartrelated;
Will IrelatedTo related IDIKWAnd KDIKWCombining to further obtain new information resource Inew;
Determination of InewAnd I01And I02The association relationship between them is preserved InewThe supported semantic recognition result deletes other semantic recognition results;
will InewAnd setting the supported semantic recognition result as a final semantic recognition result.
Further, in step S3, when the reason for forming multiple meanings is data resource redundancy, the adopted policy specifically includes:
finding conflicting data resources D01And D02;
Searching data resource D related to data chart in data chartrelated;
Will DrelatedTo related IDIKWAnd KDIKWCombining to further obtain new information resource Dnew;
Determination of DnewAnd D01And D02The association relationship between them, reserve DnewThe supported semantic recognition result deletes other semantic recognition results;
will DnewAnd setting the supported semantic recognition result as a final semantic recognition result.
Further, in step S3, when the reason for forming multiple meanings is data resource redundancy, the adopted policy specifically includes:
finding conflicting data resources D01And D02;
Searching related information resources I related to information in information chartrelated;
Will IrelatedTo related IDIKWAnd KDIKWCombining to further obtain new information resource Inew;
Determination of InewAnd D01And D02The relation between, retention InewThe supported semantic recognition result deletes other semantic recognition results;
will InewAnd setting the supported result as a final semantic recognition result.
Compared with the prior art, the invention has the beneficial effects that:
according to the multimode DIKW content multi-semantic analysis method for essential computation, when a multi-semantic result appears in a semantic recognition process based on type resources, original type resources are converted into new type resources by analyzing multi-semantic forming reasons and adopting corresponding strategies according to different forming reasons to be applied to semantic recognition, redundant recognition results are eliminated, and a final semantic recognition result is obtained, so that an artificial intelligent system is helped to improve the language text recognition efficiency, and the recognition result accuracy is improved.
Detailed Description
The principles and features of this invention are described below in conjunction with the following drawings, the illustrated embodiments are provided to illustrate the invention and not to limit the scope of the invention.
Referring to fig. 1, the present invention provides a multimodal DIKW contents multi-semantic analysis method oriented to essential computing, the method comprising the steps of:
s1, obtaining type resources to carry out semantic recognition, and judging whether multiple semantics exist according to the recognition result, wherein the type resources comprise data resources DDIKWInformation resource IDIKWAnd knowledge resources KDIKW。
Where a data resource is a discrete element resulting from direct observation, without context, having no meaning, and not being associated with some particular purpose of a human being. The data resource expresses the attribute content of a single entity, and the simplest expression is 'is _ a', and (X | a) expresses the attribute (a) of the entity X.
The information resource records human behaviors and is used for mining, analyzing and expressing the interaction relation between two entities, wherein the entities can be another person or things which exist objectively. The information resource is related to a specific purpose of a human being and is used for deducing the relationship between two entities through the purpose, and the simplest expression of the information resource is 'has _ a' and is used for representing the relationship between the entities E1 and E2.
The knowledge resources are obtained by deduction of data resources and information resources in a structured form, the entity relationship is further improved on the basis of the information resources, and the knowledge resources perform abstract induction summary on the entity relationship on the basis of the information resources.
S2, when multiple meanings exist, the reason for forming multiple meanings is analyzed.
And S3, based on the reason of multi-semantic formation, converting the original type resource into a new type resource by adopting a corresponding strategy, and acquiring a final semantic recognition result.
Wherein the multiple semantic formation reasons comprise type resource missing and type resource redundancy. Missing type resource TR in semantic contentDIKWResulting in a reduction in the limitation of the content understanding range. By increasing the restriction on content understanding, the scope of content understanding can be narrowed, so that only one derived I for different purposes is reservedDIKWAnd further solve the multi-semantic problem. The type resource missing includes data resource missing or information resource missing, so in step S2, when the analysis result indicates that the type resource missing is caused by multiple semantic formation, it needs to further determine that the type resource missing attribute is data resource missing or information resource missing.
In an embodiment of the present invention, when the reason for forming multiple meanings in step S3 is a type resource loss, the adopted strategy specifically includes:
a1, data resource D to be known0And information resources I0With respect to KDIKWCombined to obtain new information resource I with different purposesnew1And Inew2。
A2, searching the data chart for the corresponding phaseRelated data resource Drelated。
A3, DrelatedTo related IDIKWAnd KDIKWCombining to further obtain new information resource Inew3。
A4, determination of Inew3And Inew1And Inew2By keeping the relationship of (I)new3Supported IDIKWDelete other IDIKW。
A5, and I remainingDIKWAnd setting as a final semantic recognition result.
As an example, assuming that the semantic content is "at night in summer, user a stays in the study room", the system may be ambiguous in understanding the content because the semantic content lacks the information resource of "what user a does in the study room". For example, data resource D01"user A stays in the study room" and knowledge resources K1In combination with "study room is the place of study", it can be inferred that the purpose of user a staying in the study room is to study. In conjunction with data resources D02Night and knowledge resources K2"people are sleeping often night," it can be inferred that user A may be sleeping in the study room. Both of these derivation methods are correct in the absence of other related types of resources, but the purpose of the information resources they generate is different, leading to multiple senses.
Obtaining related data resources D on the basis of the data resources D1'air conditioner in bedroom is broken' and D2"the air condition of study room is good", combine data resource D03Summer and knowledge resources K3"summer hot" can be inferred as the temperature in the bedroom is higher and the temperature in the study room is lower. These data resources increase the constraints on the study room environment, narrowing the scope of content understanding to the temperature-related domain. Combining knowledge resources K4"people like to sleep in a cool place", it can be concluded that the temperature of the study room is low, suitable for sleeping, and supports IDIKW"the user A stays asleep in the study room", thereby solving the multi-semantic problem.
In another embodiment of the present invention, when the reason for forming multiple meanings in step S3 is a type resource loss, the adopted strategy specifically includes:
b1, data resource D to be known0And information resources I0With respect to KDIKWCombined to obtain new information resource I with different purposesnew1And Inew2。
B2, searching related information resources I related to information in information chartrelated。
B3, mixing IrelatedTo related IDIKWAnd KDIKWCombining to further obtain new information resource Inew3。
B4, determination of Inew3And Inew1And Inew2By keeping the relationship of (I)new3Supported IDIKWDelete other IDIKW。
B5, I remainingDIKWAnd setting as a final semantic recognition result.
The embodiment is used for solving the multi-semantic problem that the semantic content is 'at night in summer and user A stays in study room', and the related information resource I is obtained1"user A dislikes learning", in combination with D02Night and knowledge resources K5"people who like to learn may learn at night," it can be inferred that user A is unlikely to learn in the study room at this time. This IDIKWExclude I fromDIKW"user A stays in the learning room for learning", and the only I leftDIKW"user a stays in the study room for sleep" is the final result, thus solving the multi-meaning problem.
Redundancy of type resources refers to D in semantic contentDIKWOr IDIKWAn understanding of the many different purposes arises on the same issue. The type resource redundancy includes data resource redundancy or information resource redundancy, and therefore, in step S2, when the reason for the multi-meaning formation is analyzed as the type resource redundancy, it is necessary to further determine that the type resource redundancy attribute is the data resource redundancy or the information resource redundancy.
In an embodiment of the present invention, when the reason for forming multiple meanings in step S3 is redundancy of information resources, the adopted policy specifically includes:
c1 finding conflicting information resources I01And I02。
C2, searching the data chart for the data resource D related to the data chartrelated。
C3, mixing DrelatedTo related IDIKWAnd KDIKWCombining to further obtain new information resource Inew。
C4, determination of InewAnd I01And I02The association relationship between them is preserved InewAnd (4) deleting other semantic recognition results according to the supported semantic recognition results.
C5, mixing InewAnd setting the supported semantic recognition result as a final semantic recognition result.
As an example, when the semantic content is "user A likes playing basketball and user A does not like sports", resource I can be extracted01"user A likes playing basketball" and I02"the user A hates the sport", obtain knowledge resource K1"playing basketball is a sport", K2"relationship 'dislike' and relationship 'like' contradict", from I02"user A disagrees with sports" and K1The new information resource I can be deduced by' playing basketball belongs to a sportnew1"user A hates playing basketball". Due to K2Knowing I01And Inew1Contradictory, therefore, for the question "attitude of user A to play basketball", I01And I02With different understanding, represent IDIKWRedundancy in semantic content.
When the embodiment is applied to solve the multi-meaning problem, the spatial data resource D related to the basketball court of the user A is obtained1. Having related knowledge resources K3"the main activities of the basketball court are playing basketball" and "people who play basketball regularly like playing basketball". Binding of D1And K3It can be deduced that user a is often present on a basketball court and thus user a is often playing basketball. Binding K4"user A plays basketball often, and people who play basketball often are likely to like basketball", this indicates that user A is likely to like basketballBasketball, support I01. When information resource I01With support of DDIKWBut information resource I02Unsupported DDIKWWhen it is, it tends to judge I01Correct, I02And errors, thereby solving the multi-semantic problem.
In another embodiment of the present invention, when the reason for forming multiple meanings is redundancy of information resources in step S3, the adopted policy specifically includes:
d1 finding conflicting information resources I01And I02;
D2, searching the information chart for the information resource I related to the information chartrelated;
D3, mixing IrelatedTo related IDIKWAnd KDIKWCombining to further obtain new information resource Inew;
D4, determination of InewAnd I01And I02The association relationship between them is preserved InewThe supported semantic recognition result deletes other semantic recognition results;
d5, mixing InewAnd setting the supported semantic recognition result as a final semantic recognition result.
The embodiment is used for solving the multi-semantic problem that the semantic content is 'user A likes playing basketball and user A does not like sports', and the related information resource I is obtained1"user A is a member of school basketball team", has related knowledge resources K5"members of the basketball team play basketball regularly". Binding of I1And K5It can be deduced that user a is a member of the school basketball team, so user a plays basketball often. Binding K4"user A plays basketball often, people who play basketball often probably like playing basketball", which means that user A probably likes playing basketball, supports information resource I01. When information resource I01With supported IDIKWBut information resource I02Unsupported IDIKWThe system tends to determine I01Is correct, I02Is erroneous.
In an embodiment of the present invention, when the reason for forming multiple meanings in step S3 is data resource redundancy, the adopted policy specifically includes:
e1 finding conflicting data resources D01And D02;
E2, search data D related to it in data chartrelated;
E3, mixing DrelatedTo related IDIKWAnd KDIKWCombining to further obtain new information resource Dnew;
E4, determination of DnewAnd D01And D02The association relationship between them, reserve DnewThe supported semantic recognition result deletes other semantic recognition results;
e5, mixing DnewAnd setting the supported semantic recognition result as a final semantic recognition result.
As an example, when the semantic content contains a data resource D01"today's temperature is 30 degrees" and D02When the temperature of today is 20 degrees, aiming at the problem of the temperature of today, the data resource D01And D02Is contradictory, which indicates that in redundant data resource D01And D02There must be an error in between.
When the embodiment is applied to solve the multi-meaning problem of 'temperature of today', the data resource D is obtained1Summer and D2"Hainan", and knowledge resource K1"Hainan summer temperature is higher". Binding of D1、D2And K1It can be deduced that today's temperature should be high, supporting data resource D01. When data resource D01With support of DDIKWBut data resource D02Unsupported DDIKWThe system tends to judge D01Correct but D02And errors, thereby solving the multi-semantic problem.
In another embodiment of the present invention, in step S3, when the reason for forming multiple meanings is data resource redundancy, the adopted policy specifically includes:
f1 finding conflicting data resources D01And D02;
F2, searching related information resource I related to information in information chartrelated;
F3, mixing IrelatedTo related IDIKWAnd KDIKWCombining to further obtain new information resource Inew;
F4, determination of InewAnd D01And D02The relation between, retention InewThe supported semantic recognition result deletes other semantic recognition results;
f5, mixing InewAnd setting the supported result as a final semantic recognition result.
When this embodiment is used to solve the aforementioned multi-semantic problem of "temperature today", information resource I is obtained1"data resource D01From the weather bureau' and information resources I2"data resource D02From the internet ", and knowledge resources K2"data from professional organizations is more reliable than data from the internet". In connection with information resources I1、I2And knowledge resources K2Data resources D can be derived01Ratio data resource D02And is more reliable. Thus, the system tends to determine D01Is true, D02Is erroneous and thus solves the multi-lingual problem.
The above description is only for the purpose of illustrating the preferred embodiments of the present invention and is not to be construed as limiting the invention, and any modifications, equivalents, improvements and the like that fall within the spirit and principle of the present invention are intended to be included therein.