CN112199936B - Intelligent analysis method and storage medium for repeated declaration of scientific research projects - Google Patents
Intelligent analysis method and storage medium for repeated declaration of scientific research projects Download PDFInfo
- Publication number
- CN112199936B CN112199936B CN202011258000.3A CN202011258000A CN112199936B CN 112199936 B CN112199936 B CN 112199936B CN 202011258000 A CN202011258000 A CN 202011258000A CN 112199936 B CN112199936 B CN 112199936B
- Authority
- CN
- China
- Prior art keywords
- project
- reviewed
- technical
- historical
- declaration
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
- 238000011160 research Methods 0.000 title claims abstract description 23
- 238000004458 analytical method Methods 0.000 title claims abstract description 16
- 238000005516 engineering process Methods 0.000 claims abstract description 28
- 238000000034 method Methods 0.000 claims abstract description 26
- 238000012790 confirmation Methods 0.000 claims description 34
- 238000012550 audit Methods 0.000 claims description 19
- 238000012552 review Methods 0.000 claims description 11
- 238000004364 calculation method Methods 0.000 claims description 7
- 238000004590 computer program Methods 0.000 claims description 6
- 230000005540 biological transmission Effects 0.000 claims description 5
- 238000004891 communication Methods 0.000 claims description 5
- 230000009466 transformation Effects 0.000 claims description 5
- 238000010248 power generation Methods 0.000 claims description 4
- 230000006872 improvement Effects 0.000 abstract description 7
- 238000011156 evaluation Methods 0.000 abstract description 4
- 238000007726 management method Methods 0.000 abstract description 4
- 230000008569 process Effects 0.000 description 6
- 238000011161 development Methods 0.000 description 2
- 238000005457 optimization Methods 0.000 description 2
- 238000012216 screening Methods 0.000 description 2
- 230000000739 chaotic effect Effects 0.000 description 1
- 238000012986 modification Methods 0.000 description 1
- 230000004048 modification Effects 0.000 description 1
- 230000003287 optical effect Effects 0.000 description 1
- 238000012549 training Methods 0.000 description 1
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F40/00—Handling natural language data
- G06F40/10—Text processing
- G06F40/194—Calculation of difference between files
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F18/00—Pattern recognition
- G06F18/20—Analysing
- G06F18/22—Matching criteria, e.g. proximity measures
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06Q—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
- G06Q10/00—Administration; Management
- G06Q10/10—Office automation; Time management
- G06Q10/103—Workflow collaboration or project management
-
- Y—GENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
- Y04—INFORMATION OR COMMUNICATION TECHNOLOGIES HAVING AN IMPACT ON OTHER TECHNOLOGY AREAS
- Y04S—SYSTEMS INTEGRATING TECHNOLOGIES RELATED TO POWER NETWORK OPERATION, COMMUNICATION OR INFORMATION TECHNOLOGIES FOR IMPROVING THE ELECTRICAL POWER GENERATION, TRANSMISSION, DISTRIBUTION, MANAGEMENT OR USAGE, i.e. SMART GRIDS
- Y04S10/00—Systems supporting electrical power generation, transmission or distribution
- Y04S10/50—Systems or methods supporting the power network operation or management, involving a certain degree of interaction with the load-side end user applications
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- Business, Economics & Management (AREA)
- General Physics & Mathematics (AREA)
- Data Mining & Analysis (AREA)
- Strategic Management (AREA)
- Human Resources & Organizations (AREA)
- Physics & Mathematics (AREA)
- Entrepreneurship & Innovation (AREA)
- Artificial Intelligence (AREA)
- General Engineering & Computer Science (AREA)
- Operations Research (AREA)
- General Business, Economics & Management (AREA)
- Computational Linguistics (AREA)
- Health & Medical Sciences (AREA)
- Economics (AREA)
- Marketing (AREA)
- General Health & Medical Sciences (AREA)
- Quality & Reliability (AREA)
- Tourism & Hospitality (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Life Sciences & Earth Sciences (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Bioinformatics & Computational Biology (AREA)
- Computer Vision & Pattern Recognition (AREA)
- Evolutionary Biology (AREA)
- Evolutionary Computation (AREA)
- Management, Administration, Business Operations System, And Electronic Commerce (AREA)
Abstract
The invention relates to an intelligent analysis method and a storage medium for repeated declaration of scientific research projects, wherein the method comprises the following steps: receiving a declaration material electronic document of a technical project to be reviewed, and extracting text information to obtain a plurality of text information to be reviewed in different dimensions; acquiring declaration material electronic documents of historical science and technology projects in a science and technology project database, and extracting text information to obtain historical text information with a plurality of different dimensions; calculating the similarity of the technical project to be reviewed and the current historical technical project in a plurality of different dimensions according to the text information to be reviewed and the historical text information, and calculating the similarity of the technical project to be reviewed and the current historical technical project according to the similarity in a plurality of different dimensions; and judging whether the technical project to be reviewed is repeatedly declared or not according to the similarity between the technical project to be reviewed and the current historical technical project. The invention can intelligently assist in the evaluation of the stands, avoid repeated stands, and ensure the quality improvement and efficiency improvement of the stand management work.
Description
Technical Field
The invention relates to the technical field of software information, in particular to an intelligent analysis method and a storage medium for repeated declaration of scientific research projects.
Background
With the continuous deep development of electric power reform and continuous development of science and technology, scientific and technical research projects in each professional field of power grid companies are reviewed more and more, and in order to avoid repeated reporting of similar projects, similarity examination needs to be conducted on reporting materials of the scientific and technical research projects. In general, science and technology project reporting materials are large texts, and at present, the similarity judging mode of the science and technology projects needs to rely on professional manual reading and screening comparison, so that each science and technology project reporting material needs to be manually compared with mass prior science and technology project reporting materials in a database, a large amount of manpower and time cost is consumed, and the high efficiency and accuracy of similarity judgment are difficult to ensure.
Disclosure of Invention
The invention aims to provide an intelligent analysis method and a storage medium for repeated declaration of scientific research projects, so as to realize intelligent auxiliary stand review, avoid repeated stands and ensure the quality improvement and efficiency improvement of stand management work.
According to a first aspect, an embodiment of the present invention provides an intelligent analysis method for repeated declaration of scientific research projects, including:
step S1, receiving a declaration material electronic document of a technical project to be reviewed, and extracting text information from the declaration material electronic document of the technical project to be reviewed to obtain a plurality of text information to be reviewed in different dimensions;
s2, acquiring declaration material electronic documents of historical science and technology projects in a science and technology project database, and extracting text information from the acquired declaration material electronic documents of the current historical science and technology projects to obtain historical text information with a plurality of different dimensions;
s3, calculating the similarity of the technical project to be reviewed and the current historical technical project in a plurality of different dimensions according to the text information to be reviewed and the historical text information in the plurality of different dimensions, and calculating the similarity of the technical project to be reviewed and the current historical technical project according to the similarity in the plurality of different dimensions and the weight coefficients in the plurality of different dimensions;
and S4, judging whether the technical project to be reviewed is repeatedly declared or not according to the similarity between the technical project to be reviewed and the current historical technical project and the comparison result of a preset similarity threshold value.
Optionally, the formats of the electronic documents of the reporting materials of the technical project to be reviewed and the historical technical project are the same, and the electronic documents comprise preset text information with a plurality of different dimensions; the preset multiple different dimensions at least comprise project titles, project abstracts, project expected targets, project technical routes and project research contents.
Optionally, the method further includes step S51, and the step S51 includes:
when the judgment result of the step S4 is a repeated declaration, executing a step S511;
when the judgment result of the step S4 is that the report is not repeated, executing a step S512;
wherein the step S511 includes: outputting first prompt information to inform a user that repeated reporting problems exist in the technological project to be reviewed; the first prompt information comprises a judgment result of repeated reporting, the similarity of the technical project to be reviewed and the current historical technical project, and a reporting material electronic document of the technical project to be reviewed and the current historical technical project;
wherein the step S512 includes: judging whether the technological project to be reviewed has been subjected to similarity calculation with all historical technological projects in a technological project database; if yes, outputting a second prompt message to inform the user that the technical project to be reviewed does not have repeated reporting problems, and ending the intelligent review flow of the reporting material of the current technical project to be reviewed; if not, returning to the step S2 to continuously acquire the declaration material electronic document of the next history science and technology project in the science and technology project database, and extracting text information from the declaration material electronic document of the next history science and technology project to acquire a plurality of corresponding history text information with different dimensions; and then proceeds to steps S3 to S51.
Optionally, the step S511 specifically includes:
after the first prompt information is output, receiving confirmation information input by a user; the confirmation information comprises yes and no; when the confirmation information is yes, ending the intelligent audit flow of the declaration material of the current technical project to be reviewed; when the confirmation information is no, the step S512 is performed.
Optionally, the second prompt information includes: the method comprises the steps of judging results of non-repeated declaration, declaration material electronic documents of the technical projects to be reviewed, declaration material electronic documents of 5 historical technical projects with highest similarity with the technical projects to be reviewed, and 5 similarity data between the 5 historical technical projects with highest similarity and the technical projects to be reviewed.
Optionally, the preset plurality of different dimensions further includes a project classification including at least one of a relay automation group, a transmission group, a distribution group, a generation group, a communication and information group, a metering marketing group, a system operation and intelligent power grid group, a transformation group.
The step S2 specifically includes:
according to the item classification of the technical items to be reviewed, acquiring declaration material electronic documents of historical technical items corresponding to the item classification in the technical item database, and extracting text information from the acquired declaration material electronic documents of the historical technical items to obtain historical text information with different dimensions.
Optionally, the method further includes step S52, and the step S52 includes:
when the judgment result of the step S4 is a repeated declaration, executing a step S521;
when the judgment result of the step S4 is that the report is not repeated, executing a step S522;
wherein the step S521 includes: outputting third prompt information to inform a user that repeated reporting problems exist in the technological project to be reviewed; the third prompt information comprises a judgment result of repeated reporting, the similarity of the technical project to be reviewed and the current historical technical project, and a reporting material electronic document of the technical project to be reviewed and the current historical technical project;
wherein the step S522 includes: judging whether the technological project to be reviewed has been subjected to similarity calculation with all historical technological projects in a technological project database; if yes, outputting fourth prompt information to inform the user that the technical project to be reviewed does not have repeated reporting problems, and ending the intelligent review flow of the reporting material of the current technical project to be reviewed; if not, returning to the step S2 to continuously acquire the declaration material electronic document of the next history science and technology item of the corresponding item type in the science and technology item database, and extracting text information from the declaration material electronic document of the next history science and technology item to acquire corresponding history text information with a plurality of different dimensions; and then proceeds to steps S3 to S52.
Optionally, the step S521 specifically includes:
after the third prompt information is output, receiving confirmation information input by a user; the confirmation information comprises yes and no; when the confirmation information is yes, ending the intelligent audit flow of the declaration material of the current technical project to be reviewed; when the confirmation information is no, the step S522 is performed.
Optionally, the fourth prompting information includes: the method comprises the steps of judging results of non-repeated declaration, declaration material electronic documents of the technical projects to be reviewed, declaration material electronic documents of 5 historical technical projects with highest similarity with the technical projects to be reviewed, and 5 similarity data between the 5 historical technical projects with highest similarity and the technical projects to be reviewed.
According to a second aspect, an embodiment of the present invention proposes a computer readable storage medium having stored thereon a computer program, which when executed by a processor, implements the above-mentioned intelligent analysis method for repeated declaration of scientific research projects.
The embodiment of the invention provides an intelligent analysis method and a storage medium for repeated declaration of scientific research projects, which do not need to rely on professional manual reading, screening and comparison, are time-consuming and labor-consuming, and the embodiment of the invention provides a method for comprehensively considering the similarity of text information in a plurality of different dimensions, setting corresponding weight coefficients in advance according to the influence of corresponding repeated items of the text information in the different dimensions, and calculating the similarity of the scientific and technological project to be reviewed and the historical scientific and technological project according to the similarity in the plurality of different dimensions and the weight coefficients in the plurality of different dimensions; and finally, judging whether the technical project to be reviewed is repeatedly declared according to the similarity between the technical project to be reviewed and the historical technical project and the comparison result of a preset similarity threshold value, ensuring the high efficiency and accuracy of similarity judgment, realizing intelligent auxiliary stand review, avoiding repeated stands, and ensuring the quality improvement and efficiency of stand management work.
Additional features and advantages of the invention will be set forth in the detailed description which follows.
Drawings
In order to more clearly illustrate the embodiments of the invention or the technical solutions in the prior art, the drawings that are required in the embodiments or the description of the prior art will be briefly described, it being obvious that the drawings in the following description are only some embodiments of the invention, and that other drawings may be obtained according to these drawings without inventive effort for a person skilled in the art.
Fig. 1 is a flowchart of a method for repeatedly reporting intelligent analysis of a scientific research project according to a first embodiment of the invention.
Fig. 2 is a flowchart of step S51 in the second embodiment of the present invention.
Fig. 3 is a flowchart of step S52 in the third embodiment of the present invention.
Detailed Description
Various exemplary embodiments, features and aspects of the disclosure will be described in detail below with reference to the drawings. Furthermore, in the following detailed description, numerous specific details are set forth in order to provide a better illustration of the invention. It will be understood by those skilled in the art that the present invention may be practiced without some of these specific details. In some instances, well known means have not been described in detail in order to not obscure the present invention.
Example 1
An embodiment of the present invention provides a method for repeatedly reporting intelligent analysis of scientific research projects, fig. 1 is a schematic flow chart of a method of the embodiment of the present invention, and referring to fig. 1, the method of the embodiment of the present invention includes the following steps:
step S1, receiving a declaration material electronic document of a technical project to be reviewed, and extracting text information from the declaration material electronic document of the technical project to be reviewed to obtain a plurality of text information to be reviewed in different dimensions;
specifically, the format of the electronic document of the declaration material is docx document, a declaration material template is preset, and the declaration material of each review submitted technical project is filled in according to the declaration material template, wherein the template comprises text information with different dimensions; illustratively, the preset plurality of different dimensions includes at least project title, project abstract, project intended goal, project technical route, project study content, and the like.
S2, acquiring declaration material electronic documents of historical science and technology projects in a science and technology project database, and extracting text information from the acquired declaration material electronic documents of the current historical science and technology projects to obtain historical text information with a plurality of different dimensions;
specifically, the formats of the declaration material electronic documents of the technical project to be reviewed and the historical technical project are the same, and the declaration material electronic documents comprise preset text information with a plurality of different dimensions.
S3, calculating the similarity of the technical project to be reviewed and the current historical technical project in a plurality of different dimensions according to the text information to be reviewed and the historical text information in the plurality of different dimensions, and calculating the similarity of the technical project to be reviewed and the current historical technical project according to the similarity in the plurality of different dimensions and the weight coefficients in the plurality of different dimensions;
illustratively, the calculated similarities of the two in project title, project abstract, project expected goal, project technical route, project research content are respectively: x1, X2, X3, X4, X5; correspondingly, the weight coefficients of the project title, the project abstract, the project expected target, the project technical route and the project research content are K1, K2, K3, K4 and K5 respectively; the overall similarity X between the technical project to be reviewed and the current historical technical project is: x=x1×k1+x2×k2+x3×k3+x4×k4+x5×k5.
Preferably, K1< K2< K3< K4< K5.
And S4, judging whether the technical project to be reviewed is repeatedly declared or not according to the similarity between the technical project to be reviewed and the current historical technical project and the comparison result of a preset similarity threshold value.
Specifically, the preset similarity threshold T in the present embodiment is preferably but not limited to 90%; and when X is greater than or equal to T, judging that the technical project to be reviewed is repeatedly declared if the technical project to be reviewed is similar to the project content of the current historical technical project. Otherwise, when X is smaller than T, the content of the technical project to be reviewed is dissimilar to that of the current historical technical project, and the technical project to be reviewed is judged not to be repeatedly declared.
Still further exemplary, the method further comprises calculating a weight coefficient for each dimension, each dimension representing an index, specifically using information entropy, the information entropy formula being as follows:
wherein y is j Represents the j-th metric, m represents the number of subjects (i.e. how many subjects are) of the statistical training data, y ij The j-th normalized evaluation index value representing the i-th dimension text information has the following calculation formula:
n represents the number of evaluation indexes, in general, the greater the uncertainty degree of a certain index value in the comprehensive evaluation indexes is, the greater the information entropy is, the greater the information quantity provided by the indexes is, and the greater the weight coefficient is; conversely, the smaller the weight coefficient of the index. Therefore, the weight coefficient-entropy weight of each index can be calculated by utilizing the information entropy according to the chaotic degree of each index. The specific calculation formula is as follows:
wherein w is j For the corresponding weight of the j-th index, G j =1-E j (1. Ltoreq.j. Ltoreq.n), indicating the degree of difference of the indexes, E j =H(y j ) /lnm, referred to as entropy.
Optionally, the formats of the electronic documents of the reporting materials of the technical project to be reviewed and the historical technical project are the same, and the electronic documents comprise preset text information with a plurality of different dimensions; the preset multiple different dimensions at least comprise project titles, project abstracts, project expected targets, project technical routes and project research contents.
Example two
An embodiment II of the present invention is an optimization scheme based on the method described in the above embodiment I, and includes, in addition to steps S1 to S4 of the method described in the embodiment I, a step S51, where the step S51 includes:
when the judgment result of the step S4 is a repeated declaration, executing a step S511;
when the judgment result of the step S4 is that the report is not repeated, executing a step S512;
wherein the step S511 includes: outputting first prompt information to inform a user that repeated reporting problems exist in the technological project to be reviewed; the first prompt information comprises a judgment result of repeated reporting, the similarity of the technical project to be reviewed and the current historical technical project, and a reporting material electronic document of the technical project to be reviewed and the current historical technical project; specifically, the prompting mode of the first prompting information is to output and display a prompt.
Wherein the step S512 includes: judging whether the technological project to be reviewed has been subjected to similarity calculation with all historical technological projects in a technological project database; if yes, outputting a second prompt message to inform the user that the technical project to be reviewed does not have repeated reporting problems, and ending the intelligent review flow of the reporting material of the current technical project to be reviewed; if not, returning to the step S2 to continuously acquire the declaration material electronic document of the next history science and technology project in the science and technology project database, and extracting text information from the declaration material electronic document of the next history science and technology project to acquire a plurality of corresponding history text information with different dimensions; and then proceeds to steps S3 to S51.
Optionally, the step S511 specifically includes:
after the first prompt information is output, receiving confirmation information input by a user; the confirmation information comprises yes and no; when the confirmation information is yes, ending the intelligent audit flow of the declaration material of the current technical project to be reviewed; when the confirmation information is no, the step S512 is performed.
Specifically, the purpose of outputting the first prompt information in step S511 is to facilitate the final manual confirmation of the reporting material of the technical project to be reviewed according to the reporting material electronic document of the output history technical project; suspending the intelligent audit flow of the declaration material of the current technical project to be reviewed while outputting the first prompt information; then, receiving confirmation information input by the user, wherein: if the user determines that the repeated declaration is not performed according to the first prompt information, inputting a confirmation message of yes, and ending the intelligent audit flow of the declaration material of the current technical project to be reviewed; if the user determines that the repeated declaration is not performed according to the first prompt information, the confirmation information is input as no. The user in this embodiment refers to a reviewer.
In this embodiment, because the purpose of this embodiment is to repeat the reporting problem of the intelligent audit project, in this embodiment, when the situation of repeating the reporting is found, the reporting of the material intelligent audit process is suspended, and corresponding prompt information is output to the prompt device to prompt, for example, in a manner of displaying the prompt device, so as to request the panel to perform manual confirmation, if the confirmation is repeated reporting, the reporting of the material intelligent audit process is ended, and if the confirmation is not repeated reporting, the process continues to traverse other historical scientific and technological projects in the scientific and technological project database, thereby not only saving the time of intelligent audit, but also effectively avoiding the audit error problem caused by the system fault.
Optionally, the second prompt information includes: the method comprises the steps of judging results of non-repeated declaration, declaration material electronic documents of the technical projects to be reviewed, declaration material electronic documents of 5 historical technical projects with highest similarity with the technical projects to be reviewed, and 5 similarity data between the 5 historical technical projects with highest similarity and the technical projects to be reviewed.
It should be noted that, because the similarity between the declared material electronic documents of the part of the historical technological projects and the declared material electronic documents of the technological projects to be reviewed is very close to the preset similarity threshold value, but is only slightly smaller than the similarity threshold value, because any intelligent system may have errors, the declared material electronic documents of the highest 5 historical technological projects need to be output for the reviewer to carry out final manual comparison, and an accurate review result is given.
Example III
In addition to steps S1 to S4 of the method according to the first embodiment, the second embodiment of the present invention is an optimization scheme based on the method according to the first embodiment, and further includes step S52.
The preset multiple different dimensions further comprise item classification, wherein the item classification comprises a relay protection automation group, a power transmission group, a power distribution group, a power generation group, a communication and information group, a metering marketing group, a system operation and intelligent power grid group and a power transformation group.
Specifically, according to the item classification, the declaration material electronic documents of the historical technical items stored in the technical item database are also stored in a classified manner according to a relay automation group, a power transmission group, a power distribution group, a power generation group, a communication and information group, a metering marketing group, a system operation and intelligent power grid group, a power transformation group, and it can be understood that the technical item database comprises a plurality of sub databases which are arranged according to the item classification and are respectively used for storing the declaration material electronic documents of the technical items of different types such as the relay automation group, the power transmission group, the power distribution group, the power generation group, the communication and information group, the metering marketing group, the system operation and intelligent power grid group, the power transformation group, and the like.
The step S2 in the third method specifically includes:
according to the item classification of the technical items to be reviewed, acquiring declaration material electronic documents of historical technical items corresponding to the item classification in the technical item database, and extracting text information from the acquired declaration material electronic documents of the historical technical items to obtain historical text information with different dimensions.
Wherein, the step S52 includes:
when the judgment result of the step S4 is a repeated declaration, executing a step S521;
when the judgment result of the step S4 is that the report is not repeated, executing a step S522;
wherein the step S521 includes: outputting third prompt information to inform a user that repeated reporting problems exist in the technological project to be reviewed; the third prompt information comprises a judgment result of repeated reporting, the similarity of the technical project to be reviewed and the current historical technical project, and a reporting material electronic document of the technical project to be reviewed and the current historical technical project; specifically, the prompting mode of the third prompting information is to output and display a prompt.
Wherein the step S522 includes: judging whether the technological project to be reviewed has been subjected to similarity calculation with all historical technological projects in a technological project database; if yes, outputting fourth prompt information to inform the user that the technical project to be reviewed does not have repeated reporting problems, and ending the intelligent review flow of the reporting material of the current technical project to be reviewed; if not, returning to the step S2 to continuously acquire the declaration material electronic document of the next history science and technology item of the corresponding item type in the science and technology item database, and extracting text information from the declaration material electronic document of the next history science and technology item to acquire corresponding history text information with a plurality of different dimensions; and then proceeds to steps S3 to S52.
Optionally, the step S521 specifically includes:
after the third prompt information is output, receiving confirmation information input by a user; the confirmation information comprises yes and no; when the confirmation information is yes, ending the intelligent audit flow of the declaration material of the current technical project to be reviewed; when the confirmation information is no, the step S522 is performed.
Specifically, the purpose of outputting the third prompt information in step S511 is to facilitate the final manual confirmation of the reporting material of the technical project to be reviewed according to the reporting material electronic document of the output history technical project; suspending the intelligent audit flow of the declaration material of the current technical project to be reviewed while outputting the third prompt information; then, receiving confirmation information input by the user, wherein: if the user determines that the repeated declaration is not performed according to the third prompt information, inputting a confirmation message of yes, and ending the intelligent audit flow of the declaration material of the current technical project to be reviewed; and if the user determines that the repeated declaration is not performed according to the third prompt information, inputting a confirmation message as no. The user in this embodiment refers to a reviewer.
In this embodiment, because the purpose of this embodiment is to repeat the reporting problem of the intelligent audit project, in this embodiment, when the situation of repeating the reporting is found, the reporting of the material intelligent audit process is suspended, and corresponding prompt information is output to the prompt device to prompt, for example, in a manner of displaying the prompt device, so as to request the panel to perform manual confirmation, if the confirmation is repeated reporting, the reporting of the material intelligent audit process is ended, and if the confirmation is not repeated reporting, the process continues to traverse other historical scientific and technological projects in the scientific and technological project database, thereby not only saving the time of intelligent audit, but also effectively avoiding the audit error problem caused by the system fault.
Optionally, the fourth prompting information includes: the method comprises the steps of judging results of non-repeated declaration, declaration material electronic documents of the technical projects to be reviewed, declaration material electronic documents of 5 historical technical projects with highest similarity with the technical projects to be reviewed, and 5 similarity data between the 5 historical technical projects with highest similarity and the technical projects to be reviewed.
It should be noted that, because the similarity between the declared material electronic documents of the part of the historical technological projects and the declared material electronic documents of the technological projects to be reviewed is very close to the preset similarity threshold value, but is only slightly smaller than the similarity threshold value, because any intelligent system may have errors, the declared material electronic documents of the highest 5 historical technological projects need to be output for the reviewer to carry out final manual comparison, and an accurate review result is given.
Example IV
The fourth embodiment of the present invention further provides a computer readable storage medium, on which a computer program is stored, where the computer program when executed by a processor implements the intelligent analysis method for repeatedly declaring according to the above-mentioned scientific research project.
Illustratively, the computer-readable storage medium may include: any entity or device capable of carrying the computer program code, a recording medium, a U disk, a removable hard disk, a magnetic disk, an optical disk, a computer Memory, a Read-Only Memory (ROM), a random access Memory (RAM, random Access Memory), an electrical carrier signal, a telecommunications signal, a software distribution medium, and so forth.
In summary, the embodiment of the invention provides an intelligent analysis method and a storage medium for repeated declaration of scientific research projects, which do not need to rely on professional manual reading, discrimination and comparison, are time-consuming and labor-consuming, and the embodiment of the invention provides a method for comprehensively considering the similarity of text information in a plurality of different dimensions, setting corresponding weight coefficients in advance according to the influence of corresponding repeated standing of the text information in the different dimensions, and calculating the similarity of the scientific and technological project to be reviewed and the historical scientific and technological project according to the similarity in the plurality of different dimensions and the weight coefficients in the plurality of different dimensions; and finally, judging whether the technical project to be reviewed is repeatedly declared according to the similarity between the technical project to be reviewed and the historical technical project and the comparison result of a preset similarity threshold value, ensuring the high efficiency and accuracy of similarity judgment, realizing intelligent auxiliary stand review, avoiding repeated stands, and ensuring the quality improvement and efficiency of stand management work.
The foregoing description of embodiments of the invention has been presented for purposes of illustration and description, and is not intended to be exhaustive or limited to the embodiments disclosed. Many modifications and variations will be apparent to those of ordinary skill in the art without departing from the scope and spirit of the various embodiments described. The terminology used herein was chosen in order to best explain the principles of the embodiments, the practical application, or the technical improvements in the marketplace, or to enable others of ordinary skill in the art to understand the embodiments disclosed herein.
Claims (4)
1. The intelligent analysis method for repeated declaration of scientific research projects is characterized by comprising the following steps:
step S1, receiving a declaration material electronic document of a technical project to be reviewed, and extracting text information from the declaration material electronic document of the technical project to be reviewed to obtain a plurality of text information to be reviewed in different dimensions; the format of the declaration material electronic document of the technical project to be reviewed is the same as that of the declaration material electronic document of the historical technical project, and the declaration material electronic document comprises a plurality of preset text messages with different dimensions; the preset multiple different dimensions comprise project titles, project abstracts, project expected targets, project technical routes, project research contents and project classifications, wherein the project classifications comprise at least one of relay protection automation groups, power transmission groups, power distribution groups, power generation groups, communication and information groups, metering marketing groups, system operation and intelligent power grid groups and power transformation groups;
step S2, acquiring declaration material electronic documents of historical technological projects corresponding to project classification in a technological project database according to the project classification of the technological projects to be reviewed, and extracting text information from the acquired declaration material electronic documents of the current historical technological projects to obtain historical text information with different dimensions;
s3, calculating the similarity of the technical project to be reviewed and the current historical technical project in a plurality of different dimensions according to the text information to be reviewed and the historical text information in the plurality of different dimensions, and calculating the similarity of the technical project to be reviewed and the current historical technical project according to the similarity in the plurality of different dimensions and the weight coefficients in the plurality of different dimensions;
s4, judging whether the technical project to be reviewed is repeatedly declared or not according to the similarity between the technical project to be reviewed and the current historical technical project and a comparison result of a preset similarity threshold;
step S51, executing step S511 when the judgment result of the step S4 is repeated reporting; when the judgment result of the step S4 is that the report is not repeated, executing a step S512;
wherein the step S511 includes: outputting first prompt information to inform a user that repeated reporting problems exist in the technological project to be reviewed; the first prompt information comprises a judgment result of repeated reporting, the similarity of the technical project to be reviewed and the current historical technical project, and a reporting material electronic document of the technical project to be reviewed and the current historical technical project;
wherein the step S512 includes: judging whether the technological project to be reviewed has been subjected to similarity calculation with all historical technological projects in a technological project database; if yes, outputting a second prompt message to inform the user that the technical project to be reviewed does not have repeated reporting problems, and ending the intelligent review flow of the reporting material of the current technical project to be reviewed; if not, returning to the step S2 to continuously acquire the declaration material electronic document of the next history science and technology project in the science and technology project database, and extracting text information from the declaration material electronic document of the next history science and technology project to acquire a plurality of corresponding history text information with different dimensions; and then proceeds to steps S3 to S51.
2. The intelligent analysis method for repeated declaration of scientific research projects according to claim 1, wherein the step S511 specifically includes:
after the first prompt information is output, receiving confirmation information input by a user; the confirmation information comprises yes and no; when the confirmation information is yes, ending the intelligent audit flow of the declaration material of the current technical project to be reviewed; when the confirmation information is no, the step S512 is performed.
3. The intelligent analysis method for repeated declaration of scientific research projects according to claim 2, wherein the second prompt message includes: the method comprises the steps of judging results of non-repeated declaration, declaration material electronic documents of the technical projects to be reviewed, declaration material electronic documents of 5 historical technical projects with highest similarity with the technical projects to be reviewed, and 5 similarity data between the 5 historical technical projects with highest similarity and the technical projects to be reviewed.
4. A computer-readable storage medium having stored thereon a computer program, characterized by: the computer program, when executed by a processor, implements the intelligent analysis method for repeated declaration of scientific research projects according to any one of claims 1 to 3.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202011258000.3A CN112199936B (en) | 2020-11-12 | 2020-11-12 | Intelligent analysis method and storage medium for repeated declaration of scientific research projects |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202011258000.3A CN112199936B (en) | 2020-11-12 | 2020-11-12 | Intelligent analysis method and storage medium for repeated declaration of scientific research projects |
Publications (2)
Publication Number | Publication Date |
---|---|
CN112199936A CN112199936A (en) | 2021-01-08 |
CN112199936B true CN112199936B (en) | 2024-01-23 |
Family
ID=74033396
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202011258000.3A Active CN112199936B (en) | 2020-11-12 | 2020-11-12 | Intelligent analysis method and storage medium for repeated declaration of scientific research projects |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN112199936B (en) |
Families Citing this family (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN113421026A (en) * | 2021-07-19 | 2021-09-21 | 首都医科大学附属北京儿童医院 | Hospital scientific research project application management method and system |
CN113793666B (en) * | 2021-09-16 | 2023-10-27 | 中国人民解放军空军军医大学 | Method and system for processing compound mode neuron information |
Citations (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2004066086A2 (en) * | 2003-01-23 | 2004-08-05 | Verdasys, Inc. | Identifying similarities and history of modification within large collections of unstructured data |
US7194471B1 (en) * | 1998-04-10 | 2007-03-20 | Ricoh Company, Ltd. | Document classification system and method for classifying a document according to contents of the document |
WO2011056196A1 (en) * | 2009-11-09 | 2011-05-12 | Projectionworks, Inc. | Systems and methods for optically projecting three-dimensional text, images and/or symbols onto three-dimensional objects |
CN109886845A (en) * | 2019-01-08 | 2019-06-14 | 平安科技(深圳)有限公司 | Smart contract auditing method, device, computer equipment and storage medium |
CN110020026A (en) * | 2017-07-19 | 2019-07-16 | 上海互宝能源科技有限责任公司 | The duplicate checking system and method for project application data |
CN110928985A (en) * | 2019-10-14 | 2020-03-27 | 广西壮族自治区科学技术情报研究所 | Scientific and technological project duplicate checking method for automatically extracting near-meaning words based on deep learning algorithm |
Family Cites Families (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US7971180B2 (en) * | 2007-06-13 | 2011-06-28 | International Business Machines Corporation | Method and system for evaluating multi-dimensional project plans for implementing packaged software applications |
US10741093B2 (en) * | 2017-06-09 | 2020-08-11 | Act, Inc. | Automated determination of degree of item similarity in the generation of digitized examinations |
-
2020
- 2020-11-12 CN CN202011258000.3A patent/CN112199936B/en active Active
Patent Citations (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US7194471B1 (en) * | 1998-04-10 | 2007-03-20 | Ricoh Company, Ltd. | Document classification system and method for classifying a document according to contents of the document |
WO2004066086A2 (en) * | 2003-01-23 | 2004-08-05 | Verdasys, Inc. | Identifying similarities and history of modification within large collections of unstructured data |
WO2011056196A1 (en) * | 2009-11-09 | 2011-05-12 | Projectionworks, Inc. | Systems and methods for optically projecting three-dimensional text, images and/or symbols onto three-dimensional objects |
CN110020026A (en) * | 2017-07-19 | 2019-07-16 | 上海互宝能源科技有限责任公司 | The duplicate checking system and method for project application data |
CN109886845A (en) * | 2019-01-08 | 2019-06-14 | 平安科技(深圳)有限公司 | Smart contract auditing method, device, computer equipment and storage medium |
CN110928985A (en) * | 2019-10-14 | 2020-03-27 | 广西壮族自治区科学技术情报研究所 | Scientific and technological project duplicate checking method for automatically extracting near-meaning words based on deep learning algorithm |
Also Published As
Publication number | Publication date |
---|---|
CN112199936A (en) | 2021-01-08 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN111401777B (en) | Enterprise risk assessment method, enterprise risk assessment device, terminal equipment and storage medium | |
CN111159533B (en) | Intelligent charging service recommendation method and system based on user image | |
CN102819772B (en) | Power matching network builds material requirements Forecasting Methodology and device | |
CN112199936B (en) | Intelligent analysis method and storage medium for repeated declaration of scientific research projects | |
CN112199937B (en) | Short text similarity analysis method and system, computer equipment and medium thereof | |
CN112199938A (en) | Scientific and technological project similarity analysis method, computer equipment and storage medium | |
US20020004790A1 (en) | Questionnaire analysis system | |
CN105488019B (en) | A kind of equipment for monitoring power quality Fulfill testing report automatically method | |
CN112214986B (en) | An intelligent analysis device for repeated declaration of scientific research projects | |
Daim et al. | Technology diffusion: forecasting with bibliometric analysis and Bass model | |
CN115809887A (en) | Method and device for determining main business range of enterprise based on invoice data | |
CN113268614B (en) | Label system updating method and device, electronic equipment and readable storage medium | |
CN117114596A (en) | Contract project data examination method and system | |
CN113505273B (en) | Data sorting method, device, equipment and medium based on repeated data screening | |
CN112132690A (en) | Foreign exchange product information pushing method and device, computer equipment and storage medium | |
CN103279549B (en) | A method and device for acquiring target data of a target object | |
CN109471871A (en) | Bus management method and device | |
CN117094688A (en) | Digital control method and system for power supply station | |
CN113537519A (en) | Method and device for identifying abnormal equipment | |
CN117973530A (en) | Inference calculation method, device, equipment and storage medium based on large language model | |
CN117371856A (en) | Data quality monitoring method and device, storage medium and computer equipment | |
CN113626605B (en) | Information classification method, device, electronic equipment and readable storage medium | |
CN116795995A (en) | Knowledge graph construction method, knowledge graph construction device, computer equipment and storage medium | |
Zhang et al. | A discrete Jaya algorithm for vehicle routing problems with uncertain demands | |
CN112906723A (en) | Feature selection method and device |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant |