[go: up one dir, main page]

CN112199936B - Intelligent analysis method and storage medium for repeated declaration of scientific research projects - Google Patents

Intelligent analysis method and storage medium for repeated declaration of scientific research projects Download PDF

Info

Publication number
CN112199936B
CN112199936B CN202011258000.3A CN202011258000A CN112199936B CN 112199936 B CN112199936 B CN 112199936B CN 202011258000 A CN202011258000 A CN 202011258000A CN 112199936 B CN112199936 B CN 112199936B
Authority
CN
China
Prior art keywords
project
reviewed
technical
historical
declaration
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN202011258000.3A
Other languages
Chinese (zh)
Other versions
CN112199936A (en
Inventor
汪伟
章彬
汪桢子
何维
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Shenzhen Power Supply Bureau Co Ltd
Original Assignee
Shenzhen Power Supply Bureau Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Shenzhen Power Supply Bureau Co Ltd filed Critical Shenzhen Power Supply Bureau Co Ltd
Priority to CN202011258000.3A priority Critical patent/CN112199936B/en
Publication of CN112199936A publication Critical patent/CN112199936A/en
Application granted granted Critical
Publication of CN112199936B publication Critical patent/CN112199936B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/10Text processing
    • G06F40/194Calculation of difference between files
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/22Matching criteria, e.g. proximity measures
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q10/00Administration; Management
    • G06Q10/10Office automation; Time management
    • G06Q10/103Workflow collaboration or project management
    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y04INFORMATION OR COMMUNICATION TECHNOLOGIES HAVING AN IMPACT ON OTHER TECHNOLOGY AREAS
    • Y04SSYSTEMS INTEGRATING TECHNOLOGIES RELATED TO POWER NETWORK OPERATION, COMMUNICATION OR INFORMATION TECHNOLOGIES FOR IMPROVING THE ELECTRICAL POWER GENERATION, TRANSMISSION, DISTRIBUTION, MANAGEMENT OR USAGE, i.e. SMART GRIDS
    • Y04S10/00Systems supporting electrical power generation, transmission or distribution
    • Y04S10/50Systems or methods supporting the power network operation or management, involving a certain degree of interaction with the load-side end user applications

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Business, Economics & Management (AREA)
  • General Physics & Mathematics (AREA)
  • Data Mining & Analysis (AREA)
  • Strategic Management (AREA)
  • Human Resources & Organizations (AREA)
  • Physics & Mathematics (AREA)
  • Entrepreneurship & Innovation (AREA)
  • Artificial Intelligence (AREA)
  • General Engineering & Computer Science (AREA)
  • Operations Research (AREA)
  • General Business, Economics & Management (AREA)
  • Computational Linguistics (AREA)
  • Health & Medical Sciences (AREA)
  • Economics (AREA)
  • Marketing (AREA)
  • General Health & Medical Sciences (AREA)
  • Quality & Reliability (AREA)
  • Tourism & Hospitality (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Bioinformatics & Computational Biology (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Evolutionary Biology (AREA)
  • Evolutionary Computation (AREA)
  • Management, Administration, Business Operations System, And Electronic Commerce (AREA)

Abstract

The invention relates to an intelligent analysis method and a storage medium for repeated declaration of scientific research projects, wherein the method comprises the following steps: receiving a declaration material electronic document of a technical project to be reviewed, and extracting text information to obtain a plurality of text information to be reviewed in different dimensions; acquiring declaration material electronic documents of historical science and technology projects in a science and technology project database, and extracting text information to obtain historical text information with a plurality of different dimensions; calculating the similarity of the technical project to be reviewed and the current historical technical project in a plurality of different dimensions according to the text information to be reviewed and the historical text information, and calculating the similarity of the technical project to be reviewed and the current historical technical project according to the similarity in a plurality of different dimensions; and judging whether the technical project to be reviewed is repeatedly declared or not according to the similarity between the technical project to be reviewed and the current historical technical project. The invention can intelligently assist in the evaluation of the stands, avoid repeated stands, and ensure the quality improvement and efficiency improvement of the stand management work.

Description

Intelligent analysis method and storage medium for repeated declaration of scientific research projects
Technical Field
The invention relates to the technical field of software information, in particular to an intelligent analysis method and a storage medium for repeated declaration of scientific research projects.
Background
With the continuous deep development of electric power reform and continuous development of science and technology, scientific and technical research projects in each professional field of power grid companies are reviewed more and more, and in order to avoid repeated reporting of similar projects, similarity examination needs to be conducted on reporting materials of the scientific and technical research projects. In general, science and technology project reporting materials are large texts, and at present, the similarity judging mode of the science and technology projects needs to rely on professional manual reading and screening comparison, so that each science and technology project reporting material needs to be manually compared with mass prior science and technology project reporting materials in a database, a large amount of manpower and time cost is consumed, and the high efficiency and accuracy of similarity judgment are difficult to ensure.
Disclosure of Invention
The invention aims to provide an intelligent analysis method and a storage medium for repeated declaration of scientific research projects, so as to realize intelligent auxiliary stand review, avoid repeated stands and ensure the quality improvement and efficiency improvement of stand management work.
According to a first aspect, an embodiment of the present invention provides an intelligent analysis method for repeated declaration of scientific research projects, including:
step S1, receiving a declaration material electronic document of a technical project to be reviewed, and extracting text information from the declaration material electronic document of the technical project to be reviewed to obtain a plurality of text information to be reviewed in different dimensions;
s2, acquiring declaration material electronic documents of historical science and technology projects in a science and technology project database, and extracting text information from the acquired declaration material electronic documents of the current historical science and technology projects to obtain historical text information with a plurality of different dimensions;
s3, calculating the similarity of the technical project to be reviewed and the current historical technical project in a plurality of different dimensions according to the text information to be reviewed and the historical text information in the plurality of different dimensions, and calculating the similarity of the technical project to be reviewed and the current historical technical project according to the similarity in the plurality of different dimensions and the weight coefficients in the plurality of different dimensions;
and S4, judging whether the technical project to be reviewed is repeatedly declared or not according to the similarity between the technical project to be reviewed and the current historical technical project and the comparison result of a preset similarity threshold value.
Optionally, the formats of the electronic documents of the reporting materials of the technical project to be reviewed and the historical technical project are the same, and the electronic documents comprise preset text information with a plurality of different dimensions; the preset multiple different dimensions at least comprise project titles, project abstracts, project expected targets, project technical routes and project research contents.
Optionally, the method further includes step S51, and the step S51 includes:
when the judgment result of the step S4 is a repeated declaration, executing a step S511;
when the judgment result of the step S4 is that the report is not repeated, executing a step S512;
wherein the step S511 includes: outputting first prompt information to inform a user that repeated reporting problems exist in the technological project to be reviewed; the first prompt information comprises a judgment result of repeated reporting, the similarity of the technical project to be reviewed and the current historical technical project, and a reporting material electronic document of the technical project to be reviewed and the current historical technical project;
wherein the step S512 includes: judging whether the technological project to be reviewed has been subjected to similarity calculation with all historical technological projects in a technological project database; if yes, outputting a second prompt message to inform the user that the technical project to be reviewed does not have repeated reporting problems, and ending the intelligent review flow of the reporting material of the current technical project to be reviewed; if not, returning to the step S2 to continuously acquire the declaration material electronic document of the next history science and technology project in the science and technology project database, and extracting text information from the declaration material electronic document of the next history science and technology project to acquire a plurality of corresponding history text information with different dimensions; and then proceeds to steps S3 to S51.
Optionally, the step S511 specifically includes:
after the first prompt information is output, receiving confirmation information input by a user; the confirmation information comprises yes and no; when the confirmation information is yes, ending the intelligent audit flow of the declaration material of the current technical project to be reviewed; when the confirmation information is no, the step S512 is performed.
Optionally, the second prompt information includes: the method comprises the steps of judging results of non-repeated declaration, declaration material electronic documents of the technical projects to be reviewed, declaration material electronic documents of 5 historical technical projects with highest similarity with the technical projects to be reviewed, and 5 similarity data between the 5 historical technical projects with highest similarity and the technical projects to be reviewed.
Optionally, the preset plurality of different dimensions further includes a project classification including at least one of a relay automation group, a transmission group, a distribution group, a generation group, a communication and information group, a metering marketing group, a system operation and intelligent power grid group, a transformation group.
The step S2 specifically includes:
according to the item classification of the technical items to be reviewed, acquiring declaration material electronic documents of historical technical items corresponding to the item classification in the technical item database, and extracting text information from the acquired declaration material electronic documents of the historical technical items to obtain historical text information with different dimensions.
Optionally, the method further includes step S52, and the step S52 includes:
when the judgment result of the step S4 is a repeated declaration, executing a step S521;
when the judgment result of the step S4 is that the report is not repeated, executing a step S522;
wherein the step S521 includes: outputting third prompt information to inform a user that repeated reporting problems exist in the technological project to be reviewed; the third prompt information comprises a judgment result of repeated reporting, the similarity of the technical project to be reviewed and the current historical technical project, and a reporting material electronic document of the technical project to be reviewed and the current historical technical project;
wherein the step S522 includes: judging whether the technological project to be reviewed has been subjected to similarity calculation with all historical technological projects in a technological project database; if yes, outputting fourth prompt information to inform the user that the technical project to be reviewed does not have repeated reporting problems, and ending the intelligent review flow of the reporting material of the current technical project to be reviewed; if not, returning to the step S2 to continuously acquire the declaration material electronic document of the next history science and technology item of the corresponding item type in the science and technology item database, and extracting text information from the declaration material electronic document of the next history science and technology item to acquire corresponding history text information with a plurality of different dimensions; and then proceeds to steps S3 to S52.
Optionally, the step S521 specifically includes:
after the third prompt information is output, receiving confirmation information input by a user; the confirmation information comprises yes and no; when the confirmation information is yes, ending the intelligent audit flow of the declaration material of the current technical project to be reviewed; when the confirmation information is no, the step S522 is performed.
Optionally, the fourth prompting information includes: the method comprises the steps of judging results of non-repeated declaration, declaration material electronic documents of the technical projects to be reviewed, declaration material electronic documents of 5 historical technical projects with highest similarity with the technical projects to be reviewed, and 5 similarity data between the 5 historical technical projects with highest similarity and the technical projects to be reviewed.
According to a second aspect, an embodiment of the present invention proposes a computer readable storage medium having stored thereon a computer program, which when executed by a processor, implements the above-mentioned intelligent analysis method for repeated declaration of scientific research projects.
The embodiment of the invention provides an intelligent analysis method and a storage medium for repeated declaration of scientific research projects, which do not need to rely on professional manual reading, screening and comparison, are time-consuming and labor-consuming, and the embodiment of the invention provides a method for comprehensively considering the similarity of text information in a plurality of different dimensions, setting corresponding weight coefficients in advance according to the influence of corresponding repeated items of the text information in the different dimensions, and calculating the similarity of the scientific and technological project to be reviewed and the historical scientific and technological project according to the similarity in the plurality of different dimensions and the weight coefficients in the plurality of different dimensions; and finally, judging whether the technical project to be reviewed is repeatedly declared according to the similarity between the technical project to be reviewed and the historical technical project and the comparison result of a preset similarity threshold value, ensuring the high efficiency and accuracy of similarity judgment, realizing intelligent auxiliary stand review, avoiding repeated stands, and ensuring the quality improvement and efficiency of stand management work.
Additional features and advantages of the invention will be set forth in the detailed description which follows.
Drawings
In order to more clearly illustrate the embodiments of the invention or the technical solutions in the prior art, the drawings that are required in the embodiments or the description of the prior art will be briefly described, it being obvious that the drawings in the following description are only some embodiments of the invention, and that other drawings may be obtained according to these drawings without inventive effort for a person skilled in the art.
Fig. 1 is a flowchart of a method for repeatedly reporting intelligent analysis of a scientific research project according to a first embodiment of the invention.
Fig. 2 is a flowchart of step S51 in the second embodiment of the present invention.
Fig. 3 is a flowchart of step S52 in the third embodiment of the present invention.
Detailed Description
Various exemplary embodiments, features and aspects of the disclosure will be described in detail below with reference to the drawings. Furthermore, in the following detailed description, numerous specific details are set forth in order to provide a better illustration of the invention. It will be understood by those skilled in the art that the present invention may be practiced without some of these specific details. In some instances, well known means have not been described in detail in order to not obscure the present invention.
Example 1
An embodiment of the present invention provides a method for repeatedly reporting intelligent analysis of scientific research projects, fig. 1 is a schematic flow chart of a method of the embodiment of the present invention, and referring to fig. 1, the method of the embodiment of the present invention includes the following steps:
step S1, receiving a declaration material electronic document of a technical project to be reviewed, and extracting text information from the declaration material electronic document of the technical project to be reviewed to obtain a plurality of text information to be reviewed in different dimensions;
specifically, the format of the electronic document of the declaration material is docx document, a declaration material template is preset, and the declaration material of each review submitted technical project is filled in according to the declaration material template, wherein the template comprises text information with different dimensions; illustratively, the preset plurality of different dimensions includes at least project title, project abstract, project intended goal, project technical route, project study content, and the like.
S2, acquiring declaration material electronic documents of historical science and technology projects in a science and technology project database, and extracting text information from the acquired declaration material electronic documents of the current historical science and technology projects to obtain historical text information with a plurality of different dimensions;
specifically, the formats of the declaration material electronic documents of the technical project to be reviewed and the historical technical project are the same, and the declaration material electronic documents comprise preset text information with a plurality of different dimensions.
S3, calculating the similarity of the technical project to be reviewed and the current historical technical project in a plurality of different dimensions according to the text information to be reviewed and the historical text information in the plurality of different dimensions, and calculating the similarity of the technical project to be reviewed and the current historical technical project according to the similarity in the plurality of different dimensions and the weight coefficients in the plurality of different dimensions;
illustratively, the calculated similarities of the two in project title, project abstract, project expected goal, project technical route, project research content are respectively: x1, X2, X3, X4, X5; correspondingly, the weight coefficients of the project title, the project abstract, the project expected target, the project technical route and the project research content are K1, K2, K3, K4 and K5 respectively; the overall similarity X between the technical project to be reviewed and the current historical technical project is: x=x1×k1+x2×k2+x3×k3+x4×k4+x5×k5.
Preferably, K1< K2< K3< K4< K5.
And S4, judging whether the technical project to be reviewed is repeatedly declared or not according to the similarity between the technical project to be reviewed and the current historical technical project and the comparison result of a preset similarity threshold value.
Specifically, the preset similarity threshold T in the present embodiment is preferably but not limited to 90%; and when X is greater than or equal to T, judging that the technical project to be reviewed is repeatedly declared if the technical project to be reviewed is similar to the project content of the current historical technical project. Otherwise, when X is smaller than T, the content of the technical project to be reviewed is dissimilar to that of the current historical technical project, and the technical project to be reviewed is judged not to be repeatedly declared.
Still further exemplary, the method further comprises calculating a weight coefficient for each dimension, each dimension representing an index, specifically using information entropy, the information entropy formula being as follows:
wherein y is j Represents the j-th metric, m represents the number of subjects (i.e. how many subjects are) of the statistical training data, y ij The j-th normalized evaluation index value representing the i-th dimension text information has the following calculation formula:
n represents the number of evaluation indexes, in general, the greater the uncertainty degree of a certain index value in the comprehensive evaluation indexes is, the greater the information entropy is, the greater the information quantity provided by the indexes is, and the greater the weight coefficient is; conversely, the smaller the weight coefficient of the index. Therefore, the weight coefficient-entropy weight of each index can be calculated by utilizing the information entropy according to the chaotic degree of each index. The specific calculation formula is as follows:
wherein w is j For the corresponding weight of the j-th index, G j =1-E j (1. Ltoreq.j. Ltoreq.n), indicating the degree of difference of the indexes, E j =H(y j ) /lnm, referred to as entropy.
Optionally, the formats of the electronic documents of the reporting materials of the technical project to be reviewed and the historical technical project are the same, and the electronic documents comprise preset text information with a plurality of different dimensions; the preset multiple different dimensions at least comprise project titles, project abstracts, project expected targets, project technical routes and project research contents.
Example two
An embodiment II of the present invention is an optimization scheme based on the method described in the above embodiment I, and includes, in addition to steps S1 to S4 of the method described in the embodiment I, a step S51, where the step S51 includes:
when the judgment result of the step S4 is a repeated declaration, executing a step S511;
when the judgment result of the step S4 is that the report is not repeated, executing a step S512;
wherein the step S511 includes: outputting first prompt information to inform a user that repeated reporting problems exist in the technological project to be reviewed; the first prompt information comprises a judgment result of repeated reporting, the similarity of the technical project to be reviewed and the current historical technical project, and a reporting material electronic document of the technical project to be reviewed and the current historical technical project; specifically, the prompting mode of the first prompting information is to output and display a prompt.
Wherein the step S512 includes: judging whether the technological project to be reviewed has been subjected to similarity calculation with all historical technological projects in a technological project database; if yes, outputting a second prompt message to inform the user that the technical project to be reviewed does not have repeated reporting problems, and ending the intelligent review flow of the reporting material of the current technical project to be reviewed; if not, returning to the step S2 to continuously acquire the declaration material electronic document of the next history science and technology project in the science and technology project database, and extracting text information from the declaration material electronic document of the next history science and technology project to acquire a plurality of corresponding history text information with different dimensions; and then proceeds to steps S3 to S51.
Optionally, the step S511 specifically includes:
after the first prompt information is output, receiving confirmation information input by a user; the confirmation information comprises yes and no; when the confirmation information is yes, ending the intelligent audit flow of the declaration material of the current technical project to be reviewed; when the confirmation information is no, the step S512 is performed.
Specifically, the purpose of outputting the first prompt information in step S511 is to facilitate the final manual confirmation of the reporting material of the technical project to be reviewed according to the reporting material electronic document of the output history technical project; suspending the intelligent audit flow of the declaration material of the current technical project to be reviewed while outputting the first prompt information; then, receiving confirmation information input by the user, wherein: if the user determines that the repeated declaration is not performed according to the first prompt information, inputting a confirmation message of yes, and ending the intelligent audit flow of the declaration material of the current technical project to be reviewed; if the user determines that the repeated declaration is not performed according to the first prompt information, the confirmation information is input as no. The user in this embodiment refers to a reviewer.
In this embodiment, because the purpose of this embodiment is to repeat the reporting problem of the intelligent audit project, in this embodiment, when the situation of repeating the reporting is found, the reporting of the material intelligent audit process is suspended, and corresponding prompt information is output to the prompt device to prompt, for example, in a manner of displaying the prompt device, so as to request the panel to perform manual confirmation, if the confirmation is repeated reporting, the reporting of the material intelligent audit process is ended, and if the confirmation is not repeated reporting, the process continues to traverse other historical scientific and technological projects in the scientific and technological project database, thereby not only saving the time of intelligent audit, but also effectively avoiding the audit error problem caused by the system fault.
Optionally, the second prompt information includes: the method comprises the steps of judging results of non-repeated declaration, declaration material electronic documents of the technical projects to be reviewed, declaration material electronic documents of 5 historical technical projects with highest similarity with the technical projects to be reviewed, and 5 similarity data between the 5 historical technical projects with highest similarity and the technical projects to be reviewed.
It should be noted that, because the similarity between the declared material electronic documents of the part of the historical technological projects and the declared material electronic documents of the technological projects to be reviewed is very close to the preset similarity threshold value, but is only slightly smaller than the similarity threshold value, because any intelligent system may have errors, the declared material electronic documents of the highest 5 historical technological projects need to be output for the reviewer to carry out final manual comparison, and an accurate review result is given.
Example III
In addition to steps S1 to S4 of the method according to the first embodiment, the second embodiment of the present invention is an optimization scheme based on the method according to the first embodiment, and further includes step S52.
The preset multiple different dimensions further comprise item classification, wherein the item classification comprises a relay protection automation group, a power transmission group, a power distribution group, a power generation group, a communication and information group, a metering marketing group, a system operation and intelligent power grid group and a power transformation group.
Specifically, according to the item classification, the declaration material electronic documents of the historical technical items stored in the technical item database are also stored in a classified manner according to a relay automation group, a power transmission group, a power distribution group, a power generation group, a communication and information group, a metering marketing group, a system operation and intelligent power grid group, a power transformation group, and it can be understood that the technical item database comprises a plurality of sub databases which are arranged according to the item classification and are respectively used for storing the declaration material electronic documents of the technical items of different types such as the relay automation group, the power transmission group, the power distribution group, the power generation group, the communication and information group, the metering marketing group, the system operation and intelligent power grid group, the power transformation group, and the like.
The step S2 in the third method specifically includes:
according to the item classification of the technical items to be reviewed, acquiring declaration material electronic documents of historical technical items corresponding to the item classification in the technical item database, and extracting text information from the acquired declaration material electronic documents of the historical technical items to obtain historical text information with different dimensions.
Wherein, the step S52 includes:
when the judgment result of the step S4 is a repeated declaration, executing a step S521;
when the judgment result of the step S4 is that the report is not repeated, executing a step S522;
wherein the step S521 includes: outputting third prompt information to inform a user that repeated reporting problems exist in the technological project to be reviewed; the third prompt information comprises a judgment result of repeated reporting, the similarity of the technical project to be reviewed and the current historical technical project, and a reporting material electronic document of the technical project to be reviewed and the current historical technical project; specifically, the prompting mode of the third prompting information is to output and display a prompt.
Wherein the step S522 includes: judging whether the technological project to be reviewed has been subjected to similarity calculation with all historical technological projects in a technological project database; if yes, outputting fourth prompt information to inform the user that the technical project to be reviewed does not have repeated reporting problems, and ending the intelligent review flow of the reporting material of the current technical project to be reviewed; if not, returning to the step S2 to continuously acquire the declaration material electronic document of the next history science and technology item of the corresponding item type in the science and technology item database, and extracting text information from the declaration material electronic document of the next history science and technology item to acquire corresponding history text information with a plurality of different dimensions; and then proceeds to steps S3 to S52.
Optionally, the step S521 specifically includes:
after the third prompt information is output, receiving confirmation information input by a user; the confirmation information comprises yes and no; when the confirmation information is yes, ending the intelligent audit flow of the declaration material of the current technical project to be reviewed; when the confirmation information is no, the step S522 is performed.
Specifically, the purpose of outputting the third prompt information in step S511 is to facilitate the final manual confirmation of the reporting material of the technical project to be reviewed according to the reporting material electronic document of the output history technical project; suspending the intelligent audit flow of the declaration material of the current technical project to be reviewed while outputting the third prompt information; then, receiving confirmation information input by the user, wherein: if the user determines that the repeated declaration is not performed according to the third prompt information, inputting a confirmation message of yes, and ending the intelligent audit flow of the declaration material of the current technical project to be reviewed; and if the user determines that the repeated declaration is not performed according to the third prompt information, inputting a confirmation message as no. The user in this embodiment refers to a reviewer.
In this embodiment, because the purpose of this embodiment is to repeat the reporting problem of the intelligent audit project, in this embodiment, when the situation of repeating the reporting is found, the reporting of the material intelligent audit process is suspended, and corresponding prompt information is output to the prompt device to prompt, for example, in a manner of displaying the prompt device, so as to request the panel to perform manual confirmation, if the confirmation is repeated reporting, the reporting of the material intelligent audit process is ended, and if the confirmation is not repeated reporting, the process continues to traverse other historical scientific and technological projects in the scientific and technological project database, thereby not only saving the time of intelligent audit, but also effectively avoiding the audit error problem caused by the system fault.
Optionally, the fourth prompting information includes: the method comprises the steps of judging results of non-repeated declaration, declaration material electronic documents of the technical projects to be reviewed, declaration material electronic documents of 5 historical technical projects with highest similarity with the technical projects to be reviewed, and 5 similarity data between the 5 historical technical projects with highest similarity and the technical projects to be reviewed.
It should be noted that, because the similarity between the declared material electronic documents of the part of the historical technological projects and the declared material electronic documents of the technological projects to be reviewed is very close to the preset similarity threshold value, but is only slightly smaller than the similarity threshold value, because any intelligent system may have errors, the declared material electronic documents of the highest 5 historical technological projects need to be output for the reviewer to carry out final manual comparison, and an accurate review result is given.
Example IV
The fourth embodiment of the present invention further provides a computer readable storage medium, on which a computer program is stored, where the computer program when executed by a processor implements the intelligent analysis method for repeatedly declaring according to the above-mentioned scientific research project.
Illustratively, the computer-readable storage medium may include: any entity or device capable of carrying the computer program code, a recording medium, a U disk, a removable hard disk, a magnetic disk, an optical disk, a computer Memory, a Read-Only Memory (ROM), a random access Memory (RAM, random Access Memory), an electrical carrier signal, a telecommunications signal, a software distribution medium, and so forth.
In summary, the embodiment of the invention provides an intelligent analysis method and a storage medium for repeated declaration of scientific research projects, which do not need to rely on professional manual reading, discrimination and comparison, are time-consuming and labor-consuming, and the embodiment of the invention provides a method for comprehensively considering the similarity of text information in a plurality of different dimensions, setting corresponding weight coefficients in advance according to the influence of corresponding repeated standing of the text information in the different dimensions, and calculating the similarity of the scientific and technological project to be reviewed and the historical scientific and technological project according to the similarity in the plurality of different dimensions and the weight coefficients in the plurality of different dimensions; and finally, judging whether the technical project to be reviewed is repeatedly declared according to the similarity between the technical project to be reviewed and the historical technical project and the comparison result of a preset similarity threshold value, ensuring the high efficiency and accuracy of similarity judgment, realizing intelligent auxiliary stand review, avoiding repeated stands, and ensuring the quality improvement and efficiency of stand management work.
The foregoing description of embodiments of the invention has been presented for purposes of illustration and description, and is not intended to be exhaustive or limited to the embodiments disclosed. Many modifications and variations will be apparent to those of ordinary skill in the art without departing from the scope and spirit of the various embodiments described. The terminology used herein was chosen in order to best explain the principles of the embodiments, the practical application, or the technical improvements in the marketplace, or to enable others of ordinary skill in the art to understand the embodiments disclosed herein.

Claims (4)

1. The intelligent analysis method for repeated declaration of scientific research projects is characterized by comprising the following steps:
step S1, receiving a declaration material electronic document of a technical project to be reviewed, and extracting text information from the declaration material electronic document of the technical project to be reviewed to obtain a plurality of text information to be reviewed in different dimensions; the format of the declaration material electronic document of the technical project to be reviewed is the same as that of the declaration material electronic document of the historical technical project, and the declaration material electronic document comprises a plurality of preset text messages with different dimensions; the preset multiple different dimensions comprise project titles, project abstracts, project expected targets, project technical routes, project research contents and project classifications, wherein the project classifications comprise at least one of relay protection automation groups, power transmission groups, power distribution groups, power generation groups, communication and information groups, metering marketing groups, system operation and intelligent power grid groups and power transformation groups;
step S2, acquiring declaration material electronic documents of historical technological projects corresponding to project classification in a technological project database according to the project classification of the technological projects to be reviewed, and extracting text information from the acquired declaration material electronic documents of the current historical technological projects to obtain historical text information with different dimensions;
s3, calculating the similarity of the technical project to be reviewed and the current historical technical project in a plurality of different dimensions according to the text information to be reviewed and the historical text information in the plurality of different dimensions, and calculating the similarity of the technical project to be reviewed and the current historical technical project according to the similarity in the plurality of different dimensions and the weight coefficients in the plurality of different dimensions;
s4, judging whether the technical project to be reviewed is repeatedly declared or not according to the similarity between the technical project to be reviewed and the current historical technical project and a comparison result of a preset similarity threshold;
step S51, executing step S511 when the judgment result of the step S4 is repeated reporting; when the judgment result of the step S4 is that the report is not repeated, executing a step S512;
wherein the step S511 includes: outputting first prompt information to inform a user that repeated reporting problems exist in the technological project to be reviewed; the first prompt information comprises a judgment result of repeated reporting, the similarity of the technical project to be reviewed and the current historical technical project, and a reporting material electronic document of the technical project to be reviewed and the current historical technical project;
wherein the step S512 includes: judging whether the technological project to be reviewed has been subjected to similarity calculation with all historical technological projects in a technological project database; if yes, outputting a second prompt message to inform the user that the technical project to be reviewed does not have repeated reporting problems, and ending the intelligent review flow of the reporting material of the current technical project to be reviewed; if not, returning to the step S2 to continuously acquire the declaration material electronic document of the next history science and technology project in the science and technology project database, and extracting text information from the declaration material electronic document of the next history science and technology project to acquire a plurality of corresponding history text information with different dimensions; and then proceeds to steps S3 to S51.
2. The intelligent analysis method for repeated declaration of scientific research projects according to claim 1, wherein the step S511 specifically includes:
after the first prompt information is output, receiving confirmation information input by a user; the confirmation information comprises yes and no; when the confirmation information is yes, ending the intelligent audit flow of the declaration material of the current technical project to be reviewed; when the confirmation information is no, the step S512 is performed.
3. The intelligent analysis method for repeated declaration of scientific research projects according to claim 2, wherein the second prompt message includes: the method comprises the steps of judging results of non-repeated declaration, declaration material electronic documents of the technical projects to be reviewed, declaration material electronic documents of 5 historical technical projects with highest similarity with the technical projects to be reviewed, and 5 similarity data between the 5 historical technical projects with highest similarity and the technical projects to be reviewed.
4. A computer-readable storage medium having stored thereon a computer program, characterized by: the computer program, when executed by a processor, implements the intelligent analysis method for repeated declaration of scientific research projects according to any one of claims 1 to 3.
CN202011258000.3A 2020-11-12 2020-11-12 Intelligent analysis method and storage medium for repeated declaration of scientific research projects Active CN112199936B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202011258000.3A CN112199936B (en) 2020-11-12 2020-11-12 Intelligent analysis method and storage medium for repeated declaration of scientific research projects

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202011258000.3A CN112199936B (en) 2020-11-12 2020-11-12 Intelligent analysis method and storage medium for repeated declaration of scientific research projects

Publications (2)

Publication Number Publication Date
CN112199936A CN112199936A (en) 2021-01-08
CN112199936B true CN112199936B (en) 2024-01-23

Family

ID=74033396

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202011258000.3A Active CN112199936B (en) 2020-11-12 2020-11-12 Intelligent analysis method and storage medium for repeated declaration of scientific research projects

Country Status (1)

Country Link
CN (1) CN112199936B (en)

Families Citing this family (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113421026A (en) * 2021-07-19 2021-09-21 首都医科大学附属北京儿童医院 Hospital scientific research project application management method and system
CN113793666B (en) * 2021-09-16 2023-10-27 中国人民解放军空军军医大学 Method and system for processing compound mode neuron information

Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2004066086A2 (en) * 2003-01-23 2004-08-05 Verdasys, Inc. Identifying similarities and history of modification within large collections of unstructured data
US7194471B1 (en) * 1998-04-10 2007-03-20 Ricoh Company, Ltd. Document classification system and method for classifying a document according to contents of the document
WO2011056196A1 (en) * 2009-11-09 2011-05-12 Projectionworks, Inc. Systems and methods for optically projecting three-dimensional text, images and/or symbols onto three-dimensional objects
CN109886845A (en) * 2019-01-08 2019-06-14 平安科技(深圳)有限公司 Smart contract auditing method, device, computer equipment and storage medium
CN110020026A (en) * 2017-07-19 2019-07-16 上海互宝能源科技有限责任公司 The duplicate checking system and method for project application data
CN110928985A (en) * 2019-10-14 2020-03-27 广西壮族自治区科学技术情报研究所 Scientific and technological project duplicate checking method for automatically extracting near-meaning words based on deep learning algorithm

Family Cites Families (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7971180B2 (en) * 2007-06-13 2011-06-28 International Business Machines Corporation Method and system for evaluating multi-dimensional project plans for implementing packaged software applications
US10741093B2 (en) * 2017-06-09 2020-08-11 Act, Inc. Automated determination of degree of item similarity in the generation of digitized examinations

Patent Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7194471B1 (en) * 1998-04-10 2007-03-20 Ricoh Company, Ltd. Document classification system and method for classifying a document according to contents of the document
WO2004066086A2 (en) * 2003-01-23 2004-08-05 Verdasys, Inc. Identifying similarities and history of modification within large collections of unstructured data
WO2011056196A1 (en) * 2009-11-09 2011-05-12 Projectionworks, Inc. Systems and methods for optically projecting three-dimensional text, images and/or symbols onto three-dimensional objects
CN110020026A (en) * 2017-07-19 2019-07-16 上海互宝能源科技有限责任公司 The duplicate checking system and method for project application data
CN109886845A (en) * 2019-01-08 2019-06-14 平安科技(深圳)有限公司 Smart contract auditing method, device, computer equipment and storage medium
CN110928985A (en) * 2019-10-14 2020-03-27 广西壮族自治区科学技术情报研究所 Scientific and technological project duplicate checking method for automatically extracting near-meaning words based on deep learning algorithm

Also Published As

Publication number Publication date
CN112199936A (en) 2021-01-08

Similar Documents

Publication Publication Date Title
CN111401777B (en) Enterprise risk assessment method, enterprise risk assessment device, terminal equipment and storage medium
CN111159533B (en) Intelligent charging service recommendation method and system based on user image
CN102819772B (en) Power matching network builds material requirements Forecasting Methodology and device
CN112199936B (en) Intelligent analysis method and storage medium for repeated declaration of scientific research projects
CN112199937B (en) Short text similarity analysis method and system, computer equipment and medium thereof
CN112199938A (en) Scientific and technological project similarity analysis method, computer equipment and storage medium
US20020004790A1 (en) Questionnaire analysis system
CN105488019B (en) A kind of equipment for monitoring power quality Fulfill testing report automatically method
CN112214986B (en) An intelligent analysis device for repeated declaration of scientific research projects
Daim et al. Technology diffusion: forecasting with bibliometric analysis and Bass model
CN115809887A (en) Method and device for determining main business range of enterprise based on invoice data
CN113268614B (en) Label system updating method and device, electronic equipment and readable storage medium
CN117114596A (en) Contract project data examination method and system
CN113505273B (en) Data sorting method, device, equipment and medium based on repeated data screening
CN112132690A (en) Foreign exchange product information pushing method and device, computer equipment and storage medium
CN103279549B (en) A method and device for acquiring target data of a target object
CN109471871A (en) Bus management method and device
CN117094688A (en) Digital control method and system for power supply station
CN113537519A (en) Method and device for identifying abnormal equipment
CN117973530A (en) Inference calculation method, device, equipment and storage medium based on large language model
CN117371856A (en) Data quality monitoring method and device, storage medium and computer equipment
CN113626605B (en) Information classification method, device, electronic equipment and readable storage medium
CN116795995A (en) Knowledge graph construction method, knowledge graph construction device, computer equipment and storage medium
Zhang et al. A discrete Jaya algorithm for vehicle routing problems with uncertain demands
CN112906723A (en) Feature selection method and device

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant