[go: up one dir, main page]

CN119336966B - Network rumor recognition system based on artificial intelligence - Google Patents

Network rumor recognition system based on artificial intelligence Download PDF

Info

Publication number
CN119336966B
CN119336966B CN202411898369.9A CN202411898369A CN119336966B CN 119336966 B CN119336966 B CN 119336966B CN 202411898369 A CN202411898369 A CN 202411898369A CN 119336966 B CN119336966 B CN 119336966B
Authority
CN
China
Prior art keywords
news text
text
news
propagation
rumor
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN202411898369.9A
Other languages
Chinese (zh)
Other versions
CN119336966A (en
Inventor
顾紫奥
张洪松
史志欣
秦琦
唐思涵
魏一鑫
王新月
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Shandong Polytechnic College
Original Assignee
Shandong Polytechnic College
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Shandong Polytechnic College filed Critical Shandong Polytechnic College
Priority to CN202411898369.9A priority Critical patent/CN119336966B/en
Publication of CN119336966A publication Critical patent/CN119336966A/en
Application granted granted Critical
Publication of CN119336966B publication Critical patent/CN119336966B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/906Clustering; Classification
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/33Querying
    • G06F16/3331Query processing
    • G06F16/334Query execution
    • G06F16/3344Query execution using natural language analysis
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/35Clustering; Classification
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/20Natural language analysis
    • G06F40/279Recognition of textual entities
    • G06F40/284Lexical analysis, e.g. tokenisation or collocates
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N20/00Machine learning

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • Databases & Information Systems (AREA)
  • Data Mining & Analysis (AREA)
  • Artificial Intelligence (AREA)
  • Computational Linguistics (AREA)
  • Software Systems (AREA)
  • Evolutionary Computation (AREA)
  • Mathematical Physics (AREA)
  • Computing Systems (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Medical Informatics (AREA)
  • General Health & Medical Sciences (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

本发明涉及网络谣言识别技术领域,尤其涉及一种基于人工智能的网络谣言识别系统,包括:数据采集模块,用以采集新闻文本;文本分析模块,用以根据预设关键词与新闻文本的错别字数量对新闻文本特征进行分析;用户分析模块,用以对用户行为特征进行分析;传播分析模块,用以对新闻文本的传播特征进行分析;识别模块,用以对新闻文本进行识别,还用以根据新闻文本的来源对新闻文本的来源可信度进行分析,还用以根据新闻文本的发布时间对新闻文本的时间一致性进行分析;存储模块,用以将新闻文本的识别结果进行存储,并向用户进行输出。本发明有效提高了网络谣言识别的准确率。

The present invention relates to the technical field of network rumor identification, and in particular to a network rumor identification system based on artificial intelligence, comprising: a data acquisition module for collecting news texts; a text analysis module for analyzing the features of news texts according to preset keywords and the number of typos in news texts; a user analysis module for analyzing user behavior features; a communication analysis module for analyzing the communication features of news texts; an identification module for identifying news texts, and also for analyzing the source credibility of news texts according to the sources of news texts, and also for analyzing the time consistency of news texts according to the release time of news texts; a storage module for storing the recognition results of news texts and outputting them to users. The present invention effectively improves the accuracy of network rumor identification.

Description

Network rumor recognition system based on artificial intelligence
Technical Field
The invention relates to the technical field of network rumor identification, in particular to a network rumor identification system based on artificial intelligence.
Background
With the development of the internet and social media, various types of information are spread at extremely high speed, but a large amount of false information and network rumors are included. The widespread spread of misleading information not only affects public opinion, but may also negatively impact social stability and mental health of individuals. Therefore, it is important to develop an efficient and accurate network rumor recognition system.
Chinese patent publication No. CN112231562A discloses a network rumor recognition method, namely a system, which comprises the steps of obtaining a text feature matrix according to a plurality of texts containing rumor information, constructing a propagation graph structure, wherein nodes in the graph structure are the texts, adjacent matrixes in the graph structure are forwarding and comment relations between the rumor information and the texts, constructing a graph convolution neural network model, wherein input of the graph convolution neural network model is the text feature matrix and the adjacent matrixes, output of the graph convolution neural network model is the rumor feature matrix, training the neural network model according to the rumor feature matrix to obtain a rumor recognition model, and recognizing network rumors according to the rumor recognition model. According to the method, the graph convolution neural network model is trained according to the forwarding and comment relations of the rumors among a plurality of texts, the neural network model is trained according to the rumor feature matrix, the widely and dispersedly spreading features of the rumor information are effectively captured, the rumor information can be effectively identified, and therefore, when the network rumor is identified, comprehensive analysis is not conducted on text emotion expression, social network user behaviors and data sources, and the problem of low network rumor identification accuracy exists.
Disclosure of Invention
The invention aims to provide an artificial intelligence-based network rumor recognition system which solves at least one of the problems in the prior art.
In order to achieve the above purpose, the invention adopts the following technical scheme:
an artificial intelligence based network rumor identification system comprising:
the data acquisition module is used for acquiring news texts;
The text analysis module is used for analyzing the characteristics of the news text according to the number of mispronounced words of the preset keywords and the news text;
The user analysis module is used for analyzing the behavior characteristics of the user according to the comment times, the praise times, the sharing times and the reply times of the news text publisher in the first monitoring period;
The spreading analysis module is used for analyzing the spreading speed of the news text according to the sharing total number of the news text in the second monitoring period and analyzing the spreading characteristics of the news text according to the spreading speed of the news text and the number of the spreading people of the news text in the second monitoring period;
The recognition module is used for recognizing the news text according to the analysis result of the characteristics of the news text, the analysis result of the behavior characteristics of the user and the analysis result of the propagation characteristics of the news text so as to obtain the recognition result of the news text, and is also used for adjusting the recognition process of the news text according to the number of the historical propagation rumor texts of the news text publisher and the release time of the news text;
and the storage module is used for storing the recognition result of the news text and outputting the recognition result to the user.
Further, the text analysis module comprises a text analysis unit, a text analysis unit and a text analysis unit, wherein the text analysis unit is used for matching a preset keyword with a news text, if the preset keyword is successfully matched with the news text, the preset keyword is used as a rumor keyword of the news text, the occurrence frequency of the ith rumor keyword in the news text is counted, and the occurrence frequency is recorded as fi;
The text analysis unit is also used for segmenting the news text according to the news text, and setting the emotion intensity value of the j-th word of the news text as Dj;
the text analysis unit is also used for counting the number of wrongly written words of the news text and marking the number as B.
Further, the text analysis module further comprises a feature extraction unit, and the feature extraction unit calculates text features of the news text according to the occurrence times fi of the rumor keywords in the news text, the emotion intensity value Dj of each word of the news text and the mispronounced word number B of the news text to obtain features WT of the news text.
Further, the user analysis module is configured to analyze the user behavior characteristics according to the comment times Ct, the praise times R, the sharing times L and the reply times S of the news text publisher in the first monitoring period, so as to obtain the user behavior characteristics HT.
Further, the propagation analysis module includes a speed analysis unit, which is configured to analyze a propagation speed of the news text according to a total sharing number fx of the news text in the second monitoring period, so as to obtain a propagation speed Ve of the news text.
Further, the propagation analysis module further comprises a construction unit, wherein the construction unit is used for constructing the propagator coefficients according to the number of the propagators of the news text in the second monitoring period, and the construction result of the propagator coefficients comprises P1, P2 and P3.
Further, the propagation analysis module further includes a propagation analysis unit, configured to analyze the propagation characteristics of the news text according to the propagation speed Ve of the news text and the number zc of the propagators and the construction result of the propagator coefficient of the news text in the second monitoring period, so as to obtain the propagation characteristics CT of the news text.
Further, the recognition module includes a recognition unit for recognizing the news text according to the analysis result of the characteristics of the news text, the analysis result of the behavior characteristics of the user, and the analysis result of the propagation characteristics of the news text, wherein:
if γ1×wt+γ2×ht+γ3×ct is not more than u0, the recognition unit determines that the news text is a normal news text;
If γ1×wt+γ2×ht+γ3×ct > u0, the recognition unit determines that the news text is a rumor risk news text;
Wherein γ1 is text weight, γ2 is behavior weight, γ3 is propagation weight, and u0 is preset anomaly coefficient.
Further, the recognition module further comprises a source analysis unit, the source analysis unit constructs source anomaly coefficients of the news text according to the number ys of the historical propagation rumor texts of the news text publisher, adjusts the recognition process of the news text according to the source anomaly coefficients, and sets the adjusted preset anomaly coefficients as u1.
Further, the recognition further comprises a consistency analysis unit, the consistency analysis unit analyzes consistency parameters YZ of the news text according to the release days t0 and the preset release days t1 of the news text, compares the consistency parameters YZ of the news text with a preset consistency threshold YZ0, and sets a preset adjustment ratio as eta 1 if the consistency parameters of the news text do not accord with the threshold.
The method has the advantages that through multi-level feature analysis, including text features, user behaviors, propagation features and source credibility, network rumors can be accurately identified, accuracy and efficiency of judgment are improved, the introduction of the user analysis and propagation analysis module is beneficial to better evaluating information propagation risks, powerful support is provided for decision making, and finally, the storage and feedback functions of the system ensure that users can timely acquire identification results, and user experience and credibility are improved. In conclusion, the invention provides an efficient and accurate solution in rumor identification, and is helpful for maintaining a healthy and real network environment.
Drawings
In order to more clearly illustrate the technical solutions of the embodiments of the present invention, the drawings required for the description of the embodiments will be briefly described below, and it is apparent that the drawings in the following description are only some embodiments of the present invention, and other drawings may be obtained according to these drawings without inventive effort for a person skilled in the art.
Fig. 1 is a schematic diagram of a network rumor recognition system based on artificial intelligence according to the embodiment.
Fig. 2 is a schematic structural diagram of a text analysis module according to the present embodiment.
Fig. 3 is a schematic structural diagram of a propagation analysis module according to the present embodiment.
Fig. 4 is a schematic structural diagram of an identification module according to the present embodiment.
Detailed Description
In order to more clearly illustrate the present invention, the present invention will be further described with reference to preferred embodiments and the accompanying drawings. Like parts in the drawings are denoted by the same reference numerals. It is to be understood by persons skilled in the art that the following detailed description is illustrative and not restrictive, and that this invention is not limited to the details given herein.
It should be noted that although the terms first, second, third, etc. may be used in the embodiments of the present application, the description should not be limited to these terms. These terms are only used to distinguish one from another. For example, a first may also be referred to as a second, and similarly, a second may also be referred to as a first, without departing from the scope of embodiments of the application.
Referring to fig. 1, a schematic diagram of an artificial intelligence based network rumor recognition system according to the present embodiment is shown, which includes,
It can be understood that the method for collecting news texts is not particularly limited in this embodiment, and a person skilled in the art can freely set the method only by meeting the method for collecting news texts, wherein the news texts can be collected from news websites and social media through a crawler technology or a third party API interface.
With continued reference to fig. 1, the system further includes:
The text analysis module is connected with the data acquisition module and is used for analyzing news text characteristics according to the number of mispronounced words of preset keywords and news texts.
Specifically, the setting of the preset keywords is not specifically limited, and a person skilled in the art can freely set the preset keywords only by meeting the setting requirement of the preset keywords, wherein the preset keywords can be set as "burst, urgent, shock, explosive, real, secret, action, masking, decision inner curtain, danger, alarm, threat, fraud, health warning" and the like.
Referring to fig. 2, the text analysis module includes:
The text analysis unit is used for matching the preset keywords with the news text, if the preset keywords are successfully matched with the news text, the preset keywords are used as rumor keywords of the news text, if the preset keywords are not successfully matched with the news text, the preset keywords are not used as rumor keywords of the news text, the occurrence times of the ith rumor keywords in the news text are counted, and the occurrence times are recorded as fi;
The text analysis unit is also used for segmenting the news text according to the news text, and setting the emotion intensity value of the j-th word of the news text as Dj;
The text analysis unit is also used for counting the number of wrongly written characters of the news text and recording the number of wrongly written characters as B, and through analyzing the preset keywords and the number of wrongly written characters, the module can quickly identify possible characteristics of the rumor text and promote preliminary screening.
Specifically, in this embodiment, the number of occurrences of each rumor keyword in the news text, the method of word segmentation in the news text, and the method of obtaining the number of wrongly written words are not specifically limited, and a person skilled in the art may freely set the number of occurrences of each rumor keyword in the news text, the method of word segmentation in the news text, and the method of obtaining the number of wrongly written words in each time, which are only required to meet the set requirements, where word frequency can be counted by SnowNLP, word segmentation can be performed on the news text by using a bargain word segmentation tool, emotion assignment can be performed on each word in the news text by using a hakuh emotion dictionary, and the number of wrongly written words in the news text can be identified by using Hunspell.
With continued reference to fig. 2, the text analysis module further includes:
The feature extraction unit is connected with the text analysis unit, and calculates text features WT of the news text according to the occurrence times of the key words of each rumor in the news text, the emotion intensity value of each word of the news text and the number of wrongly written words of the news text, and sets:
;
According to the multi-dimensional characteristics such as the occurrence frequency of the keywords, the emotion intensity, the number of wrongly written words and the like, the text characteristic WT is comprehensively calculated, the judgment capacity and the accuracy of the system on the news text are improved, and the accuracy of network rumor identification is further improved, wherein w1 is the keyword weight, w2 is the emotion weight, w3 is the wrongly written word weight, w1+w2+w3=1, I is the number of rumor keywords in the news text, N is the total word number of the news text, and NZ is the total word number of the news text.
It can be understood that, in this embodiment, the method for obtaining the total word number (N) and the total word Number (NZ) of the news text is not specifically limited, and can be freely set by a person skilled in the art, so long as the total word number (N) and the total word Number (NZ) of the news text generally meet the requirement of obtaining through a string processing function in a programming language, where the total word number (N) and the total word Number (NZ) can be obtained through the string processing function in the programming language.
With continued reference to fig. 1, the system further includes:
The user analysis module is connected with the text analysis module and is used for analyzing the user behavior characteristics according to the comment times Ct, the praise times R, the sharing times L and the reply times S of the news text publisher in the first monitoring period so as to effectively evaluate the user behavior characteristics, wherein:
The user behavior feature analysis module sets the user behavior feature as HT, sets ht=α×ct/t1+β× (r+l+s)/T1, where T1 is the duration of the first monitoring period, α is the comment weight, β is the interaction weight, and α+β=1, it can be understood that in this embodiment, the setting of the comment weight and the interaction weight is not specifically limited, and a person skilled in the art can freely set the setting requirements of the comment weight and the interaction weight only need to be satisfied, where the optimal value of α is 0.4, and the optimal value of β is 0.6, and by analyzing the interaction behavior (comment, praise, share, etc.) of the news publisher in the first monitoring period, the module can effectively evaluate the user behavior feature HT, reflect the potential risk of rumor propagation, and integrate social interaction into the rumor recognition process by the user behavior analysis, thereby improving the practicability of the system.
It can be understood that, in the embodiment, the comment times Ct, the praise times R, the sharing times L and the reply times S of the news text publisher in the first monitoring period are not specifically limited, and the person skilled in the art can freely set the comment times Ct, the praise times R, the sharing times L and the reply times S of the news text publisher in the first monitoring period only need to be met, wherein the comment times Ct, the praise times R, the sharing times L and the reply times S of the news text publisher in the first monitoring period can be obtained by the API of the social media or the news website, the setting of the first monitoring period is not specifically limited, and the person skilled in the art can freely set the setting of the first monitoring period only need to be met, wherein the news text publisher can be used as the first monitoring period 7 days before the publishing.
With continued reference to fig. 1, the system further includes:
and the transmission analysis module is connected with the user analysis module and is used for analyzing the transmission characteristics of the news text according to the sharing total number and the number of the transmission people of the news text in the second monitoring period.
Referring to fig. 3, the propagation analysis module includes:
The speed analysis unit is used for analyzing the propagation speed Ve of the news text according to the sharing total number fx of the news text in the second monitoring period, timely identifying information propagation trend, setting ve=fx/T2 and T2 as the duration of the second monitoring period, and calculating the propagation speed Ve by analyzing the relationship between the sharing total number and time of the news text in the appointed monitoring period. A high propagation speed generally means that the content is interesting and shared by a large number of users in a short time, possibly indicating that the content is an emergency or information with a high degree of focus. Through the analysis, the system can timely identify the propagation trend of the information and provide basis for rumor identification.
With continued reference to fig. 3, the propagation analysis module further includes:
The construction unit is connected with the speed analysis unit and used for constructing the propagator coefficient according to the number of propagators of the news text in the second monitoring period so as to effectively distinguish the credibility of different propagators and reduce the propagation risk of false information, if cb (k) is less than or equal to q1, the propagator coefficient of the kth propagator is set as P1 by the propagation analysis unit, if q1< cb (k) is less than or equal to q2, the propagator coefficient of the kth propagator is set as P2 by the propagation analysis unit, if cb (k) is greater than q2, the propagator coefficient of the kth propagator is set as P3 by the propagation analysis unit, and the propagator coefficient can be evaluated according to the past propagation records of different propagators, so that the robustness and the effectiveness of the system are further enhanced;
It is understood that, in this embodiment, each preset number and preset coefficient are not specifically limited, and a person skilled in the art can freely set the number and preset coefficient, and only needs to meet the setting requirement of each preset number and preset coefficient, wherein, the optimal value of q1 is 3, the optimal value of q2 is 5, the optimal value of P1 is 1, the optimal value of P2 is 1.2, and the optimal value of P3 is 1.5.
Specifically, the method for acquiring the total sharing number of the news texts, the number of the propagator and the number of the propagator historical propagation rumor texts in the second monitoring period is not limited in this embodiment, and can be freely set by a person skilled in the art, and only the requirements for acquiring the total sharing number of the news texts, the number of the propagator and the number of the propagator historical propagation rumor texts in the second monitoring period are met, wherein the news texts can be acquired through social media or APIs of a news website, the method for acquiring the news texts in the second monitoring period is not limited in this embodiment, and can be freely set by a person skilled in the art, and only the requirements for setting the second monitoring period are met, wherein the second monitoring period can be set to be 5 days after the news texts are released.
With continued reference to fig. 3, the propagation analysis module further includes:
the propagation analysis unit is connected with the construction unit and is used for analyzing the propagation characteristics CT of the news text according to the propagation speed Ve of the news text, the number zc of the propagators of the news text in the second monitoring period and the construction result of the propagator coefficients, and setting:
;
The method comprises the steps of determining a transmission speed of news text, determining a transmission characteristic of the news text, determining a transmission quantity threshold value of CB, determining a transmission characteristic of the news text, determining a transmission speed of VT, determining a transmission quantity threshold value of CB, determining a transmission coefficient of a kth transmitter of the news text by integrating a transmission speed analysis and a transmission coefficient result, and further analyzing the transmission characteristic of the news text to help judge influence and potential public opinion risks of the news content, so that scientific basis is provided for further rumor identification.
It can be understood that, in this embodiment, the setting of the preset propagation speed and the preset propagation number threshold is not specifically limited, and a person skilled in the art can freely set the setting requirement of the preset propagation speed and the preset propagation number threshold only needs to be met, wherein the optimal value of VT is 200 times/day, and the optimal value of CB is 1000.
With continued reference to fig. 1, the system further includes:
the recognition module is connected with the transmission analysis module and is used for recognizing the news text according to the analysis result of the characteristics of the news text, the analysis result of the behavior characteristics of the user and the analysis result of the transmission characteristics of the news text so as to obtain the recognition result of the news text, and adjusting the recognition process of the news text according to the number of the historical transmission rumor texts of a news text publisher and the release time of the news text.
Referring to fig. 4, the identification module includes:
the recognition unit is used for recognizing the news text according to the analysis result of the characteristics of the news text, the analysis result of the behavior characteristics of the user and the analysis result of the propagation characteristics of the news text, wherein:
if γ1×wt+γ2×ht+γ3×ct is not more than u0, the recognition unit determines that the news text is a normal news text;
If γ1×wt+γ2×ht+γ3×ct > u0, the recognition unit determines that the news text is a rumor risk news text;
the method comprises the steps of comprehensively analyzing text characteristics, user behavior characteristics and propagation characteristics of a news text, so that final judgment of the news text is realized, and accuracy of network rumor identification is improved.
It can be understood that the setting of each weight and the preset anomaly coefficient is not specifically limited in this embodiment, and a person skilled in the art can freely set the setting of each weight and the preset anomaly coefficient only by meeting the setting requirement, wherein the optimal value of γ1 is 0.4, the optimal value of γ2 is 0.3, the optimal value of γ3 is 0.3, and the optimal value of u0 is 0.46.
With continued reference to fig. 4, the identification module further includes:
The source analysis unit is connected with the identification unit, constructs source anomaly coefficients of the news texts according to the number ys of the historical propagation rumor texts of the news text publishers, and adjusts the identification process of the news texts according to the source anomaly coefficients, wherein:
If ys=0, the source analysis unit sets the source abnormality factor of the news text to 0;
If ys >0, the source analysis unit sets the source anomaly coefficient of the news text as LY, and sets ly=η×lg (ys+1), wherein η is a preset adjustment ratio, and 0< η <0.4;
The source analysis unit sets the adjusted preset abnormal coefficient as u1, sets u1=u0-LY, and constructs the source abnormal coefficient by analyzing the historical propagation rumor text quantity of the publisher, so that the system can perform risk assessment on the source of the news text to improve the accuracy of network rumor identification.
Specifically, the method for obtaining the number of the history propagation rumor texts of the news text publisher is not specifically limited, and can be freely set by a person skilled in the art, and only the requirement for obtaining the number of the history propagation rumor texts of the news text publisher is met, wherein the method can be obtained through social media or APIs of a news website.
It can be understood that the setting of the preset adjustment ratio is not particularly limited in this embodiment, and a person skilled in the art can freely set the setting of the preset adjustment ratio only by meeting the setting requirement of the preset adjustment ratio, wherein the optimal value of η is 0.3.
With continued reference to fig. 4, the identifying further includes:
The consistency analysis unit is connected with the source analysis unit, analyzes consistency parameters YZ of the news text according to the release days t0 and the preset release days t1 of the news text, and sets YZ=1-1/(1+e t1-t0), and e is a natural logarithm;
The consistency analysis unit compares consistency parameters YZ of the news text with a preset consistency threshold YZ0 and adjusts the construction process of the source anomaly coefficient according to the comparison result, wherein:
if YZ is less than or equal to YZ0, the consistency analysis unit judges that the consistency parameters of the news text accord with the threshold value and does not adjust;
If YZ > YZ0, the consistency analysis unit judges that the consistency parameter of the news text does not accord with the threshold value, sets the preset adjustment proportion as eta 1, sets eta 1 = eta x {1+ exp [3 x (YZ-YZ 0)/(YZ+yz0) -3] }, evaluates the consistency parameter YZ of the news text according to the release time, and is helpful for judging whether the information is a network rumor not, thereby providing important basis for rumor identification and further improving the accuracy of network rumor identification.
Specifically, the embodiment does not specifically limit the manner of obtaining the number of days of release of the news text, and a person skilled in the art can freely set the method, which only needs to meet the requirement of obtaining the number of days of release of the news text, wherein the method can be obtained through social media or an API of a news website.
It can be understood that, in this embodiment, the setting of the preset consistency threshold and the preset number of days of release is not specifically limited, and a person skilled in the art can freely set the setting of the preset consistency threshold and the preset number of days of release only by meeting the setting requirement of the preset consistency threshold and the preset number of days of release, wherein the optimal value of t1 is 5 days, and the optimal value of yz0 is 0.5.
With continued reference to fig. 1, the system further includes:
the storage module is connected with the identification module and used for storing the identification result of the news text and outputting the identification result to a user.
Specifically, the network rumor recognition system based on artificial intelligence is applied to rumor recognition of social media and a network platform, and through comprehensively analyzing multidimensional information such as text characteristics, user behaviors, propagation characteristics, information sources and the like, accuracy and efficiency of rumor recognition are improved, and the network system is beneficial to building a healthier and safer network environment.
It should be understood that the foregoing examples of the present invention are provided merely for clearly illustrating the present invention and are not intended to limit the embodiments of the present invention, and that various other changes and modifications may be made therein by one skilled in the art without departing from the spirit and scope of the present invention as defined by the appended claims.

Claims (5)

1. An artificial intelligence based network rumor identification system comprising:
the data acquisition module is used for acquiring news texts;
The text analysis module is used for analyzing the characteristics of the news text according to the number of mispronounced words of the preset keywords and the news text;
The user analysis module is used for analyzing the behavior characteristics of the user according to the comment times, the praise times, the sharing times and the reply times of the news text publisher in the first monitoring period;
The spreading analysis module is used for analyzing the spreading speed of the news text according to the sharing total number of the news text in the second monitoring period and analyzing the spreading characteristics of the news text according to the spreading speed of the news text and the number of the spreading people of the news text in the second monitoring period;
The recognition module is used for recognizing the news text according to the analysis result of the characteristics of the news text, the analysis result of the behavior characteristics of the user and the analysis result of the propagation characteristics of the news text so as to obtain the recognition result of the news text, and is also used for adjusting the recognition process of the news text according to the number of the historical propagation rumor texts of the news text publisher and the release time of the news text;
the text analysis module comprises a text analysis unit, a storage module and a text analysis unit, wherein the text analysis unit is used for matching a preset keyword with the news text, taking the preset keyword as a rumor keyword of the news text if the preset keyword is successfully matched with the news text, counting the occurrence frequency of the ith rumor keyword in the news text, and marking as fi;
The text analysis unit is also used for segmenting the news text according to the news text, and setting the emotion intensity value of the j-th word of the news text as Dj;
The text analysis unit is also used for counting the number of wrongly written words of the news text and marking the number as B;
The text analysis module further comprises a feature extraction unit, the feature extraction unit calculates text features of the news text according to the occurrence times fi of the key words of each rumor in the news text, the emotion intensity value Dj of each word of the news text and the number B of wrongly written words of the news text to obtain features WT of the news text, and the feature extraction unit is set:
Comprehensively calculating text characteristics WT according to multidimensional characteristics such as the occurrence frequency, emotion intensity, mispronounced word number and the like of the keywords, improving the judgment capability and accuracy of a system on news texts and further improving the accuracy of network rumor recognition, wherein w1 is keyword weight, w2 is emotion weight, w3 is mispronounced word weight, w1+w2+w3=1, I is the number of rumor keywords in the news texts, N is the total word number of the news texts, and NZ is the total word number of the news texts;
the user analysis module is used for analyzing the user behavior characteristics according to the comment times Ct, the praise times R, the sharing times L and the reply times S of the news text publisher in the first monitoring period to obtain user behavior characteristics HT, wherein HT=alpha×Ct/T1 +beta× (R+L+S)/T1 is set as the duration of the first monitoring period, alpha is comment weight, beta is interaction weight, and alpha+beta=1;
The identification module further comprises a source analysis unit, the source analysis unit constructs source anomaly coefficients of the news text according to the number ys of the historical propagation rumor texts of the news text publisher, adjusts the identification process of the news text according to the source anomaly coefficients, and sets the adjusted preset anomaly coefficients as u1;
The identification module further comprises a consistency analysis unit, wherein the consistency analysis unit analyzes consistency parameters YZ of the news text according to the release days t0 and the preset release days t1 of the news text, compares the consistency parameters YZ of the news text with a preset consistency threshold YZ0, and sets a preset adjustment ratio as eta 1, eta 1=eta x {1+exp [3 x (YZ-YZ 0)/(YZ+yz0) -3] } if the consistency parameters of the news text do not accord with the threshold.
2. The artificial intelligence based network rumor recognition system of claim 1, wherein the propagation analysis module includes a speed analysis unit for analyzing the propagation speed of the news text according to the total sharing number fx of the news text in the second monitoring period to obtain the propagation speed Ve of the news text.
3. The artificial intelligence based network rumor recognition system of claim 2, wherein the propagation analysis module includes a construction unit to construct the propagator coefficients according to the number of propagators of the news text in the second monitoring period, and the construction result of the propagator coefficients includes P1, P2 and P3.
4. The artificial intelligence based network rumor recognition system according to claim 3, wherein the propagation analysis module further comprises a propagation analysis unit for analyzing the propagation characteristics of the news text according to the propagation speed Ve of the news text and the number zc of the propagators of the news text and the construction result of the propagators coefficient in the second monitoring period to obtain the propagation characteristics CT of the news text, and setting:
;
Wherein VT is a preset propagation speed, CB is a preset propagation number threshold, and P (k) is a propagator coefficient of a kth propagator of the news text.
5. The artificial intelligence based network rumor recognition system of claim 4, wherein the recognition module comprises a recognition unit for recognizing the news text based on the analysis result of the characteristics of the news text, the analysis result of the behavior characteristics of the user, and the analysis result of the propagation characteristics of the news text, wherein:
if γ1×wt+γ2×ht+γ3×ct is not more than u0, the recognition unit determines that the news text is a normal news text;
If γ1×wt+γ2×ht+γ3×ct > u0, the recognition unit determines that the news text is a rumor risk news text;
Wherein γ1 is text weight, γ2 is behavior weight, γ3 is propagation weight, and u0 is preset anomaly coefficient.
CN202411898369.9A 2024-12-23 2024-12-23 Network rumor recognition system based on artificial intelligence Active CN119336966B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202411898369.9A CN119336966B (en) 2024-12-23 2024-12-23 Network rumor recognition system based on artificial intelligence

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202411898369.9A CN119336966B (en) 2024-12-23 2024-12-23 Network rumor recognition system based on artificial intelligence

Publications (2)

Publication Number Publication Date
CN119336966A CN119336966A (en) 2025-01-21
CN119336966B true CN119336966B (en) 2025-07-11

Family

ID=94277131

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202411898369.9A Active CN119336966B (en) 2024-12-23 2024-12-23 Network rumor recognition system based on artificial intelligence

Country Status (1)

Country Link
CN (1) CN119336966B (en)

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106598944A (en) * 2016-11-25 2017-04-26 中国民航大学 Civil aviation security public opinion emotion analysis method
CN107797998A (en) * 2016-08-29 2018-03-13 腾讯科技(深圳)有限公司 The recognition methods of user-generated content containing rumour and device
CN108038240A (en) * 2017-12-26 2018-05-15 武汉大学 Based on content, the social networks rumour detection method of user's multiplicity

Family Cites Families (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7739601B1 (en) * 2002-01-23 2010-06-15 Microsoft Corporation Media authoring and presentation
CN108153884B (en) * 2017-12-26 2020-11-10 厦门大学 Analysis method for propagation of microblog rumors
CN112329444B (en) * 2020-11-10 2022-09-02 南开大学 Early rumor detection method fusing text and propagation structure
CN115661696B (en) * 2022-10-27 2025-04-15 重庆邮电大学 A rumor detection method based on topic visualization

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107797998A (en) * 2016-08-29 2018-03-13 腾讯科技(深圳)有限公司 The recognition methods of user-generated content containing rumour and device
CN106598944A (en) * 2016-11-25 2017-04-26 中国民航大学 Civil aviation security public opinion emotion analysis method
CN108038240A (en) * 2017-12-26 2018-05-15 武汉大学 Based on content, the social networks rumour detection method of user's multiplicity

Also Published As

Publication number Publication date
CN119336966A (en) 2025-01-21

Similar Documents

Publication Publication Date Title
Boukabous et al. Crime prediction using a hybrid sentiment analysis approach based on the bidirectional encoder representations from transformers
US20100174813A1 (en) Method and apparatus for the monitoring of relationships between two parties
CN109918505B (en) Network security event visualization method based on text processing
CN104216954A (en) Prediction device and prediction method for state of emergency topic
Uppal et al. Fake news detection using discourse segment structure analysis
CN109299277A (en) Public opinion analysis method, server and computer-readable storage medium
CN105956740B (en) Semantic risk calculation method based on text logical features
CN113946682B (en) Sensitive text detection method and system based on adaptive graph neural network
CN112559756A (en) Construction method and application method of seismic event knowledge graph
Kim et al. SMS spam filterinig using keyword frequency ratio
CN111858924A (en) A system with network public opinion monitoring and analysis functions
Singh et al. Fighting for Information Credibility: AnEnd-to-End Framework to Identify FakeNews during Natural Disasters.
CN120745646B (en) Method and system for auditing network chat sensitive words based on multidimensional identification
CN113015171A (en) System with network public opinion monitoring and analyzing functions
Girsang Sentiment analysis of covid-19 public activity restriction (ppkm) impact using bert method
CN112528024A (en) Microblog emergency detection method based on multi-feature fusion
CN119336966B (en) Network rumor recognition system based on artificial intelligence
CN119476306B (en) Digital modeling response method, system, equipment and medium based on artificial intelligence
CN118798335A (en) Knowledge graph construction method, device, equipment and readable storage medium
CN116561298A (en) Title generation method, device, equipment and storage medium based on artificial intelligence
Lima et al. Statistical learning from Brazilian fake news
Li et al. An event ontology construction approach to web crime mining
CN111291376B (en) Web vulnerability verification method based on crowdsourcing and machine learning
CN119179808B (en) Information file analysis response management system based on big data
Gascó et al. Evaluating noise perception through online social networks: A text mining approach to designing a noise-event alarm system based on social media content

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant