[go: up one dir, main page]

0% found this document useful (0 votes)
83 views11 pages

Final Year of Computer Engineering 2022-23 Semester VII Project Synopsis

This project synopsis is for a final year computer engineering project on detecting fake websites and news. The project aims to classify websites and news articles as real or fake using concepts from artificial intelligence, natural language processing, and machine learning. The team plans to develop a model that can analyze the content, headlines, and publishing source of news to determine authenticity. They will collect a dataset, extract features, train classifiers like naive Bayes and SVM, and build a web interface to display the results. The project aims to address the spread of misinformation online and help users verify the truthfulness of news.

Uploaded by

Aniket Jawale
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
83 views11 pages

Final Year of Computer Engineering 2022-23 Semester VII Project Synopsis

This project synopsis is for a final year computer engineering project on detecting fake websites and news. The project aims to classify websites and news articles as real or fake using concepts from artificial intelligence, natural language processing, and machine learning. The team plans to develop a model that can analyze the content, headlines, and publishing source of news to determine authenticity. They will collect a dataset, extract features, train classifiers like naive Bayes and SVM, and build a web interface to display the results. The project aims to address the spread of misinformation online and help users verify the truthfulness of news.

Uploaded by

Aniket Jawale
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 11

Computer Engineering Department

Government College of Engineering & Research Avasari Khurd


Savitribai Phule Pune University

Final Year of Computer Engineering


2022-23 Semester VII
Project Synopsis

 Project Group ID: 23.

 Title of the Project: Fake Website And News Detection.

 Team Members :

Sr. Roll No Name Email Mobile Sign


1 20223069 Pooja Madan Kadam Pooja.kadam122020gcoeara.ac.in 9156397169
2 20223071 Rutuja vitthal Nighot Rutuja.nighot122020@gcoeara.ac.in 917295184
3 19121008 Aniket Dnyaneshwar aniket.jawale122019@gcoeara.ac.in 8999318385
Jawale

Date: Name & Signature Name & Signature


Guide Project Coordinator
1
Abstract

In our modern era where the internet is ubiquitous, everyone relies on various
online resources for news. Along with the increase in the use of social media
platforms like Facebook, Twitter, etc. news spread rapidly among millions of users
within a very short span of time. The spread of fake Website and News has far-
reaching consequences like the creation of biased opinions to swaying election
outcomes for the benefit of certain candidates. Moreover, spammers use appealing
news headlines to generate revenue using advertisements via click-baits. In this
project we aim to perform binary classification of various news and websites
articles available online with the help of concepts pertaining to Artificial
Intelligence, Natural Language Processing and Machine Learning. We aim to
provide the user with the ability to classify the website and news as fake or real
and also check the authenticity of the website publishing the news.

 Keywords: Internet, Social Media, Fake News, fake website Classification,


Artificial Intelligence, Machine Learning, Websites, Authenticity.

2
Background and Motivation

There is a vital need to deal with the fake data spread across online
platforms since it creates hassles for users in terms of rumors, identity theft,
lack of authenticity and confidentiality, fake profiles etc. The dissemination
of false information through social media as possible undermines trust in
the news ecosystem, harms the reputations of individuals and organizations,
and causes fear in the public at large, all of which have the potential to
undermine societal stability. False news that has been generated is very
difficult to spot since the terminology used in the news is comparable to
that used in real news, and fake news is produced with the goal of instilling
confidence in the public. As a result, false news identification is required.

3
Problem Definition and Objectives

Fake news is deliberately written misleading material meant to deceive the public.
Authenticity and purpose are the two most important aspects of this concept. Fake
news has two characteristics: firstly, it contains incorrect material that could be
confirmed as such, but secondly, it is produced with the dishonest goal of
misleading readers. The distribution of false material through social media may
have important implications, such weakening public faith in the news ecosystem,
hurting an individual’s or organization’s reputation, or causing fear among the
general public, all of which can affect society’s stability. The data may be
represented as a collection of tuples consisting of headlines and text from a certain
number of news articles, with. It determines not whether a piece of info is fake in
the fake news identification issue.

The methods used to manipulate information differentiate real news from fake
news. Alternatively, news material may use deceptive tactics such as fabricating
facts to make the customer believe something they do not want to believe. It is also
possible to impose material that seems to be from reputable sources, but the
sources are not. Additionally, fraudulent features of fake news include the use of
altered material, such as headlines and pictures that do not match the information
delivered or the contextualization of fake news using real components and
information but in a misleading context

4
Literature Survey

Mykhailo Granik et. al. in their paper shows a simple approach for fake news
detection using naive Bayes classifier. This approach was implemented as
software system an test against a data set of Facebook news posts. They were
collected from three large Facebook pages each from the right and from the left, as
well as three large mainstream political news pages (Politico, CNN, ABC News).
They achieved classification accuracy of approximately 74%. Classification
accuracy for fake news is slightly worse. This may be caused by the skewness of
the dataset: only 4.9% of it is fake news.
Himank Gupta et. al. gave a framework based on different machine learning
approach that deals with various problems including accuracy shortage, time lag
(BotMaker) and high processing time to handle thousands of tweets in 1 sec.
Firstly, they have collected 400,000 tweets from HSpam14 dataset. Then they
further characterize the 150,000 spam tweets and 250,000 non- spam tweets. They
also derived some lightweight features along with the Top-30 words that are
providing highest information gain from Bag-of-Words model. 4. They were able
to achieve an accuracy of 91.65% and surpassed the existing solution by
approximately18%.
Marco L. Della Vedova et. al. first proposed a novel ML fake news detection
method which, by combining news content and social context features,
outperforms existing methods in the literature, increasing its accuracy up to
78.8%. Second they implemented their method within a Facebook Messenger
Chabot and validate it with a real-world application, obtaining a fake news
detection accuracy of 81.7%.

5
Methodology

The system will is develop in three parts. The first part is static which works on
machine learning classifier. We have to study and trained the model with 4
different classifiers and chose the best classifier for final execution. The second
part is dynamic which takes the keyword/text from user and searches online for the
truth probability of the news. The third part provides the authenticity of the URL
input by user.
In this project, we are going to use Python and its Sci-kit libraries Python has a
huge set of libraries and extensions, which can be easily used in Machine Learning.
Sci-Kit Learn library is the best source for machine learning algorithms where
nearly all types of machine learning algorithms are readily available for Python,
thus easy and quick evaluation of ML algorithms is possible. We are also going to
use Flask for the web based deployment of the model, provides client side
implementation using HTML, CSS and Javascript.

6
Data split Feature
Dataset Pre Processing
(Train Test) Extraction

User Input Classification Training the


model classifier

Truth
Probability True False

Figure: System Architecture

7
News
Retrive

News Website

Store data in Process News


database

Get News Data Set

Extract feature

User input for checking Use navieBayes And


Training data
the news SVM

Display result
on Webpage

Figure: Flow Chart-Proposed model

8
Functionalities:
1. To check whether the given web URL is fake or not. . It will give output in
the terms of true or false.
2. To check whether the given news is real or fake. It will give output in the
terms of true or false.

9
Software and Hardware Requirements

Software: Jupyter Notebook.

Framework: Flask.

Language: Python for backend and HTML, CSS, JS for front end.

Hardware:

1. Processor-i5
2. Hard Disk- 1TB
3. Memory – 8GB Ram

10
References

1. A Smart fake news detection system using machine Learning IEE


paper https://www.researchgate.net/publication/339022255.
2. Hindawi journals: Analysing Machine Learning Enabled Fake
News Detection
Datasets.https://www.hindawi.com/journals/wcmc/2022/1575365/
?utm_source=google&utm_medium=cpc&utm_campaign=HDW
_MRKT_GBL_SUB_ADWO_PAI_DYNA_JOUR_JMATH_X00
00&gclid=CjwKCAjwu5yYBhAjEiwAKXk_eE7L-Rxj-
RfkS5_3Fl8jRI9e4DgzrciRdxrGvUDAC9cN_31FLz_EQBoCkX
UQAvD_BwE

11

You might also like