Research Paper On Sentilytics
Research Paper On Sentilytics
INTRODUCTION
Social media platforms' explosive expansion has
produced an enormous amount of user-generated
material, including comments, reviews, and debates.
These remarks offer insightful analysis of audience
perspective, attitudes, and expectations. Content
providers, companies, and organizations all depend on
an awareness of user sentiment if they are to properly
handle user problems, enhance engagement, and
optimize content strategies. Manual examination of
hundreds of comments is not feasible, though, and
most current sentiment analysis systems lack real-time
processing and useful insights [1]. This work presents Figure 1: Overview of the Sentilytics Analysis Workflow
Sentilytics, a Chrome add-on automating comment
scraping, sentiment categorization, visualization, and
AI-driven insights, to close this gap.
LITERATURE REVIEW
Deep learning and NLP have advanced sentiment
analysis and allowed insights from social media
The Sentilytics add-on guarantees flawless data comments. Existing studies, books, papers, or
processing and insight development by means of a documentation on sentiment classification, data
disciplined workflow. As shown in Figure 1, the collecting, visualization, and AI-driven insights are
procedure starts with structured CSV file storage of compiled in this part. Finding important developments
scraping comments made using the YouTube API [1]. and gaps, we show how Sentilytics combines
DistilBERT then sorts the gathered data such that sentiment analysis, real-time scraping, and actionable
every comment is assigned a positive, negative, or insights into one platform.
neutral [3] classification. The system creates visual
representations such as histograms, word clouds, and
pie charts following categorization, therefore
offering an easy
Users of social media now mostly express their ideas This section outlines the methodology of Sentilytics,
on this platform; hence sentiment analysis is a very covering the system architecture, real-time comment
important instrument for public opinion understanding. scraping, sentiment analysis, AI-driven insights,
B. Liu (2022) in Sentiment Analysis: Mining and report generation. The workflow ensures
Opinions, Sentiments, and Emotions [1] emphasizes seamless extraction, processing, and visualization of
the need for obtaining sentiments from user-generated user comments for sentiment-based decision-making.
information. While recent developments in deep
learning, especially transformer-based models, have 3.1. System Architecture
greatly increased accuracy and contextual
understanding, traditional sentiment analysis Sentilytics uses a disciplined approach for sentiment
techniques depended on lexicon- based methodologies analysis, as Figure 1 illustrates. Extensive comments
and machine learning models. extracted via the Chrome Extension Interface from
platform APIs (YouTube, Instagram, X) are stored as a
Deep learning models—especially transformer-based
CSV file. Eliminating stopwords, links, and special
architectures—have shown recent success in sentiment
characters, the preprocessing module cleans the text.
classification applications. By means of a comparative
investigation of BERT, DistilBERT, and RoBERTa, While Matplotlib and Seaborn create sentiment
Joshy and Sundar (2024) [4] discovered that visualizations like count plots, histograms, word
DistilBERT achieves equivalent accuracy with low clouds, and pie charts, DistilBERT labels comments as
computing cost. Jethani (2024) also investigated the either "positive," "negative," or "neutral." Extensive
use of DistilBERT for emotional recognition, stressing themes are extracted from the AI Insights Module
its effectiveness in handling vast textual data volumes. (Llama3) together with practical advice. Results are
shown at last via a Flask web interface for user
Sentiment analysis's application in social media has
involvement.
been investigated in several sectors of business use.
Sprout Social (2024) [6] explores how companies use
sentiment data to improve client involvement.
Likewise, Venngage [7] offers methods for visualizing
sentiment data, stressing the need of graphical
representation in obtaining practical understanding
from big data.
Research already in publication emphasizes the value 3.3. Sentiment Analysis Using
of deep learning in sentiment analysis, the success of
automated comment scraping, and the part data DistilBERT
visualization plays in enabling insights. For content
Sentilytics uses a lightweight transformer model
creators, there is a gap in combining these elements
tailored for sentiment classification, DistilBERT, once
into a single, user-friendly platform, though.
the comments are gathered. Preprocessing starts the
Combining real-time scraping, sentiment
process; NLTK and regex help to remove special
categorization, and AI- driven insights into a complete
characters, links, and stopwords from the text, thus
sentiment analysis platform, Sentilytics seeks to close
preparing it for analysis.
this gap.
DistilBERT sorts the comments into three sentiment
3. Methodology categories—positive, negative, and neutral—after
preprocessing. Various visualizations like count plots
and histograms to indicate the distribution of sentiment Countplot & Histogram – Shows positive,
categories, pie charts to show the proportion of negative, and neutral comment distribution.
positive, negative, and neutral comments, and word
clouds stressing often used phrases in the comments
are produced to communicate the data.
3.7. Conclusion
Sentilytics generates insights using a disciplined
approach for real-time sentiment analysis. It starts with Figure 4: Sample Word Cloud Graph
comment extraction using platform APIs and then uses
DistilBERT to preprocess text and classify sentiment. Word Cloud – It highlights often used
Llama3 (12b) helps to clarify insights by pointing out words while stressing shared themes in user
recurrent themes and recommendations for comments.
development. An interactive Flask interface presents the
The sentiment distribution graphs (Figures 3 and 4)
visual results. As shown in Figure 2: Sentilytics
expose general audience reception. While a lot of
System Architecture, this flawless procedure helps
negative comments point to certain problems that
content authors to make data-driven decisions
require attention, a bigger ratio of positive remarks
depending on sentiment patterns and practical insights.
implies great involvement. The word cloud (Figure 5)
enables content authors to discover audience
4. Results and Discussion expectations and criticism, therefore helping to
The examined sentiment data is presented in this part pinpoint often discussed subjects.
together with discussions of sentiment distribution
patterns, interpretation of important visualizations, and 4.2. Case Studies and Use Cases
comparison of Sentilytics' performance with other
To demonstrate Sentilytics real-world impact, we
sentiment analysis techniques.
present example use cases:
4.1. Visualization and Sentiment Case Study 1: Creator of YouTube content
Distribution finds consistent unfavourable comments on
audio quality. They increase production
Multiple graphical approaches help to clearly show settings by means of AI-driven insights,
the processed sentiment data, hence guiding user thereby increasing engagement.
thoughts. Case Study 2: Brand Monitoring on
Instagram — A brand uses Sentilytics to
react proactively, therefore enhancing
consumer
confidence when it finds customer discontent sentiment classification, artificial intelligence-driven
with delivery times. insights utilizing Llama3, and visual analytics. The
results show its efficiency in spotting sentiment trends,
4.3. Why DistilBERT? displaying comments, and creating actionable
recommendations, thereby supporting data-driven
Driven by DistilBERT, Sentilytics is compared with
decision-making.
conventional sentiment analysis models:
Approaches Accuracy
Processing Multilingual Sarcasm
Sentlytics has difficulties including API rate constraints,
Speed Support Handling language complexity in sentiment classification, and
DistilBERT High Fast Yes Good possible AI biases in insights generating, even with its
VADER Moderate
Fast Limited Poor
success [2][6].
Traditional ML Moderate
Slow Limited Weak
(SVM,
etc.) Improvements ahead include