0% found this document useful (0 votes)

16 views6 pages

Transcript IEEE Paper

The document presents a project on developing a YouTube transcript summarizer website aimed at enhancing the accessibility and usability of video transcripts. It details the use of a Chrome Extension that interacts with a backend REST API to automatically extract and summarize YouTube video transcripts using NLP techniques, specifically employing the T5 model for summarization. The project outlines its architecture, software specifications, and potential real-world applications for various users including educators and content consumers.

Uploaded by

22202017

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

16 views6 pages

Transcript IEEE Paper

Uploaded by

22202017

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 6

e-ISSN:2582-7219

INTERNATIONAL JOURNAL OF
MULTIDISCIPLINARY RESEARCH
IN SCIENCE, ENGINEERING AND TECHNOLOGY

Volume 7, Issue 10, October 2024

Impact Factor: 7.521

6381 907 438 6381 907 438 ijmrset@gmail.com @ www.ijmrset.com

Transcript Summarizer for Youtube

Harini S, Harshitha Reddy B, Akshaya R, Bhavya G
Department of Computer Science and Business Systems, R.M.D. Engineering College (An Autonomous Institution),
R.S.M Nagar, Kavaraipettai, India

ABSTRACT: In today's digital age, online video content has become an integral part of our daily lives. As a result the
need for efficient and time-saving tools for consuming this content has grown substantially. This project presents the
development of a YouTube transcript summarizer website—a valuable solution designed to enhance the accessibility
and usability of video transcripts. The objective of this project is to create a user-friendly web application that can
automatically extract YouTube video transcripts and generate concise and informative summaries. By providing this
tool, we aim to cater to a wide range of users, including content consumers seeking quick insights, educators in need of
efficient content review, and businesses looking to streamline content management.

I.INTRODUCTION

As a Computer Science student, you learn on a daily basis from videos, articles, documentation, and so on. A majority
of learning happens through Youtube as well. PS Youtube also provides entertainment. A lot of time can be saved if you
can summarize the content of the youtube videos. In this project, you will be creating a Chrome Extension which will
make a request to the backend REST API where it will perform NLP and respond with a summarized version of a
YouTube transcript. The YouTube videos are usually summarized through manual descriptions and thumbnails.
YouTube is the second most visited website worldwide. The range of videos on YouTube includes short films, music
videos, feature films, documentaries, audio recordings, corporate sponsored movie trailers, live streams, vlogs, and
many other contents from popular YouTubers. YouTube users watch more than one billion hours of video every day.
This project proposes the usage of a transformer package for summarizing the transcripts of the video, thereby
providing a meaningful and germane summary of the video. T5 is an encoder-decoder model which is pre-trained on a
set of unsupervised and supervised tasks and for which each task is converted into a text-to-text format. Our main
concern is to summarize the data, so a pre-trained summarization technique is used. Keywords: Text Summarizer,
Chrome Extension, HuggingFace transformers, WebAPI.

II. USECASE SCENARIO

A. Application of Project- Discuss real-world applications of the YouTube transcript summarizer, such as aiding content
consumers, researchers, or educators. Explain how this tool can be a time-saver and enhance the learning experience
B. Existing System- Provide an overview of any existing tools or services related to video transcript summarization.
Mention their strengths and weaknesses
C. Proposed system- Describe in detail your YouTube transcript summarizer website, including the user interface,
features, and how it will extract and summarize video transcripts.

III. SOFTWARE SPECIFICATION

back-end uses Flask framework to receive API calls from the client and then respond with the summarized text . This
API can work only on those YouTube videos which have wellformatted closed captions in it. The same backend also
hosts a web version of the Summarizer to make those API calls in simple way and show the output within the webpage.

\ Units
• Use `/` (Root Endpoint): It displays a general purpose introductory webpage and also provides links to web
summarizer and API information. You can go to this point [here](https://ytsum.herokuapp.com/).

IJMRSET © 2024 | An ISO 9001:2008 Certified Journal | 15031

• `/web/` (Web Summarizer Endpoint): It displays the web version of the summarizer tool. The webpage has input
elements and a summarize button. After clicking summarize, the `API` is called and the response is displayed to the
user. You can go to this endpoint by directly clicking [here](https://ytsum.herokuapp.com/web/).
• `/api/` (API Description Endpoint): The webpage at this endpoint describes basic API information in case you would
like to use it. Feel free to learn and use our API in your projects.
• `/summarize/` (API Endpoint): This endpoint is for **API purposes only**. That is why, the response type of the
**`GET Request`** at this endpoint is in JSON format.

A.BACK END
APIs have revolutionized the way applications are built and there are numerous examples of APIs being used in
different applications. To set up our API, we begin by creating a back-end application directory with an app.py file.
This file is initialized with a basic Flask RESTful Boilerplate. We then create a virtual environment to isolate the
location where all the dependencies will reside. Once the virtual environment is activated, we use pip to install the
necessary dependencies, including Flask, YouTube_Transcript_API, and transformers. It is important to ensure that the
content is original and not plagiarized to maintain its integrity.

B. GET TRANSCRIPTS
In this module, we will utilize a Python API to obtain transcripts/subtitles for a specified YouTube video. The API is
capable of working with automatically generated subtitles, translating subtitles, and does not require a headless browser
like other Selenium-based solutions. In app.py, we define a function that takes the YouTube video ID as an input
parameter and returns the parsed full transcript as the output. Since we receive the transcript in JSON format with text,
start, and duration attributes, we only extract the text data from the response and return the transcript as a single string.
This process allows us to obtain the complete transcript of the video.

IV. PROJECT DESCRIPTION

The project follows a clear flowchart as shown in Figure 1. Firstly, the user opens a YouTube video and clicks on the
"summarize" button in the chrome extension. This initiates a HTTP request to the back-end of the system.
Subsequently, the request is made to access the transcripts using the YouTube video ID obtained from the URL. The
response to this request will be a transcript of the video in JSON format. Once the transcripts are obtained in text
format, the system performs transcript summarization, which involves reducing the length of the transcript while
retaining the most important information. Finally, the summarized transcript is displayed on the extension

A.PERFORM TEXT SUMMARIZATION

Text summarization refers to the task of condensing longer text into a shorter summary while preserving the key
information and meaning of the original text. There are two main approaches used for text summarization: extractive
summarization and abstractive summarization. Extractive summarization involves identifying important sentences and
phrases from the original text and outputting only the necessary parts, while abstractive summarization involves
generating a completely new text that is shorter than the original text, often using encoderdecoder models like Bart or
T5. For this project, we will use the HuggingFace transformers library in Python to perform abstractive text
summarization on the transcript obtained from the previous step. In app.py, a function is created that accepts the
YouTube transcript as input and returns the summarized transcript as output. To perform the summarization, a tokenizer
and a model are instantiated from the checkpoint name. The T5- specific prefix "summarize:" is added to the transcript
that needs to be summarized. The PreTrainedModel.generate() method is then used to generate the summary.

B. REST API ENDPOINT

The next step is to define the resources that will be utilized in the implementation of this backend service. As this is a
straightforward application with only a single endpoint, the only resource we need to define is the summarized text. In
app.py, we create a Flask API Route with a GET HTTP Request method and a 17 | P a g e URI of
http://[hostname]/api/summarize?youtube_url=#{url}. We then extract the YouTube video ID from the YouTube URL
obtained from the query parameters. After that, we generate the summarized transcript by executing the transcript
generation function and the transcript summarizer function. Finally, we return the summarized transcript with an HTTP
Status OK and handle HTTP exception as required.

C. DISPLAY SUMMARIZED TEXT

To enable interaction between the extension and backend server, we need to add functionality to make HTTP REST API
Calls. In popup.js, we attach an event listener to the Summarize button with the event type "click" and pass an
anonymous callback function. In the callback function, we use the chrome.runtime.sendMessage method to send an
action message to contentScript.js to generate the summary. We also add an event listener, chrome.runtime.onMessage,
to listen for message results from contentScript.js, which will execute the outputSummary callback function. In the
callback function, we use JavaScript to programmatically display the summary in the div element. We also need to
inject the content script contentScript.js into a particular page and execute the script automatically. In contentScript.js,
we add an event listener chrome.runtime.onMessage to listen to the message generator, which will execute the generate
Summary callback function. In the callback function, we extract the URL of the current tab, make a GET HTTP request
using the XML HTTP Request Web API to the backend, and receive the summarized text as a response. Then, we send
an action message result with the summary payload using chrome.runtime.sendMessage to notify popup.js to display
the summarized text.

ACKNOWLEDGMENT

The success and final outcome of this project required a lot of guidance, Support and kind co-operation from many, for
successful completion. We wish to express our sincere thanks to all those who were involved in the completion of this
project.

It is our immense pleasure to express our deep sense of gratitude to our respected chairman Thiru R. S. Munirathinam,
our vice chairman Thiru R. M. Kishore, and our director Thiru R. Jothi Naidu for the facilities and support given by
them in the college.

We are extremely thankful to our principal Dr. N. Anbuchezhian, M.S, M.B.A, M.E, Ph.D., for giving us an opportunity
to serve the purpose of education.

We are indebted to Dr. G. Amudha, M.E, Ph.D., Professor, Head of the Department in Computer Science and Business
Systems for providing the necessary guidance and constant encouragement for successful completion of this project on
time.

We extend our sincere thanks and gratitude to our project guide Dr. S. Deepa B.Tech,M.E, Assistant Professor in the
Department of Computer Science and Business Systems, who guided us all along till the completion of our
projectwork.

REFERENCES

[1]. ‘Automated Video Summarization Using Speech Transcript’ byCuneyt M. Taskiran, Aronon Amir, Dulce B.
Ponceleon, Edward J. Delph
[2]. “Digital video Summarization Techniques”, Ashenafi Workie, Rajesh Sharma, Yun Koo Chun

[3]. S. Tharun, R. Kranthi Kumar, P. Sai Sravanth, G. Srujan Reddy, B. Akshay, “Survey on Abstractive Transcript
Summarization of YouTube Videos”, in International Journal of Advanced Research in Science, Communication and
Technology (IJARSCT)
[4]. Nallapati, R., Zhou, B., Gulcehre, C., & Xiang, B. (2017). Summarunner: A recurrent neural network based
sequence model for extractive summarization of documents. In Proceedings of the AAAI Conference on Artificial
Intelligence (Vol. 31, No. 1).
[5]. Nguyen, T. T., Nguyen, M. Q., Nguyen, L. T., & Nguyen, H. N. (2019). A hybrid approach for summarizing
youtube video transcripts. Information Processing & Management, 56(6), 1444-1459.
[6]. Zeng, J., Wei, F., & Liu, S. (2020). Learning to summarize from human feedback on summary prototypes. In
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics (pp. 5641- 5647).
[7]. Huang, X., Shi, Y., Xiong, W., & Zhang, J. (2021). EduSum: A largescale dataset and neural model for automated
educational video summarization. In Proceedings of the 2021 Conference of the North American Chapter of the
Association for Computational Linguistics: Human Language Technologies (pp. 452-462).
[8]. https://atmamani.github.io/blog/building-restful-apis-with-flask-inpython/
[9]. https://pypi.org/project/youtube-transcript-api/
[10].https://medium.com/swlh/parsing-rest-api-payload-and-queryparameters-with-flask-better-than-
marshmallowaa79c889e3ca
[11].https://developer.chrome.com/docs/extensions/mv2/
[12].https://developer.mozilla.org/enUS/docs/Web/API/XMLHttpRequest/ Using_XMLHttpReques

INTERNATIONAL JOURNAL OF
MULTIDISCIPLINARY RESEARCH
IN SCIENCE, ENGINEERING AND TECHNOLOGY

| Mobile No: +91-6381907438 | Whatsapp: +91-6381907438 | ijmrset@gmail.com |

www.ijmrset.com

Mini Project Report 7 Sem..
No ratings yet
Mini Project Report 7 Sem..
16 pages
YouTube Transcript Summarizer Guide
100% (1)
YouTube Transcript Summarizer Guide
11 pages
Youtube Video Summarizer
No ratings yet
Youtube Video Summarizer
4 pages
Mini ProjectA17
0% (1)
Mini ProjectA17
25 pages
FINAL
No ratings yet
FINAL
13 pages
2612 Manikanta Reddy K
No ratings yet
2612 Manikanta Reddy K
53 pages
Fin Ijprems1684402871
No ratings yet
Fin Ijprems1684402871
3 pages
YouTube Video Summarizer Tool
No ratings yet
YouTube Video Summarizer Tool
17 pages
Group 24 Report
No ratings yet
Group 24 Report
48 pages
Mini ProjectA17
No ratings yet
Mini ProjectA17
25 pages
Survey Paper On Youtube Transcript Summarizer: Eesha Inamdar, Varada Kalaskar, Vaidehi Zade
No ratings yet
Survey Paper On Youtube Transcript Summarizer: Eesha Inamdar, Varada Kalaskar, Vaidehi Zade
4 pages
Youtube Nites Generator
No ratings yet
Youtube Nites Generator
24 pages
Abstract 3
No ratings yet
Abstract 3
4 pages
Project Report
No ratings yet
Project Report
25 pages
Batch8 - Youtube Transcript Summarizer
No ratings yet
Batch8 - Youtube Transcript Summarizer
1 page
Minor Project
No ratings yet
Minor Project
10 pages
TSP Projectppt
No ratings yet
TSP Projectppt
10 pages
YouTube Edu Video Summarizer
No ratings yet
YouTube Edu Video Summarizer
5 pages
Technical Seminar Report
No ratings yet
Technical Seminar Report
21 pages
Documentation
No ratings yet
Documentation
28 pages
YTSummarizer
No ratings yet
YTSummarizer
26 pages
VENkat
No ratings yet
VENkat
41 pages
Youtube Transscript Summirizer
No ratings yet
Youtube Transscript Summirizer
9 pages
Mini Project Report
No ratings yet
Mini Project Report
20 pages
Paper 1
No ratings yet
Paper 1
2 pages
Face Detection Using Open CV
No ratings yet
Face Detection Using Open CV
9 pages
Chapters Merged
No ratings yet
Chapters Merged
53 pages
Youtube Transcript Summarizer: Siddhartha
No ratings yet
Youtube Transcript Summarizer: Siddhartha
7 pages
Youtube Transcript Summarizer Using Flask
No ratings yet
Youtube Transcript Summarizer Using Flask
9 pages
Mini Project Final Review Batch 8B
No ratings yet
Mini Project Final Review Batch 8B
16 pages
Documentation 10
No ratings yet
Documentation 10
26 pages
Yt Summarizer Final
No ratings yet
Yt Summarizer Final
36 pages
Video Transcript Summarizer
No ratings yet
Video Transcript Summarizer
5 pages
Video Summarization for Developers
No ratings yet
Video Summarization for Developers
1 page
Video Transcript Summarizer Project
No ratings yet
Video Transcript Summarizer Project
12 pages
YouTube Video Summariser Using NLP
No ratings yet
YouTube Video Summariser Using NLP
27 pages
YouTube Transcript To Detailed Notes Converter
No ratings yet
YouTube Transcript To Detailed Notes Converter
8 pages
IET Final Year Project - Making YouTube Transcript
No ratings yet
IET Final Year Project - Making YouTube Transcript
63 pages
1 - 5. YouTube Transcript Synthesis
No ratings yet
1 - 5. YouTube Transcript Synthesis
6 pages
YouTube Transcript Summarization
No ratings yet
YouTube Transcript Summarization
5 pages
Seminar Presentation Abstract
No ratings yet
Seminar Presentation Abstract
1 page
New Yt Research Paper
No ratings yet
New Yt Research Paper
13 pages
IR Report
No ratings yet
IR Report
10 pages
Final Report Major
No ratings yet
Final Report Major
43 pages
IJNRD2306300
No ratings yet
IJNRD2306300
6 pages
Ask Vid
No ratings yet
Ask Vid
11 pages
Technical Report 1.2
No ratings yet
Technical Report 1.2
27 pages
Report Format
No ratings yet
Report Format
26 pages
Creating A Youtube Summariser &#8211
No ratings yet
Creating A Youtube Summariser &#8211
8 pages
Fin Irjmets1649174683
No ratings yet
Fin Irjmets1649174683
5 pages
Title Mini Project-Final
No ratings yet
Title Mini Project-Final
1 page
Video Transcription and Summarization Using NLP
No ratings yet
Video Transcription and Summarization Using NLP
5 pages
Final Ojt
No ratings yet
Final Ojt
31 pages
Chatpedia: "Seamlessly Interact With Pdfs and Videos."
No ratings yet
Chatpedia: "Seamlessly Interact With Pdfs and Videos."
19 pages
YouTube Video Transcript Summarizer
No ratings yet
YouTube Video Transcript Summarizer
30 pages
? YouTube Transcript Extraction & AI Summary To Google Docs Using LangChain (Python)
No ratings yet
? YouTube Transcript Extraction & AI Summary To Google Docs Using LangChain (Python)
3 pages
News Summarizer Using Chatgpt
No ratings yet
News Summarizer Using Chatgpt
28 pages
Mini Project Report
No ratings yet
Mini Project Report
26 pages
Sarthak Notes
No ratings yet
Sarthak Notes
51 pages
Microsoft Word - D365CE Interview QA
100% (10)
Microsoft Word - D365CE Interview QA
53 pages
CODE:-: Index - HTML
No ratings yet
CODE:-: Index - HTML
6 pages
Biometric Attendance Document
No ratings yet
Biometric Attendance Document
5 pages
Azure & Web Services Development
No ratings yet
Azure & Web Services Development
9 pages
CP R81 Gaia AdminGuide
No ratings yet
CP R81 Gaia AdminGuide
578 pages
Card Payments Integrations, Switch and Card Schemes Connectors, IsO8583 REST API Saas Integration
No ratings yet
Card Payments Integrations, Switch and Card Schemes Connectors, IsO8583 REST API Saas Integration
3 pages
Ultimate Aspnet Core Webapi
No ratings yet
Ultimate Aspnet Core Webapi
6 pages
Web API Reference
No ratings yet
Web API Reference
30 pages
Dot Net MCQ4 Se Ad
No ratings yet
Dot Net MCQ4 Se Ad
114 pages
Toc Ultimate ASP - Net Core Web API
No ratings yet
Toc Ultimate ASP - Net Core Web API
10 pages
Swagger Javatpoint
No ratings yet
Swagger Javatpoint
22 pages
Hasmukh Patel - Dot Net Lead
No ratings yet
Hasmukh Patel - Dot Net Lead
7 pages
Web API Security Essentials - Sample Chapter
No ratings yet
Web API Security Essentials - Sample Chapter
26 pages
Smart Glass Using IoT and Machine Learning Technol
No ratings yet
Smart Glass Using IoT and Machine Learning Technol
12 pages
001 KaiOS OS Introduction V1.1
No ratings yet
001 KaiOS OS Introduction V1.1
23 pages
02-What Are The Benefits and Drawbacks of Using PATCH For Updating Resources
No ratings yet
02-What Are The Benefits and Drawbacks of Using PATCH For Updating Resources
1 page
Unit 4 Python
No ratings yet
Unit 4 Python
12 pages
Web API & Flask: Key Concepts Explained
No ratings yet
Web API & Flask: Key Concepts Explained
7 pages
Web API
No ratings yet
Web API
6 pages
Web API PDF
No ratings yet
Web API PDF
51 pages
Azure AD Integration in ASP.NET API
No ratings yet
Azure AD Integration in ASP.NET API
3 pages
2024 TechGuide WebAPI
No ratings yet
2024 TechGuide WebAPI
33 pages
Appian I QA
100% (1)
Appian I QA
18 pages
Integration Testing Using The Azurite Storage Emulator - by Pete Morton - Purplebricks Digital - Medium
No ratings yet
Integration Testing Using The Azurite Storage Emulator - by Pete Morton - Purplebricks Digital - Medium
8 pages
Pra Api 22 1
No ratings yet
Pra Api 22 1
64 pages
CP R81.20 Gaia AdminGuide
No ratings yet
CP R81.20 Gaia AdminGuide
664 pages
Web API Basics
100% (3)
Web API Basics
28 pages
Slot 27-Working With ASP - NET Core Web API
No ratings yet
Slot 27-Working With ASP - NET Core Web API
47 pages
Which Class in WPF Adds Additional Framework
No ratings yet
Which Class in WPF Adds Additional Framework
5 pages

Transcript IEEE Paper

Uploaded by

Transcript IEEE Paper

Uploaded by

e-ISSN:2582-7219

Volume 7, Issue 10, October 2024

Impact Factor: 7.521

6381 907 438 6381 907 438 ijmrset@gmail.com @ www.ijmrset.com

Transcript Summarizer for Youtube

II. USECASE SCENARIO

III. SOFTWARE SPECIFICATION

IJMRSET © 2024 | An ISO 9001:2008 Certified Journal | 15031

IV. PROJECT DESCRIPTION

A.PERFORM TEXT SUMMARIZATION

B. REST API ENDPOINT

IJMRSET © 2024 | An ISO 9001:2008 Certified Journal | 15032

C. DISPLAY SUMMARIZED TEXT

IJMRSET © 2024 | An ISO 9001:2008 Certified Journal | 15033

IJMRSET © 2024 | An ISO 9001:2008 Certified Journal | 15034

| Mobile No: +91-6381907438 | Whatsapp: +91-6381907438 | ijmrset@gmail.com |

You might also like