Image-Captioning-and-segmentation

Dataset

This project uses the Flickr8k dataset from Kaggle.
Please download it from the above link and place it in your local environment if you want to run the full code and experiments.

Description

This repository contains code for image captioning and segmentation using deep learning techniques.
It combines CNN-based feature extraction and LSTM-based sequence modeling to generate captions for images.

Files

main.py — Main training and evaluation script.
.ipynb files — Notebooks with experiments and visualizations.
model.keras — Trained Keras model.
tokenizer.pkl — Tokenizer used for captions.

How to Run

Clone this repository.
Download the dataset from Kaggle and place images and captions in your project folder.
Install requirements (pip install -r requirements.txt if you have one).
Run main.py or open notebooks to start experimenting.

License

This project is for academic and research purposes only.

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
README.md		README.md
flickr8k-image-captioning-using-cnns-lstms (1).ipynb		flickr8k-image-captioning-using-cnns-lstms (1).ipynb
img.png		img.png
img_1.png		img_1.png
img_2.png		img_2.png
img_3.png		img_3.png
main.py		main.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Image-Captioning-and-segmentation

Dataset

Description

Files

How to Run

License

About

Uh oh!

Releases

Packages

Languages

AkankshaSingh5git/Image-Captioning-and-segmentation

Folders and files

Latest commit

History

Repository files navigation

Image-Captioning-and-segmentation

Dataset

Description

Files

How to Run

License

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages