Skip to content

asshejan/Image-Caption-Generator

Folders and files

NameName
Last commit message
Last commit date

Latest commit

Β 

History

4 Commits
Β 
Β 
Β 
Β 
Β 
Β 

Repository files navigation

Image Caption Generation

This project implements an image caption generation model using deep learning techniques. The model processes images and generates descriptive captions using a combination of Convolutional Neural Networks (CNNs) and Recurrent Neural Networks (RNNs). The quality of generated captions is evaluated using the BLEU (Bilingual Evaluation Understudy) score, a widely used metric for measuring the accuracy of machine-generated text against human references.

Preview Image

Include a preview image of the model's output here:

Preview Image

Features

  • Uses CNN (e.g., InceptionV3) for image feature extraction.
  • Implements an LSTM-based decoder for caption generation.
  • Tokenization and vocabulary handling with a custom word index.
  • Evaluation using BLEU Score to measure the quality of generated captions.

Installation

To set up the project, install the required dependencies:

pip install tensorflow numpy matplotlib nltk

Usage

Run the Jupyter Notebook to train and test the model:

jupyter notebook image-caption-generation.ipynb

Evaluation

The model performance is measured using the BLEU score, which evaluates how similar the generated captions are to reference captions. Higher BLEU scores indicate better caption accuracy.

License

This project is licensed under the MIT License.

About

This notebook builds an image caption generation model using CNNs for image feature extraction and LSTMs for text generation. It processes images and captions, trains a deep learning model, and evaluates performance with the BLEU score for accuracy. Useful for AI-driven applications requiring automated image descriptions. πŸš€

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors