🧠 Lightweight BART for Conversational Summarization

This project implements a scaled-down version of the BART (Bidirectional and Auto-Regressive Transformers) model for abstractive summarization of conversational data. The goal is to create a summarization model that can be trained and deployed in resource-constrained environments while maintaining reasonable performance.

🔍 Motivation

With the increasing use of digital communication platforms like Zoom, Microsoft Teams, and Discord, the ability to summarize large volumes of conversational text is more important than ever. However, full-sized transformer models like BART-base are often computationally intensive. This project explores the performance tradeoffs of a reduced BART model trained on smaller hardware and datasets.

🧱 Model Architecture

Based on the original BART model
Reduced to:
- 4 encoder layers
- 4 decoder layers
- Smaller hidden sizes and fewer attention heads
Trained using the Hugging Face transformers library

📊 Datasets

📚 Pretraining

Gigaword Corpus (~700k samples)
Used for learning general summarization patterns and building foundational knowledge.

🗣️ Fine-Tuning

SAMsum Dataset
Dialogue-based summarization dataset with informal, multi-speaker conversational text.

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
.gitignore		.gitignore
30ktest.ipynb		30ktest.ipynb
README.md		README.md
bart_base.ipynb		bart_base.ipynb
licence.txt		licence.txt
new_small_bart.ipynb		new_small_bart.ipynb
test.json		test.json
train.json		train.json
val.json		val.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

🧠 Lightweight BART for Conversational Summarization

🔍 Motivation

🧱 Model Architecture

📊 Datasets

📚 Pretraining

🗣️ Fine-Tuning

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

🧠 Lightweight BART for Conversational Summarization

🔍 Motivation

🧱 Model Architecture

📊 Datasets

📚 Pretraining

🗣️ Fine-Tuning

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages