GitHub - ldi18/TabularQGAN: Code Release for TabularQGAN: A Quantum Generative Model for Tabular Data

Introduction

This code is the official implementation accompanying TabularQGAN: A Quantum Generative Model for Tabular Data (Bhardwaj et al., 2025). If you use this code, please cite the paper (see Citation). It provides the ability to train the TabularQGAN model and the classical benchmark models.

It is an adaptation of the open source code, qugen. For full documentation on the qugen library please see https://qutacquantum.github.io/qugen/

Installation

This repository declares requires-python = ">=3.10,<3.11" in pyproject.toml. We have only tested it with Python 3.10.19 (pyenv install 3.10.19 && pyenv local 3.10.19).

Create a virtual environment with Python 3.10.x using pyenv:

~/.pyenv/versions/3.10.19/bin/python -m venv .venv
Activate the environment: source .venv/bin/activate
Run pip install -e . to install the project in editable mode with the TabularQGAN code, SDV-based classical baselines, evaluation helpers, and plotting dependencies.

Instructions for training and running models

The script to train the modeles is called 'apps/ehr/train_tabularqgan.py' which is run from the command line.

The training data, model type, and hyperparameters can all be specified via command line arguments. An output of a sample of synthetic data from the trained model.

The tabularQGAN model also saves as pickle files the parameters at each epoch of training and as a csv and a plot the asscoiated overall metric and KL metric for each epoch of training.

Each model type also has a model handler, they are found in qugen/main/generator and are imported by the training script as needed.

Steps (run from the repository root; place the raw Adult CSV as apps/ehr/training_data/adult.csv before step 1):

Ingest and encode the data:

python apps/ehr/data_ingestion_tabular_adults_census.py
Train TabularQGAN (example):

python apps/ehr/train_tabularqgan.py adults_census_10_non_boolean qgan 2 3000 0.1 0.05 0.1 small 11

These two commands are the minimal pipeline for the Adult census 10-qubit non-boolean setup.
Select other model types, datasets, and hyperparameters via command line as needed:

The command line arguments for the script are as follows:

Number	Name	Description
0.	script	hyperparameter_train_discrete.py
1.	data_set_name	str: Options are 'adult_census_10', 'adult_census_15', 'adults_census_10_non_boolean', 'adults_census_15_non_boolean'
2.	model_type	str: Options are 'qgan', 'ctgan' and 'copulagan'
3.	circuit_depth	int: for number of layers
4.	n_epochs	int: for number of epochs to train each model
5.	lr_generator	float: learning rate for the generator
6.	lr_discriminator	float: learning rate for the discriminator
7.	batch_size_fraction	float: batch size fraction for the training
8.	classical_model_size	str: options 'small', 'large' , for the layer width for the classical model.
9.	random_seed	int: random seed for initialising jax

The synthetic data is found at 'experiments/[model_name]/synethetic_data_[model_name].csv' for the tabularQGAN model and 'experiments_classical/[model_name]/synethetic_data_[model_name].csv' for the classical models. Additional files produced in the experiments folder are- pickle files for each iteration, loss curve for KL divergence and overall metric and csv files with best overall metric and kl divergence

Notes

For convenience we have also included a json file with the best found configurations per model and dataset as reported in the paper, called 'best_config.json'.

Only the data processeing code for the adult census data set is provided here as the MIMIC-III dataset cannot not be distributed. See https://physionet.org/content/mimiciii/1.4/ for details.

Citation

@misc{bhardwaj2025tabularqganquantumgenerativemodel,
      title={TabularQGAN: A Quantum Generative Model for Tabular Data}, 
      author={Pallavi Bhardwaj and Caitlin Jones and Lasse Dierich and Aleksandar Vučković},
      year={2025},
      eprint={2505.22533},
      archivePrefix={arXiv},
      primaryClass={cs.LG},
      url={https://arxiv.org/abs/2505.22533}, 
}

Contact

pallavi.bhardwaj@sap.com

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
apps		apps
benchmarks		benchmarks
qugen		qugen
vae-bgm_data_generator @ 2b9cf5f		vae-bgm_data_generator @ 2b9cf5f
.gitmodules		.gitmodules
.python-version		.python-version
LICENSE		LICENSE
README.md		README.md
best_config.json		best_config.json
pyproject.toml		pyproject.toml
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Introduction

Installation

Instructions for training and running models

Notes

Citation

Contact

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Introduction

Installation

Instructions for training and running models

Notes

Citation

Contact

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages