Diffusion Model for Synthetic Microscopy Data Generation

A PyTorch-based framework for generating synthetic microscopy data of bacterial cells in microfluidic devices using diffusion models. This project specializes in the "mother machine" - a microfluidic device for long-term single-cell observation.

Results Preview

Generated Videos

Each video consists of 16 consecutive frames showing synthetic bacterial cell growth:

Ground Truth Videos

Features

Image and video diffusion model training
Cell Tracking Challenge (CTC) dataset compatibility
Built on PyTorch and diffusers library
Pre-trained models available
Automated dataset management
Weights & Biases integration for experiment tracking

Installation

Prerequisites

CUDA-capable GPU (recommended)
Anaconda or Miniconda

Setup

conda env create -f env.yaml
conda activate diffusion-env

Dataset

Using the MOMA Dataset

The default "moma" (Mother Machine) dataset follows the Cell Tracking Challenge (CTC) format and is automatically downloaded from Zenodo. All images used to train the model are resized to 256x32 pixels.

Custom Dataset Structure

Support for custom datasets through either:

CTC format conversion
Custom Dataset class implementation

Example CTC structure:

data/
    /moma/
        /CTC/
            train/
                01/
                    t001.tif
                    t002.tif
                02/
                    t001.tif
                    t002.tif

Usage

Training

Train on images or videos:

# For video training
python main.py --dataset moma --data-type video

# For image training
python main.py --dataset moma --data-type image

Model Checkpoints

Checkpoints are saved automatically when loss improves:

models/
    /moma/
        /video/
            /<modelname>/
                config.json                            # Model hyperparameters and architecture settings
                diffusion_pytorch_models.safetensors   # Trained model weights and states
                training_metrics.png                   # Loss curves, Noise schedule distribution & and learning rate 
                epoch_5/                               # Checkpoint directory for each epoch
                    sample_001.gif                     # Generated validation samples
                    sample_002.gif
                    sample_003.gif

Inference

Generate new samples:

python inference.py --dataset moma --data-type video

Output structure:

outputs/
    /moma/
        /video/
            /<modelname>/
                {num_timesteps}_inference_steps_<sample_number>.gif

Model Architecture

Components

VAE Encoder/Decoder: Uses AutoencoderKL for dimensionality reduction
Image Model: UNet2DModel for single-frame generation
Video Model: UNet3DConditionModel for temporal sequence generation (generates 16-frame sequences)

Training Progress

Epoch 1	Epoch 3	Epoch 5	Epoch 7	Epoch 9

Training Metrics

Future Work

Incorporate validation set
Generate segmentation masks in addition to images
Generate longer videos or with larger image sizes

Name		Name	Last commit message	Last commit date
Latest commit History 39 Commits
data		data
examples		examples
models		models
.gitignore		.gitignore
README.md		README.md
argparse_utils.py		argparse_utils.py
config.py		config.py
data_utils.py		data_utils.py
env.yaml		env.yaml
generate_noisy_or_GT_videos.py		generate_noisy_or_GT_videos.py
inference.py		inference.py
main.py		main.py
model.py		model.py
test_vae.py		test_vae.py
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Diffusion Model for Synthetic Microscopy Data Generation

Results Preview

Generated Videos

Ground Truth Videos

Features

Installation

Prerequisites

Setup

Dataset

Using the MOMA Dataset

Custom Dataset Structure

Usage

Training

Model Checkpoints

Inference

Model Architecture

Components

Training Progress

Training Metrics

Future Work

About

Uh oh!

Releases

Packages

Languages

owen24819/cells-diffusion

Folders and files

Latest commit

History

Repository files navigation

Diffusion Model for Synthetic Microscopy Data Generation

Results Preview

Generated Videos

Ground Truth Videos

Features

Installation

Prerequisites

Setup

Dataset

Using the MOMA Dataset

Custom Dataset Structure

Usage

Training

Model Checkpoints

Inference

Model Architecture

Components

Training Progress

Training Metrics

Future Work

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages