Changelog

All notable changes to the GARAGE project.


[1.0.0] — 2025-09-28 (bioRxiv preprint)

Added

  • Two-stage GARAGE pipeline: GAT-based cell selection + GAT-seeded GAN generation.

  • Built-in support for 4 scRNA-seq datasets: Yan, Pollen, CBMC, Muraro.

  • Wasserstein distance computation via Optimal Transport.

  • Data validation module: CV², Fano, and PCA loading feature selection; Leiden clustering; ARI/NMI/macro-F1 reporting; UMAP visualisation.

  • 5 general-purpose SOTA baselines (PyTorch): GAN, WGAN, F-GAN, VAE, LSH-GAN.

  • 5 scRNA-seq-specific baselines (PyTorch): scGAN, scVAE, scDiffusion, GAN-ROS, VAE-ROS.

  • Biological validation: GAT attention ↔ marker-gene enrichment, rare-cell positive rate analysis.

  • Held-out rare-cell utility experiment with Random Forest classifier.

  • Ablation studies: leakage fraction sweep (0.0–0.3) and multi-seed synthesis (5 seeds).

  • Analysis suite: distribution metrics, clustering evaluation, aggregate losses, summary tables.

  • ReadTheDocs documentation (Sphinx + MyST Parser + sphinx-rtd-theme).

  • config.py — single source of truth for paths and hyper-parameters.

  • CITATION.cff — citation metadata for GitHub/Zotero integration.