Overview
GARAGE
Graph-Attentive Rare-cell-Aware single-cell data GEneration
How to read these docs
GARAGE’s documentation follows the Diátaxis framework to serve different needs at different points in your journey:
Getting Started — first-time setup and your first synthetic dataset.
Tutorials — step-by-step, end-to-end walkthroughs for research use cases.
How‑to Guides — focused recipes for specific tasks.
Theoretical Background — understanding the architecture and methods.
API Reference — function signatures, module-level documentation.
Appendix — glossary, FAQ, troubleshooting, citation, and changelog.
Getting Started
Tutorials
How‑to Guides
Theoretical Background
API Reference
- config module
- data_generation module
- GARAGE — Graph-Attentive Rare-cell-Aware single-cell data GEneration.
DiscriminatorGATClassifierGeneratorgat_subsample()generate_data()load_dataset()main()run_garage()sample_Z()train_gan()- Wasserstein distance between real and generated scRNA-seq distributions.
load_generated()load_real()main()wasserstein_distance()- Core Functions
- data_validation module
- Data validation for GARAGE‑generated scRNA‑seq data.
cluster_and_evaluate()cv2_selection()load_generated()load_labels()load_real()main()plot_umap()sweep_resolution()- Feature selection for scRNA-seq data (Python port of feature_selection.R).
cv2_selection()fano_selection()main()pca_loading_selection()run_feature_selection()- Core Functions
- Resolution Sweep
- Reference Notebook
- biological_analysis module
- Biological validation of attention-prioritised cells.
enrichment_analysis()load_cbmc()main()marker_expression_analysis()per_celltype_positive_rate_analysis()print_summary_interpretation()run_gat_and_get_attention()- Held-out rare-cell utility experiment.
GATClassifierGarageDiscriminatorGarageGeneratorLSHDiscriminatorLSHGeneratorVanillaDiscriminatorVanillaGeneratorevaluate_classifier()evaluate_classifier_downgraded()garage_gat_seeds()generate_garage()generate_lsh_gan()generate_vanilla_gan()knn_subsample()load_data()main()sample_Z()split_rare()train_garage_gan()train_lsh_gan()train_vanilla_gan()- Fixed marker-gene clustering evaluation.
compute_ari_nmi()evaluate_baseline()evaluate_clustering()evaluate_real_reference()evaluate_sweep()get_pseudo_labels()load_garage_data()load_gen_data()load_real_data()main()select_markers_cbmc()select_markers_muraro()select_markers_pollen()select_markers_yan()- Modules Overview
- ablation_study module
- GAN training stability ablation study.
DiscriminatorGeneratorgat_subsample()load_cbmc()load_muraro()load_pollen()load_yan()main()sample_Z()train_gan_with_leakage()CriticFDiscriminatorFGeneratorGATG_DiscriminatorGATG_GeneratorLSHDiscriminatorLSHGeneratorVanillaDiscriminatorVanillaGeneratorWGeneratorfisher_ratio()gat_subsample()knn_subsample()load_labels()load_real()main()sample_Z()train_and_generate_fgan()train_and_generate_gan()train_and_generate_gatgan()train_and_generate_lshgan()train_and_generate_wgan()- Modules Overview
- analysis module
load_real()load_synthetic()main()mmd_rbf()sliced_wasserstein()cluster_and_evaluate()cv2()evaluate_baseline()evaluate_sweep()load_gen()load_labels()load_real()main()fmt_mean_std()main()evaluate_ari()fano_selection()load_real()main()- Fixed marker-gene clustering evaluation (grid sweep).
compute_ari_nmi()evaluate_clustering()evaluate_real_reference()evaluate_sweep()get_pseudo_labels()load_garage_data()load_gen_data()load_real_data()main()select_markers_cbmc()select_markers_muraro()select_markers_pollen()select_markers_yan()load_gan_data()load_garage_data()load_lsh_gan_data()load_real_data()main()mmd_rbf()rbf_kernel()load_gan_data()load_garage_data()load_lsh_gan_data()load_real_data()main()sliced_wasserstein()- Modules Overview
Appendix
Image credits: GARAGE architecture image by the authors; 10x Genomics scRNA-seq comparison used under fair-use for educational illustration.