Skip to content
@sdv-dev

The Synthetic Data Vault Project

Owned and maintained by DataCebo.

The Synthetic Data Vault

The official development organization for the Synthetic Data Vault (SDV) ecosystem.

This organization is owned and maintained by DataCebo and contains the open-source libraries, research projects, and engineering repositories that power the SDV ecosystem.

Core Projects

  • SDV – Synthetic Data Vault
  • RDT – Reversible Data Transforms
  • SDMetrics – Synthetic data evaluation metrics
  • Copulas – Statistical copula models
  • SDGym – Benchmarking framework for synthetic data generation

Enterprise

Looking for enterprise-scale synthetic data generation?

SDV Enterprise provides:

  • Enterprise relational modeling
  • Automated metadata discovery
  • Synthetic data for testing software applications
  • Air-gapped and on-premises deployment
  • AI training datasets
  • Demo data generation
  • Performance testing
  • Enterprise support

Learn more:


© DataCebo

Pinned Loading

  1. SDV SDV Public

    Synthetic data generation for tabular data

    Python 3.5k 416

  2. CTGAN CTGAN Public

    Conditional GAN for generating synthetic tabular data.

    Python 1.6k 329

  3. SDMetrics SDMetrics Public

    Metrics to evaluate quality and efficacy of synthetic datasets.

    Python 261 52

  4. Copulas Copulas Public

    A library to model multivariate data using copulas.

    Python 646 121

  5. SDGym SDGym Public

    Benchmarking synthetic data generation methods.

    Python 309 68

  6. RDT RDT Public

    A library of Reversible Data Transforms

    Python 134 27

Repositories

Showing 10 of 14 repositories

Top languages

Loading…

Most used topics

Loading…