Main

Main

Main folder. Material related to my books on synthetic data and generative AI. Also contains documents blending components from several folders, or covering topics spanning across multiple folders..

Stars: 66

Visit
 screenshot

This repository contains material related to the new book _Synthetic Data and Generative AI_ by the author, including code for NoGAN, DeepResampling, and NoGAN_Hellinger. NoGAN is a tabular data synthesizer that outperforms GenAI methods in terms of speed and results, utilizing state-of-the-art quality metrics. DeepResampling is a fast NoGAN based on resampling and Bayesian Models with hyperparameter auto-tuning. NoGAN_Hellinger combines NoGAN and DeepResampling with the Hellinger model evaluation metric.

README:

This repository features material related to content that spreads across multiple folders. For the time being, it is related to my new book Synthetic Data and Generative AI, available here, and published by Elsevier.

It also includes:

  • NoGAN code, a tabular data synthesizer running 1000x faster than GenAI methods based on neural networks, and consistently delivering better results regardless of the evaluation metric (including state-of-the-art new quality metrics capturing a lot more than traditional distances), both on categorical and numerical features, or a mix of both. For details, see technical paper #29, available here.
  • DeepResampling code, another fast NoGAN based on resampling and distribution-free Hierarchical Bayesian Models, with hyperparameter auto-tuning. For details, see technical paper #31, available here.
  • NoGAN_Hellinger code (two scripts), with loss function replaced by the Hellinger model evaluation metric. A blend of NoGAN and DeepResampling. For details, see section 2.4 in the project textbook, here.

For Tasks:

Click tags to check more tools for each tasks

For Jobs:

Alternative AI tools for Main

Similar Open Source Tools

For similar tasks

For similar jobs