Handling imbalanced datasets in machine learning is a difficult challenge, and can include topics such as payment fraud, diagnosing cancer or disease, and even cybersecurity attacks. How to improve massively imbalanced datasets in machine learning with synthetic data?