Sample datasets
Census dataset

COVID-19 dataset

Baseball dataset


Last updated
Was this helpful?
To provide users with practical examples for testing and analytics, we have selected datasets optimized for various scenarios. These datasets are sourced from well-known repositories and are designed to help users get started with Syntho's features effectively. For testing purposes, you can access a multi-table dataset, while for analytics, there is a single-table dataset. These datasets serve as a practical starting point for exploring Syntho's features and capabilities:
Use Case: Ideal for analytics and AI model training.
Description: Contains demographic information, including age, education, occupation, and income classification.

Click below link to download .csv file.
Use Case: Useful for testing synthetic data generation on multi-table healthcare-related datasets.
Description: Includes tables such as patients, conditions, encounters etc. simulated for COVID-19 scenarios.
Source: Synthea COVID Patients Dataset.

Click below link to download .zip file for 10k patient records with COVID-19 in the CSV format. If you would like to download 100k patient records version, please click here.
Use Case: Suitable for analytics and relational dataset exploration.
Description: Features player statistics and seasonal performance data.
Source: Lahman Baseball Dataset.


Click below link to download .zip file.
Last updated
Was this helpful?
Was this helpful?

