Sample Datasets
Last updated
Was this helpful?
Last updated
Was this helpful?
To provide users with practical examples for testing and analytics, we have selected datasets optimized for various scenarios. These datasets are sourced from well-known repositories and are designed to help users get started with Syntho's features effectively. For testing purposes, you can access a multi-table dataset, while for analytics, there is a single-table dataset. Additionally, a two-table sequence dataset is available for sequence-based modeling and evaluation. These datasets serve as a practical starting point for exploring Syntho's features and capabilities:
Use Case: Ideal for analytics and AI model training.
Description: Contains demographic information, including age, education, occupation, and income classification.
Click below link to download .csv
file.
Use Case: Useful for testing synthetic data generation on multi-table healthcare-related datasets.
Description: Includes tables such as patients, conditions, encounters etc. simulated for COVID-19 scenarios.
Source: Synthea COVID Patients Dataset.
Click below link to download .zip
file for 10k patient records with COVID-19 in the CSV format. If you would like to download 100k patient records version, please click here.
Use Case: Suitable for analytics and sequence-based data generation.
Description: Features player statistics and seasonal performance data.
Source: Lahman Baseball Dataset.
Click below link to download .zip
file.