Sample Datasets
Last updated
Last updated
To provide users with practical examples for testing and analytics, we have selected datasets optimized for various scenarios. These datasets are sourced from well-known repositories and are designed to help users get started with Syntho's features effectively. For testing purposes, you can access a multi-table dataset, while for analytics, there is a single-table dataset. Additionally, a two-table sequence dataset is available for sequence-based modeling and evaluation. These datasets serve as a practical starting point for exploring Syntho's features and capabilities:
Use Case: Ideal for analytics and AI model training.
Description: Contains demographic information, including age, education, occupation, and income classification.
Click below link to download .csv
file.
Use Case: Useful for testing synthetic data generation on multi-table healthcare-related datasets.
Description: Includes tables such as patients, conditions, encounters etc. simulated for COVID-19 scenarios.
Source: Synthea COVID Patients Dataset.
Click below link to download .zip
file for 10k patient records with COVID-19 in the CSV format. If you would like to download 100k patient records version, please click here.
Use Case: Suitable for analytics and sequence-based data generation.
Description: Features player statistics and seasonal performance data.
Source: Lahman Baseball Dataset.
Click below link to download .zip
file.