> For the complete documentation index, see [llms.txt](https://docs.syntho.ai/llms.txt). Markdown versions of documentation pages are available by appending `.md` to page URLs; this page is available as [Markdown](https://docs.syntho.ai/overview/get-started/use-cases-and-configuration.md). # Use Cases & Configuration Start with your goal. Then pick the first use case to implement. Each use case optimizes for a different outcome. For example,. testing, realism, privacy, consistency, or speed. Each has different configuration requirements. ### Getting started Before you dive into a use case, make sure the basics are covered: * [Prerequisites](/overview/get-started/prerequisites.md) * [Deploy Syntho](/deploy-syntho/introduction.md) #### Generation approaches * **AI-generated synthesis**: best when you need statistical utility with strong privacy, or extra rows. * **Rule-based generation**: best when values must follow explicit business logic. * **Masking / de-identification**: best when you need format-preserving replacements and stable keys/relationships. * **Hybrid**: best when one approach alone does not meet your requirements. #### Key configuration decisions These decisions drive most success (and most rework). **1) Pick the workspace mode that matches your starting point** * **De-identify**: you already have a production-like dataset and mainly need to replace identifiers. * **Mock or mask all**: you need “production-like” formats but you don’t want to keep original values. * **Mock all**: you have little/no source data and want to generate everything from scratch. * **Synthesize all**: you have enough rows and want maximum statistical utility with strong privacy. **2) Decide if you should reshape to a single entity table** AI synthesis works best on a single table. It is often worth creating a SQL view first (especially for ML, analytics and data sharing). * [Use SQL views as input tables](/setup-workspaces/create-a-workspace/use-sql-views-as-input-tables.md) * [Cross-table relationships limitations](/configure-a-data-generation-job/configure-column-settings/ai-powered-generation/table-relationships.md) **3) Choose masking, rule-based, and AI synthesis** * Use **masking** when downstream systems validate formats (emails, IBANs, UUIDs). * Use **rule-based / calculated columns** when the business logic must always hold (profit = revenue - costs). * Use **AI synthesis** when you need privacy + statistical utility for indirect identifiers (age, gender, weight). #### Governance, compliance, and automation (reference) * Use workspace roles and sharing to control who can view data and change generators. See [Workspace & user management](/overview/get-started/syntho-bootcamp/8.-workspace-and-user-management.md). * Automate workspace setup, scans, and runs via the [Syntho REST API](/syntho-api/syntho-rest-api.md). * For deployment options, see [Deploy Syntho](/deploy-syntho/introduction.md). ### Use cases

	Cover image
Use Case 1: Application & API Testing Privacy-safe, production-like test data for application and API tests in non-production.	/files/I0kbZfdGenRDB21GS5vO	/pages/034fkmKsQX3DK0iB7UVm
Use Case 2: Load & Stress Generate large volumes and edge cases for performance testing without sensitive production data.	/files/GzMpv3u3uevUiBGFKFqD	/pages/NkTf8L2XkXSNqgHYP8MA
Use Case 3: Demo Data Realistic demo data that contains no real identifiers and can be shared safely.	/files/ujHbESJyWHjW78cL7Qu6	/pages/gtzgHUwDJVk6MFeP72L9
Use Case 4: ETL & Data Pipeline Testing End-to-end pipeline testing without using production data.	/files/oaK560iE4ICpL1AhHdLO	/pages/m0lRco6Dnu1jsuaHJaEA
Use Case 5: Feature Development Shift-left testing with realistic synthetic data when production data is unavailable or restricted.	/files/GzMpv3u3uevUiBGFKFqD	/pages/OABjH6uhizuKtzt5cuCG
Use Case 6: ML Model Development Generate feature datasets when real data is scarce or sensitive.	/files/ujHbESJyWHjW78cL7Qu6	/pages/c4s5IDwQiiYHo4SD8Lb8
Use Case 7: Analytics Sandboxes Secure sandboxes for exploratory analytics and data science.	/files/ujHbESJyWHjW78cL7Qu6	/pages/n2MVcgykJHrCjz460ag5
Use Case 8: Cloud & Data Migration Validate data workflows during migrations.	/files/q1kuF8aCbmqiHC1gUDLz	/pages/P8PBQJs0PbTAZSlhQmlX
Use Case 9: Data Sharing & Monetization Share data with strong privacy protection.	/files/q1kuF8aCbmqiHC1gUDLz	/pages/qjGJAhVnPr3eAhxOw7do
Use Case 10: Data Subsetting Reduce data footprint while preserving integrity.	/files/pRyl3u4klwSFcFnF2Lrk	/pages/xq3wmL0rf8lmm7PUdIDe
Use Case 11: Accelerate PoCs & Pilots Deliver privacy-safe datasets fast to validate ideas, integrations, and workflows.	/files/q1kuF8aCbmqiHC1gUDLz	/pages/dRqPKvaPcwWLB81BtcED
Use Case 12: Training & Education Create safe, realistic datasets for onboarding, workshops, and hands-on training.	/files/pRyl3u4klwSFcFnF2Lrk	/pages/pGk7k01EA5f5uwtQyyfs

Baseline workflow (applies to every use case)

Use this checklist to go from “use case” to a repeatable job. {% stepper %} {% step %} ### Prerequisites Confirm access, schema alignment, and environment readiness. * [Prerequisites](/overview/get-started/prerequisites.md) {% endstep %} {% step %} ### Create a workspace Pick the source + destination, then choose a workspace mode that matches your starting point. * [Create a workspace](/setup-workspaces/create-a-workspace.md) * [Workspace modes](/setup-workspaces/create-a-workspace/workspace-modes.md) {% endstep %} {% step %} ### Configure generators Start from the simplest approach that meets the goal. * [Introduction to data generators](/overview/get-started/introduction-to-data-generators.md) * [Generators](/configure-a-data-generation-job/configure-column-settings.md) {% endstep %} {% step %} ### Handle keys and relationships (relational schemas) Make FK behavior explicit before your first big run. * [Referential integrity & foreign keys](/overview/get-started/syntho-bootcamp/6.-referential-integrity-and-foreign-keys.md) * [Manage foreign keys](/configure-a-data-generation-job/manage-foreign-keys.md) * [Key generators](/configure-a-data-generation-job/configure-column-settings/key-generators.md) {% endstep %} {% step %} ### Validate and sync Validate early, then resync whenever the schema drifts. * [Validate and synchronize workspace](/configure-a-data-generation-job/generation-and-validation/validate-and-synchronize-workspace.md) {% endstep %} {% step %} ### Tune generation settings Optimize performance and reduce write errors before scaling up. * [View and adjust generation settings](/configure-a-data-generation-job/generation-and-validation/view-and-adjust-generation-settings.md) * [Large workloads](/overview/get-started/syntho-bootcamp/9.-large-workloads.md) {% endstep %} {% endstepper %}

--- # Agent Instructions This documentation is published with GitBook. GitBook is the documentation platform designed so that both humans and AI agents can read, navigate, and reason over technical content effectively. Learn more at gitbook.com. ## Querying This Documentation If you need additional information that is not directly available in this page, you can query the documentation dynamically by asking a question. Perform an HTTP GET request on the current page URL with the `ask` query parameter, and the optional `goal` query parameter: ``` GET https://docs.syntho.ai/overview/get-started/use-cases-and-configuration.md?ask=&goal= ``` `ask` is the immediate question: it should be specific, self-contained, and written in natural language. `goal` is optional and describes the broader end goal you are ultimately trying to accomplish on behalf of the user. GitBook uses it to tailor the answer towards what is most useful for that goal. The response will contain a direct answer to the question and relevant excerpts and sources from the documentation. Use this mechanism when the answer is not explicitly present in the current page, you need clarification or additional context, or you want to retrieve related documentation sections.