LogoLogo
Go to Syntho.AI
English
English
  • Welcome to Syntho
  • Overview
    • Get started
      • Syntho bootcamp
        • 1. What is Syntho?
        • 2. Introduction data anonymization
        • 3. Connectors & workspace creation
        • 4. PII scan
        • 5. Generators
          • Mockers
          • Maskers
          • AI synthesize
          • Calculated columns
          • Free text de-identification
        • 6. Referential integrity & foreign keys
        • 7. Workspace synchronization & validation
        • 8. Workspace & user management
        • 9. Large workloads​
        • 10. AI synthesis: Data pre-processing when using
      • Prerequisites
      • Sample datasets
      • Introduction to data generators
      • AI-generated synthetic data
    • Frequently asked questions
  • Setup Workspaces
    • View workspaces
    • Create a workspace
      • Connect to a database
        • PostgreSQL
        • MySQL / MariaDB
        • Oracle
        • Microsoft SQL Server
        • DB2
        • Databricks
          • Importing Data into Databricks
        • Hive
        • SAP Sybase
        • Azure Data Lake Storage (ADLS)
        • Amazon Simple Storage Service (S3)
      • Workspace modes
    • Edit a workspace
    • Duplicate a workspace
    • Transfer workspace ownership
    • Share a workspace
    • Delete a workspace
    • Workspace default settings
  • Configure a Data Generation Job
    • Configure table settings
    • Configure column settings
      • AI synthesize
        • Sequence model
          • Prepare your sequence data
        • QA report
        • Additional privacy controls
        • Cross-table relationships limitations
      • Mock
        • Text
          • Supported languages
        • Numeric (integer)
        • Numeric (decimal)
        • Datetime
        • Other
      • Mask
        • Text
        • Numeric (integer)
        • Numeric (decimal)
        • Datetime
        • UUID
      • Duplicate
      • Exclude
      • Consistent mapping
      • Calculated columns
      • Key generators
        • Differences between key generators
      • JSON de-identification
    • Manage personally identifiable information (PII)
      • Privacy dashboard
      • Discover and de-identify PII columns
        • Identify PII columns manually
        • Automatic PII discovery with PII scanner
      • Remove columns from PII list
      • Automatic PII discovery and de-identification in free text columns
      • Supported PII & PHI entities
    • Manage foreign keys
      • Foreign key inheritance
      • Add virtual foreign keys
        • Add virtual foreign keys
        • Use foreign key scanner
        • Import foreign keys via JSON
        • Export foreign keys via JSON
      • Delete foreign keys
    • Validate and synchronize workspace
    • View and adjust generation settings
  • Deploy Syntho
    • Introduction
      • Syntho architecture
      • Requirements
        • Requirements for Docker deployments
        • Requirements for Kubernetes deployments
      • Access Docker images
        • Online
        • Offline
    • Deploy Syntho using Docker
      • Preparations
      • Deploy using Docker Compose
      • Run the application
      • Manually saving logs
      • Updating the application
      • Backup
    • Deploy Syntho using Kubernetes
      • Preparations
      • Deploy Ray using Helm
        • Upgrading Ray CRDs
        • Troubleshooting
      • Deploy Syntho using Helm
      • Validate the deployment
      • Troubleshooting
      • Saving logs
      • Upgrading the applications
      • Backup
    • Manage users and access
      • Single Sign-On (SSO) in Azure
      • Manage admin users
      • Manage non-admin users
    • Logs and monitoring
      • Does Syntho collect any data?
      • Temporary data storage by application
  • Syntho API
    • Syntho REST API
Powered by GitBook
On this page
  • Interactive guide: How to validate a workspace
  • Source schema validation
  • Destination schema validation
  • Table-level issues (blocking)
  • Table-level warnings (non-blocking)
  • Column-level issues (blocking)
  • Column-level warnings (non-blocking)
  • Final step: generate

Was this helpful?

  1. Overview
  2. Get started
  3. Syntho bootcamp

7. Workspace synchronization & validation

Previous6. Referential integrity & foreign keysNext8. Workspace & user management

Last updated 22 days ago

Was this helpful?

Before generating synthetic data, Syntho performs automated checks to ensure your workspace aligns with the schemas of your source and destination databases. This ensures compatibility, preserves referential integrity, and prevents generation failures.


Interactive guide: How to validate a workspace

Follow the interactive guide below to validate a workspace

Your source database schema may evolve over time—tables may be added, columns removed, or data types changed. Syntho automatically detects such changes during generation and warns you if your workspace is out of sync.

Syntho will detect and synchronize:

  • Table-level changes: primary key updates, constraint modifications, table removal

  • Column-level changes: type changes, nullable status, constraints, or key assignments

Unresolved schema mismatches may result in job failure, so it’s best practice to sync regularly.


Syntho also validates your destination schema to ensure it is ready to receive the generated synthetic data. Issues here can block the generation process or result in incorrect output.

  • Table does not exist

  • Primary or foreign key mismatch with source

Resolution

Ensure table structure matches the source schema. Remove or align keys as needed.

  • Table already contains data → Truncate the table in the destination

  • Table is part of a circular reference → Disable foreign key constraints temporarily

  • Missing columns

  • Extra non-nullable columns without defaults

  • Not-nullable columns not present in the source

Resolution

Adjust column definitions to match source schema exactly.

  • Data type mismatches

  • Unexpected uniqueness or check constraints

  • Missing default values

  • Column length or max value is lower than source

Resolution

Align data types, constraints, and value limits between source and destination columns.


Final step: generate

After resolving all validation issues and confirming schema synchronization, you can safely proceed to generate synthetic data.

Source schema validation
Automatically detected changes
Destination schema validation
Table-level issues (blocking)
Table-level warnings (non-blocking)
Column-level issues (blocking)
Column-level warnings (non-blocking)