LogoLogo
Go to Syntho.AI
English
English
  • Welcome to Syntho
  • Overview
    • Get started
      • Syntho bootcamp
        • 1. What is Syntho?
        • 2. Introduction data anonymization
        • 3. Connectors & workspace creation
        • 4. PII scan
        • 5. Generators
          • Mockers
          • Maskers
          • AI synthesize
          • Calculated columns
          • Free text de-identification
        • 6. Referential integrity & foreign keys
        • 7. Workspace synchronization & validation
        • 8. Workspace & user management
        • 9. Large workloads​
        • 10. AI synthesis: Data pre-processing when using
      • Prerequisites
      • Sample datasets
      • Introduction to data generators
      • AI-generated synthetic data
    • Frequently asked questions
  • Setup Workspaces
    • View workspaces
    • Create a workspace
      • Connect to a database
        • PostgreSQL
        • MySQL / MariaDB
        • Oracle
        • Microsoft SQL Server
        • DB2
        • Databricks
          • Importing Data into Databricks
        • Hive
        • SAP Sybase
        • Azure Data Lake Storage (ADLS)
        • Amazon Simple Storage Service (S3)
      • Workspace modes
    • Edit a workspace
    • Duplicate a workspace
    • Transfer workspace ownership
    • Share a workspace
    • Delete a workspace
    • Workspace default settings
  • Configure a Data Generation Job
    • Configure table settings
    • Configure column settings
      • AI synthesize
        • Sequence model
          • Prepare your sequence data
        • QA report
        • Additional privacy controls
        • Cross-table relationships limitations
      • Mock
        • Text
          • Supported languages
        • Numeric (integer)
        • Numeric (decimal)
        • Datetime
        • Other
      • Mask
        • Text
        • Numeric (integer)
        • Numeric (decimal)
        • Datetime
        • UUID
      • Duplicate
      • Exclude
      • Consistent mapping
      • Calculated columns
      • Key generators
        • Differences between key generators
      • JSON de-identification
    • Manage personally identifiable information (PII)
      • Privacy dashboard
      • Discover and de-identify PII columns
        • Identify PII columns manually
        • Automatic PII discovery with PII scanner
      • Remove columns from PII list
      • Automatic PII discovery and de-identification in free text columns
      • Supported PII & PHI entities
    • Manage foreign keys
      • Foreign key inheritance
      • Add virtual foreign keys
        • Add virtual foreign keys
        • Use foreign key scanner
        • Import foreign keys via JSON
        • Export foreign keys via JSON
      • Delete foreign keys
    • Validate and synchronize workspace
    • View and adjust generation settings
  • Deploy Syntho
    • Introduction
      • Syntho architecture
      • Requirements
        • Requirements for Docker deployments
        • Requirements for Kubernetes deployments
      • Access Docker images
        • Online
        • Offline
    • Deploy Syntho using Docker
      • Preparations
      • Deploy using Docker Compose
      • Run the application
      • Manually saving logs
      • Updating the application
      • Backup
    • Deploy Syntho using Kubernetes
      • Preparations
      • Deploy Ray using Helm
        • Upgrading Ray CRDs
        • Troubleshooting
      • Deploy Syntho using Helm
      • Validate the deployment
      • Troubleshooting
      • Saving logs
      • Upgrading the applications
      • Backup
    • Manage users and access
      • Single Sign-On (SSO) in Azure
      • Manage admin users
      • Manage non-admin users
    • Logs and monitoring
      • Does Syntho collect any data?
      • Temporary data storage by application
  • Syntho API
    • Syntho REST API
Powered by GitBook
On this page

Was this helpful?

  1. Overview
  2. Get started
  3. Syntho bootcamp
  4. 5. Generators

Calculated columns

PreviousAI synthesizeNextFree text de-identification

Last updated 16 days ago

Was this helpful?

allow you to define custom formulas to generate or transform data using logical, mathematical, statistical, or text-based operations. These formulas are applied after other generators like mockers or AI synthesis, ensuring that all base data is available before the calculation is performed.

When to use

  • To clean or reformat data (e.g. trimming, date formatting)

  • To apply conditional logic (e.g. gender-based name generation)

  • To compute derived values from other columns (e.g. sales + tax)

  • To categorize or flag data based on specific criteria

When not to use

  • For straightforward mock data generation without dependencies

  • When no conditional logic is needed

  • If the column is already correctly populated or handled by simpler generators


Interactive guide: How to apply a calculated column formula

Follow the interactive guide below to apply a calculated column formula.

Calculated columns follow a structured expression syntax combining functions, column references, constants, and mockers.

Example formulas

[Total Sales] + ([Total Sales] * [Tax Rate])
IF([Gender] = 'M', MOCK_FIRST_NAME, IF([Gender] = 'F', MOCK_FIRST_NAME_FEMALE, 'nothing'))

Key syntax rules

  • Column reference: [ColumnName] for same-table columns

  • Functions: IF(), AND(), DATE(), etc.

  • Mockers: Use MOCK_FIRST_NAME, or MOCK_CONSISTENT_FIRST_NAME for consistent mapping

  • Operators: +, -, *, /, =, <>, <, >

  • Constants: Use strings "text", numbers 100, or dates DATE(2020, 12, 31)

  • Avoid: Column names starting with _, which will cause formula errors


Using mockers in formulas

To insert mock data dynamically within formulas:

  • Type MOCK_ and choose from the autosuggest list

  • Use names like MOCK_FIRST_NAME, MOCK_COMPANY_EMAIL

  • For consistent mapping, use MOCK_CONSISTENT_FIRST_NAME, etc.



Calculated columns give you full control over how synthetic values are created or transformed. They’re ideal for applying custom logic while retaining flexibility in the data generation process.

Calculated columns

Supported data types

Generator
Supported data types

Calculated Columns

Categorical, Discrete, Continuous

Syntax and structure