# Databricks

<div data-full-width="false"><figure><img src="https://content.gitbook.com/content/U61B9DqtWCNO3Z30vnjh/blobs/uEtlxY85XzLR8diKyuKB/ChatGPT%20Image%20Sep%209,%202025,%2004_10_21%20PM.png" alt="" width="375"><figcaption></figcaption></figure></div>

Source and Destination Databases

{% hint style="info" %}
**Important**

This connector can only be used as a source database. The generated data can be written to Local [Filesystem](https://docs.syntho.ai/setup-workspaces/create-a-workspace/connect-to-a-database/local-filesystem), [Azure Data Lake Storage (ADLS)](https://docs.syntho.ai/setup-workspaces/create-a-workspace/connect-to-a-database/azure-data-lake-storage-adls) or [Amazon Simple Storage Service (S3)](https://docs.syntho.ai/setup-workspaces/create-a-workspace/connect-to-a-database/amazon-simple-storage-service-s3) as Parquet files.
{% endhint %}

## Before you begin

Before you begin, gather this connection information:

* Name of the server that hosts the database you want to connect to and port number
* The name of the database that you want to connect to
* HTTP path to the data source
* Personal Access Token
* In Databricks, find your cluster server hostname and HTTP path using the instructions in [Construct the JDBC URL](https://docs.databricks.com/user-guide/bi/jdbc-odbc-bi.html#http-path) on the Databricks website.

If you first need to load data into Databricks, see [Importing Data into Databricks](https://docs.syntho.ai/setup-workspaces/create-a-workspace/connect-to-a-database/databricks/importing-data-into-databricks).

## Connect and set up the workspace

1. Launch **Syntho** and select **Connect to a database** (or **Create workspace**).
2. Under **The connection details**, choose **Databricks** from the **Type** dropdown.
3. Fill in the required fields:
   * **Server hostname** → e.g. `adb-1111111111111111.0.azuredatabricks.net`
   * **Catalog name** → e.g. `demo_catalog`
   * **Database name** → e.g. `marketing_db`
   * **HTTP Path** → e.g. `sql/protocolv1/o/1234567890123456/0000-111111-demo123`
   * **Port number** → default is `443`
   * **Personal Access Token** → (See [Personal Access Tokens](https://docs.databricks.com/aws/en/dev-tools/auth/pat) on the **Databricks** website for information on access tokens.)
   * **Warehouse ID (optional)** → the SQL Warehouse to query through (recommended). Use this when possible. It is usually faster and more stable for large databases. If omitted, Syntho falls back to JDBC retrieval via the Spark driver.
4. Click **Create Workspace** to complete the setup.\
   If Syntho can't make the connection, verify that your credentials are correct. If issues persist, your computer may not be able to locate the server. Contact your **network administrator** or **database administrator** for support.

## Supported Databricks versions

The table below provides an overview of the supported Databricks versions and their corresponding Apache Spark versions.

| Databricks Version | Spark Version |
| ------------------ | ------------- |
| 16.2               | 3.5.0         |
| 15.4 LTS           | 3.5.0         |
| 14.3 LTS           | 3.5.0         |

**Note:** Version 13 is no longer supported.

## Supported data types

The following table summarizes the current support limitations for various data types when using connectors with **Databricks**. It indicates what's supported per generator type.

| Data Type      | AI-powered Generation                                                                                                      | Mockers                                                                                                          | Mask                                                                                                          | Calculated Columns                                                                                                               |
| -------------- | -------------------------------------------------------------------------------------------------------------------------- | ---------------------------------------------------------------------------------------------------------------- | ------------------------------------------------------------------------------------------------------------- | -------------------------------------------------------------------------------------------------------------------------------- |
| INT            | [True](https://docs.syntho.ai/configure-a-data-generation-job/configure-column-settings/ai-powered-generation#discrete)    | [True](https://docs.syntho.ai/configure-a-data-generation-job/configure-column-settings/mockers/numeric-integer) | [True](https://docs.syntho.ai/configure-a-data-generation-job/configure-column-settings/mask/numeric-integer) | [True](https://docs.syntho.ai/configure-a-data-generation-job/configure-column-settings/calculated-columns#supported-data-types) |
| SMALLINT       | [True](https://docs.syntho.ai/configure-a-data-generation-job/configure-column-settings/ai-powered-generation#discrete)    | [True](https://docs.syntho.ai/configure-a-data-generation-job/configure-column-settings/mockers/numeric-integer) | [True](https://docs.syntho.ai/configure-a-data-generation-job/configure-column-settings/mask/numeric-integer) | [True](https://docs.syntho.ai/configure-a-data-generation-job/configure-column-settings/calculated-columns#supported-data-types) |
| TINYINT        | [True](https://docs.syntho.ai/configure-a-data-generation-job/configure-column-settings/ai-powered-generation#discrete)    | [True](https://docs.syntho.ai/configure-a-data-generation-job/configure-column-settings/mockers/numeric-integer) | [True](https://docs.syntho.ai/configure-a-data-generation-job/configure-column-settings/mask/numeric-integer) | [True](https://docs.syntho.ai/configure-a-data-generation-job/configure-column-settings/calculated-columns#supported-data-types) |
| BIGINT         | [True](https://docs.syntho.ai/configure-a-data-generation-job/configure-column-settings/ai-powered-generation#discrete)    | [True](https://docs.syntho.ai/configure-a-data-generation-job/configure-column-settings/mockers/numeric-integer) | [True](https://docs.syntho.ai/configure-a-data-generation-job/configure-column-settings/mask/numeric-integer) | [True](https://docs.syntho.ai/configure-a-data-generation-job/configure-column-settings/calculated-columns#supported-data-types) |
| DECIMAL        | [True](https://docs.syntho.ai/configure-a-data-generation-job/configure-column-settings/ai-powered-generation#continuous)  | [True](https://docs.syntho.ai/configure-a-data-generation-job/configure-column-settings/mockers/numeric-decimal) | [True](https://docs.syntho.ai/configure-a-data-generation-job/configure-column-settings/mask/numeric-decimal) | [True](https://docs.syntho.ai/configure-a-data-generation-job/configure-column-settings/calculated-columns#supported-data-types) |
| FLOAT          | [True](https://docs.syntho.ai/configure-a-data-generation-job/configure-column-settings/ai-powered-generation#continuous)  | [True](https://docs.syntho.ai/configure-a-data-generation-job/configure-column-settings/mockers/numeric-decimal) | [True](https://docs.syntho.ai/configure-a-data-generation-job/configure-column-settings/mask/numeric-decimal) | [True](https://docs.syntho.ai/configure-a-data-generation-job/configure-column-settings/calculated-columns#supported-data-types) |
| DOUBLE         | [True](https://docs.syntho.ai/configure-a-data-generation-job/configure-column-settings/ai-powered-generation#continuous)  | [True](https://docs.syntho.ai/configure-a-data-generation-job/configure-column-settings/mockers/numeric-decimal) | [True](https://docs.syntho.ai/configure-a-data-generation-job/configure-column-settings/mask/numeric-decimal) | [True](https://docs.syntho.ai/configure-a-data-generation-job/configure-column-settings/calculated-columns#supported-data-types) |
| STRING         | [True](https://docs.syntho.ai/configure-a-data-generation-job/configure-column-settings/ai-powered-generation#categorical) | [True](https://docs.syntho.ai/configure-a-data-generation-job/configure-column-settings/mockers/text)            | [True](https://docs.syntho.ai/configure-a-data-generation-job/configure-column-settings/mask/text)            | [True](https://docs.syntho.ai/configure-a-data-generation-job/configure-column-settings/calculated-columns#supported-data-types) |
| BINARY         | False                                                                                                                      | True\*                                                                                                           | True\*                                                                                                        | True\*                                                                                                                           |
| BOOLEAN        | False                                                                                                                      | [True](https://docs.syntho.ai/configure-a-data-generation-job/configure-column-settings/mockers/other)           | False                                                                                                         | True\*                                                                                                                           |
| DATE           | [True](https://docs.syntho.ai/configure-a-data-generation-job/configure-column-settings/ai-powered-generation#datetime)    | [True](https://docs.syntho.ai/configure-a-data-generation-job/configure-column-settings/mockers/datetime)        | [True](https://docs.syntho.ai/configure-a-data-generation-job/configure-column-settings/mask/datetime)        | False                                                                                                                            |
| TIMESTAMP      | [True](https://docs.syntho.ai/configure-a-data-generation-job/configure-column-settings/ai-powered-generation#datetime)    | [True](https://docs.syntho.ai/configure-a-data-generation-job/configure-column-settings/mockers/datetime)        | [True](https://docs.syntho.ai/configure-a-data-generation-job/configure-column-settings/mask/datetime)        | False                                                                                                                            |
| TIMESTAMP\_NTZ | [True](https://docs.syntho.ai/configure-a-data-generation-job/configure-column-settings/ai-powered-generation#datetime)    | [True](https://docs.syntho.ai/configure-a-data-generation-job/configure-column-settings/mockers/datetime)        | [True](https://docs.syntho.ai/configure-a-data-generation-job/configure-column-settings/mask/datetime)        | False                                                                                                                            |
| ARRAY          | False                                                                                                                      | True\*                                                                                                           | True\*                                                                                                        | True\*                                                                                                                           |
| STRUCT         | False                                                                                                                      | [True](https://docs.syntho.ai/configure-a-data-generation-job/configure-column-settings/mockers/other#json)      | False                                                                                                         | False                                                                                                                            |
| MAP            | False                                                                                                                      | True\*                                                                                                           | True\*                                                                                                        | True\*                                                                                                                           |
| VARIANT        | False                                                                                                                      | True\*                                                                                                           | True\*                                                                                                        | True\*                                                                                                                           |
| OBJECT         | False                                                                                                                      | True\*                                                                                                           | True\*                                                                                                        | True\*                                                                                                                           |
| ENUM           | False                                                                                                                      | False                                                                                                            | False                                                                                                         | False                                                                                                                            |

{% hint style="info" %}
**\*** Some data types are not actively supported. Some generators may still show `True` for these fields. This means you can apply the generator, even though the type is not actively supported. Duplication is fully supported for these data types.
{% endhint %}

## Limitations

* When entering catalog, database, or schema names, use lowercase letters. Names containing capital letters must be entered in lowercase to ensure a proper connection.
* Schema, table, and column names containing single quotes (`'`) or backticks (\`) are not supported.
