Hive
Last updated
Was this helpful?
Last updated
Was this helpful?
Syntho Beta feature
Important
This connector can only be used as a source database. The generated data can be written to Azure Data Lake Storage (ADLS) or Amazon Simple Storage Service (S3) as Parquet files.
Before you begin, gather this connection information:
Name of the server that hosts the database you want to connect to and port number
User name and password
Are you connecting to an SSL server?
Launch Syntho and select Connect to a database, or under Create workspace, select Hive. For a complete list of data connections, select More under From database. Then do the following:
Enter the name of the server that hosts the database and the port number to use.
Optionally, enter the schema name.
Enter user name and password.
Select the Require SSL check box when connecting to an SSL server.
Select Create Workspace.
If Syntho can't make the connection, verify that your credentials are correct. If you still can't connect, your computer is having trouble locating the server. Contact your network administrator or database administrator.
Hive only
In Hive databases, source tables are often partitioned based on three columns treated as index columns. These columns are used for ordering in queries, but they do not always form unique composites.
TINYINT
discrete
SMALLINT
discrete
INT
discrete
BIGINT
discrete
FLOAT
continuous
DOUBLE
continuous
DECIMAL
continuous
TIMESTAMP
datetime
False
DATE
datetime
False
STRING
categorical
VARCHAR
categorical
CHAR
categorical
BOOLEAN
bool
False
False
False
BINARY
bytes
False
False
False
False
ARRAY
categorical
MAP
categorical
STRUCT
categorical
Data types that are labeled as "no active support" mean that they are not actively supported, however, you may still be able to apply generators (e.g., AI-powered generation, mask, mockers or calculated columns) to these columns. Duplication is fully supported for these types.
To address this, use the partitioning columns along with the additional columns specified through the "ORDER BY" dropdown. This approach ensures unique and consistent ordering, leveraging both the partitioning logic and user-defined columns. For more information, check in table settings.