Quick Start
Follow the quickstart guide for the best practices on setting up Hydra — add realtime analytics to any project in less than a day.
Sign-up
-
To get started, please sign up at https://www.hydra.so/
-
Check your email. Once approved, select “Sign Up” below listed in the purple link
Hydra
-
Production Hydra databases are set to a Small plan with 2 Performance CPU & 8GB RAM with 50GB on-disk storage. These hardware specs and limits can be modified at any time. To modify before a database is created, select the “Scaling and Storage” drop down menu before finishing the “create a project” setup flow.
-
Development Hydra databases are 8 shared-CPU & 4GB RAM with 10GB of on-disk storage. Development databases must be upgraded to Hydra production plans to scale hardware specs and storage.
-
Hydra selects the latest supported version of Postgres by default. This setting can be modified during setup or at any point afterward.
-
Hydra selects US-East by default. If you’d like a different region for your Postgres database, please contact our support team.
-
Note, on-disk storage can not be scaled down to a smaller footprint once provisioned per instance.
Once Completed, your Postgres instance will be listed as “Running”, showing both the database specifications and the Postgres connection string.
Enable Analytics Engine
Hydra’s Analytics Engine is powered by duckdb and is enabled automatically when needed: for example, whenever you use DuckDB functions (such as read_csv
), when you query DuckDB tables, and when running COPY table TO 's3://...'
. However, if you want queries which only touch Postgres tables to use DuckDB execution you need to run SET duckdb.force_execution TO true
’. This feature is opt-in. To avoid doing that for every session, you can configure it for a certain user by doing ALTER USER my_analytics_user SET duckdb.force_execution TO true
.
Load Data into Hydra
The following setup instructions are for populating data into Hydra from an existing Postgres database. View the detailed guides for migrating data from Heroku, Render, AWS RDS, and self-managed Postgres. If you do not have an existing database, jump ahead to the next step or load sample data in “playing_with_hydra” example below.
Capture a backup file of your existing database. You will need to know the hostname, username, password, and database name of the database you wish to capture.
Restoring data into Hydra
Using the captured data backup, you can copy your data into Hydra using pg_restore
as follows. You will need the hostname, username, password, and database name of your Hydra database. You can find these on the Hydra dashboard.
Using Sample data
You can use the following queries in the SQL Editor tab. The queries creates a table, adds data, and retrieves the data from the table.
Run each statement by clicking the green triangle next to each query.
Connect Data Lake Storage
To enable a data lake, you can use Hydra’s globally-distributed data lake, AWS S3, GCP Cloud Storage, or Cloudflare R2. If you’d like Hydra to support any additional storage vendors, please contact our support team.
-
Connect to Postgres using your Hydra Postgres connection string.
-
Adding a secret to the
duckdb.secrets
table:
If you would like to try Hydra’s globally distributed storage beta, please contact our support team for setup.
Tutorial
With your Hydra Postgres database now setup, data loaded, and connected with Data Lake storage view our complete guide on Tutorial section.
Next Steps
Next, we recommend following the views documentation for best practices. Setting up views and establishing caching with Data Lake storage is a great way to pre-compute and avoid network latency between compute and storage layers.