Get Started#

This page describes how to get started with the hydro, with a focus on installation.

Standalone Installation#

PyPI

To install use pip:

pip install spark-hydro

To start an interative shell:

pyspark --packages io.delta:delta-core_2.12:2.2.0 --conf "spark.sql.extensions=io.delta.sql.DeltaSparkSessionExtension" --conf "spark.sql.catalog.spark_catalog=org.apache.spark.sql.delta.catalog.DeltaCatalog"

For other methods, follow this guide.

Databricks#

PyPI

To install hydro on a Databricks cluster, follow these directions.