Skip to content

Commit

Permalink
Update Getting started.md
Browse files Browse the repository at this point in the history
  • Loading branch information
ghislainfourny authored Oct 24, 2024
1 parent 0e78311 commit c1bd37e
Showing 1 changed file with 6 additions and 4 deletions.
10 changes: 6 additions & 4 deletions docs/Getting started.md
Original file line number Diff line number Diff line change
Expand Up @@ -78,11 +78,11 @@ If you use Linux, Florian Kellner also kindly contributed an [installation scrip

RumbleDB requires an Apache Spark installation on Linux, Mac or Windows.

It is straightforward to directly [download it](https://spark.apache.org/downloads.html), unpack it and put it at a location of your choosing. We recommend to pick Spark 3.2.2. Let us call this location SPARK_HOME (it is a good idea, in fact to also define an environment variable SPARK_HOME pointing to the absolute path of this location).
It is straightforward to directly [download it](https://spark.apache.org/downloads.html), unpack it and put it at a location of your choosing. We recommend to pick Spark 3.4.3. Let us call this location SPARK_HOME (it is a good idea, in fact to also define an environment variable SPARK_HOME pointing to the absolute path of this location).

What you need to do then is to add the subdirectory "bin" within the unpacked directory to the PATH variable. On macOS this is done by adding

export SPARK_HOME=/path/to/spark-3.2.2-bin-hadoop3.2
export SPARK_HOME=/path/to/spark-3.4.3-bin-hadoop3.2
export PATH=$SPARK_HOME/bin:$PATH

(with SPARK_HOME appropriately set to match your unzipped Spark directory) to the file .zshrc in your home directory, then making sure to force the change with
Expand Down Expand Up @@ -111,9 +111,11 @@ Like Spark, RumbleDB is just a download and no installation is required.

In order to run RumbleDB, you simply need to download one of the small .jar files from the [download page](https://github.com/RumbleDB/rumble/releases) and put it in a directory of your choice, for example, right besides your data.

If you use Spark 3.2+, use rumbledb-1.22.0-for-spark-3.2.jar.
If you use Spark 3.4+, use rumbledb-1.22.0-for-spark-3.4.jar.

If you use Spark 3.3+, use rumbledb-1.22.0-for-spark-3.3.jar.
If you use Spark 3.5+, use rumbledb-1.22.0-for-spark-3.5.jar.

If you use Spark 4.0+ (preview), use rumbledb-1.22.0-for-spark-4.0.jar.

These jars do not embed Spark, since you chose to set it up separately. They will work with your Spark installation with the spark-submit command.

Expand Down

0 comments on commit c1bd37e

Please sign in to comment.