Spark onboarding
To get started with Prophecy, you can use the in-product walkthrough for a SQL model or Spark pipeline. This page will describe the Spark onboarding option. If you have completed this walkthrough, you can always return to it again.
Get started
- Open the Prophecy homepage.
- Click Onboarding.
- Select Low-code Spark.
This opens up the visual pipeline canvas! As you begin, notice that Prophecy automatically attaches to a Spark cluster. We provide a default fabric (execution environment) for this onboarding project.
Follow the steps
In this onboarding quick start, follow the orange dot in the product interface to click through the tutorial.
You can automatically go through steps by clicking Auto-connect Gems or Auto-configure the Gem. You can also select Skip to complete the entire model design.
At a high level, this walkthrough helps you:
- Create multiple new CSV datasets using Source gems.
- Connect input datasets and define join conditions using a Join gem.
- Add a new column using a SQL expression in a Reformat gem.
- Use common aggregations like count and sum in an Aggregate gem.
- Save your transformed data using a Target gem.
- Understand the underlying Python code that is used when running the pipeline.
- Run a pipeline and review interim datasets.
What's next?
The in-product walkthrough has helped familiarize you with the visual canvas. Next, you can create your own project and do more with your data!
To understand an entire project workflow, including connecting to execution environments and deploying pipelines, try one of our getting started tutorials.