Scheduling
Prophecy lets you schedule and automate your data pipeline runs. Once you have developed a pipeline, you can run it at specific frequencies using Prophecy's native scheduler.
Schedules define when the pipeline will run and whether to send alerts about the pipeline run. For a summary of the pipeline scheduling workflow, review the diagram below.
Parameters
The following table describes schedule parameters for a pipeline. You will create independent schedules per pipeline.
Parameter | Description | Default |
---|---|---|
Frequency | How often the pipeline will run. | Daily |
Repeat at | When the pipeline run will repeat. Example: Repeat every Monday and Friday at 2:00AM. | Varies depending on your chosen frequency |
Timezone | The timezone of Repeat at time. | The timezone where you are located |
Alerts on the full job | Toggle that enables sending an email on the start, success, and/or failure of the pipeline run. | Off |
Enabled | Toggle that enables or disables the schedule | Off |
Schedule a pipeline
To run a pipeline on a recurring basis, you need to configure a schedule and publish your project. Let’s walk through both.
Configure the schedule
To create a new schedule:
- Open a pipeline in the project editor.
- Expand the Options (ellipses) menu in the project header.
- Click Schedule.
- Fill in the schedule frequency.
- Enable the schedule using the toggle in the top right.
- Click Schedule to save the schedule.
Enable the schedule
After configuring the schedule, complete the following steps to ensure the schedule actually runs:
- Save your project as a draft.
- Publish your project.
Why do you need to publish the project? Publishing the project performs two key actions.
Defines the execution environment (fabric)
When you publish a project, you select one or more fabrics that serve as the execution environments for deployed projects. For each selected fabric, a separate deployment is created. Pipeline schedules are only enabled for deployed projects.
If you do not select any fabrics during project publication, no deployments will be created. As a result, no scheduled executions will occur, even if a schedule has been configured.
Determines the pipeline version for execution
Publication determines what version of the pipeline will run during a scheduled execution. When a new project version is published, scheduled executions for the associated fabric(s) automatically begin using the updated pipeline version. Each deployment is isolated per fabric—publishing a new version to one fabric does not update deployments on other fabrics.
Disabling a schedule
To stop scheduled pipeline runs:
- Open the pipeline schedule.
- Use the toggle to disable the schedule.
- Click Schedule to save.
- Save your project as a draft.
- Publish the project.
Disabling a schedule also requires publishing your project. Since schedule status is part of the deployment configuration, changes won't take effect until the next project version is published.
Monitor scheduled pipelines
You and your team members might have many scheduled pipelines in your Prophecy environment. To see a list of deployed projects, scheduled pipelines, and pipeline run history, open the Monitoring page in Prophecy.
External schedulers
By default, SQL projects leverage a Prophecy-native scheduler to automate pipeline runs. While we recommend using the Prophecy scheduler, you can also use external schedulers like Databricks Jobs or Apache Airflow if preferred. Projects that leverage the Simple Git Storage Model are not compatible with external schedulers. To learn more about external schedulers, visit Orchestration.