October 2024
3.4.* (October 28, 2024)
- Prophecy Python libs version: 1.9.24
- Prophecy Scala libs version: 8.4.0
Features
Spark Copilot Enhancements
Limit data preview in interims: There is a new global level flag that admins can use to disable Data sampling for a given Fabric. This flag overrides the Pipeline level Data sampling settings. When disabled, you won't be able to see production data in the interims when you run the Pipeline.
Data sampling is enabled on by default. When left enabled, Data sampling uses the Pipeline's data sampling settings. Prophecy samples data during the interactive run experience to provide the best debugging experience for users.
For more information, see Interims.
SQL Copilot Enhancements
Data tests: We now support model and column data tests, which are dbt macro generated tests that can be parametrized and applied to a given model or any number of columns.
You can create a new data test definition to use in your model or column test, or you can use an out-of-the-box supported dbt Simple data test.
For more information, see Use model and column tests.
Variant type support: You can use Prophecy to convert your variant schemas into flat, structured formats to make them easier to understand and use for analytics.
This is available for when you want to determine the variant schema of your Snowflake array or object.
Using the variant schema functionality, you can do the following:
- Infer the variant schema
- Configure the parsing limit for inferring the column structure
- Use a nested column inside of the Visual Expression Builder
For more information, see Variant schema.
Flatten Schema Gem: When processing raw data it can be useful to flatten complex data types like
Struct
s andArray
s into simpler, flatter schemas. This Gem builds upon the variant type support by allowing you to preserve all schemas, and not just the first one.FlattenSchema works on Snowflake sources that have nested columns that you'd like to extract into a flat schema.
For more information, see FlattenSchema.
Deduplicate Gem: You can use the deduplicate Gem to remove rows with duplicate values of specified columns. There are four Row to keep options that you can use in your deduplicate Gem.
For more information, see Deduplicate.
Gem builder: You can add custom Gems to your SQL projects using the Gem builder. You can create custom source, target, and transformation Gems, and then publish them for your team to use.
Our SQL Gem builder supports Databricks and Snowflake SQL.
For more information, see Gem builder.
AI Copilot Enhancements
- Voice interface in Copilot conversations: You can now use audio to interact with Data Copilot. Copilot will transcribe your request and read aloud the text response shown in the chat.
You must use Google Chrome to use the voice interface.
- Enhanced Gem Modification for AI Copilot: We’ve upgraded the graph modification experience in AI Copilot. Now, you can specify particular gems for Copilot to focus on, allowing you to better control and limit the scope of changes. This added flexibility is especially valuable when working with large model graphs, making targeted edits more efficient and precise.
For example, here are a few ideas that you can base your prompts on:
- Propagate the customer ID through all the Gems
- Add a reformat Gem after this
- Join selected Gems on common columns
- Join this Gem with payment data
Added/updated Gems are highlighted in yellow.
Use previous prompts in chat with AI Copilot: Now you can revert to a previous prompt, and Copilot will return the result for that prompt. Go ahead and edit the previous prompt - AI Copilot will accept the edits and generate a suggestion accordingly.
Prompt suggestions: AI Copilot will now make suggested prompts in the Chat interface. Click a suggested prompt to view the resulting model graph.
Updated Explain dialog design: We've updated the Explain dialog design so that it's cleaner and easier to use.
Minor Improvements
Code diff on merge screen: You can view the code changes of your commit when you start the process to commit changes. This gives you granular change visibility so that you can understand the detailed changes being made to your Pipelines, Models, datasets, and other entities.
You can also use manual merge to resolve conflicts. This provides you with simple, yet effective ways to resolve merge conflicts for granular changes. For SQL, you can also toggle on Code Changes to view and edit the code directly on the files.
For more information, see the Git docs.
Databricks Volumes support: Prophecy now supports Databricks Volumes. When you run a Python or Scala Pipeline via a Job, you must bundle them as whl/jar artifacts. These artifacts must then be made accessible to the Databricks Job in order to use them as a library installed on the cluster.
You can designate a path to a Volume for uploading the whl/jar files on the Providers tab of your Databricks Fabric.
Prophecy Lib version: You can now find out what the current Prophecy Library version is on your clusters. Use the Cluster Attached dropdown to see the log for the current version in the Fabric connection logs.