Skip to main content

107 docs tagged with "gems"

View all tags

Aggregate gem

Group data and apply aggregation methods or pivot operations

Avro

Parameters and properties to read from and write to Avro files

BigQuery

Parameters and properties to read from and write to the BigQuery warehouse

Buffer gem

Expand or contracts the boundaries of a polygon or line

CosmosDB

Parameters and properties to read from and write to the CosmosDB warehouse

CountRecords

Returns one integer that represents the count of records in the input dataset

CreatePoint gem

Create geographic points with longitude and latitude coordinates

CSV

Parameters and properties to read from and write to CSV files

DataCleansing gem

Standardize data formats and address missing or null values in the data

Delta

Parameters and properties to read from and write to Delta files

Delta Table

Read from or write to tables managed by a Delta table metastore

Directory gem

Return a listing of all the files in a specified directory

DynamicInput

Run SQL queries that update dynamically at runtime

DynamicSelect gem

Dynamically filter columns of your dataset based on a set of conditions

DynamicSelect gem

Dynamically filter columns of your dataset based on a set of conditions

Email gem

Send your pipeline output tables to others via email

EmailData gem

Send data from your Spark pipeline to others by email

Filter gem

Filter your data based on a custom filter condition

Fixed Format

Parameters and properties to read from and write to Fixed Format files

Functions

Build functions with SQL macros to be used in gem expressions

Gems

Power your pipelines with gems

Gems

Transform your data with Prophecy gems

Heatmap gem

Generate spatial heatmaps from geo point data using hexagons

Hive Table

Read from or write to tables managed by a Hive metastore

Iceberg

Read from or write to tables managed by Iceberg

JDBC

Parameters and properties to read from and write to the JDBC warehouse

Join

Join two or more datasets

Join gem

Join one or more DataFrames on conditions

JSON

Parameters and properties to read from and write to JSON files

Kafka

Parameters and properties to read from and write to Kafka files

Limit gem

Limit the number of columns processed

MongoDB

Parameters and properties to read from and write to the MongoDB warehouse.

ORC

Parameters and properties to read from and write to ORC files

Parquet

Parameters and properties to read from and write to Parquet files

Pivot gem

Convert your table from long to wide format

PolyBuild gem

Create a polygon or polyline from a set of coordinates

Redshift

Parameters and properties to read from and write to the Redshift warehouse.

Reformat gem

Use expressions to reformat column names and values

Reformat gem

Select one or more columns or values using expressions and functions

RestAPIEnrich gem

Enrich DataFrame with content from rest API response based on configuration

SampleRows gem

Sample records by choosing a specific number or percentage of records

Script gem

Leverage a Python script in your pipeline

Seed

Parameters and properties to read from Seed files

Simplify gem

Decrease the number of nodes that make up a polygon or polyline

Smartsheet

Use data from Smartsheet in your Spark pipeline

Snowflake

Parameters and properties to read from and write to the Snowflake warehouse.

Tableau gem

Send data from your Spark pipeline to Tableau

Text

Parameters and properties to read from and write to Text file

ToDo

Create a placeholder gem in your pipeline

Unpivot gem

Use the Unpivot gem to transform your data from a wide format to a long format

Upload files

Learn how to upload files to your Spark pipeline

XLSX (Excel)

Parameters and properties to read from and write too XLSX (Excel) files

XML

Parameters and properties to read from and write to XML files