Skip to main content

Join

SQL

Use the Join gem to combine data from two or more tables based on a shared column value. This helps you link related information, such as customer details and purchase history, or user activity logs and account records.

Input and Output

PortDescription
in0The first input table in the join.
in1The second input table in the join.
inNOptional: Additional input table for the join.
outA single table that results from the join operation(s).

To add additional input ports, click + next to Ports.

Parameters

To configure the Join gem, you need to define join conditions and select the columns that will appear in the output table.

Join conditions

You can add one or more join conditions to the gem depending on the number of input tables added.

ParametersDescription
Join typeThe different join types you can choose from. These may vary by SQL provider. Learn about different join types below.
Join conditionThe condition that matches rows between tables.
Custom Join

If you want to use a type of join that is available in your SQL warehouse, you can type the name of that join directly in Prophecy.

Expressions

ParametersDescription
ExpressionsDefines the output columns that will be returned by the gem. If left empty, Prophecy passes through all the input columns without any modifications.

Example

Assume you have two tables: orders and customers. You want the orders table to include customer information, so you need to join the tables based on customer ID. You only want to preserve records in the output that have a match. To do so:

  1. Connect orders to in0 and customers to in1.
  2. Choose Inner Join as the join type.
  3. If using a visual expression, use the following join condition: in0.CustomerID equals in1.customer_id
  4. If using the code expression, use the following SQL join condition: in0.CustomerID = in1.customer_id
  5. Leave the Expressions tile empty.
  6. Save and run the gem.

Join types

Suppose there are two tables, Employees and Departments, with the following contents:

Employees

EMPLOYEE_IDEMPLOYEE_NAMEDEPARTMENT_ID
1Alice10
2Bob20
3Charlie30
4DavidNULL
5Eve20

Departments

DEPARTMENT_IDDEPARTMENT_NAME
10HR
20Engineering
30Sales
40Marketing

INNER JOIN

Inner Join will return columns from both the tables and only the matching records as long as the condition is satisfied.

For example, if the Join condition provided was employees.department_id = departments.department_id, the sample query would be:

SELECT e.employee_id, e.employee_name, d.department_name
FROM employees e
INNER JOIN departments d
ON e.department_id = d.department_id;
EMPLOYEE_IDEMPLOYEE_NAMEDEPARTMENT_NAME
1AliceHR
2BobEngineering
5EveEngineering
3CharlieSales

LEFT JOIN / LEFT OUTER JOIN

Left Join (or Left Outer join) will return columns from both the tables and match records with records from the left table. The result-set will contain null for the rows for which there is no matching row on the right side.

For example, if the Join condition provided was employees.department_id = departments.department_id, the sample query would be:

SELECT e.employee_id, e.employee_name, d.department_name
FROM employees e
LEFT JOIN departments d
ON e.department_id = d.department_id;
EMPLOYEE_IDEMPLOYEE_NAMEDEPARTMENT_NAME
1AliceHR
2BobEngineering
3CharlieSales
4DavidNULL
5EveEngineering

RIGHT JOIN / RIGHT OUTER JOIN

Right Join (or Right Outer join) will return columns from both the tables and match records with records from the right table. The result-set will contain null for the rows for which there is no matching row on the left side.

For example, if the Join condition provided was employees.department_id = departments.department_id, the sample query would be:

SELECT e.employee_id, e.employee_name, d.department_name
FROM employees e
OUTER JOIN departments d
ON e.department_id = d.department_id;
EMPLOYEE_IDEMPLOYEE_NAMEDEPARTMENT_NAME
1AliceHR
2BobEngineering
5EveEngineering
3CharlieSales
NULLNULLMarketing

FULL JOIN / FULL OUTER JOIN

Full Outer Join will return columns from both the tables and matching records with records from the left table and records from the right table. The result-set will contain NULL values for the rows for which there is no matching.

For example, if the Join condition provided was employees.department_id = departments.department_id, the sample query would be:

SELECT e.employee_id, e.employee_name, d.department_name
FROM employees e
FULL OUTER JOIN departments d
ON e.department_id = d.department_id;
EMPLOYEE_IDEMPLOYEE_NAMEDEPARTMENT_NAME
1AliceHR
2BobEngineering
3CharlieSales
4DavidNULL
5EveEngineering
NULLNULLMarketing

CROSS JOIN

Returns the Cartesian product of two datasets. It combines all rows from both tables. Cross Join will not have any Join conditions specified.

For example, the sample query would be:

SELECT e.employee_id, e.employee_name, d.department_name
FROM employees e
CROSS JOIN departments d;
EMPLOYEE_IDEMPLOYEE_NAMEDEPARTMENT_NAME
1AliceHR
1AliceEngineering
1AliceSales
1AliceMarketing
2BobHR
2BobEngineering
2BobSales
2BobMarketing
3CharlieHR
3CharlieEngineering
3CharlieSales
3CharlieMarketing
4DavidHR
4DavidEngineering
4DavidSales
4DavidMarketing
5EveHR
5EveEngineering
5EveSales
5EveMarketing

NATURAL INNER JOIN

A natural join (or Natural Inner Join) is identical to an explicit Inner Join but it automatically joins columns with the same names in both tables. Natural Join will not have any join conditions specified.

For example, the sample query would be:

SELECT e.employee_id, e.employee_name, d.department_name
FROM employees e
CROSS JOIN departments d;
EMPLOYEE_IDEMPLOYEE_NAMEDEPARTMENT_NAME
1AliceHR
2BobEngineering
5EveEngineering
3CharlieSales

NATURAL LEFT OUTER JOIN

A natural Left Outer join (or Natural Left Join) is identical to an explicit Left Outer Join but it automatically joins columns with the same names in both tables. Natural Left Outer Join will not have any join conditions specified.

For example, the sample query would be:

SELECT e.employee_id, e.employee_name, d.department_name
FROM employees e
NATURAL LEFT OUTER JOIN departments d;
EMPLOYEE_IDEMPLOYEE_NAMEDEPARTMENT_NAME
1AliceHR
2BobEngineering
3CharlieSales
4DavidNULL
5EveEngineering

NATURAL RIGHT OUTER JOIN

A natural Left Right join (or Natural Right Join) is identical to an explicit Right Outer Join but it automatically joins columns with the same names in both tables. Natural Right Outer Join will not have any join conditions specified.

For example, the sample query would be:

SELECT e.employee_id, e.employee_name, d.department_name
FROM employees e
NATURAL RIGHT OUTER JOIN departments d;
EMPLOYEE_IDEMPLOYEE_NAMEDEPARTMENT_NAME
1AliceHR
2BobEngineering
5EveEngineering
3CharlieSales
NULLNULLMarketing

NATURAL FULL OUTER JOIN

A natural Full Outer join (or Natural Full Join) is identical to an explicit Full Outer Join but it automatically joins columns with the same names in both tables. Natural Full Outer Join will not have any join conditions specified.

For example, the sample query would be:

SELECT e.employee_id, e.employee_name, d.department_name
FROM employees e
NATURAL FULL OUTER JOIN departments d;
EMPLOYEE_IDEMPLOYEE_NAMEDEPARTMENT_NAME
1AliceHR
2BobEngineering
3CharlieSales
4DavidNULL
5EveEngineering
NULLNULLMarketing