Skip to main content

DataCleansing

Use the DataCleansing gem to standardize data formats and address missing or null values in the data.

Parameters

ParameterDescription
Remove nulls from entire datasetRemoves any rows that contain null values.
This operates on all columns—not just those you selected to clean.
Select columns to cleanSpecifies the columns to apply data cleansing transformations to.
Replace null values in columnReplaces null values in selected columns with a specified default.
Example: 0 for numeric columns, empty string for text
Remove unwanted charactersRemoves specified characters from all values in the selected columns.
Example: remove whitespaces or punctuation
Modify caseConverts text in selected columns to a specified case format.
Example: lowercase, UPPERCASE, Title Case

Example

Assume you have a dataset that includes all entries from a feedback survey.

NameDateRatingFeedback
Ada2025-04-185I really enjoy the product
scott2025-04-185NULL
emma2025-04-172The product is confusing
NULL2025-04-173NULL

The following is one way to configure a DataCleansing gem for this table:

  1. Select columns to clean: Name
  2. Replace null values in column: Not provided
  3. Modify case: Title Case

Result

After the transformation, the table will look like:

NameDateRatingFeedback
Ada2025-04-185I really enjoy the product
Scott2025-04-185NULL
Emma2025-04-172The product is confusing
Not provided2025-04-173NULL