Remove Duplicates step

# What to use the Remove Duplicates step for

Use this step to Remove duplicates from the specified data. This can be limited to particular columns. This step can be used when importing data from Google Sheets or CSV files as well as scraping data.

You can use this step to:

  • Filter scrasped data for duplicates
  • Clean up data stored in CSV files
  • Remove duplicates from Google Sheet

# How to configure the Remove Duplicates step

# Data

Select the data to deduplicate.

# Columns to check

Specify a list of column numbers, each separated with a comma. Only these columns will be checked for duplicates.

For example, entering 'A,B' here will check for duplicates in columns A and B only.

# Output

The step outputs a preview of the deduplicated data.

Configuration settings

    Step type

    Manipulate data