Skip to main content

Enrich and Normalize Data

Clean Up Data Using Find/Replace And Other Bulk Operations

Oftentimes it is necessary to migrate data from one location to another. As part of this exercise, cleaning up the data is undertaken in order to create a “clean slate”. Conscia supports this by providing a toolbox of bulk operations that can be run on the data to perform various tasks such as find / replace, mass apply and type conversion.

alt_text .

To access the data operations:

  1. Navigate to the source collection and field of interest
  2. While hovering over the field header, click on the arrow to expose the inline menu
  3. From the menu, select TRANSFORM and the target operation.

Note that each field will display a different set of transformations (only those transformations applicable to the current data type are shown).

Data Normalization with Bulk-Apply Data Transformation Features

When you are pulling data from multiple data sources and/or is being managed by siloed departments and teams, it is common to have inconsistencies.

The following transformations are available to standardize and normalize your data through Control Center and can also be operationalized as data preparation recipes that can be applied on a schedule.

Enrichment FeatureDescription
Change CaseChanges the sentence case of values in the selected column and data set.
Concatenate FieldsCombines two columns into a new column with values separated by a user-specified delimiter.
Convert Array to StringConverts the values in an array to a delimited string.
Convert Date Field FormatConverts date values in a selected field to an alternative date format.
Convert Field to NumberChanges the type of a field by converting it from a text to number
Convert Field to TextConverts the type of a field to text
Remove Non-printable CharactersRemoves any non-printable characters found within the values in a field
Trim WhitespaceRemoves whitespaces in values of the selection column
Find and Replace (Single)Find a specified pattern in the data set and replaces it with the user defined value or pattern
Find and Replace (Multiple)Allows a user to reference a collection containing a series of find and replace values. When a value is found in the target column, it is replaced with the corresponding value from the lookup table.
Mass Apply (Single)Overwrites a field value with a user defined target value
Mass Apply (Multiple)Overwrites values in the target column. The user can define either to overwrite the target column with the mass applied values, or append the values to the end.
Mass RemoveRemoves a value from a selected field
Group Children RecordsAggregates child records under a parent record
Extract PatternTakes in a regular expression (regex) pattern and outputs a new column containing text which matches the user defined pattern. Each match is separated by a pipe (|) in the new column.
Extract Unique ValuesExtracts all unique values stored within a field. Output is stored as a separate collection.
Spell CheckChecks the spellings of words in a string column and suggest correct spellings.
Split Field (Single)Splits data into separate columns given a character. Contents of this column will be split on the given character and stored in a new single array column
Split Field (Multiple)Splits a value within the target field into multiple fields. The number of generated fields is based on the number of delimiters found.