Blog

Data Cleaning 101: 5 steps to start cleaning your data

If you’ve been following our series so far, you will have learned all about why you should care about data cleansing. Did you know that bad data can cost companies up to a quarter of their revenue? So, cleaning bad data is paramount for good data practice. But for more reality checks, catch up on the first instalment: why should you care about data cleansing? 

Cleaning bad data can be tricky

This article presents 5 essential steps for you to better understand your org, your data, and how to start the data cleansing process. By following these steps, you can go forward and make well-informed decisions for your company. You may even discover some quick wins along the way.

Follow through these 5 steps to begin your journey to cleaner data.

1. Ask yourself: why do I need this data?

It may sound broad, but knowing what you want to get from your data is vital. So take a step back and look at your end goal. Are you collecting all of the information you need? Or maybe you’re collecting too much? After all, there’s no point collecting data if you’re never going to use it. One less field you’re capturing is one less field you need to maintain!

Start by creating and establishing a plan outlining what clean data means for you and your org. You can’t do anything until you understand your org and users. 

For some prompts, start here:

  • What data do I need to achieve my business goals?
  • Am I capturing everything I need to succeed?
  • Am I collecting some data for the sake of collecting data?

2. Examine where your data is coming from

It’s time to identify all the different ways that data is entering your org. It’s likely that when you dig deeper, there will be more entry points than you think!

A few entry points to consider:

  • service agents
  • sales reps
  • partner users
  • web-to-lead forms
  • integrations

Once you’ve made your list and found the holes, you can do something about them!

3. Plug the holes to clean the data

I saw a great analogy on Reddit:

I think of improving data quality as a sinking boat. If you are sinking, you need to plug the holes first (sources of bad data) and then start bailing out the water (getting rid of the bad data) second.”

(lziemke)

Simply put, if data enters your org clean, this saves you from battling with your records later on. This should be the initial focus. Thankfully there are lots of useful ways to make this a reality with minimal effort that don’t impede the end user experience.

Where possible, think of picklists, validation rules, or even consider managed packages available from the AppExchange.

4. Try the AppExchange!

Solutions on the AppExchange can be a great cost-effective, and time-efficient way to help you combat unclean data. Managed package providers are often experts in their field, so finding a ready-to-go data management solution will help you clean your data in no time!

So how do you choose the right app for you? This isn’t an exhaustive list, but we’ve included a data cleansing focused checklist to ask the managed package providers:

  1. Does the solution automatically resolve data quality issues?
  2. Does the solution require changes to the user experience?
  3. Does the solution clean all my data entry points?
  4. Does the solution clean up pre-existing data?
  5. How long does it take to implement the solution?

After all, don’t reinvent the wheel when it’s already turning – if you have an issue, it’s likely that hundreds of others do too!

5. Start building good data habits for cleaner data

So you’ve taken a good look at your business, identified where your data is coming from, and even potentially put some actions in place to begin cleaning up your org. Now it’s time to maintain this good practice as your org grows and share your wisdom.

Here are a few ways you can build good data habits in your organization:

  • Get users involved and trained in what to look out for
  • Identify entry points that are inputting bad data
  • Schedule regular manual checks for your data 

Operations Director, Beth, is our resident data cleaner and a strong believer in the power of good data practice:

“There is nothing more satisfying than having a good old spring clean in Salesforce! It’s great to identify trends and get to the root cause of how bad data is able to get in. Popping in a scheduled time every two weeks is the way I’ve found works best. Don’t let it build up and be a task you keep on putting off!”

Beth Clements, ProvenWorks

We live in a ‘need it done yesterday’ society so safeguarding your org against the perils of bad data is a sure way to give you back your time and let you focus on the things that matter.

Remember: the best day to start cleaning your data was yesterday. The second best day is today.

Life’s better after cleaning bad data from your org.

Stay tuned for the final instalment to our data blog series to learn about some of our favourite cleaning apps.

See you back here soon for the final instalment of the series!

Data Cleaning 101: Why should you care about data cleansing?

Unclean data stinks up your system. It can travel through your business, slowing you down, causing confusion and costing you money. Don’t believe us? Between 2019-2021, over 40% of sales reps did not have enough information about leads and accounts to make effective sales, and there are plenty more stats around data cleansing coming up.

This is the first in our three-part series taking you through the importance of data cleansing and some top tips for practising clean data habits… so make sure to stick around!

An introduction to data cleansing

Bad data… Does it really matter? How do I clean it? And what can I do about it in the future?

Don’t worry we have you covered with the ‘whats’, ‘whys’ and ‘hows’ of dirty data throughout this series. First off a few definitions…

Clean data: Data that is without error and in its entirety so it can be used effectively by everyone.

Data cleansing: The process of identifying incomplete, incorrect, inaccurate or irrelevant data and modifying, validating, deleting or replacing it.

So how does unclean data actually affect me?

Over the past 6 years Salesforce and Salesforce bloggers have noted the persistent and holistic impact that bad data has on companies of all sizes. Let’s take a look at some numbers:

As we’ve already discussed, between 2019-2021, over 40% of sales reps did not have enough information about leads and accounts to make effective sales, and therefore struggled to achieve monthly and annual sales targets. This inevitably affects generated revenue.

Why is data cleansing important?

  1. Ensure consistency, validity and confidence in your data by cleansing it regularly. This foundation can yield better results and help reach goals quicker.
  1. Reduce the time employees spend sorting through bad data and let users be more productive on the tasks that matter – like growing your business or organization!
  1. Eradicate data privacy worries for customers and clients. It’s outright good practice; you wouldn’t want your private mail sent to an incorrect address!
  1. Protect your company’s reputation. Saving bad or incomplete information affects all areas of your company, from sales and marketing to the senior management, stopping everyone from carrying out their job well and making beneficial decisions, which directly affects the reputation of your business and its revenue.

The good news

Not all is lost! The University of Texas has estimated that even if the data entered is 10% more accurate, then your revenue would be considerably boosted not only for bigger enterprises but also for B2B and B2C firms.

So now you know what data cleansing is, how do you practice it?

Watch out for the next post in Data Cleaning 101 for our five best tips for cleansing your Salesforce org. You don’t want to miss that one!

Want to learn more about the impact of poor data?

Check out our blog: What is the impact of poor quality data? 

Can I import data into Salesforce without being an Administrator?

Yes, you can!

SimpleImport by ProvenWorks for Salesforce CRM allows you to import data from Excel into Salesforce without being an admin user. Administrators can choose to remain in control over the fields and objects users can import into by giving a non-admin user a license to SimpleImport with restrictions in place. These restrictions will allow the admin to specify the exact objects the user will have access to import into. See below, we’ve restricted this sales user to only be able to import into the Lead object.



Want to take it a step further and restrict the fields users can import into as well? Take a look at our guided import wizard functionality which gives users a drag and drop component with your predefined object and field mappings. This means that if a user were to try to import into a field that wasn’t configured in the mapping, the import would fail. You can open up importing to non-admin users with the assurance that they can only import into objects and fields specified by the admin.

Community and Portal User Imports

We understand that you don’t want community and portal users to be able to import into any object and fields in Salesforce, so our Managed Import component can also be used in a Salesforce Community. This is perfect for recurring imports like product and sales data, or deal registration.


Where to find SimpleImport

Install SimpleImport from the Salesforce AppExchange



Book a free demo

Arrange a free demo of SimpleImport to see it in action.

AddressTools: address verification statuses

AddressTools enables Salesforce users to verify address data through a number of means such as Scheduled Batch Address Verification, the Address Verification Flow Component and at point of entry with the package’s core functionality!

Six statuses are returned via the address verification processes, see their values and descriptions below:

  1. Verified – Address verified without change
  2. Corrected – Address verified with corrections
  3. Ambiguous – Address values match multiple verified addresses
  4. Parsed but not found – Address values partially matched to a verified address
  5. Not Matched – Address values cannot be matched or parsed
  6. Not Checked – Address has not been verified

If you’re looking to verify your address data in Salesforce, get in contact with info@provenworks.com to see how we can help!

How to: setup automated international address verification via Process Builder

Before getting started:

  • Requires an active verification credits subscription in your organization.
  • The address block requiring verification must be configured in the AddressTools Administration Address Blocks.
  • This feature is only suitable for low-volume data upsertion due to dependencies on ‘future methods’. Check out Scheduled Address Verification for high-volume implementations.

Create a New Process

Navigate to Setup | Process Automation | Process Builder.

Select New.

Provide a Name and API Name.

Set The process starts when to A record change.

Press Save.

Choose Object

Select Add Object to open the Choose Object panel.


Select Find an object…

Choose the relevant Object.

Under Start the process, select the appropriate option for your use case from the available options.

Press Save.

Add Criteria

Select Add Criteria to open the Define Criteria panel.

Name the criteria using the Name Criteria field.


Set Criteria for Executing Actions to Conditions are met.

Under Set Conditions, select the Find a Field and select the associated Address Status field for the address block and press Choose to confirm.


Set Operator to Equals.

Set Type to Picklists.

Set Value to Not checked.

If you want to add more criteria, select Add Row.

Set Conditions to All of the conditions are met (AND).

Press Save.

Add Action

Select Add Action to open the Select and Define Action panel.

Set Action Type to Apex.



Name the action using the Action Name field.

Set Apex Class to Premise Level Address Verification.

Set Country-field API name | Type to String.


Set Country-field API name | Value to the address block’s country field API name.

Set ID of record to validate | Type to Field Reference.

Select Find a field… to open the Select a Field modal.

Select the Object’s ID field.

Select Choose.

Select Save.

Activate and Test

Select Activate.

Select Confirm.


Insert a record into your object with a complete address block and the necessary criteria to meet to rules configured in the Add Criteria section. Once the record has been inserted, you may need to refresh the page to display the updated status,

Contact Us

If you are experiencing any issues with the setup, please reach out to: support@provenworks.com where a member of our team will be happy to assist.


Back to the AddressTools Premium installation walkthrough

Information: what data is sent to ProvenWorks servers when using AddressTools?

This article will cover:

  • AddressTools Free
  • AddressTools Premium

The org details which are public on the partner support interface are sent to us and we receive general org information which is used for licencing and credit verification.

If you use our premise level address verfication or batch verification then address blocks are sent to our servers for processing.

Contact Us

If you have any other questions relating to AddressTools Free or Premium then please do not hesitate to contact us at support@provenworks.com. We will be happy to answer any questions or help with any problems you may have.

Notice: AddressTools “query access to the ValidatedField__c object”

AddressTools Spring’20 Notice

In Spring’20, Salesforce is restricting access to custom settings which had previously been accessible by all users within your Org by default.

If you are experiencing the following error: The current user does not have query access to the ValidatedField__c object or one of its fields which access is required.

You will be required to manually assign the profile with access to the AddressTools custom settings. This can either be done by:

Profile Custom Setting Definitions.

Go to Setup | Profiles | [Profile in Question] | Custom Setting Definitions | Edit. Move CountryComplete Settings and available AddressTools Fields to Validate to the enabled lists. Select Save.

View All Custom Settings Profile Permission.

Go to Setup | Profiles | [Profile in Question]. Search for View All Custom Settings and check it. Save the changes.

Both of the above can either be applied via a Permission Set or directly on the affected profiles themselves.

To find out more about this change and how to configure permission sets, you can read the official Salesforce release notesreleasenotes.doc.salesforce

Contact Us

If you have any other questions then please do not hesitate to contact us at: support@provenworks.com. We will be happy to answer any questions or help with any problems you may have.

How to: update the Fields and Objects available in SimpleImport

SimpleImport caches field and object data on its initial load per Salesforce session. If a field or object is created or edited during that time of an active session, you must log out of your Salesforce org and log in again for the changes to be acknowledged. When you reload SimpleImport in the new session, the changes to the organization’s metadata will become available.