DATA CLEANING WITH SQP (pt2)

Checking for Duplicates

It is very important we check for duplicates in our Data as part of Confirming the integrity of our data.
There are some data fields that are not meant to have duplicates, for example, the UNIQUE ID column in the video below.





We need to confirm that there are no duplicate values in the UNIQUE ID Column.

We can do that using the query below:

Select UniqueID, COUNT (UniqueID)

FROM PROJ.dbo.Nash

GROUP BY UniqueID

HAVING COUNT (UniqueID) >1

At the end of the Query, we can see that no duplicate Values exist in the UNIQUE ID Feild


Comments

Popular posts from this blog

EXPLORATORY ANALYSIS OF COVID_19 CASES AND VACCINATION IN NIGERIA