beginner

What is the main purpose of finding duplicates in a database?

To identify records that appear more than once based on certain columns, which helps in cleaning data and ensuring accuracy.

Click to reveal answer

beginner

Which SQL clause is commonly used to group rows when searching for duplicates?

The GROUP BY clause groups rows that have the same values in specified columns, allowing aggregate functions like COUNT() to find duplicates.

Click to reveal answer

beginner

Write a simple SQL query to find duplicate email addresses in a users table.

SELECT email, COUNT(*) AS count FROM users GROUP BY email HAVING COUNT(*) > 1;

Click to reveal answer

intermediate

Why is using HAVING COUNT(*) > 1 important in finding duplicates?

Because HAVING filters groups after aggregation, it selects only those groups where the count of records is more than one, indicating duplicates.

Click to reveal answer

intermediate

How can indexing help in finding duplicates efficiently?

Indexing the columns used to find duplicates speeds up grouping and counting by allowing the database to quickly locate matching values.

Click to reveal answer

Which SQL clause helps group rows to find duplicates?

AWHERE

BORDER BY

CGROUP BY

DJOIN

What does HAVING COUNT(*) > 1 do in a duplicate search query?

AFilters groups with more than one record

BOrders the results by count

CLimits the output to one record

DJoins tables

Which function counts the number of rows in each group?

ASUM()

BMAX()

CAVG()

DCOUNT()

Why is indexing useful when finding duplicates?

AIt speeds up grouping and searching

BIt deletes duplicates automatically

CIt slows down queries

DIt changes data types

Which SQL keyword is used to filter rows before grouping?

AHAVING

BWHERE

CGROUP BY

DSELECT

Explain how to write a SQL query to find duplicate records based on one or more columns.

Describe why indexing columns can improve the performance of duplicate detection queries.