Concept Flow - Removing duplicates (drop_duplicates)
Start with DataFrame
Check each row for duplicates
Mark duplicates as True/False
Keep first occurrence, remove others
Return DataFrame without duplicates
End
The process checks each row in the data, marks duplicates, keeps the first occurrence, and removes the rest to return a clean DataFrame.