Microsoft Office Tutorials and References
In Depth Information
mated by using a fairly non-intuitive advanced filter technique. Now, removing duplicate rows is very easy,
thanks to Excel's Remove Duplicates command (introduced in Excel 2007).
Start by moving the cell cursor to any cell within your data range. Choose Data ⇒ Data Tools ⇒ Remove Duplic-
ates, and Excel displays the Remove Duplicates dialog box shown in Figure 16-5.
If your data is in a table, you can also use Table Tools ⇒ Design ⇒ Data Tools ⇒ Remove
Duplicates. These two commands work exactly the same.
Figure 16-5: Use the Remove Duplicates dialog box to delete duplicate rows.
The Remove Duplicates dialog box lists all the columns in your data range or table. Place a check mark next to
the columns that you want to be included in the duplicate search. Most of the time, you'll want to select all the
columns, which is the default. Click OK, and Excel weeds out the duplicate rows and displays a message that
tells you how many duplicates it removed. It would be nice if Excel gave you the option to just highlight the du-
plicates so you could look them over, but it doesn't. If Excel deletes too many rows, you can undo the procedure
by clicking Undo (or by pressing Ctrl+Z).
When you select all columns in the Remove Duplicates dialog box, Excel will delete a row only if the content
of every column is duplicated. In some situations, you may not care about matching some columns, so you
would deselect those columns in the Remove Duplicates dialog box. For example, if each row has a unique ID
code, Excel would never find any duplicate rows, so you'd want to uncheck that column in the Remove Duplic-
ates dialog box.
When duplicate rows are found, the first row is kept, and subsequent duplicate rows are deleted.