How do I dedupe my recordset?

**gpl** · Sep 28 '10, 09:41 PM

Im assuming that the 1. 2. 3. at the start of each row is a unique identifier
Assume your table is called TABLE

Code:

Select t.* from TABLE inner join
(
Select SixChar, Min(ID) as ID
from TABLE t
group by SixChar
) u
on t.ID=u.ID

The inner query which creates a derived table selects only the unique values of SixChar, along with the lowest value of the ID for that value of SixChar, the join allows you to return the whole row from TABLE that has the same value of ID and SixChar.

It will of course be a bit trickier if you do not have a unique row identifier.

Note that the house number and postcode will uniquely identify an address

**MWilson** · Sep 30 '10, 11:39 AM

That is spot on, thankyou very much!

Within that same bit of code, is it possible to have the 'deduped' data placed into a permanent table? As far as I can see in my SQL studio, the derived table can't be worked on in its own right.

**gpl** · Sep 30 '10, 11:51 AM

Yes, of course, standard sql allows for this, either
1] you have a table called ddtable with the same structure as TABLE already existing

Code:

Insert ddtable
Select t.* from TABLE inner join 
( 
Select SixChar, Min(ID) as ID 
from TABLE t 
group by SixChar 
) u 
on t.ID=u.ID

2] if you want to create a table and populate it at the same time

Code:

Select t.* 
INTO ddtable
from TABLE inner join 
( 
Select SixChar, Min(ID) as ID 
from TABLE t 
group by SixChar 
) u 
on t.ID=u.ID

the chief difference in the 2 approaches are that with the second one, you can create the table at the same time as filling it.
With the first approach, it will append data to ddtable every time you run it

How do I dedupe my recordset?

How do I dedupe my recordset?

Comment

Comment

Comment