How efficient are queries that use DISTINCT (vs filtering data yourself)

**rski** · May 19 '11, 07:11 AM

Did you try to optimize the query in any way?
Is it very complex query? Did you check explain plan for it?
I think that processing 1mln rows from the application will also be very time consuming. How do you want to code that distinct in application(usi ng some hash table?)?

**Brosert** · Oct 12 '11, 01:52 AM

1) Not sure what you mean by optimisation. I am making sure that where possible, indexes are being used for joins (the data was designed to be queried roughly how I query it)
2) The query joins on 3 tables, but I wouldn't consider it overly complex
3) No I did not have a look at the explain plan, because I didn't really think it was overly relevant to the question - I hoped someone might discuss whether on the whole they think it more efficient to use distinct or filter in the app.
4) 1 million rows may be time consuming either way, but the specific question I was interested in is whether using DISTINCT is likely to be more or less efficient than filtering Application side.

For the record, I have found that in this case at least, it seems a lot more efficient to filter in the application rather than on the database - and is far more flexible

**rski** · Oct 14 '11, 06:41 AM

4) yes, but if you do filtering on the application side you will have to send 1mln rows to the application and it is wrong from the performance point of view. If you can do something with the database (like get distinct columns) you should do that.

How efficient are queries that use DISTINCT (vs filtering data yourself)

How efficient are queries that use DISTINCT (vs filtering data yourself)

Comment

Comment

Comment