Table structure and query efficiency

**Skarjune** · Oct 20 '06, 03:15 PM

Re: Table structure and query efficiency

Jody wrote:

I'm wondering is this the best way to achieve it and how efficient is it
assuming that potentially there could be hundreds of thousands of
widgets ?

I'd also considered using a pivot table with 3 columns - widgetid,
saleid and leaseid and then joining the widget table to the pivot table...

Jody, you've walked through your scenario quite well. So, here's a
comment.

I like UNIONs, but your "pivot table" idea could provide a useful
intermediary as an alternative. As for scalable performance, that will
rely upon indexing the criteria fields properly. If the application is
mostly a read for the customers, you can index as many fields as
possible, which should give efficiency as long as you are using numeric
datatypes where applicable and avoiding the LIKE operator (BETWEEN is
OK as it's actually just a compound Boolean).

For categories with known lists, use lookup tables and relate/index
those to the widget table so that you can search categoryids with
integers rather than text fields. If you pull the ids from the user
selections, then the queries are no more complicated. Even if you want
to use text criteria, it will always run faster if you run text
conditions against the lookup table with that joined to the widget
table via ids--although you can end up with lots of JOINs to manage.

Try modeling both and see what works for you. Either way, it seems that
you might want to label the rows for Sale or Lease for the customer,
which can be accomplished by hacking a flag field such as ['Buy' AS
Availability] and ['Lease' AS Availability]

**Jody** · Oct 21 '06, 11:55 PM

Re: Table structure and query efficiency

"Skarjune" <dhs@wordimage. comwrote in
news:1161358456 .365654.252480@ b28g2000cwb.goo glegroups.com:

Thanks for the advice. I already created a field called forSaleLease in
the main widget table which uses bits 0 and 1 as flags to indicate which
it is. I include this check in each of the unions so it shouldn't need
to go on with the rest of the joins. As you suggest also in the first
part of each query I select a constant as the flag so I know which
select statement generated the result (select 1 as searchType... union
select 2 as searchType).

I also have a table for the categories and just use the id in the widget
table (although I validate the category first and as they are so small I
plan on caching them in my application code and then just looking them
up in code from the index returned). I usually try to avoid any
redundant fields and use lookup tables with id's instead of literal
values where possible.

I have always preferred to do it that way, althugh you end up with often
complicated joins (nested queries, dericed tables, etc), or at least
many tables involved. I wasn't sure whether as databases grew it
sometimes became neccessary to implement some redundancy or storing
literial values in main tables to avoid overhead of additional joins.

I'd already implemented the system using the unions so I guess will
leave it for now and try to find some time later to test the
alternative. As the data is only small at the moment the basic tests I
have done thus far have been inconclusive.

I find I spend more time on the initial design and stressing over the
'correct' or 'most efficient' design than I do on coding the thing! I
guess until data reaches a certain point it is hard to evaluate any
performance penalties and adding indexes or additional restructing is
always an option.

Thanks again.
Jody

Jody wrote:
>

>I'm wondering is this the best way to achieve it and how efficient is
>it assuming that potentially there could be hundreds of thousands of
>widgets ?

>

>I'd also considered using a pivot table with 3 columns - widgetid,
>saleid and leaseid and then joining the widget table to the pivot
>table...

>
Jody, you've walked through your scenario quite well. So, here's a
comment.
>
I like UNIONs, but your "pivot table" idea could provide a useful
intermediary as an alternative. As for scalable performance, that will
rely upon indexing the criteria fields properly. If the application is
mostly a read for the customers, you can index as many fields as
possible, which should give efficiency as long as you are using
numeric datatypes where applicable and avoiding the LIKE operator
(BETWEEN is OK as it's actually just a compound Boolean).
>
For categories with known lists, use lookup tables and relate/index
those to the widget table so that you can search categoryids with
integers rather than text fields. If you pull the ids from the user
selections, then the queries are no more complicated. Even if you want
to use text criteria, it will always run faster if you run text
conditions against the lookup table with that joined to the widget
table via ids--although you can end up with lots of JOINs to manage.
>
Try modeling both and see what works for you. Either way, it seems
that you might want to label the rows for Sale or Lease for the
customer, which can be accomplished by hacking a flag field such as
['Buy' AS Availability] and ['Lease' AS Availability]
>
>

Table structure and query efficiency

Table structure and query efficiency

Comment

Comment