best way to match values in two tables...

**dwblas** · Aug 3 '10, 04:15 PM

You want to do this the other way around, i.e, look up the items in the dictionary. I would also suggest that you strip() the keys before using them to get rid of any spaces.

Code:

reader = open("D:\\temp\\table2.csv",'r')
for line in reader:
    line = line.strip()
    Tmp2Arr = line.split(',')
    If Tmp2Arr[0].strip() in TmpDict:
        print TmpDict[Tmp2Arr[0]]

You can also use the intersection of 2 sets if you only want to identify the elements that are the same.

This works but slow... and very memory intensive (my table1 is 500MB and table2 is extremely large)

If a dictionary does not work for whatever reason, the next step up is an SQL database. SQLite comes with Python, so post back if you want some help in that area.

Code:

reader = open("D:\\temp\\table1.csv",'r')
for line in reader:
    line = line.strip()
    TmpArr=line.split(',')
    ##
    ## indentation error here in the code as posted (should be indented)
    TmpDict[TmpArr[0]]=TmpArr[1]+str(',')+TmpArr[5]+str(',')+TmpArr[6]
##
## also a list is slightly faster
    TmpDict[TmpArr[0]]=[TmpArr[1], TmpArr[5], TmpArr[6]]
## then you can just join them
## an example
test_list = ["one", "two", "three"]
print ",".join(test_list)

reader.close()

**erbrose** · Aug 3 '10, 04:46 PM

wow, thanks dwblas!
That dramatically increased the search speed! Still alot to learn with python..
Very much appreciated!
Eric

**MMcCarthy** · Aug 4 '10, 07:20 AM

If your query has been answered please mark as Best Answer the post you feel gave you the solution.

Mary

best way to match values in two tables...

best way to match values in two tables...

Comment

Comment

Comment