Hello,
I have a problem with a small asp-solution that searches for PDF-documents
with
indexing service.
For some files in the search result I get gibberish returned, such as
*************** *************** *************** *********
I$OYDURSURGXFWV SURGXFHGLQ0H[LFR/DERUDWRU\5HSRUW/DERUDWRU\2UGHUH U5HVSRQVLEOH6
WDWXV)HPLQLQH*2 7-RKDQVVRQ6XVDQQH )LQDO'DWH)LQDO3 URMHFW3URMHFW1D PH&RVWSODFH9HU
1R$9$523'36XPPD U\7KHUHVXOWV5XQ 2II7KHSURGXFWVW KDWZHUHSURGXFHG ZHUHEDG7KHVXUID F
HPDWHULDOZDVK\G URSKRELFDQGDOOW KHSURGXFWVKDGUX QRII6HHSLFWXUH7 KHSURGXFWVWKDWS U
RGXFHGZHUHJRRG, WZDVWKHVDPHSURG XFWVWKDWSURGXFH GEXWZLWKVSXQERQ G%XURSHVXUIDFHP D
WHULDO7KHSURGXF WVKDGIDVWLQOHWJ RRGVSUHDGLQJLQW KHFRUHDQGQRUXQR II'RVLPDW7KHSUR G
XFWSURGXFHGZDVE DG6HYHUDORIWKHS URGX
*************** *************** *************** *********
while other files returns "good text" like this:
*************** *************** *************** *********
Feminine 865106-Date Final Projectname Orderer 2004-06-02 ALVARO PDP
Johansson Susanne Distributed to: Internal test Alvaro v. 20-21 Summary
Mission Background Comments Conclusion Test methods Test objects Sample No:
20040527-001-01 Alvaro Labrep 2_2.rep SEBJOIS 2004-03-17 Printed by:
labreporter 2004-06-02 15:51:51Laborat ory Report No:20040527-001 Rev: 1
Status:Final Brand /Name SABA Ultr
*************** *************** *************** *********
The only difference between these files are that they seem to be saved with
different PDF versions or something like that (looking in File --> Document
Properties of the files).
The "bad" file has the following information there:
Creator: Windows NT 4.0
Producer: Acrobat Distiller Daemon 3.01 for HP-UX A.09.01 and later (HPPA)
PDF version: 1.1 (Acrobat 2.x)
The "good" file has the following information:
Creator: AdobePS5.dll Version 5.1.2
Producer: Acrobat Distiller 4.0 for Windows
PDF version: 1.3 (Acrobat 4.x)
A small part of the code looks like this:
*************** *************** *************** *********
set objConnection = Server.CreateOb ject("ADODB.Con nection")
set objIndex = Server.CreateOb ject("ADODB.Rec ordset")
objConnection.C onnectionString = "Provider=MSIDX S;"
objConnection.O pen
strSQL = "SELECT Characterizatio n, Filename, Path FROM
se_got_data.lim spdf..SCOPE() WHERE "
objIndex.Open strSQL, objConnection
do until objIndex.EOF
Response.write objIndex("Chara cterization")
objIndex.MoveNe xt
loop
objConnection.C lose
Set objConnection = nothing
*************** *************** *************** *********
The problem seems to be this Characterizatio n-part of the earlier version of
PDFs. Has anyone experienced anything like this before??
Best regards
Martin Emanuelsson
Gothenburg, Sweden
I have a problem with a small asp-solution that searches for PDF-documents
with
indexing service.
For some files in the search result I get gibberish returned, such as
*************** *************** *************** *********
I$OYDURSURGXFWV SURGXFHGLQ0H[LFR/DERUDWRU\5HSRUW/DERUDWRU\2UGHUH U5HVSRQVLEOH6
WDWXV)HPLQLQH*2 7-RKDQVVRQ6XVDQQH )LQDO'DWH)LQDO3 URMHFW3URMHFW1D PH&RVWSODFH9HU
1R$9$523'36XPPD U\7KHUHVXOWV5XQ 2II7KHSURGXFWVW KDWZHUHSURGXFHG ZHUHEDG7KHVXUID F
HPDWHULDOZDVK\G URSKRELFDQGDOOW KHSURGXFWVKDGUX QRII6HHSLFWXUH7 KHSURGXFWVWKDWS U
RGXFHGZHUHJRRG, WZDVWKHVDPHSURG XFWVWKDWSURGXFH GEXWZLWKVSXQERQ G%XURSHVXUIDFHP D
WHULDO7KHSURGXF WVKDGIDVWLQOHWJ RRGVSUHDGLQJLQW KHFRUHDQGQRUXQR II'RVLPDW7KHSUR G
XFWSURGXFHGZDVE DG6HYHUDORIWKHS URGX
*************** *************** *************** *********
while other files returns "good text" like this:
*************** *************** *************** *********
Feminine 865106-Date Final Projectname Orderer 2004-06-02 ALVARO PDP
Johansson Susanne Distributed to: Internal test Alvaro v. 20-21 Summary
Mission Background Comments Conclusion Test methods Test objects Sample No:
20040527-001-01 Alvaro Labrep 2_2.rep SEBJOIS 2004-03-17 Printed by:
labreporter 2004-06-02 15:51:51Laborat ory Report No:20040527-001 Rev: 1
Status:Final Brand /Name SABA Ultr
*************** *************** *************** *********
The only difference between these files are that they seem to be saved with
different PDF versions or something like that (looking in File --> Document
Properties of the files).
The "bad" file has the following information there:
Creator: Windows NT 4.0
Producer: Acrobat Distiller Daemon 3.01 for HP-UX A.09.01 and later (HPPA)
PDF version: 1.1 (Acrobat 2.x)
The "good" file has the following information:
Creator: AdobePS5.dll Version 5.1.2
Producer: Acrobat Distiller 4.0 for Windows
PDF version: 1.3 (Acrobat 4.x)
A small part of the code looks like this:
*************** *************** *************** *********
set objConnection = Server.CreateOb ject("ADODB.Con nection")
set objIndex = Server.CreateOb ject("ADODB.Rec ordset")
objConnection.C onnectionString = "Provider=MSIDX S;"
objConnection.O pen
strSQL = "SELECT Characterizatio n, Filename, Path FROM
se_got_data.lim spdf..SCOPE() WHERE "
objIndex.Open strSQL, objConnection
do until objIndex.EOF
Response.write objIndex("Chara cterization")
objIndex.MoveNe xt
loop
objConnection.C lose
Set objConnection = nothing
*************** *************** *************** *********
The problem seems to be this Characterizatio n-part of the earlier version of
PDFs. Has anyone experienced anything like this before??
Best regards
Martin Emanuelsson
Gothenburg, Sweden
Comment