Need help with some very Practical Extraction

**KevinADC** · Jan 28 '07, 08:08 PM

as long as the pattern is on the same line this should work:

Code:

my @digits = ();
foreach (@array_of_data){
    if ( /mydigits=(\d+)/i ){
       print "found $1 in this line: $_\n";
       push @digits,$1;
    }
}
print "$_\n" for @digits;

but can be changed if the pattern is broken over multiple lines.

**theapeman** · Jan 28 '07, 10:09 PM

Wow, thank you, that is very helpful. Love the i modifier for case insensitivity!

As I suspected, the HTML doc looks like one long line to Perl, so what happens is it finds the first instance, say, 123456789, prints

"found 123456789 in this line:"

followed by what to you and me looks like more than 3000 lines of HTML, then prints:

"123456789"

and then quits. But that is more than I could get it to do before, and this definitely has me pointed in the right direction, so thanks again. :)

**theapeman** · Jan 28 '07, 10:40 PM

Wow, I just realized what you did with the parentheses and the $1 to extract only the digits. Awesome!

**KevinADC** · Jan 28 '07, 11:39 PM

See how this works:

Code:

use WWW::Mechanize;
$url = "http://someurl";
my $mechanize = WWW::Mechanize->new(autocheck => 1);
$mechanize->get($url);
my $string_of_data = $mechanize->content;
my @digits = $string_of_data =~ m/mydigits=(\d+)/igm;
print "$_\n" for @digits;

if that doesn't work, change the 'm' after 'ig' to an 's'

Need help with some very Practical Extraction

Need help with some very Practical Extraction

Comment

Comment

Comment

Comment