issues simply parsing a whitespace-delimited textfile in pythonscript

**Damon Getsman** · Jun 27 '08, 04:25 PM

Re: issues simply parsing a whitespace-delimited textfile in pythonscript

Okay, so I manged to kludge around the issue by not using
the .readline() in my 'for' statement. Instead, I'm slurping the
whole file into a new list that I put in for that purpose, and
everything seems to be working just fine. However, I don't know WHY
the other method failed and I'm at a loss for why that didn't work and
this is working. I'd really like to know the why about this issue so
that I don't have to use crappy coding practice and kludge around it
the next time I have an issue like this.

Any ideas much appreciated.

Damon G.

**Paul McGuire** · Jun 27 '08, 04:25 PM

Re: issues simply parsing a whitespace-delimited textfile in pythonscript

On May 21, 10:59 am, Damon Getsman <dgets...@amire hab.netwrote:

I'm having an issue parsing lines of 'last' output that I have stored
in a /tmp file. The first time it does a .readline() I get the full
line of output, which I'm then able to split() and work with the
individual fields of without any problem. Unfortunately, the second
time that I do a .readline() on the file, I am only receiving the
first character of the first field. Looking through the /tmp file
shows that it's not corrupted from the format that it should be in at
all... Here's the relevant script:
>
----
#parse
Lastdump = open('/tmp/esd_tmp', 'r')
>
#find out what the last day entry is in the wtmp
cur_rec = Lastdump.readli ne()
work = cur_rec.split()
>
if debug == 1:
print work
print " is our split record line from /tmp/esd_tmp\n"
>
startday = work[3]
>
if debug == 1:
print startday + " is the starting day\n"
print days
print " is our dictionary of days\n"
print days[startday] + " is our ending day\n"
>
for cur_rec in Lastdump.readli ne():
work = cur_rec.split()
>

<snip>

for cur_rec in Lastdump.readli ne():

is the problem. readline() returns a string containing the next
line's worth of text, NOT an iterator over all the subsequent lines in
the file. So your code is really saying:

next_line_in_fi le = Lastdump.readli ne():
for cur_rec in next_line_in_fi le:

which of course, is iterating over a string character by character.

Since you are opening Lastdump (not great casing for a variable name,
BTW - looks like a class name with that leading capital letter), it
gives you an iterator already. Try this instead:

lastdump = open('/tmp/esd_tmp', 'r')

cur_rec = lastdump.next()

...

for cur_rec in lastdump:

...

This should get you over the hump on reading the file.

Also, may I suggest this method for splitting up each record line, and
assigning individual fields to variables:

user,s1,s2,day, month,date,time ,desc = cur_rec.split(N one,7)

-- Paul

**Damon Getsman** · Jun 27 '08, 04:25 PM

Re: issues simply parsing a whitespace-delimited textfile in pythonscript

On May 21, 11:15 am, Paul McGuire <pt...@austin.r r.comwrote:

<snip>
>
for cur_rec in Lastdump.readli ne():
>
is the problem. readline() returns a string containing the next
line's worth of text, NOT an iterator over all the subsequent lines in
the file. So your code is really saying:
>
next_line_in_fi le = Lastdump.readli ne():
for cur_rec in next_line_in_fi le:
>
which of course, is iterating over a string character by character.
>
Since you are opening Lastdump (not great casing for a variable name,
BTW - looks like a class name with that leading capital letter), it
gives you an iterator already. Try this instead:
>
lastdump = open('/tmp/esd_tmp', 'r')
>
cur_rec = lastdump.next()
>
...
>
for cur_rec in lastdump:
>
...
>
This should get you over the hump on reading the file.
>
Also, may I suggest this method for splitting up each record line, and
assigning individual fields to variables:
>
user,s1,s2,day, month,date,time ,desc = cur_rec.split(N one,7)
>
-- Paul

Well the individual variables isn't exactly appropriate as I'm only
going to be using 2 of the fields. I think I will set those to
individual variables with a slice of what you mentioned, though, for
readability. Thank you for the tips, they were all much appreciated.

-Damon

issues simply parsing a whitespace-delimited textfile in pythonscript

issues simply parsing a whitespace-delimited textfile in pythonscript

Comment

Comment

Comment