[Rd] seek(con, 0, "end",
rw="r") does not always work correctly (PR#7901)
tplate at blackmesacapital.com
tplate at blackmesacapital.com
Fri May 27 18:13:11 CEST 2005
ligges at statistik.uni-dortmund.de wrote:
> tplate at blackmesacapital.com wrote:
>
>
>>I've noticed that seek(con, 0, "end", rw="r") on a file connection does
>>not always work correctly after a write (R 2.1.0 on Windows).
>>
>>[Is a call to fflush() needed inside file_seek() in main/connections.c?]
>
>
>
> If you have an idea where to fflush() precisely and your patch works,
> please tell it! I'll happily run some test cases where seeking matters.
>
I couldn't see why the current code was returning a bad value under some
conditions. (That's why didn't offer anything more than a suggestion).
My suggestion to use an fflush() was a guess (hence the question mark,
but evidence for the guess being correct was that doing a flush at the R
command line made the whole thing work correctly.) To be safe, I would
try to put a flush() right at the beginning of file_seek(), before the
call to f_tell(). I tried this, and with the modification the test case
I gave produced correct output. Here's how the beginning of my modified
file_seek() function (in main/connections.c) looks:
static double file_seek(Rconnection con, double where, int origin, int rw)
{
Rfileconn this = con->private;
FILE *fp = this->fp;
#if defined(HAVE_OFF_T) && defined(__USE_LARGEFILE)
off_t pos;
#else
#ifdef Win32
off64_t pos;
#else
long pos;
#endif
#endif
int whence = SEEK_SET;
fflush(fp);
pos = f_tell(fp);
/* make sure both positions are set */
> Note that ?seek currently tells us "The value returned by seek(where=NA)
> appears to be unreliable on Windows systems, at least for text files."
> It would be nice if this comment could be removed, of course ....
May the explanation could be given that this happens with text files
because Windows inserts extra characters at end-of-lines when reading
"text" mode files (but with binary files, things should be fine.) This
particular issue is documented in Microsoft Windows documentation (e.g.,
at http://msdn2.microsoft.com/library/75yw9bf3(en-us,vs.80).aspx, found
by searching on Google using the terms "fseek windows documentation").
Are there any known issues using seek with binary files under Windows?
If there are not, then the caveat could be made specific to text files
and all vagueness removed.
-- Tony Plate
>
> Uwe Ligges
>
>
>
>
>>Example (see the lines with the "***WRONG***" comment)
>>
>> > # seek(, rw="r") on a file does not always work correctly after a write
>> > f <- file("tmp3.txt", "w+b")
>> > # Write something earlier in the file
>> > seek(f, 10, rw="w")
>>[1] 0
>> > writeLines(c("ghi", "jkl"), f)
>> > seek(f, 20, rw="w")
>>[1] 18
>> > writeLines(c("abc"), f)
>> > seek(f, 0, "end", rw="w")
>>[1] 24
>> > # Try to read at the end of the file
>> > seek(f, 0, "end", rw="r")
>>[1] 0
>> > readLines(f, -1)
>>character(0)
>> > seek(f, 0, "end", rw="w")
>>[1] 18
>> > # write something at the end of the file
>> > writeLines(c("def"), f)
>> > # Try to read at the end of the file
>> > # flush(f) # flushing here makes the seek work correctly
>> > seek(f, 0, "end", rw="r")
>>[1] 24
>> > seek(f, NA, rw="r") # ***WRONG*** (should return 28)
>>[1] 24
>> > readLines(f, -1) # ***WRONG*** (should return character(0))
>>[1] "def"
>> > seek(f, 20, rw="r")
>>[1] 28
>> > readLines(f, -1)
>>[1] "abc" "def"
>> > seek(f, 0, "end", rw="r") # now it works correctly
>>[1] 28
>> > seek(f, NA, rw="r")
>>[1] 28
>> > readLines(f, -1)
>>character(0)
>> > close(f)
>> >
>> > version
>> _
>>platform i386-pc-mingw32
>>arch i386
>>os mingw32
>>system i386, mingw32
>>status
>>major 2
>>minor 1.0
>>year 2005
>>month 04
>>day 18
>>language R
>> >
>>
>>-- Tony Plate
>>
>>______________________________________________
>>R-devel at stat.math.ethz.ch mailing list
>>https://stat.ethz.ch/mailman/listinfo/r-devel
>
>
> ______________________________________________
> R-devel at stat.math.ethz.ch mailing list
> https://stat.ethz.ch/mailman/listinfo/r-devel
>
More information about the R-devel
mailing list