Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

How about return int64_t or long long #102

Open
lygstate opened this issue Aug 6, 2021 · 2 comments
Open

How about return int64_t or long long #102

lygstate opened this issue Aug 6, 2021 · 2 comments

Comments

@lygstate
Copy link

lygstate commented Aug 6, 2021

return (sb.st_size - curpos); //FIXME: can overflow

@mheily
Copy link
Owner

mheily commented Aug 14, 2021

Returning an int64_t just transfers responsibility for the problem to the caller, because the return value is stored in a int32_t inside struct kevent.

The return value of this function is stored in an intptr_t, which is a signed long. By default, off_t is also a signed long, but it can also be a long long if the program is compiled with -D_FILE_OFFSET_BITS=64. This is one way that it could overflow.

There's also a race between the calls to lseek() and stat(). If the file was truncated (or shrank) in between these two system calls, the value returned by lseek() could be greater than sb.st_size, which would result in a negative value being returned by the function. It would probably be better to return sb.st_size in this case.

I think there's some benefit in sanity checking the return value a bit more, to ensure that the value isn't negative and isn't greater than the size of the file. Interestingly, I see the following clang-tidy warning:

Clang-Tidy: Narrowing conversion from 'uintptr_t' (aka 'unsigned long') to signed type 'int' is implementation-defined

I'm not sure where clang-tidy is getting uintptr_t from..

@arr2036
Copy link
Collaborator

arr2036 commented Aug 24, 2021

Looking through the man pages the value returned by lseek can be greater than the file size under normal conditions as well, as you're allowed to write past the current end of the file.

     The lseek() function allows the file offset to be set beyond the end of the existing end-
     of-file of the file. If data is later written at this point, subsequent reads of the data
     in the gap return bytes of zeros (until data is actually written into the gap).

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants