optimization problem: ptr not kept in register
Ian Lance Taylor
iant@google.com
Wed Mar 26 07:36:00 GMT 2014
On Tue, Mar 25, 2014 at 12:38 PM, Peter A. Felvegi
<petschy@praire-chicken.com> wrote:
>
> The reduced test case is at the end. It encodes data into a buffer in a loop
> with variable length encoding (not a working real encoding). For some
> reason, the write ptr is not kept in a register, but loaded/stored when
> used/updated. There is a potential function call in the loop, but there are
> __builtin_expect hints, so I think it would be possible to use a register
> for the ptr and store just before the call, and load it back right after the
> call. This would speed up the common code path: less code, less loads and
> stores.I measured around 20-30% more runtime, compared to a version where a
> pointer goes in and the updated ptr is returned. However, passing/returning
> the ptr has other issues, esp for a decoder, that would return the decoded
> value normally, not the ptr.
You marked the encode_noinline function as noinline, and encode can
call encode_noinline. The encode_noinline function could change any
part of global memory, and in particular could change the value of
n->next. So the loop has to reload that value, in case it was
changed.
Ian
More information about the Gcc-help
mailing list