[PATCH][ARM] Implement vectorizer cost hooks
Richard Earnshaw
rearnsha@arm.com
Mon Feb 11 10:57:00 GMT 2013
On 05/02/13 18:18, Christophe Lyon wrote:
> Hi,
>
> Following the discussion about "disable peeling" [1] a few weeks ago,
> it turned out that the vectorizer cost model needed some
> implementation for ARM.
>
> The attached patch implements arm_builtin_vectorization_cost and
> arm_add_stmt_cost, providing default costs when aligned and unaligned
> loads/stores have the same cost (=1). init_cost and finish_cost still
> use the default implementation (I noticed that x86 has chosen to
> duplicate the default implementation without changing it, why?)
>
> Benchmarking shows very little variation, expect a noticeable +1.6% on coremark.
>
> If this is OK, we can then discuss how to disable peeling completely
> when aligned and unaligned accesses have the same cost (and thus where
> peeling is a loss of performance). I think adding a new hook is
> necessary, since target descriptions may use different models for
> these costs (eg x86 makes no difference between unaligned loads and
> unaligned stores).
>
> Thanks,
>
> Christophe.
>
> [1] http://gcc.gnu.org/ml/gcc/2012-12/msg00036.html
>
> 2013-02-05 Christophe Lyon <christophe.lyon@linaro.org>
>
> * config/arm/arm.c (arm_builtin_vectorization_cost)
> (arm_add_stmt_cost): New functions.
> (TARGET_VECTORIZE_BUILTIN_VECTORIZATION_COST)
> (TARGET_VECTORIZE_ADD_STMT_COST): Define.
> (struct processor_costs): New struct type.
> (default_arm_cost): New struct of type processor_costs.=
>
Christophe,
Thanks for the patch. This is mostly OK, but please can you make the
following changes.
+struct processor_costs {
Please name this something like cpu_vec_costs. It's not the only cost
table in the back-end.
+struct processor_costs default_arm_cost = { /* arm generic costs. */
Similarly, use something like default_arm_vec_cost.
+const struct processor_costs *arm_cost = &default_arm_cost;
And here. But better still, link this through the current_tune table
rather than introducing a new global.
Finally,
@@ -27256,4 +27272,130 @@ arm_validize_comparison (rtx *comparison, rtx
* op1, rtx * op2)
}
+/* Vectorizer cost model implementation. */
Please put the patch in a more suitable location rather than just
dumping it at the end of the file. There are already numerous functions
related to costs that are mostly grouped together. I suggest this goes
near the rtx_costs code.
R.
More information about the Gcc-patches
mailing list