This is the mail archive of the gcc-patches@gcc.gnu.org mailing list for the GCC project.

Index Nav:	[Date Index] [Subject Index] [Author Index] [Thread Index]
Message Nav:	[Date Prev] [Date Next]	[Thread Prev] [Thread Next]
Other format:	[Raw text]

Re: [PATCH v3] [aarch64] Add CPU support for Ampere Computing's eMAG.

From: Philipp Tomsich <philipp dot tomsich at theobroma-systems dot com>
To: Christoph Müllner <christoph dot muellner at theobroma-systems dot com>
Cc: Kyrill Tkachov <kyrylo dot tkachov at foss dot arm dot com>, Richard Earnshaw <Richard dot Earnshaw at arm dot com>, James Greenhalgh <James dot Greenhalgh at arm dot com>, Marcus Shawcroft <Marcus dot Shawcroft at arm dot com>, gcc-patches at gcc dot gnu dot org
Date: Wed, 21 Nov 2018 13:26:21 +0100
Subject: Re: [PATCH v3] [aarch64] Add CPU support for Ampere Computing's eMAG.
References: <20181120180055.16226-1-christoph.muellner@theobroma-systems.com> <5BF532E6.1050600@foss.arm.com> <F46A653A-95DD-4A05-89A5-39846D95B3D4@theobroma-systems.com>

This is currently slowed down by the speed of subversion (as my subversion tree
was outdated).  So it should only be a matter of days ... ;-)

> On 21.11.2018, at 12:15, Christoph Müllner <christoph.muellner@theobroma-systems.com> wrote:
> 
>> 
>> On 21.11.2018, at 11:26, Kyrill Tkachov <kyrylo.tkachov@foss.arm.com> wrote:
>> 
>> Hi Christoph,
>> 
>> On 20/11/18 18:00, Christoph Muellner wrote:
>>> Tested with "make check" and no regressions found.
>>> 
>>> This patch depends on the struct xgene1_prefetch_tune,
>>> which has been acknowledged already:
>>> https://gcc.gnu.org/ml/gcc-patches/2018-11/msg00985.html
>>> 
>>> *** gcc/ChangeLog ***
>>> 
>>> 2018-xx-xx  Christoph Muellner <christoph.muellner@theobroma-system.com>
>>> 
>>> 	* config/aarch64/aarch64-cores.def: Define emag.
>>> 	* config/aarch64/aarch64-tune.md: Regenerated with emag.
>>> 	* config/aarch64/aarch64.c (emag_tunings): New struct.
>>> 	* doc/invoke.texi: Document mtune value.
>> 
>> This looks ok to me but you'll need a maintainer to approve.
>> You mentioned this depends on your previously approved patches.
>> Do you have write access or do you need someone to commit them for you?
> 
> I'd don't have write access.
> But I have already contacted somebody with write access to get my ACK'ed changes in.
> 
> Thanks,
> Christoph
> 
>> 
>> Thanks,
>> Kyrill
>> 
>>> Signed-off-by: Christoph Muellner <christoph.muellner@theobroma-systems.com>
>>> ---
>>> gcc/config/aarch64/aarch64-cores.def |  3 +++
>>> gcc/config/aarch64/aarch64-tune.md   |  2 +-
>>> gcc/config/aarch64/aarch64.c         | 25 +++++++++++++++++++++++++
>>> gcc/doc/invoke.texi                  |  2 +-
>>> 4 files changed, 30 insertions(+), 2 deletions(-)
>>> 
>>> diff --git a/gcc/config/aarch64/aarch64-cores.def b/gcc/config/aarch64/aarch64-cores.def
>>> index 1f3ac56..68cca00 100644
>>> --- a/gcc/config/aarch64/aarch64-cores.def
>>> +++ b/gcc/config/aarch64/aarch64-cores.def
>>> @@ -61,6 +61,9 @@ AARCH64_CORE("thunderxt88",   thunderxt88,   thunderx,  8A,  AARCH64_FL_FOR_ARCH
>>> AARCH64_CORE("thunderxt81",   thunderxt81,   thunderx,  8A,  AARCH64_FL_FOR_ARCH8 | AARCH64_FL_CRC | AARCH64_FL_CRYPTO, thunderx,  0x43, 0x0a2, -1)
>>> AARCH64_CORE("thunderxt83",   thunderxt83,   thunderx,  8A,  AARCH64_FL_FOR_ARCH8 | AARCH64_FL_CRC | AARCH64_FL_CRYPTO, thunderx,  0x43, 0x0a3, -1)
>>> +/* Ampere Computing cores. */
>>> +AARCH64_CORE("emag",        emag,      xgene1,    8A,  AARCH64_FL_FOR_ARCH8 | AARCH64_FL_CRC | AARCH64_FL_CRYPTO, emag, 0x50, 0x000, 3)
>>> +
>>> /* APM ('P') cores. */
>>> AARCH64_CORE("xgene1",      xgene1,    xgene1,    8A,  AARCH64_FL_FOR_ARCH8, xgene1, 0x50, 0x000, -1)
>>> diff --git a/gcc/config/aarch64/aarch64-tune.md b/gcc/config/aarch64/aarch64-tune.md
>>> index fade1d4..2fc7f03 100644
>>> --- a/gcc/config/aarch64/aarch64-tune.md
>>> +++ b/gcc/config/aarch64/aarch64-tune.md
>>> @@ -1,5 +1,5 @@
>>> ;; -*- buffer-read-only: t -*-
>>> ;; Generated automatically by gentune.sh from aarch64-cores.def
>>> (define_attr "tune"
>>> -	"cortexa35,cortexa53,cortexa57,cortexa72,cortexa73,thunderx,thunderxt88p1,thunderxt88,thunderxt81,thunderxt83,xgene1,falkor,qdf24xx,exynosm1,phecda,thunderx2t99p1,vulcan,thunderx2t99,cortexa55,cortexa75,cortexa76,ares,tsv110,saphira,cortexa57cortexa53,cortexa72cortexa53,cortexa73cortexa35,cortexa73cortexa53,cortexa75cortexa55,cortexa76cortexa55"
>>> +	"cortexa35,cortexa53,cortexa57,cortexa72,cortexa73,thunderx,thunderxt88p1,thunderxt88,thunderxt81,thunderxt83,emag,xgene1,falkor,qdf24xx,exynosm1,phecda,thunderx2t99p1,vulcan,thunderx2t99,cortexa55,cortexa75,cortexa76,ares,tsv110,saphira,cortexa57cortexa53,cortexa72cortexa53,cortexa73cortexa35,cortexa73cortexa53,cortexa75cortexa55,cortexa76cortexa55"
>>> 	(const (symbol_ref "((enum attr_tune) aarch64_tune)")))
>>> diff --git a/gcc/config/aarch64/aarch64.c b/gcc/config/aarch64/aarch64.c
>>> index f7f88a9..995aafe 100644
>>> --- a/gcc/config/aarch64/aarch64.c
>>> +++ b/gcc/config/aarch64/aarch64.c
>>> @@ -957,6 +957,31 @@ static const struct tune_params xgene1_tunings =
>>>   &xgene1_prefetch_tune
>>> };
>>> +static const struct tune_params emag_tunings =
>>> +{
>>> +  &xgene1_extra_costs,
>>> +  &xgene1_addrcost_table,
>>> +  &xgene1_regmove_cost,
>>> +  &xgene1_vector_cost,
>>> +  &generic_branch_cost,
>>> +  &xgene1_approx_modes,
>>> +  6, /* memmov_cost  */
>>> +  4, /* issue_rate  */
>>> +  AARCH64_FUSE_NOTHING, /* fusible_ops  */
>>> +  "16",	/* function_align.  */
>>> +  "16",	/* jump_align.  */
>>> +  "16",	/* loop_align.  */
>>> +  2,	/* int_reassoc_width.  */
>>> +  4,	/* fp_reassoc_width.  */
>>> +  1,	/* vec_reassoc_width.  */
>>> +  2,	/* min_div_recip_mul_sf.  */
>>> +  2,	/* min_div_recip_mul_df.  */
>>> +  17,	/* max_case_values.  */
>>> +  tune_params::AUTOPREFETCHER_OFF,	/* autoprefetcher_model.  */
>>> +  (AARCH64_EXTRA_TUNE_NO_LDP_STP_QREGS),	/* tune_flags.  */
>>> +  &xgene1_prefetch_tune
>>> +};
>>> +
>>> static const struct tune_params qdf24xx_tunings =
>>> {
>>>   &qdf24xx_extra_costs,
>>> diff --git a/gcc/doc/invoke.texi b/gcc/doc/invoke.texi
>>> index e016dce..ac81fb2 100644
>>> --- a/gcc/doc/invoke.texi
>>> +++ b/gcc/doc/invoke.texi
>>> @@ -15288,7 +15288,7 @@ Specify the name of the target processor for which GCC should tune the
>>> performance of the code.  Permissible values for this option are:
>>> @samp{generic}, @samp{cortex-a35}, @samp{cortex-a53}, @samp{cortex-a55},
>>> @samp{cortex-a57}, @samp{cortex-a72}, @samp{cortex-a73}, @samp{cortex-a75},
>>> -@samp{cortex-a76}, @samp{ares}, @samp{exynos-m1}, @samp{falkor},
>>> +@samp{cortex-a76}, @samp{ares}, @samp{exynos-m1}, @samp{emag}, @samp{falkor},
>>> @samp{qdf24xx}, @samp{saphira}, @samp{phecda}, @samp{xgene1}, @samp{vulcan},
>>> @samp{thunderx}, @samp{thunderxt88}, @samp{thunderxt88p1}, @samp{thunderxt81},
>>> @samp{tsv110}, @samp{thunderxt83}, @samp{thunderx2t99},

References:
- [PATCH v3] [aarch64] Add CPU support for Ampere Computing's eMAG.
  - From: Christoph Muellner
- Re: [PATCH v3] [aarch64] Add CPU support for Ampere Computing's eMAG.
  - From: Kyrill Tkachov
- Re: [PATCH v3] [aarch64] Add CPU support for Ampere Computing's eMAG.
  - From: Christoph Müllner

Index Nav:	[Date Index] [Subject Index] [Author Index] [Thread Index]
Message Nav:	[Date Prev] [Date Next]	[Thread Prev] [Thread Next]