Differences between revisions 9 and 10
Revision 9 as of 2010-06-18 13:59:13
Size: 1658
Editor: TobiasBurnus
Comment: typo
Revision 10 as of 2010-07-08 14:11:18
Size: 2283
Editor: TobiasBurnus
Comment: Update OpenMP 3.1 item
Deletions are marked like this. Additions are marked like this.
Line 24: Line 24:
 * Implement untied tasks (no compliance issue; needs to be well tuned to be actually faster; cf. page 53 of [[https://iwomp.zih.tu-dresden.de/downloads/omp30-tasks.pdf|pdf]])
 * OpenMP v3.1, when released ("The OpenMP ARB has [[ftp://ftp.nag.co.uk/sc22wg5/N1801-N1850/N1817.txt|started]] developing OpenMP 3.1, which is scheduled to be released for public review at [[http://www.ccs.tsukuba.ac.jp/workshop/IWOMP2010/|IWOMP, the International Workshop for OpenMP]], in June 2010. This update will include clarifications, and several extensions.")
 * Implement `untied` tasks (no compliance issue; needs to be well tuned to be actually faster; cf. page 53 of [[https://iwomp.zih.tu-dresden.de/downloads/omp30-tasks.pdf|pdf]])
 * OpenMP v3.1, when released ("The OpenMP ARB has [[ftp://ftp.nag.co.uk/sc22wg5/N1801-N1850/N1817.txt|started]] developing OpenMP 3.1, which is scheduled to be released for public review at [[http://www.ccs.tsukuba.ac.jp/workshop/IWOMP2010/|IWOMP, the International Workshop for OpenMP]], in June 2010. This update will include clarifications, and several extensions.") The release of a public draft has been delayed and is seemingly now available before [[http://sc10.supercomputing.org/|Supercomputing 2010]] (= before mid of November). For the new features, see also the [[http://www.springerlink.com/content/978-3-642-13216-2|OWOMP 2010 proceedings]] and a [[http://www-949.ibm.com/software/rational/cafe/blogs/ccpp-parallel-multicore/2010/06/21/the-view-from-iwomp-2010-trip-report|blog entry]]. Especially, user-defined reductions (a major item), affinity, atomics extensions (support capture/write), and task scheduling items (`taskyield` construct, `final` clause) are to be expected.

OpenMP

This page contains information on GCC's implementation of the OpenMP standard and related functionality like the auto parallelizer (-ftree-parallelize-loops).

As of GCC 4.2, the compiler implements version 2.5 of the OpenMP standard and as of 4.4 it implements version 3.0 of the OpenMP standard.

OpenMP Documentation

Automatic Parallelization

(-ftree-parallelize-loops)

  • Streamization

TODO List

Feel free to add new items to this list as you run into issues or features that would be interesting to add. Send mail to the list and/or the GCC OpenMP maintainers if any item in this list sounds interesting but is hard to understand.

  • Fix PR 35423.

  • Fine tune the auto scheduling feature for parallel loops.

  • Implement untied tasks (no compliance issue; needs to be well tuned to be actually faster; cf. page 53 of pdf)

  • OpenMP v3.1, when released ("The OpenMP ARB has started developing OpenMP 3.1, which is scheduled to be released for public review at IWOMP, the International Workshop for OpenMP, in June 2010. This update will include clarifications, and several extensions.") The release of a public draft has been delayed and is seemingly now available before Supercomputing 2010 (= before mid of November). For the new features, see also the OWOMP 2010 proceedings and a blog entry. Especially, user-defined reductions (a major item), affinity, atomics extensions (support capture/write), and task scheduling items (taskyield construct, final clause) are to be expected.

None: openmp (last edited 2015-01-29 08:24:51 by tschwinge)