Changes between Version 2 and Version 3 of doc/app/recommended_compiler_options
- Timestamp:
- Aug 23, 2018 12:30:58 PM (6 years ago)
Legend:
- Unmodified
- Added
- Removed
- Modified
-
doc/app/recommended_compiler_options
v2 v3 1 = '''This page is under construction! ''' =1 ^= '''This page is under construction! ''' = 2 2 \\\\\\ 3 3 … … 8 8 Compilers currently used by the PALM group are Intel, Cray, PGI and GNU-Fortran. We recommend to use these compilers with the following options (library and include-file options are omitted here): 9 9 10 Intel (version 14 and higher): 10 '''Intel (version 14 and higher) for optimized code:'''\\{{{-fpe0 -O3 -xHost -fp-model source -ftz -no-prec-div -no-prec-sqrt -ip}}} 11 11 12 12 ||='''Option''' =||='''meaning''' =|| … … 15 15 ||-O3 ||Performs O2 optimizations and enables more aggressive loop transformations such as Fusion, Block-Unroll-and-Jam, and collapsing IF statements. || 16 16 ||-xHost ||This option tells the compiler to generate instructions for the highest instruction set available on the compilation host processor. || 17 ||-fp-model source ||Controls the semantics of floating-point calculations. Rounds intermediate results to source-defined precision. '''This option is required to force jobs to deliver reproduceble results. If not set, two identical PALM runsmay produce different results.''' Code performance is slightly reduced when using this option. ||17 ||-fp-model source ||Controls the semantics of floating-point calculations. Rounds intermediate results to source-defined precision. '''This option is required to force jobs to deliver reproduceble results. If not set, two PALM runs with exactly identical setup may produce different results.''' Code performance is slightly reduced when using this option. || 18 18 ||-ftz ||This option flushes denormal results to zero when the application is in the gradual underflow mode. It may improve performance if the denormal values are not critical to your application's behavior. || 19 19 ||-no-prec-div ||Enables optimizations that give slightly less precise results than full IEEE division. ||