Changes between Version 219 and Version 220 of doc/tec/changelog_2018


Ignore:
Timestamp:
Mar 9, 2013 12:36:25 AM (12 years ago)
Author:
raasch
Comment:

--

Legend:

Unmodified
Added
Removed
Modified
  • doc/tec/changelog_2018

    v219 v220  
    3131GPU porting of {{{pres}}}, {{{swap_timelevel}}}. Further porting of {{{poisfft}}} (including the tridiagonal solver), which now runs completely on GPU without any host/device data transfer for serial and parallel runs (but parallel runs still require data transfer before and after the MPI transpositions). The tridiagonal routines have been split into external subroutines (instead using embedded routines with {{{CONTAINS}}}). There is no distinction between parallel/non-parallel runs in {{{poisfft}}} and {{{tridia}}} any more. The respective preprocessor directives have been removed. The tridia routines have been moved to the end of file {{{poisfft.f90}}} because of a probable bug in the PGI compiler 12.5 (otherwise "invalid device function" is indicated during runtime). Resorting from/to array {{{work}}} have been changed in the {{{transpose}}} routines. {{{work}}} now has 4 dimensions instead of 1. Adjustments of openACC directives. Output of accelerator board information. (cuda_fft_interfaces, fft_xy, flow_statistics, header, init_3d_model, palm, poisfft, pres, prognostic_equations, swap_timelevel, time_integration, transpose, .mrun.config.imuk_gpu)
    3232
    33 Optimization of {{{tridia}}} routines: constant elements and coefficients of array {{{tri}}} are stored in separate arrays {{{ddzuw}}} and {{{tric}}} and only calculated once at beginning. Last dimension of {{{tri}}} has been reduced from 5 to 2. Routine {{{poisfft_init}}} is now called internally from {{{poisfft}}}. (Makefile, Makefile_check, init_grid, init_3d_model, modules, palm, poisfft, poisfft_hybrid)
     33Optimization of {{{tridia}}} routines: constant elements and coefficients of array {{{tri}}} are stored in separate arrays {{{ddzuw}}} and {{{tric}}} and only calculated once at beginning. Last dimension of {{{tri}}} has been reduced from 5 to 2. Routine {{{poisfft_init}}} is now called internally from {{{poisfft}}}. (Makefile, Makefile_check, init_pegrid, init_3d_model, modules, palm, poisfft, poisfft_hybrid)
    3434
    3535CPU-time per grid point and timestep is output to {{{CPU_MEASURES}}} file. (cpu_statistics, modules, time_integration)