Skip to content

Latest commit

 

History

History
44 lines (32 loc) · 1.7 KB

part1.md

File metadata and controls

44 lines (32 loc) · 1.7 KB

Part 1: 3D multi-XPUs diffusion solver

Steady state solution of a diffusive process for given physical time steps using the pseudo-transient acceleration (using the so-called "dual-time" method).

💡 Use ParallelStencil.jl and ImplicitGlobalGrid.jl for the (multi-)XPU implementation. You are free to use either @parallel or @parallel_indices type of kernel definition.

Intro

What's all about. Brief overview about:

  • the process
  • the equations
  • the aims
  • ...

Methods

The methods to be used:

  • spatial and temporal discretisation
  • solution approach
  • hardware
  • ...

Results

Results section

3D diffusion

Report an animation of the 3D solution here and provide and concise description of the results. Unleash your creativity to enhance the visual output.

Performance

Briefly elaborate on performance measurement and assess whether you are compute or memory bound for the given physics on the targeted hardware.

Memory throughput

Strong-scaling on CPU and GPU -> optimal "local" problem sizes.

Weak scaling

Multi-GPU weak scaling

Work-precision diagrams

Provide a figure depicting convergence upon grid refinement; report the evolution of a value from the quantity you are diffusing for a specific location in the domain as function of numerical grid resolution. Potentially compare against analytical solution.

Provide a figure reporting on the solution behaviour as function of the solver's tolerance. Report the relative error versus a well-converged problem for various tolerance-levels.

Discussion

Discuss and conclude on your results

References

Provide here refs if needed.