First version.
- Use custom execution policies (
dynk::RangePolicy
anddynk::MDRangePolicy
) that create Kokkos execution policies instead of recreating Kokkos execution policies from themselves.
- Use fences for the layer approach, after and before the parallel block.
- Add documentation.