Added Limitations

This commit is contained in:
Patrick Lipka 2024-12-16 11:28:28 +01:00
parent 956923151f
commit 3473ce65ee
1 changed files with 3 additions and 0 deletions

View File

@ -161,6 +161,9 @@ To add a new kernel to the project, follow these steps:
- The kernels execution should be parallelizable using all of the available strategies (`omp` (OpenMP) and `eventify` (Eventify) for now). You can add more strategies by extending the `strategy` namespace.
- The `VECTOR_SIZE` preprocessor variable defines the size of the input data and should be appropriate for the kernel you are implementing.
## Known Isuues and Limitations
- The instantiation of Eventify's `task_system` is inckluded in the kernel timing, leading to a constant overhead compared to OpenMP. On NVIDIA Grace, this is 2.8 ms. It's ongoning discussion whether to include it or not.
## Contributing