Added Limitations
This commit is contained in:
		
							parent
							
								
									956923151f
								
							
						
					
					
						commit
						3473ce65ee
					
				| 
						 | 
				
			
			@ -161,6 +161,9 @@ To add a new kernel to the project, follow these steps:
 | 
			
		|||
- The kernel’s execution should be parallelizable using all of the available strategies (`omp` (OpenMP) and `eventify` (Eventify) for now). You can add more strategies by extending the `strategy` namespace.
 | 
			
		||||
- The `VECTOR_SIZE` preprocessor variable defines the size of the input data and should be appropriate for the kernel you are implementing.
 | 
			
		||||
 | 
			
		||||
## Known Isuues and Limitations
 | 
			
		||||
- The instantiation of Eventify's `task_system` is inckluded in the kernel timing, leading to a constant overhead compared to OpenMP. On NVIDIA Grace, this is 2.8 ms. It's ongoning discussion whether to include it or not.
 | 
			
		||||
 | 
			
		||||
 | 
			
		||||
## Contributing
 | 
			
		||||
 | 
			
		||||
| 
						 | 
				
			
			
 | 
			
		|||
		Loading…
	
		Reference in New Issue