VisionCpp
0.0.1
|
specialisation Device_ for sycl More...
#include <sycl_device.hpp>
Public Member Functions | |
Device_ () | |
template<size_t LC, size_t LR, size_t CGT, size_t RGT, size_t CLT, size_t RLT, typename Expr > | |
void | execute (Expr &expr) const |
specialisation Device_ for sycl
device | type supported by sycl |
Definition at line 32 of file sycl_device.hpp.
|
inline |
Definition at line 40 of file sycl_device.hpp.
References visioncpp::sycl.
|
inline |
generating the short class name for the AMD gpu
replacing the the leaf node in the expression tree with a placeholder number
submitting the lambda expression to the sycl queue.
creating global accessors on all input output buffers
create the tuple of local output accessor
starting point of local tuples
merge it with all the other existing tuples
submitting the kernel lambda to the parallel
creating the index access for each thread
creating the eval expression for evaluating the expression tree. The output now moved to the front so the Output_offset should be reduced by one.
Definition at line 52 of file sycl_device.hpp.
References visioncpp::internal::tools::tuple::append(), visioncpp::extract_accessors(), and visioncpp::internal::tools::tuple::size().