aboutsummaryrefslogtreecommitdiffhomepage
path: root/doc/UsingNVCC.dox
diff options
context:
space:
mode:
authorGravatar Gael Guennebaud <g.gael@free.fr>2014-01-24 13:24:30 +0100
committerGravatar Gael Guennebaud <g.gael@free.fr>2014-01-24 13:24:30 +0100
commit6cf938df53b5f26690a0e8034e10d11640d3a5c2 (patch)
treecdaf246271fa21e3388e048c2439f7e3024eb90d /doc/UsingNVCC.dox
parentafcfb560a2b92ad8b73f03fff80e691050fd5adc (diff)
Add a minimalistic page on CUDA with Eigen.
Diffstat (limited to 'doc/UsingNVCC.dox')
-rw-r--r--doc/UsingNVCC.dox32
1 files changed, 32 insertions, 0 deletions
diff --git a/doc/UsingNVCC.dox b/doc/UsingNVCC.dox
new file mode 100644
index 000000000..e9df5de04
--- /dev/null
+++ b/doc/UsingNVCC.dox
@@ -0,0 +1,32 @@
+
+namespace Eigen {
+
+/** \page TopicCUDA Using Eigen in CUDA kernels
+
+\b Disclaimer: this page is about an \b experimental feature in %Eigen.
+
+Staring from CUDA 5.0, the CUDA compiler, \c nvcc, is able to properly parse %Eigen's code (almost).
+A few adaptations of the %Eigen's code already allows to use some parts of %Eigen in your own CUDA kernels.
+To this end you need the devel branch of %Eigen, CUDA 5.0 or greater with GCC.
+
+Known issues:
+
+ - \c nvcc with MS Visual Studio does not work (patch welcome)
+
+ - \c nvcc with \c clang does not work (patch welcome)
+
+ - \c nvcc 5.5 with gcc-4.7 (or greater) has issues with the standard \c <limits> header file. To workaround this, you can add the following before including any other files:
+ \code
+ // workaround issue between gcc >= 4.7 and cuda 5.5
+ #if (defined __GNUC__) && (__GNUC__>4 || __GNUC_MINOR__>=7)
+ #undef _GLIBCXX_ATOMIC_BUILTINS
+ #undef _GLIBCXX_USE_INT128
+ #endif
+ \endcode
+
+ - On 64bits system Eigen uses \c long \c int as the default type for indexes and sizes. On CUDA device, it would make sense to default to 32 bits \c int.
+ However, to keep host and CUDA code compatible, this cannot be done automatically by %Eigen, and the user is thus required to define \c EIGEN_DEFAULT_DENSE_INDEX_TYPE to \c int throughout his code (or only for CUDA code if there is no interaction between host and CUDA code through %Eigen's object).
+
+*/
+
+}