eigen

mirror of https://gitlab.com/libeigen/eigen.git synced 2025-10-17 18:41:31 +08:00

Author	SHA1	Message	Date
Gael Guennebaud	7ad07fc6f2	Update documentation for aligned_allocator	2017-09-20 10:22:00 +02:00
LaFeuille	7c9b07dc5c	Typo fix alignmeent ->alignment	2017-09-20 06:38:39 +00:00
Christoph Hertzberg	23f8b00bc8	clang provides __has_feature(is_enum) (but not <type_traits>) in C++03 mode	2017-09-14 19:26:03 +02:00
Christoph Hertzberg	0c9ad2f525	std::integral_constant is not C++03 compatible	2017-09-14 19:23:38 +02:00
Gael Guennebaud	6d42309f13	Fix compilation of Vector::operator()(enum) by treating enums as Index	2017-09-07 14:34:30 +02:00
Gael Guennebaud	21633e585b	bug #1462 : remove all occurences of the deprecated __CUDACC_VER__ macro by introducing EIGEN_CUDACC_VER	2017-08-24 11:06:47 +02:00
Gael Guennebaud	b0f55ef85a	merge	2017-02-21 17:04:10 +01:00
Gael Guennebaud	9b6e365018	Fix linking issue.	2017-02-21 16:52:22 +01:00
Gael Guennebaud	3d200257d7	Add support for automatic-size deduction in reshaped, e.g.: mat.reshaped(4,AutoSize); <-> mat.reshaped(4,mat.size()/4);	2017-02-21 15:57:25 +01:00
Gael Guennebaud	1e3aa470fa	Fix long to int conversion	2017-02-21 13:56:01 +01:00
Gael Guennebaud	b3fc0007ae	Add support for mat(all) as an alias to mat.reshaped(mat.size(),fix<1>);	2017-02-21 13:49:09 +01:00
Gael Guennebaud	63798df038	Fix usage of CUDACC_VER	2017-02-20 08:16:36 +01:00
Gael Guennebaud	5937c4ae32	Fall back is_integral to std::is_integral in c++11	2017-02-13 17:14:26 +01:00
Jonathan Hseu	3453b00a1e	Fix vector indexing with uint64_t	2017-02-11 21:45:32 -08:00
Gael Guennebaud	4b22048cea	Fallback Reshaped to MapBase when possible (same storage order and linear access to the nested expression)	2017-02-11 15:32:53 +01:00
Gael Guennebaud	e43016367a	Forgot to include a file in previous commit	2017-02-11 10:34:18 +01:00
Benoit Steiner	8b3cc54c42	Added a new EIGEN_HAS_INDEXED_VIEW define that set to 0 for older compilers that are known to fail to compile the indexed views (I used the define from the indexed_views.cpp test). Only include the indexed view methods when the compiler supports the code. This makes it possible to use Eigen again in complex code bases such as TensorFlow and older compilers such as gcc 4.8	2017-02-10 13:08:49 -08:00
Gael Guennebaud	0256c52359	Include clang in the list of non strict MSVC (just to be sure)	2017-02-10 13:41:52 +01:00
Gael Guennebaud	0eceea4efd	Define EIGEN_COMP_GNUC to reflect version number: 47, 48, 49, 50, 60, ...	2017-02-01 23:36:40 +01:00
Gael Guennebaud	9036cda364	Cleanup intitial reshape implementation: - reshape -> reshaped - make it compatible with evaluators.	2017-01-29 14:57:45 +01:00
Gael Guennebaud	0e89baa5d8	import yoco xiao's work on reshape	2017-01-29 14:29:31 +01:00
Gael Guennebaud	d024e9942d	MSVC 1900 release is not c++14 compatible enough for us. The 1910 update seems to be fine though.	2017-01-27 22:17:59 +01:00
Rasmus Munk Larsen	edaa0fc5d1	Revert PR-292. After further investigation, the memcpy->memmove change was only good for Haswell on older versions of glibc. Adding a switch for small sizes is perhaps useful for string copies, but also has an overhead for larger sizes, making it a poor trade-off for general memcpy. This PR also removes a couple of unnecessary semi-colons in Eigen/src/Core/AssignEvaluator.h that caused compiler warning everywhere.	2017-01-26 12:46:06 -08:00
Gael Guennebaud	25a1703579	Merged in ggael/eigen-flexidexing (pull request PR-294) generalized operator() for indexed access and slicing	2017-01-26 08:04:23 +00:00
Gael Guennebaud	28351073d8	Fix unamed type as template argument (ok in c++11 only)	2017-01-25 22:54:51 +01:00
Gael Guennebaud	607be65a03	Fix duplicates of array_size bewteen unsupported and Core	2017-01-25 22:53:58 +01:00
Gael Guennebaud	296d24be4d	bug #1381 : fix sparse.diagonal() used as a rvalue. The problem was that is "sparse" is not const, then sparse.diagonal() must have the LValueBit flag meaning that sparse.diagonal().coeff(i) must returns a const reference, const Scalar&. However, sparse::coeff() cannot returns a reference for a non-existing zero coefficient. The trick is to return a reference to a local member of evaluator<SparseMatrix>.	2017-01-25 17:39:01 +01:00
Rasmus Munk Larsen	3be5ee2352	Update copy helper to use fast_memcpy.	2017-01-24 14:22:49 -08:00
Rasmus Munk Larsen	e6b1020221	Adds a fast memcpy function to Eigen. This takes advantage of the following: 1. For small fixed sizes, the compiler generates inline code for memcpy, which is much faster. 2. My colleague eriche at googl dot com discovered that for large sizes, memmove is significantly faster than memcpy (at least on Linux with GCC or Clang). See benchmark numbers measured on a Haswell (HP Z440) workstation here: https://docs.google.com/a/google.com/spreadsheets/d/1jLs5bKzXwhpTySw65MhG1pZpsIwkszZqQTjwrd_n0ic/pubhtml This is of course surprising since memcpy is a less constrained version of memmove. This stackoverflow thread contains some speculation as to the causes: http://stackoverflow.com/questions/22793669/poor-memcpy-performance-on-linux Below are numbers for copying and slicing tensors using the multithreaded TensorDevice. The numbers show significant improvements for memcpy of very small blocks and for memcpy of large blocks single threaded (we were already able to saturate memory bandwidth for >1 threads before on large blocks). The "slicingSmallPieces" benchmark also shows small consistent improvements, since memcpy cost is a fair portion of that particular computation. The benchmarks operate on NxN matrices, and the names are of the form BM_$OP_${NUMTHREADS}T/${N}. Measured improvements in wall clock time: Run on rmlarsen3.mtv (12 X 3501 MHz CPUs); 2017-01-20T11:26:31.493023454-08:00 CPU: Intel Haswell with HyperThreading (6 cores) dL1:32KB dL2:256KB dL3:15MB Benchmark Base (ns) New (ns) Improvement ------------------------------------------------------------------ BM_memcpy_1T/2 3.48 2.39 +31.3% BM_memcpy_1T/8 12.3 6.51 +47.0% BM_memcpy_1T/64 371 383 -3.2% BM_memcpy_1T/512 66922 66720 +0.3% BM_memcpy_1T/4k 9892867 6849682 +30.8% BM_memcpy_1T/5k 14951099 10332856 +30.9% BM_memcpy_2T/2 3.50 2.46 +29.7% BM_memcpy_2T/8 12.3 7.66 +37.7% BM_memcpy_2T/64 371 376 -1.3% BM_memcpy_2T/512 66652 66788 -0.2% BM_memcpy_2T/4k 6145012 6117776 +0.4% BM_memcpy_2T/5k 9181478 9010942 +1.9% BM_memcpy_4T/2 3.47 2.47 +31.0% BM_memcpy_4T/8 12.3 6.67 +45.8 BM_memcpy_4T/64 374 376 -0.5% BM_memcpy_4T/512 67833 68019 -0.3% BM_memcpy_4T/4k 5057425 5188253 -2.6% BM_memcpy_4T/5k 7555638 7779468 -3.0% BM_memcpy_6T/2 3.51 2.50 +28.8% BM_memcpy_6T/8 12.3 7.61 +38.1% BM_memcpy_6T/64 373 378 -1.3% BM_memcpy_6T/512 66871 66774 +0.1% BM_memcpy_6T/4k 5112975 5233502 -2.4% BM_memcpy_6T/5k 7614180 7772246 -2.1% BM_memcpy_8T/2 3.47 2.41 +30.5% BM_memcpy_8T/8 12.4 10.5 +15.3% BM_memcpy_8T/64 372 388 -4.3% BM_memcpy_8T/512 67373 66588 +1.2% BM_memcpy_8T/4k 5148462 5254897 -2.1% BM_memcpy_8T/5k 7660989 7799058 -1.8% BM_memcpy_12T/2 3.50 2.40 +31.4% BM_memcpy_12T/8 12.4 7.55 +39.1 BM_memcpy_12T/64 374 378 -1.1% BM_memcpy_12T/512 67132 66683 +0.7% BM_memcpy_12T/4k 5185125 5292920 -2.1% BM_memcpy_12T/5k 7717284 7942684 -2.9% BM_slicingSmallPieces_1T/2 47.3 47.5 +0.4% BM_slicingSmallPieces_1T/8 53.6 52.3 +2.4% BM_slicingSmallPieces_1T/64 491 476 +3.1% BM_slicingSmallPieces_1T/512 21734 18814 +13.4% BM_slicingSmallPieces_1T/4k 394660 396760 -0.5% BM_slicingSmallPieces_1T/5k 218722 209244 +4.3% BM_slicingSmallPieces_2T/2 80.7 79.9 +1.0% BM_slicingSmallPieces_2T/8 54.2 53.1 +2.0 BM_slicingSmallPieces_2T/64 497 477 +4.0% BM_slicingSmallPieces_2T/512 21732 18822 +13.4% BM_slicingSmallPieces_2T/4k 392885 390490 +0.6% BM_slicingSmallPieces_2T/5k 221988 208678 +6.0% BM_slicingSmallPieces_4T/2 80.8 80.1 +0.9% BM_slicingSmallPieces_4T/8 54.1 53.2 +1.7% BM_slicingSmallPieces_4T/64 493 476 +3.4% BM_slicingSmallPieces_4T/512 21702 18758 +13.6% BM_slicingSmallPieces_4T/4k 393962 404023 -2.6% BM_slicingSmallPieces_4T/5k 249667 211732 +15.2% BM_slicingSmallPieces_6T/2 80.5 80.1 +0.5% BM_slicingSmallPieces_6T/8 54.4 53.4 +1.8% BM_slicingSmallPieces_6T/64 488 478 +2.0% BM_slicingSmallPieces_6T/512 21719 18841 +13.3% BM_slicingSmallPieces_6T/4k 394950 397583 -0.7% BM_slicingSmallPieces_6T/5k 223080 210148 +5.8% BM_slicingSmallPieces_8T/2 81.2 80.4 +1.0% BM_slicingSmallPieces_8T/8 58.1 53.5 +7.9% BM_slicingSmallPieces_8T/64 489 480 +1.8% BM_slicingSmallPieces_8T/512 21586 18798 +12.9% BM_slicingSmallPieces_8T/4k 394592 400165 -1.4% BM_slicingSmallPieces_8T/5k 219688 208301 +5.2% BM_slicingSmallPieces_12T/2 80.2 79.8 +0.7% BM_slicingSmallPieces_12T/8 54.4 53.4 +1.8 BM_slicingSmallPieces_12T/64 488 476 +2.5% BM_slicingSmallPieces_12T/512 21931 18831 +14.1% BM_slicingSmallPieces_12T/4k 393962 396541 -0.7% BM_slicingSmallPieces_12T/5k 218803 207965 +5.0%	2017-01-24 13:55:18 -08:00
Gael Guennebaud	d83db761a2	Add support for std::integral_constant	2017-01-24 16:28:12 +01:00
Gael Guennebaud	bc10201854	Add test for multiple symbols	2017-01-24 16:27:51 +01:00
Gael Guennebaud	ddd83f82d8	Add support for "SymbolicExpr op fix<N>" in C++98/11 mode.	2017-01-24 10:54:42 +01:00
Gael Guennebaud	228fef1b3a	Extended the set of arithmetic operators supported by FixedInt (-,+,*,/,%,&,\|)	2017-01-24 10:53:51 +01:00
Gael Guennebaud	bb52f74e62	Add internal doc	2017-01-24 10:13:35 +01:00
Gael Guennebaud	41c523a0ab	Rename fix_t to FixedInt	2017-01-24 09:39:49 +01:00
Gael Guennebaud	ca79c1545a	Add std:: namespace prefix to all (hopefully) instances if size_t/ptrdfiff_t	2017-01-23 22:02:53 +01:00
Gael Guennebaud	4d302a080c	Recover compile-time size from seq(A,B) when A and B are fixed values. (c++11 only)	2017-01-19 20:34:18 +01:00
Gael Guennebaud	54f3fbee24	Exploit fixed values in seq and reverse with C++98 compatibility	2017-01-19 19:57:32 +01:00
Gael Guennebaud	7691723e34	Add support for fixed-value in symbolic expression, c++11 only for now.	2017-01-19 19:25:29 +01:00
Gael Guennebaud	f3ccbe0419	Add a Symbolic::FixedExpr helper expression to make sure the compiler fully optimize the usage of last and end.	2017-01-18 23:16:32 +01:00
Gael Guennebaud	15471432fe	Add a .reverse() member to ArithmeticSequence.	2017-01-18 11:35:27 +01:00
Gael Guennebaud	e4f8dd860a	Add missing operator*	2017-01-18 10:49:01 +01:00
Gael Guennebaud	71e5b71356	Add a get_runtime_value helper to deal with pointer-to-function hack, plus some refactoring to make the internals more consistent.	2017-01-17 11:33:57 +01:00
Gael Guennebaud	23bfcfc15f	Add missing overload of get_compile_time for c++98/11	2017-01-17 10:30:21 +01:00
Gael Guennebaud	edff32c2c2	Disambiguate the two versions of fix for doxygen	2017-01-17 10:29:33 +01:00
Gael Guennebaud	4989922be2	Add support for symbolic expressions as arguments of operator()	2017-01-16 22:21:23 +01:00
Gael Guennebaud	12e22a2844	typos in doc	2017-01-16 16:31:19 +01:00
Gael Guennebaud	a9232af845	Introduce a variable_or_fixed<N> proxy returned by fix<N>(val) to pass both a compile-time and runtime fallback value in case N means "runtime". This mechanism is used by the seq/seqN functions. The proxy object is immediately converted to pure compile-time (as fix<N>) or pure runtime (i.e., an Index) to avoid redundant template instantiations.	2017-01-16 16:17:01 +01:00
Gael Guennebaud	6e97698161	Introduce a EIGEN_HAS_CXX14 macro	2017-01-16 16:13:37 +01:00
Gael Guennebaud	752bd92ba5	Large code refactoring: - generalize some utilities and move them to Meta (size(), array_size()) - move handling of all and single indices to IndexedViewHelper.h - several cleanup changes	2017-01-11 17:24:02 +01:00

... 2 3 4 5 6 ...

1275 Commits