GitHub-Proxy/eigen - eigen - Git: MartinFarm

GitHub-Proxy/eigen

mirror of https://gitlab.com/libeigen/eigen.git synced 2025-10-21 04:21:07 +08:00

Go to file

Sameer Agarwal b55b5c7280 Speed up row-major matrix-vector product on ARM

The row-major matrix-vector multiplication code uses a threshold to
check if processing 8 rows at a time would thrash the cache.

This change introduces two modifications to this logic.

1. A smaller threshold for ARM and ARM64 devices.

The value of this threshold was determined empirically using a Pixel2
phone, by benchmarking a large number of matrix-vector products in the
range [1..4096]x[1..4096] and measuring performance separately on
small and little cores with frequency pinning.

On big (out-of-order) cores, this change has little to no impact. But
on the small (in-order) cores, the matrix-vector products are up to
700% faster. Especially on large matrices.

The motivation for this change was some internal code at Google which
was using hand-written NEON for implementing similar functionality,
processing the matrix one row at a time, which exhibited substantially
better performance than Eigen.

With the current change, Eigen handily beats that code.

2. Make the logic for choosing number of simultaneous rows apply
unifiormly to 8, 4 and 2 rows instead of just 8 rows.

Since the default threshold for non-ARM devices is essentially
unchanged (32000 -> 32 * 1024), this change has no impact on non-ARM
performance. This was verified by running the same set of benchmarks
on a Xeon desktop.

2019-02-01 15:23:53 -08:00

Add recent gemm related changesets and various cleanups in perf-monitoring

2019-01-29 11:53:47 +01:00

Fix numerous shadow-warnings for GCC<=4.8

2018-08-28 18:32:39 +02:00

Simplify handling of tests that must fail to compile.

2018-12-12 15:48:36 +01:00

MIsc. source and comment typos

2018-03-11 10:01:44 -04:00

Fixed compilation error due to obsolete internal::abs and internal::sqrt function calls

2014-03-26 22:02:48 -04:00

Slightly extend discussions on auto and move the content of the Pit falls wiki page here.

2019-01-30 13:09:21 +01:00

Speed up row-major matrix-vector product on ARM

2019-02-01 15:23:53 -08:00

PR 572: Add initializer list constructors to Matrix and Array (include unit tests and doc)

2019-01-21 16:25:57 +01:00

Enable "old" CMP0026 policy (not perfect, but better than dozens of warning)

2018-12-08 18:59:51 +01:00

Simplify handling and non-splitted tests and include split_test_helper.h instead of re-generating it. This also allows us to modify it without breaking existing build folder.

2018-07-16 18:55:40 +02:00

bug #1669 : fix PartialPivLU/inverse with zero-sized matrices.

2019-01-29 10:27:13 +01:00

Workaround lack of support for arbitrary packet-type in Tensor by manually loading half/quarter packets in tensor contraction mapper.

2019-01-30 16:48:01 +01:00

.hgeol

Added a pattern which forces LF line endings for *.sh files.

2013-07-31 18:20:58 +02:00

.hgignore

ignore all *build* sub directories

2017-12-14 14:22:14 +01:00

CMakeLists.txt

Bypass inline asm for non compatible compilers.

2019-01-23 23:43:13 +01:00

COPYING.BSD

Intel(R) MKL support added.

2011-12-05 14:52:21 +07:00

COPYING.GPL

there's no reason why we should follow the FSF's stupid recommendation for the naming of these files, right? This could give the wrong impression that Eigen is only GPL-licensed.

2009-11-14 23:26:07 -05:00

COPYING.LGPL

Replace COPYING.LGPL by a copy of the LGPL 2.1 (instead of LGPL 3).

2012-09-10 13:27:44 -04:00

COPYING.MINPACK

add COPYING.MINPACK

2012-07-15 11:46:22 -04:00

COPYING.MPL2

add COPYING.MPL2

2012-07-15 10:20:59 -04:00

COPYING.README

Replace COPYING.LGPL by a copy of the LGPL 2.1 (instead of LGPL 3).

2012-09-10 13:27:44 -04:00

CTestConfig.cmake

Optimize the product of a householder-sequence with the identity, and optimize the evaluation of a HouseholderSequence to a dense matrix using faster blocked product.

2018-07-11 17:16:50 +02:00

CTestCustom.cmake.in

Allow to filter out build-error messages

2018-07-24 20:12:49 +02:00

eigen3.pc.in

Further fixes for CMAKE_INSTALL_PREFIX correctness

2015-11-07 21:29:24 -05:00

INSTALL

finally, the right fix: set CTEST_BUILD_TARGET.

2009-10-04 20:27:44 -04:00

README.md

Add links where to make PRs and report bugs into README.md

2018-04-13 21:05:28 +00:00

signature_of_eigen3_matrix_library

improve the scripts for building unit tests:

2009-11-25 21:26:37 -05:00

README.md

Eigen is a C++ template library for linear algebra: matrices, vectors, numerical solvers, and related algorithms.

For more information go to http://eigen.tuxfamily.org/.

For pull request please only use the official repository at https://bitbucket.org/eigen/eigen.

For bug reports and feature requests go to http://eigen.tuxfamily.org/bz.

Languages

C++ 84.9%

Fortran 8.5%

C 2.8%

CMake 1.9%

Cuda 1.2%

Other 0.6%