GitHub-Proxy/eigen - eigen - Git: MartinFarm

GitHub-Proxy/eigen

mirror of https://gitlab.com/libeigen/eigen.git synced 2025-10-12 16:11:29 +08:00

Go to file

Gustavo Lima Chaves 1024a70e82 gebp: Add new ½ and ¼ packet rows per (peeling) round on the lhs

MIME-Version: 1.0
Content-Type: text/plain; charset=UTF-8
Content-Transfer-Encoding: 8bit

The patch works by altering the gebp lhs packing routines to also
consider ½ and ¼ packet lenght rows when packing, besides the original
whole package and row-by-row attempts. Finally, gebp itself will try
to fit a fraction of a packet at a time if:

i) ½ and/or ¼ packets are available for the current context (e.g. AVX2
   and SSE-sized SIMD register for x86)

ii) The matrix's height is favorable to it (it may be it's too small
    in that dimension to take full advantage of the current/maximum
    packet width or it may be the case that last rows may take
    advantage of smaller packets before gebp goes row-by-row)

This helps mitigate huge slowdowns one had on AVX512 builds when
compared to AVX2 ones, for some dimensions. Gains top at an extra 1x
in throughput. This patch is a complement to changeset 4ad359237aeb519dbd4b55eba43057b37988838c
.

Since packing is changed, Eigen users which would go for very
low-level API usage, like TensorFlow, will have to be adapted to work
fine with the changes.

2018-12-21 11:03:18 -08:00

add changesets related to matrix product perf.

2018-12-13 10:33:29 +01:00

Fix numerous shadow-warnings for GCC<=4.8

2018-08-28 18:32:39 +02:00

Simplify handling of tests that must fail to compile.

2018-12-12 15:48:36 +01:00

MIsc. source and comment typos

2018-03-11 10:01:44 -04:00

Fixed compilation error due to obsolete internal::abs and internal::sqrt function calls

2014-03-26 22:02:48 -04:00

bug #1615 : slightly increase the default unrolling limit to compensate for changeset 101ea26f5e18919972b321b5f7e3ef4e07be3fd6

2018-12-13 10:42:39 +01:00

gebp: Add new ½ and ¼ packet rows per (peeling) round on the lhs

2018-12-21 11:03:18 -08:00

Simplify handling of tests that must fail to compile.

2018-12-12 15:48:36 +01:00

Enable "old" CMP0026 policy (not perfect, but better than dozens of warning)

2018-12-08 18:59:51 +01:00

Simplify handling and non-splitted tests and include split_test_helper.h instead of re-generating it. This also allows us to modify it without breaking existing build folder.

2018-07-16 18:55:40 +02:00

Add regression test for bug #1174

2018-12-12 18:03:31 +01:00

Fix shorten-64-to-32 warning. Use regular memcpy if num_threads==0.

2018-12-12 14:45:31 -08:00

.hgeol

Added a pattern which forces LF line endings for *.sh files.

2013-07-31 18:20:58 +02:00

.hgignore

ignore all *build* sub directories

2017-12-14 14:22:14 +01:00

CMakeLists.txt

Simplify handling of tests that must fail to compile.

2018-12-12 15:48:36 +01:00

COPYING.BSD

Intel(R) MKL support added.

2011-12-05 14:52:21 +07:00

COPYING.GPL

there's no reason why we should follow the FSF's stupid recommendation for the naming of these files, right? This could give the wrong impression that Eigen is only GPL-licensed.

2009-11-14 23:26:07 -05:00

COPYING.LGPL

Replace COPYING.LGPL by a copy of the LGPL 2.1 (instead of LGPL 3).

2012-09-10 13:27:44 -04:00

COPYING.MINPACK

add COPYING.MINPACK

2012-07-15 11:46:22 -04:00

COPYING.MPL2

add COPYING.MPL2

2012-07-15 10:20:59 -04:00

COPYING.README

Replace COPYING.LGPL by a copy of the LGPL 2.1 (instead of LGPL 3).

2012-09-10 13:27:44 -04:00

CTestConfig.cmake

Optimize the product of a householder-sequence with the identity, and optimize the evaluation of a HouseholderSequence to a dense matrix using faster blocked product.

2018-07-11 17:16:50 +02:00

CTestCustom.cmake.in

Allow to filter out build-error messages

2018-07-24 20:12:49 +02:00

eigen3.pc.in

Further fixes for CMAKE_INSTALL_PREFIX correctness

2015-11-07 21:29:24 -05:00

INSTALL

finally, the right fix: set CTEST_BUILD_TARGET.

2009-10-04 20:27:44 -04:00

README.md

Add links where to make PRs and report bugs into README.md

2018-04-13 21:05:28 +00:00

signature_of_eigen3_matrix_library

improve the scripts for building unit tests:

2009-11-25 21:26:37 -05:00

README.md

Eigen is a C++ template library for linear algebra: matrices, vectors, numerical solvers, and related algorithms.

For more information go to http://eigen.tuxfamily.org/.

For pull request please only use the official repository at https://bitbucket.org/eigen/eigen.

For bug reports and feature requests go to http://eigen.tuxfamily.org/bz.

Languages

C++ 84.9%

Fortran 8.5%

C 2.8%

CMake 1.9%

Cuda 1.2%

Other 0.6%