Martin Kroeker
6154f72d6d
Copy BUILD_ settings to the LAPACK make.inc
2020-10-11 18:25:16 +02:00
Martin Kroeker
ae8b0d257a
Set BUILD_ options to 1 instead of just defining them
2020-10-11 18:08:21 +02:00
Martin Kroeker
1da32cc1fc
Add cblas_xerbla interface
2020-10-11 17:45:41 +02:00
Martin Kroeker
8c5e08076e
If none of the BUILD_ options is set, enable them all
2020-10-11 17:33:51 +02:00
Martin Kroeker
5f23bdf437
remove debug output
2020-10-11 17:23:08 +02:00
Martin Kroeker
b593e6b650
Merge pull request #2885 from martin-frbg/ifexists
...
Improve CMAKE check for conflicting config_kernel.h
2020-10-11 15:45:24 +02:00
Martin Kroeker
082c86a538
Merge pull request #2884 from martin-frbg/sse_fixup
...
Add workaround for unwanted default activation of -msse3 in DYNAMIC_ARCH builds
2020-10-11 15:14:03 +02:00
Martin Kroeker
e396ec8b56
Allow building support for only a subset of variable types
2020-10-11 15:11:15 +02:00
Martin Kroeker
68e6823d36
Adapt for supporting only a subset of variable types
2020-10-11 15:01:32 +02:00
Martin Kroeker
887e00fd7f
Adapt for supporting only a subset of variable types
2020-10-11 14:58:57 +02:00
Martin Kroeker
886a8e3190
Adapt for supporting only a subset of variable types
2020-10-11 14:57:32 +02:00
Martin Kroeker
0f7d73ff6d
Allow supporting only a subset of variable types
2020-10-11 14:53:26 +02:00
Martin Kroeker
6b6adf8a4a
Allow compiling only a subset of kernels for specific variable types
2020-10-11 14:52:09 +02:00
Martin Kroeker
a6570108c5
Add Makefile support for enabling only some variable types
2020-10-11 14:49:58 +02:00
Martin Kroeker
ef552bc578
Add Makefile support for enabling only some variable types
2020-10-11 14:49:06 +02:00
Martin Kroeker
efe1ad4700
Add Makefile support for enabling only some variable types
2020-10-11 14:48:23 +02:00
Martin Kroeker
b27ca78a21
Adapt to having only a subset of variable types supported
2020-10-11 14:46:24 +02:00
Martin Kroeker
93454022a9
Adapt to having only a subset of variable types supported
2020-10-11 14:45:40 +02:00
Martin Kroeker
20cf1d773f
Adapt to having only a subset of variable types supported
2020-10-11 14:44:56 +02:00
Martin Kroeker
5c657fffad
Adapt to having only a subset of variable types supported
2020-10-11 14:44:13 +02:00
Martin Kroeker
b262058059
Adapt to having only a subset of variable types supported
2020-10-11 14:43:13 +02:00
Martin Kroeker
bc319cee82
Adapt to having only a subset of variable types supported
2020-10-11 14:42:26 +02:00
Martin Kroeker
e5966f8606
Adapt to having only a subset of variable types supported
2020-10-11 14:41:43 +02:00
Martin Kroeker
9df12eb08f
Adapt to having only a subset of variable types supported
2020-10-11 14:40:51 +02:00
Martin Kroeker
cf53970bcb
Adapt to having only a subset of variable types supported
2020-10-11 14:40:06 +02:00
Martin Kroeker
dcd51d5c72
Adapt to having only a subset of variable types supported
2020-10-11 14:39:19 +02:00
Martin Kroeker
b8f95354c7
Adapt to having only a subset of variable types supported
2020-10-11 14:38:25 +02:00
Martin Kroeker
d33de97d60
Adapt to having only a subset of variable types supported
2020-10-11 14:36:45 +02:00
Martin Kroeker
6a83c591d6
Adapt for having only a subset of variable types
2020-10-11 14:34:12 +02:00
Martin Kroeker
f6d2827d0c
Adapt ctests to having only a subset of types in the build
2020-10-11 14:32:00 +02:00
Martin Kroeker
08f4749eb4
Adapt tests to having only a subset of types in the build
2020-10-11 14:25:24 +02:00
Martin Kroeker
63d7dad04c
Adapt utests for builds supportin only some variable types
2020-10-11 14:15:35 +02:00
Martin Kroeker
ac653c94f3
Merge branch 'develop' into issue2588-cmake
2020-10-11 13:57:07 +02:00
Martin Kroeker
190b74dd24
Add files via upload
2020-10-11 13:26:05 +02:00
Martin Kroeker
9d43140d61
Improve check for conflicting config_kernel.h
2020-10-11 12:58:17 +02:00
Martin Kroeker
8ef600f1a3
Merge pull request #95 from xianyi/develop
...
rebase
2020-10-11 12:53:18 +02:00
Martin Kroeker
88928650c4
Merge pull request #2883 from martin-frbg/issue2872
...
Minor CMAKE fixes
2020-10-11 10:30:33 +02:00
Martin Kroeker
7a53128481
Add whitelist of DYNAMIC_ARCH kernels for which -msse3 needs to be enabled
2020-10-11 01:06:46 +02:00
Martin Kroeker
0c773b8205
Do not rely on HAVE_SSE3 in DYNAMIC_ARCH builds
2020-10-11 01:04:57 +02:00
Martin Kroeker
fbda20c856
Merge pull request #94 from xianyi/develop
...
rebase
2020-10-11 01:03:00 +02:00
Martin Kroeker
82a497ec5d
restore PRESCOTT default for DYNAMIC_LIST
2020-10-11 00:43:09 +02:00
Martin Kroeker
de27e4f5fb
Stop DYNAMIC_ARCH build if the toplevel source contains a stray config_kernel.h from a gmake build
...
This is unlikely to happen in practice, but if it does, the rogue file would get included instead of the dynamically generated version for each target_core, leading to very confusing errors like "invalid operands (undefined UND and ABS sections)" in compilation of the assembly kernels as macros like PREFETCH would remain undefined
2020-10-11 00:40:22 +02:00
Martin Kroeker
e1b7123bbe
Merge pull request #2867 from Qiyu8/usimd-floatdot
...
Optimize the performance of dot by using universal intrinsics in X86/ARM
2020-10-10 12:10:25 +02:00
Qiyu8
f32d34a015
add sse3 compiler flag
2020-10-10 10:36:15 +08:00
Martin Kroeker
599777ecb7
Merge pull request #2879 from martin-frbg/issue2839
...
Default BLAS3_MEM_ALLOC_THRESHOLD on all platforms to 32
2020-10-06 23:26:52 +02:00
Martin Kroeker
7812486091
Use generic C for D/Z nrm2 kernels on Windows to work around fpu exception bug
2020-10-06 21:33:16 +02:00
Matti Picus
a5b164946c
add fninit to reset fpu registers before assembler routines
2020-10-05 22:13:25 +03:00
Martin Kroeker
a5feea6611
make BLAS3_MEM_ALLOC_THRESHOLD configurable on non-Windows
2020-10-04 23:01:06 +02:00
Martin Kroeker
dc8e4e1959
Reduce the BLAS3 heap allocation threshold to 32 and mark it as configurable
2020-10-04 22:59:24 +02:00
Martin Kroeker
cccd1438da
Merge pull request #93 from xianyi/develop
...
rebase
2020-10-04 22:57:11 +02:00