Commit Graph

  • 1c0b03efb4 Merge branch 'develop' into develop Martin Kroeker 2020-10-11 23:34:14 +02:00
  • c589c3e2a1 Merge pull request #2882 from martin-frbg/issue2709 Martin Kroeker 2020-10-11 22:22:30 +02:00
  • ec638a82bf Merge pull request #2852 from martin-frbg/issue2588-cmake Martin Kroeker 2020-10-11 22:21:33 +02:00
  • caa0d757ca repair TABs Martin Kroeker 2020-10-11 18:29:34 +02:00
  • 6154f72d6d Copy BUILD_ settings to the LAPACK make.inc Martin Kroeker 2020-10-11 18:25:16 +02:00
  • ae8b0d257a Set BUILD_ options to 1 instead of just defining them Martin Kroeker 2020-10-11 18:08:21 +02:00
  • 1da32cc1fc Add cblas_xerbla interface Martin Kroeker 2020-10-11 17:45:41 +02:00
  • 8c5e08076e If none of the BUILD_ options is set, enable them all Martin Kroeker 2020-10-11 17:33:51 +02:00
  • 5f23bdf437 remove debug output Martin Kroeker 2020-10-11 17:23:08 +02:00
  • b593e6b650 Merge pull request #2885 from martin-frbg/ifexists Martin Kroeker 2020-10-11 15:45:24 +02:00
  • 082c86a538 Merge pull request #2884 from martin-frbg/sse_fixup Martin Kroeker 2020-10-11 15:14:03 +02:00
  • e396ec8b56 Allow building support for only a subset of variable types Martin Kroeker 2020-10-11 15:11:15 +02:00
  • 68e6823d36 Adapt for supporting only a subset of variable types Martin Kroeker 2020-10-11 15:01:32 +02:00
  • 887e00fd7f Adapt for supporting only a subset of variable types Martin Kroeker 2020-10-11 14:58:57 +02:00
  • 886a8e3190 Adapt for supporting only a subset of variable types Martin Kroeker 2020-10-11 14:57:32 +02:00
  • 0f7d73ff6d Allow supporting only a subset of variable types Martin Kroeker 2020-10-11 14:53:26 +02:00
  • 6b6adf8a4a Allow compiling only a subset of kernels for specific variable types Martin Kroeker 2020-10-11 14:52:09 +02:00
  • a6570108c5 Add Makefile support for enabling only some variable types Martin Kroeker 2020-10-11 14:49:58 +02:00
  • ef552bc578 Add Makefile support for enabling only some variable types Martin Kroeker 2020-10-11 14:49:06 +02:00
  • efe1ad4700 Add Makefile support for enabling only some variable types Martin Kroeker 2020-10-11 14:48:23 +02:00
  • b27ca78a21 Adapt to having only a subset of variable types supported Martin Kroeker 2020-10-11 14:46:24 +02:00
  • 93454022a9 Adapt to having only a subset of variable types supported Martin Kroeker 2020-10-11 14:45:40 +02:00
  • 20cf1d773f Adapt to having only a subset of variable types supported Martin Kroeker 2020-10-11 14:44:56 +02:00
  • 5c657fffad Adapt to having only a subset of variable types supported Martin Kroeker 2020-10-11 14:44:13 +02:00
  • b262058059 Adapt to having only a subset of variable types supported Martin Kroeker 2020-10-11 14:43:13 +02:00
  • bc319cee82 Adapt to having only a subset of variable types supported Martin Kroeker 2020-10-11 14:42:26 +02:00
  • e5966f8606 Adapt to having only a subset of variable types supported Martin Kroeker 2020-10-11 14:41:43 +02:00
  • 9df12eb08f Adapt to having only a subset of variable types supported Martin Kroeker 2020-10-11 14:40:51 +02:00
  • cf53970bcb Adapt to having only a subset of variable types supported Martin Kroeker 2020-10-11 14:40:06 +02:00
  • dcd51d5c72 Adapt to having only a subset of variable types supported Martin Kroeker 2020-10-11 14:39:19 +02:00
  • b8f95354c7 Adapt to having only a subset of variable types supported Martin Kroeker 2020-10-11 14:38:25 +02:00
  • d33de97d60 Adapt to having only a subset of variable types supported Martin Kroeker 2020-10-11 14:36:45 +02:00
  • 6a83c591d6 Adapt for having only a subset of variable types Martin Kroeker 2020-10-11 14:34:12 +02:00
  • f6d2827d0c Adapt ctests to having only a subset of types in the build Martin Kroeker 2020-10-11 14:32:00 +02:00
  • 08f4749eb4 Adapt tests to having only a subset of types in the build Martin Kroeker 2020-10-11 14:25:24 +02:00
  • 63d7dad04c Adapt utests for builds supportin only some variable types Martin Kroeker 2020-10-11 14:15:35 +02:00
  • ac653c94f3 Merge branch 'develop' into issue2588-cmake Martin Kroeker 2020-10-11 13:57:07 +02:00
  • 190b74dd24 Add files via upload Martin Kroeker 2020-10-11 13:26:05 +02:00
  • 9d43140d61 Improve check for conflicting config_kernel.h Martin Kroeker 2020-10-11 12:58:17 +02:00
  • 8ef600f1a3 Merge pull request #95 from xianyi/develop Martin Kroeker 2020-10-11 12:53:18 +02:00
  • 88928650c4 Merge pull request #2883 from martin-frbg/issue2872 Martin Kroeker 2020-10-11 10:30:33 +02:00
  • 7a53128481 Add whitelist of DYNAMIC_ARCH kernels for which -msse3 needs to be enabled Martin Kroeker 2020-10-11 01:06:46 +02:00
  • 0c773b8205 Do not rely on HAVE_SSE3 in DYNAMIC_ARCH builds Martin Kroeker 2020-10-11 01:04:57 +02:00
  • fbda20c856 Merge pull request #94 from xianyi/develop Martin Kroeker 2020-10-11 01:03:00 +02:00
  • 82a497ec5d restore PRESCOTT default for DYNAMIC_LIST Martin Kroeker 2020-10-11 00:43:09 +02:00
  • de27e4f5fb Stop DYNAMIC_ARCH build if the toplevel source contains a stray config_kernel.h from a gmake build Martin Kroeker 2020-10-11 00:40:22 +02:00
  • e1b7123bbe Merge pull request #2867 from Qiyu8/usimd-floatdot Martin Kroeker 2020-10-10 12:10:25 +02:00
  • f32d34a015 add sse3 compiler flag Qiyu8 2020-10-10 10:36:15 +08:00
  • 599777ecb7 Merge pull request #2879 from martin-frbg/issue2839 Martin Kroeker 2020-10-06 23:26:52 +02:00
  • 7812486091 Use generic C for D/Z nrm2 kernels on Windows to work around fpu exception bug Martin Kroeker 2020-10-06 21:33:16 +02:00
  • a5b164946c add fninit to reset fpu registers before assembler routines Matti Picus 2020-10-05 22:13:25 +03:00
  • a5feea6611 make BLAS3_MEM_ALLOC_THRESHOLD configurable on non-Windows Martin Kroeker 2020-10-04 23:01:06 +02:00
  • dc8e4e1959 Reduce the BLAS3 heap allocation threshold to 32 and mark it as configurable Martin Kroeker 2020-10-04 22:59:24 +02:00
  • cccd1438da Merge pull request #93 from xianyi/develop Martin Kroeker 2020-10-04 22:57:11 +02:00
  • f032d8966e Merge pull request #2874 from Flamefire/memory_fixes Martin Kroeker 2020-10-04 15:16:51 +02:00
  • f6e4cf2f9d Merge pull request #2876 from Flamefire/omp_fork_fix Martin Kroeker 2020-10-03 22:52:17 +02:00
  • 9828343e12 Merge pull request #2878 from brada4/asms Martin Kroeker 2020-10-03 22:51:49 +02:00
  • d2333e7842 aarch64 fix std=c18 compilation User User-User 2020-10-03 18:00:34 +03:00
  • 3094fc6c83 Lazyly reinit threads after a fork in OMP mode Alexander Grund 2020-10-01 15:41:42 +02:00
  • 3c05f54df8 Avoid out of bounds access on invalid memory free Alexander Grund 2020-10-01 10:48:45 +02:00
  • dee7c49938 Fix TABs and trailing space Alexander Grund 2020-10-01 10:43:16 +02:00
  • d3c0d6811b Merge pull request #2873 from martin-frbg/issue2871 Martin Kroeker 2020-10-01 06:38:22 +02:00
  • 9637cd1fd1 Merge pull request #2865 from thisch/backticks Martin Kroeker 2020-10-01 06:38:06 +02:00
  • 2367726578 Remove redundant status message Martin Kroeker 2020-09-30 23:28:49 +02:00
  • 5464eb13ea Change ifdef linux to __linux for C11 compatibility Martin Kroeker 2020-09-30 22:59:41 +02:00
  • e1574cbc83 Change ifdef linux to __linux for C11 compatibility Martin Kroeker 2020-09-30 22:50:21 +02:00
  • 0b2bb5696a Change ifdef linux to __linux for C11 compatibility Martin Kroeker 2020-09-30 22:47:25 +02:00
  • a7d5d0078d Change ifdef linux to __linux for C11 compatibility Martin Kroeker 2020-09-30 22:46:25 +02:00
  • be40440ec5 Change ifdef linux to __linux for C11 compatibility Martin Kroeker 2020-09-30 22:45:18 +02:00
  • 2bf70c8e3b Change ifdef linux to __linux for C11 compatibility Martin Kroeker 2020-09-30 22:43:25 +02:00
  • 60e6c68e38 Adapt ARM architect Qiyu8 2020-09-29 16:36:14 +08:00
  • 64629cb5c7 Merge pull request #91 from xianyi/develop Martin Kroeker 2020-09-28 22:48:53 +02:00
  • 1b1a757f5f Optimize the performance of dot by using universal intrinsics in X86/ARM Qiyu8 2020-09-28 20:36:53 +08:00
  • 0d98ce202c Merge pull request #2866 from RajalakshmiSR/p10_dcopy Martin Kroeker 2020-09-28 07:22:54 +02:00
  • 2df4235e00 Optimize dcopy/zcopy for POWER10 Rajalakshmi Srinivasaraghavan 2020-09-27 21:42:32 -05:00
  • fe8cd5ae7e Consolidate usage of backticks for build options Thomas Hisch 2020-09-28 00:42:17 +02:00
  • ba31c8f5f9 Merge pull request #2853 from Qiyu8/usimd-daxpy Martin Kroeker 2020-09-27 23:19:59 +02:00
  • e961d4d609 Merge pull request #2864 from martin-frbg/lapack445 Martin Kroeker 2020-09-27 23:11:17 +02:00
  • 7ed25e9e10 FIx underflow/rounding errors in LAPACK (S,D)LANV2 Martin Kroeker 2020-09-27 22:59:20 +02:00
  • 7b169379e0 Merge pull request #2863 from martin-frbg/readmefixes Martin Kroeker 2020-09-27 22:50:25 +02:00
  • 7f539fb850 Update cpu list, outline cmake build, clarify scope of set_num_threads extension Martin Kroeker 2020-09-27 22:48:41 +02:00
  • caf7a12295 Merge pull request #90 from xianyi/develop Martin Kroeker 2020-09-27 22:35:45 +02:00
  • 72b5b73647 Merge pull request #2850 from xiaojiayuan111/develop Martin Kroeker 2020-09-27 12:12:35 +02:00
  • 881c15179f remove default support for FMA4 on zen architect Qiyu8 2020-09-27 09:35:50 +08:00
  • 896bbd55e1 Add support for building only selected variable types Martin Kroeker 2020-09-26 23:25:55 +02:00
  • c5a32288c6 Work around sgemm_r/dgemm_r not being properly defined with BUILD_COMPLEX/BUILD_COMPLEX16 Martin Kroeker 2020-09-26 23:24:37 +02:00
  • dfaafd3b55 Merge pull request #2854 from martin-frbg/travis-graviton Martin Kroeker 2020-09-23 21:59:18 +02:00
  • f2e9a24e1a Add AWS Graviton2 build Martin Kroeker 2020-09-23 19:02:20 +02:00
  • 98153875e9 Adapt tests to having only a subset of types in the library Martin Kroeker 2020-09-22 23:28:57 +02:00
  • 0eaae30e8c Adapt tests to having only a subset of types in the build Martin Kroeker 2020-09-22 23:28:03 +02:00
  • dfbc62ef7e Support building only a subset of types Martin Kroeker 2020-09-22 23:25:59 +02:00
  • b475b4bd0d Support building only a subset of types Martin Kroeker 2020-09-22 23:25:04 +02:00
  • 357bff06b5 Add BUILD_vartype defines Martin Kroeker 2020-09-22 23:24:22 +02:00
  • 988a6f429e Add BUILD_vartype defines Martin Kroeker 2020-09-22 23:23:33 +02:00
  • e5e2fbd593 Support building only selected types Martin Kroeker 2020-09-22 23:21:30 +02:00
  • 3287848c8f Support building only seleced types Martin Kroeker 2020-09-22 23:20:51 +02:00
  • 26611af8e1 fix grouping of sources used for more than one type Martin Kroeker 2020-09-22 23:20:05 +02:00
  • b886bd672b add defines for building a subset of types Martin Kroeker 2020-09-22 23:18:55 +02:00
  • 61fae59298 Merge pull request #88 from xianyi/develop Martin Kroeker 2020-09-22 23:15:33 +02:00
  • 33d22f99f1 Merge pull request #2851 from martin-frbg/travis-xcode12 Martin Kroeker 2020-09-22 21:44:55 +02:00