Commit Graph

  • 0d007adb18 fix clang_cl-flang job to use flang-new after the llvm update Martin Kroeker 2024-05-30 23:30:16 +02:00
  • 997f99fb24 deploy: b9a1c9a06c martin-frbg 2024-05-30 19:33:02 +00:00
  • b9a1c9a06c Merge pull request #4725 from Neumann-A/patch-1 Martin Kroeker 2024-05-30 21:32:32 +02:00
  • ff6670cb83 don't generate non-cblas files for gemm_batch Martin Kroeker 2024-05-30 18:26:02 +02:00
  • dd4505c5dd Fix CMake warning Alexander Neumann 2024-05-30 09:04:23 +02:00
  • 362a063396 remove return value Martin Kroeker 2024-05-29 23:16:58 +02:00
  • d0794f88dc add gemm_batch driver Martin Kroeker 2024-05-29 15:49:20 +02:00
  • 833a8880c6 add cblas_?gemm_batch Martin Kroeker 2024-05-29 15:47:50 +02:00
  • 89c7bbcba6 add cblas_?gemm_batch Martin Kroeker 2024-05-29 15:47:02 +02:00
  • 103637887e add cblas_?gemm_batch Martin Kroeker 2024-05-29 15:46:10 +02:00
  • 0073affe63 Merge pull request #4693 from goplanid/locks-improvement Martin Kroeker 2024-05-26 23:14:52 +02:00
  • 834e633d79 Merge pull request #4718 from martin-frbg/issue4713 Martin Kroeker 2024-05-26 16:38:18 +02:00
  • 3833190454 Merge pull request #4716 from martin-frbg/lapack1018 Martin Kroeker 2024-05-26 14:01:31 +02:00
  • cf7e668fe8 Merge pull request #4709 from martin-frbg/docsbuildbranch Martin Kroeker 2024-05-26 14:01:05 +02:00
  • 8b4996a2d5 Override icx's default fast math mode to ensure correct NaN handling Martin Kroeker 2024-05-26 13:16:03 +02:00
  • 616cc28d82 Override icx's default fast math mode to ensure correct NaN handling Martin Kroeker 2024-05-26 12:59:11 +02:00
  • 41b07c0dac deploy: 772116879d martin-frbg 2024-05-26 10:27:10 +00:00
  • 772116879d Merge pull request #4717 from bartoldeman/zscal-float-inf-fix Martin Kroeker 2024-05-26 12:26:41 +02:00
  • 62f7b244ff Replace use of FLT_MAX in x86_64 zscal.c by isinf() Bart Oldeman 2024-05-24 17:20:27 +00:00
  • 7ebbe3cc72 Fix potential bounds error (Reference-LAPACK PR 1018) Martin Kroeker 2024-05-23 23:12:19 +02:00
  • 791e015024 Fix potential bounds error (Reference-LAPACK PR 1018) Martin Kroeker 2024-05-23 23:11:14 +02:00
  • 4dd715d220 Fix potential bounds error (Reference-LAPACK PR 1018) Martin Kroeker 2024-05-23 23:09:55 +02:00
  • e2c1a1e269 Fix potential bounds error (Reference-LAPACK PR 1018) Martin Kroeker 2024-05-23 23:08:27 +02:00
  • e112191b54 POWER: Fix issues in zscal to address lapack failures Rajalakshmi Srinivasaraghavan 2024-05-22 08:00:06 -05:00
  • 172d91846f Don't try to deploy docs in a fork Martin Kroeker 2024-05-20 22:53:43 +02:00
  • 700ea74a37 Merge pull request #4705 from martin-frbg/issue4703 Martin Kroeker 2024-05-18 21:38:22 +02:00
  • aa259b141d Merge pull request #4704 from amritahs-ibm/saxpy_perf_fix Martin Kroeker 2024-05-18 19:11:25 +02:00
  • 1e6cd90065 deploy: 25b34e67f9 martin-frbg 2024-05-18 15:26:45 +00:00
  • 25b34e67f9 Merge pull request #4678 from ev-br/codspeed Martin Kroeker 2024-05-18 16:51:02 +02:00
  • 6494f432df Fix INTERFACE64 builds on Loongarch64 Martin Kroeker 2024-05-18 16:49:03 +02:00
  • 81cf0db047 DOC: add a readme for benchmarks/pybench Evgeni Burovski 2024-05-18 15:30:00 +03:00
  • 9f28161837 BENCH: add benchmarks using codspeed.io Evgeni Burovski 2024-05-06 11:36:30 +03:00
  • 5015548d18 Merge pull request #4700 from martin-frbg/fix4698 Martin Kroeker 2024-05-16 15:38:01 +02:00
  • 0050a9660b Correctly detect ARM Neoverse V2 CPUs. Matthias Langer 2024-05-16 09:59:52 +00:00
  • d3ea02fdef deploy: ce96e0e50f martin-frbg 2024-05-16 07:45:02 +00:00
  • ce96e0e50f Merge pull request #4699 from ChipKerchner/fixSwapVectorOrder Martin Kroeker 2024-05-16 09:28:20 +02:00
  • a3f6b13bc9 remove spurious brace Martin Kroeker 2024-05-16 09:25:53 +02:00
  • 3a1417671a POWER: Fixing endianness issue in cswap/zswap kernel for AIX Chip Kerchner 2024-05-15 19:36:46 -05:00
  • 668f48f4fc Use CMAKE_C_COMPILER_VERSION instead of dumpversion calls (#4698) Martin Kroeker 2024-05-15 23:58:14 +02:00
  • 39c96063fb Merge pull request #4694 from martin-frbg/issue3660 Martin Kroeker 2024-05-15 22:14:41 +02:00
  • f5c080f083 Fix CMAKE syntax in kernel file parsing of IFNEQ conditionals (#4695) Martin Kroeker 2024-05-15 20:58:31 +02:00
  • c2d46288f3 deploy: 9a2a6a2e52 martin-frbg 2024-05-15 16:35:56 +00:00
  • 9a2a6a2e52 Merge pull request #4696 from frjohnst/restore_second Martin Kroeker 2024-05-15 18:35:20 +02:00
  • 87026ac1b1 Revert "fix conlict between PR 4515 and AIX shared obj support" frjohnst 2024-05-15 09:45:17 -04:00
  • 56d3d1039c Revert "resolve second_ conflict which breaks xlf timef" frjohnst 2024-05-15 09:44:29 -04:00
  • 2957281275 Introduce a lower limit for multithreading Martin Kroeker 2024-05-14 18:59:21 +02:00
  • 5fd871d7ea Introduce a lower limit for multithreading Martin Kroeker 2024-05-14 18:48:03 +02:00
  • 588fc0976c deploy: 6ca9ffa7f5 martin-frbg 2024-05-14 16:13:13 +00:00
  • 6ca9ffa7f5 Merge pull request #4655 from yamazakimitsufumi/update_2d_thread_distribution Martin Kroeker 2024-05-14 18:12:43 +02:00
  • 0dc80a5c8d locks improvement Deeksha Goplani 2024-05-13 22:17:23 +05:30
  • b45a78c6e9 fix zdotu argument passing in utest_ext on windows (#4691) Martin Kroeker 2024-05-13 14:50:50 +02:00
  • 1c2b341aac deploy: 1ab9f50561 martin-frbg 2024-05-13 09:01:02 +00:00
  • 1ab9f50561 Merge pull request #4690 from mattip/blasint Martin Kroeker 2024-05-13 11:00:33 +02:00
  • 87b3d9054f Fix regression SAXPY when compiler with OpenXL compiler. Amrita H S 2024-05-07 11:31:36 -05:00
  • 243640c354 use blasint instead of int to quiet warnings Matti Picus 2024-05-12 10:24:16 +03:00
  • f0560f906f Merge pull request #4689 from martin-frbg/issue4684 Martin Kroeker 2024-05-11 14:39:54 +02:00
  • 278eb4273f deploy: e1e0d9a2ae martin-frbg 2024-05-11 11:39:14 +00:00
  • e1e0d9a2ae Merge pull request #4688 from XiWeiGu/loongarch64_fixed_gcc14_compilation Martin Kroeker 2024-05-11 13:38:45 +02:00
  • d8baf2f2ea Support compilation without CBLAS Martin Kroeker 2024-05-11 13:10:54 +02:00
  • a6c184d150 forward NO_CFLAGS to the CFLAGS, if set Martin Kroeker 2024-05-11 13:07:30 +02:00
  • ecf8b588a9 loongarch64: Fixed GCC14 compilation issue gxw 2024-05-11 16:10:38 +08:00
  • d7aeb31879 deploy: 8da6f7e5f2 martin-frbg 2024-05-10 09:29:41 +00:00
  • 8da6f7e5f2 Merge pull request #4686 from XiWeiGu/loongarch64_dgemm_kernel_16x6 Martin Kroeker 2024-05-10 11:29:12 +02:00
  • f9a26240a7 loongarch64: Fixed icamax_lsx gxw 2024-05-09 19:15:03 +08:00
  • cb0f707409 loongarch64: Fixed utest fork:safety gxw 2024-05-09 15:36:25 +08:00
  • 637c650f4f loongarch64: Add buffer offset for target LOONGSON3R5 gxw 2024-05-08 14:40:48 +08:00
  • 3cd72d8911 deploy: 5d678f1831 martin-frbg 2024-05-09 11:17:57 +00:00
  • 5d678f1831 Merge pull request #4685 from martin-frbg/issue4660-2 Martin Kroeker 2024-05-09 13:17:29 +02:00
  • b45d8e1ab2 remove stray comma Martin Kroeker 2024-05-09 12:33:19 +02:00
  • 0b9826bc2a deploy: 5500b4ab26 martin-frbg 2024-05-08 17:04:26 +00:00
  • 5500b4ab26 Merge pull request #4680 from theAeon/develop Martin Kroeker 2024-05-08 19:03:57 +02:00
  • 6017ad7146 loongarch64: Update dgemm_kernel_16x4 to dgemm_kernel_16x6 gxw 2024-04-28 16:36:17 +08:00
  • d66aa63478 Merge pull request #4681 from martin-frbg/fix4662-2 Martin Kroeker 2024-05-08 01:44:32 +02:00
  • f0f1ff7820 fix HUGETLB allocation for TLS mode as well Martin Kroeker 2024-05-08 00:40:36 +02:00
  • edfe1aa471 Expose whether locking is enabled in get_config Andrew Robbins 2024-05-07 11:06:27 -04:00
  • edeb5259a1 Merge pull request #4679 from martin-frbg/fix4662 Martin Kroeker 2024-05-07 15:57:50 +02:00
  • 4376b6f7d2 Restore Loongson LA64ARCH handling Martin Kroeker 2024-05-07 14:42:01 +02:00
  • d497004746 deploy: 8735b54fa8 martin-frbg 2024-05-07 11:45:18 +00:00
  • 8735b54fa8 Merge pull request #4662 from martin-frbg/hugetlb-doc Martin Kroeker 2024-05-07 13:32:07 +02:00
  • fc10673fd3 Merge branch 'develop' into hugetlb-doc Martin Kroeker 2024-05-07 13:31:39 +02:00
  • c4e8b964f7 deploy: c20189cc82 martin-frbg 2024-05-06 15:10:57 +00:00
  • c20189cc82 Merge pull request #4677 from martin-frbg/issue4676 Martin Kroeker 2024-05-06 17:10:19 +02:00
  • bbd227ce4a Add Intel Meteor Lake and Emerald Rapids Martin Kroeker 2024-05-06 00:11:44 +02:00
  • 40dca68563 deploy: f034745ce6 martin-frbg 2024-05-04 13:59:01 +00:00
  • f034745ce6 Merge pull request #4675 from martin-frbg/issue4619 Martin Kroeker 2024-05-04 15:50:13 +02:00
  • a82ecadc11 mention LD_LIBRARY_PATH Martin Kroeker 2024-05-04 15:48:48 +02:00
  • 8ea58e57bf deploy: b859f6f191 martin-frbg 2024-05-04 13:21:19 +00:00
  • b859f6f191 Merge pull request #4617 from cyk2018/patch-1 Martin Kroeker 2024-05-04 15:20:52 +02:00
  • dc99b61380 sort unwanted interdependencies of alloc_shm and alloc_hugetlb Martin Kroeker 2024-05-04 14:49:00 +02:00
  • 9c4e10fbd1 sort hugetlb and shm alloc options Martin Kroeker 2024-05-04 14:48:02 +02:00
  • a63d71129c Merge pull request #4671 from martin-frbg/issue4668 Martin Kroeker 2024-04-30 20:06:42 +02:00
  • 3d26837a35 Suppress GCC14 error exit in the f2c-converted LAPACK Martin Kroeker 2024-04-30 19:05:18 +02:00
  • 7c915e64ca Silence a GCC14 warning/error in the f2c-converted LAPACK Martin Kroeker 2024-04-30 17:48:14 +02:00
  • 397ece7b83 deploy: edacf9b397 martin-frbg 2024-04-30 06:51:17 +00:00
  • edacf9b397 Work around spurious BLAS3 test errors on LOONGSON3R3/4 (#4667) Martin Kroeker 2024-04-30 08:50:47 +02:00
  • 6a8b157f10 deploy: 89e3fd0821 martin-frbg 2024-04-29 15:23:51 +00:00
  • 89e3fd0821 Merge pull request #4666 from martin-frbg/issue4633 Martin Kroeker 2024-04-29 17:23:20 +02:00
  • b1d722fc0c Fix cast to work with INTERFACE64 (especially on big-endian) Martin Kroeker 2024-04-29 15:37:26 +02:00
  • 8c54f71bc8 deploy: 1031d161f6 martin-frbg 2024-04-25 16:06:02 +00:00
  • 1031d161f6 Merge pull request #4663 from ayappanec/develop Martin Kroeker 2024-04-25 18:05:33 +02:00