Commit Graph

  • a48b117636 Update version information for 0.3.28 Martin Kroeker 2024-08-12 18:22:20 +02:00
  • d8f740791a tweak threshold a little more to cover POWER10 fma Martin Kroeker 2024-08-12 14:50:49 +02:00
  • 73e13b0273 flesh out HERK prototype Martin Kroeker 2024-08-12 14:45:40 +02:00
  • 824306baab flesh out HERK prototype Martin Kroeker 2024-08-12 14:44:13 +02:00
  • 7ca835a82c address clang array overflow warning Martin Kroeker 2024-08-10 13:44:56 +02:00
  • a87c4d26dd Merge pull request #4857 from nekopsykose/ppc Martin Kroeker 2024-08-10 00:15:28 +02:00
  • 1265eee85c fix cmake typo for power10 cc version check psykose 2024-08-09 20:38:05 +02:00
  • f0e9e93a2b deploy: cb38d666da martin-frbg 2024-08-09 01:41:29 +00:00
  • cd3945b998 Update version to 0.3.28.dev Martin Kroeker 2024-08-08 23:09:45 +02:00
  • cbd321aecb Update versin to 0.3.28.dev Martin Kroeker 2024-08-08 23:08:52 +02:00
  • cb38d666da Merge pull request #4855 from OpenMathLib/release-0.3.0 Martin Kroeker 2024-08-08 23:08:07 +02:00
  • 5ef8b19646 Merge pull request #4854 from OpenMathLib/develop v0.3.28 release-0.3.0 Martin Kroeker 2024-08-08 22:41:46 +02:00
  • 884a949a0d Merge branch 'release-0.3.0' into develop Martin Kroeker 2024-08-08 22:41:26 +02:00
  • 116bc767d8 Update version to 0.3.28 Martin Kroeker 2024-08-08 22:23:02 +02:00
  • 91d6722a3d Update version to 0.3.28 Martin Kroeker 2024-08-08 22:22:24 +02:00
  • 2c8e001efe Merge pull request #4853 from martin-frbg/changelog0328 Martin Kroeker 2024-08-08 21:14:40 +02:00
  • e33ee60651 deploy: 1c2bfea1bb martin-frbg 2024-08-08 17:17:44 +00:00
  • 1c2bfea1bb Merge pull request #4852 from martin-frbg/fix4814 Martin Kroeker 2024-08-08 19:16:48 +02:00
  • 1df95bb23a Update Changelog.txt for 0.3.28 Martin Kroeker 2024-08-08 18:51:25 +02:00
  • 7878976236 disable forwarding from SBGEMM to SBGEMV for now Martin Kroeker 2024-08-08 18:03:38 +02:00
  • d92cc96978 Merge pull request #4851 from martin-frbg/test3m Martin Kroeker 2024-08-08 00:07:17 +02:00
  • 76db713e79 fix invocation of GEMM3M tests Martin Kroeker 2024-08-07 21:37:20 +02:00
  • deae7cf1ec Merge pull request #4850 from martin-frbg/generic_3m Martin Kroeker 2024-08-07 21:35:38 +02:00
  • 46e331a917 remove the unworkable GEMM3M restriction from GENERIC again Martin Kroeker 2024-08-07 19:41:10 +02:00
  • ccc23338d7 have the dummy GEMM3M kernel at least forward to regular GEMM Martin Kroeker 2024-08-07 19:39:02 +02:00
  • fe0a69e308 even less invasive Harmen Stoppels 2024-08-07 16:43:45 +02:00
  • f49371c1ba Set CMake 3.0 policies to NEW Harmen Stoppels 2024-08-07 16:40:11 +02:00
  • 1ef9f24b39 Revert "require consistent minimal cmake version" Harmen Stoppels 2024-08-07 16:37:02 +02:00
  • a24acffaef deploy: 753c7ebe17 martin-frbg 2024-08-07 12:10:30 +00:00
  • 753c7ebe17 Merge pull request #4835 from martin-frbg/revertwin4359 Martin Kroeker 2024-08-07 14:09:32 +02:00
  • 5b07ec643c require consistent minimal cmake version Harmen Stoppels 2024-08-07 09:43:47 +02:00
  • b0ac6d8f10 deploy: 3b8d7dfdca martin-frbg 2024-08-06 22:15:41 +00:00
  • 3b8d7dfdca Merge pull request #4846 from martin-frbg/lapack1025 Martin Kroeker 2024-08-07 00:04:37 +02:00
  • 797ae08dbe Add explanation of LAPACK_STRLEN Martin Kroeker 2024-08-06 21:38:00 +02:00
  • 923b79de47 make the type of the hidden arguments configurable via LAPACK_STRLEN (Reference-LAPACK PR 1025) Martin Kroeker 2024-08-06 17:55:14 +02:00
  • cc36db643e Support new LAPACK build option LAPACK_STRLEN Martin Kroeker 2024-08-06 17:31:03 +02:00
  • 7e8118d94e Support new build option LAPACK_STRLEN Martin Kroeker 2024-08-06 17:30:17 +02:00
  • 5bdd3a05f0 Merge pull request #4841 from martin-frbg/lapack1033 Martin Kroeker 2024-08-05 23:50:40 +02:00
  • ae9e0e36c3 Merge pull request #4842 from martin-frbg/lapack1030 Martin Kroeker 2024-08-05 22:23:44 +02:00
  • bce48d4a13 Fix typos and sytrd boundary workspace (Reference-LAPACK PR 1030) Martin Kroeker 2024-08-05 17:37:07 +02:00
  • c8b4ceca85 prevent compilers from using FMA (Reference-LAPACK PR 1033) Martin Kroeker 2024-08-05 16:45:05 +02:00
  • 47b38c50ff deploy: 14a8a9a43c martin-frbg 2024-08-05 13:36:02 +00:00
  • 14a8a9a43c Merge pull request #4840 from martin-frbg/issue4823 Martin Kroeker 2024-08-05 15:35:25 +02:00
  • a4845fa12d set MACOSX_RPATH to true on Apple Martin Kroeker 2024-08-04 23:38:22 +02:00
  • 19f8a8d61c Merge pull request #4839 from martin-frbg/fix4794 Martin Kroeker 2024-08-04 21:38:58 +02:00
  • cf483d9f64 Merge pull request #4836 from martin-frbg/issue4275-3 Martin Kroeker 2024-08-04 12:27:23 +02:00
  • 50397e017a Merge pull request #4838 from martin-frbg/fix4662-3 Martin Kroeker 2024-08-04 11:32:10 +02:00
  • 4143af7855 deploy: ae27b02213 martin-frbg 2024-08-04 08:12:17 +00:00
  • ae27b02213 Merge pull request #4837 from martin-frbg/dyn_riscv_cmake Martin Kroeker 2024-08-04 10:11:40 +02:00
  • f1c9803f9a add proper return statement Martin Kroeker 2024-08-04 00:14:31 +02:00
  • 60abcc3991 add proper return statement Martin Kroeker 2024-08-04 00:13:31 +02:00
  • 5257f807a9 fix invalid ifdef syntax in HUGETLB handling Martin Kroeker 2024-08-04 00:03:17 +02:00
  • 2aed90171a Add riscv sources for DYNAMIC_ARCH Martin Kroeker 2024-08-03 23:58:10 +02:00
  • e8bd97ab4b add RISCV64 entries for DYNAMIC_ARCH Martin Kroeker 2024-08-03 23:56:59 +02:00
  • f40819476c mention RISCV64 as a permitted architecture for DYNAMIC_ARCH Martin Kroeker 2024-08-03 23:54:35 +02:00
  • 7af3c552d3 use TARGET rather than CORE from Makefile.conf_last to fill in pkgconfig Martin Kroeker 2024-08-03 23:51:57 +02:00
  • a3dfcd8da7 deploy: 2c2b6bcf96 martin-frbg 2024-08-03 16:52:41 +00:00
  • 2c2b6bcf96 Merge pull request #4831 from martin-frbg/gemmforward Martin Kroeker 2024-08-03 18:52:11 +02:00
  • 6468dc1142 restore the coarse locking of the pre-4359 version Martin Kroeker 2024-08-02 16:39:47 +02:00
  • abff4baa4d re-enable queue struct members related to locking Martin Kroeker 2024-08-02 16:37:01 +02:00
  • 1a7b8c650d Merge branch 'develop' into betterPowerGEMVTail Chip Kerchner 2024-08-01 14:59:12 -05:00
  • e2334d0218 Remove GEMV hack. Chip Kerchner 2024-08-01 14:44:40 -05:00
  • 42d8865234 fix typo Martin Kroeker 2024-08-01 12:24:45 +02:00
  • 9eecd0d33b enable GEMM/GEMV forwarding for riscv and ppc Martin Kroeker 2024-07-31 23:29:12 +02:00
  • fcb88b9d52 enable GEMM/GEMV forwarding for riscv and ppc Martin Kroeker 2024-07-31 23:21:35 +02:00
  • 9afd0c8afd Merge pull request #4814 from Mousius/gemv-proxy Martin Kroeker 2024-07-31 23:18:01 +02:00
  • 96fa4775ae deploy: edbf093c98 martin-frbg 2024-07-31 17:45:49 +00:00
  • edbf093c98 Update zarch SCAL kernels to handle INF and NAN arguments (#4829) Martin Kroeker 2024-07-31 19:45:15 +02:00
  • ba2e989c67 Add accumulators to AArch64 GEMV Kernels Chris Sidebottom 2024-07-31 13:07:35 +01:00
  • b26424c6a2 Allow opt into GEMM -> GEMV forwarding Chris Sidebottom 2024-07-24 11:25:28 +01:00
  • 90eb863d4b Re-add accidental removal Chris Sidebottom 2024-07-23 23:39:07 +01:00
  • 28b5334f22 Complete implementation of GEMV forwarding Chris Sidebottom 2024-07-23 20:42:39 +00:00
  • 3db5dbc88e forward to GEMV when one argument is actually a vector Martin Kroeker 2024-05-20 22:40:04 +02:00
  • 5b041a4c03 deploy: 136a4edc5f martin-frbg 2024-07-30 20:19:50 +00:00
  • 136a4edc5f Merge pull request #4830 from martin-frbg/jenk Martin Kroeker 2024-07-30 22:19:14 +02:00
  • 86c15f028b Update Jenkinsfile.pwr Martin Kroeker 2024-07-30 21:21:34 +02:00
  • a13015b656 try requesting ubuntu22 instead of latest Martin Kroeker 2024-07-30 19:10:18 +02:00
  • d11e734002 Merge pull request #4827 from Mousius/a64fx-gcc11 Martin Kroeker 2024-07-29 16:36:13 +02:00
  • 54ce33e851 Fix GCC11 check for A64FX target Chris Sidebottom 2024-07-29 15:28:59 +01:00
  • 2e644584b9 deploy: 6d071f1a1c martin-frbg 2024-07-29 13:34:20 +00:00
  • 6d071f1a1c Merge pull request #4826 from Mousius/a64fx-fallback Martin Kroeker 2024-07-29 15:33:43 +02:00
  • 3ed226d3f8 Re-add ISCLANG filter Chris Sidebottom 2024-07-29 11:32:59 +01:00
  • 85ca003ae7 Add fallback compile options for A64FX target Chris Sidebottom 2024-07-29 11:14:25 +01:00
  • 260f47b0b7 deploy: 05bf35f296 martin-frbg 2024-07-27 18:28:39 +00:00
  • 05bf35f296 Merge pull request #4822 from martin-frbg/issue4821 Martin Kroeker 2024-07-27 20:28:06 +02:00
  • 175008caf8 harden against a dashed suffix to the gcc version number Martin Kroeker 2024-07-27 19:08:02 +02:00
  • 886acfc444 Merge pull request #4819 from martin-frbg/issue4776 Martin Kroeker 2024-07-26 16:57:35 +02:00
  • 597effebfe deploy: 092986582f martin-frbg 2024-07-26 13:21:06 +00:00
  • 4460d3ee7f re-enable the sgesdd benchmark Martin Kroeker 2024-07-26 15:07:52 +02:00
  • 092986582f Merge pull request #4818 from martin-frbg/docs_winbuild Martin Kroeker 2024-07-26 14:57:53 +02:00
  • 9d3b4893e4 deploy: 25e148ec58 martin-frbg 2024-07-26 12:57:17 +00:00
  • 25e148ec58 Merge pull request #4817 from martin-frbg/fix4807 Martin Kroeker 2024-07-26 14:56:44 +02:00
  • a090011fbf just use numeric constants in dimensions Martin Kroeker 2024-07-26 12:56:12 +02:00
  • 7006492863 replace "Preview" in the MSVC vcvarsall path with "Community" Martin Kroeker 2024-07-26 12:49:57 +02:00
  • db5328e85b make array dimensions constant Martin Kroeker 2024-07-26 12:45:39 +02:00
  • d9ae4609fb remove C99 requirement Martin Kroeker 2024-07-26 11:15:33 +02:00
  • a875304eb0 fix inverted conditional for NAN handling Martin Kroeker 2024-07-26 09:50:20 +02:00
  • 24acdd6bbb correct offset Martin Kroeker 2024-07-26 09:49:24 +02:00
  • fb7c53c5e5 Merge pull request #4807 from martin-frbg/scalfixes Martin Kroeker 2024-07-25 23:42:50 +02:00
  • 15c53dd2e0 Merge pull request #4794 from XiWeiGu/Fixed_Numpy_CI_Test Martin Kroeker 2024-07-25 23:42:13 +02:00