Commit Graph

  • b1f6c4a1e4 Merge pull request #4160 from Mousius/sve-sniff Martin Kroeker 2023-07-26 13:46:16 +02:00
  • 9ff84dc3f2 remove unused status variable Martin Kroeker 2023-07-26 10:02:44 +02:00
  • 94adf98bb8 remove unused status variable Martin Kroeker 2023-07-26 08:31:37 +02:00
  • 3326b924b3 remove status variable blas_num_threads_set; initialize openmp thread maximum on startup Martin Kroeker 2023-07-26 00:31:24 +02:00
  • ea669c8ae9 simplify openmp thread limit handling Martin Kroeker 2023-07-26 00:27:14 +02:00
  • 24586bc4ff Disambiguate whilelt Chris Sidebottom 2023-07-25 14:50:38 +01:00
  • f971ef55f2 Add ARMV8SVE to AArch64 Dynamic Dispatch Chris Sidebottom 2023-07-25 11:56:33 +01:00
  • aea2a4622b Use latest non-SVE kernels in ARMV8SVE Chris Sidebottom 2023-07-25 14:05:17 +01:00
  • 826a9d5fa4 Adds tail undisturbed for RVV Level 2 operations Octavian Maghiar 2023-07-25 11:36:23 +01:00
  • 42cbcf58bf EMPTY: [skip ci] [skip cirrus] steppi 2023-07-24 16:38:52 -04:00
  • b92033e3be EMPTY: [skip ci] steppi 2023-07-24 16:20:56 -04:00
  • 7c8ea130a3 Set up cirun workflow for arm64 graviton steppi 2023-07-18 14:36:03 -04:00
  • 7976deff80 Fix file permissions (issue 4095) martin-frbg 2023-07-23 20:37:07 +02:00
  • fec4867748 Fix file permissions (issue 4095) martin-frbg 2023-07-23 20:31:55 +02:00
  • 25037ae875 Fix actual arguments in some LAPACK procedure calls (Reference-LAPACK PR 885) (#4155) Martin Kroeker 2023-07-22 23:14:25 +02:00
  • 8df0289db6 Adds tail undisturbed for RVV Level 1 operations Octavian Maghiar 2023-07-20 15:28:35 +01:00
  • bd01dc354b Merge pull request #4151 from martin-frbg/issue4101 Martin Kroeker 2023-07-20 13:21:07 +02:00
  • 3bdcf3259d Merge branch 'xianyi:develop' into issue4101 Martin Kroeker 2023-07-20 08:23:20 +02:00
  • 5cb4f5940d Merge pull request #4152 from martin-frbg/shutup-4098 Martin Kroeker 2023-07-20 08:22:57 +02:00
  • 76ef1672f8 Override DSDOT with generic code to get rid of qemu precision error Martin Kroeker 2023-07-19 22:31:07 +02:00
  • 8a27a274a1 Merge pull request #4150 from martin-frbg/armsve Martin Kroeker 2023-07-19 22:25:55 +02:00
  • b34f19a365 Ensure that a premature call to set_num_threads will not overwrite unrelated memory Martin Kroeker 2023-07-19 22:19:22 +02:00
  • 66904f8148 Ensure that a premature call will not overwrite unrelated memory Martin Kroeker 2023-07-19 22:14:34 +02:00
  • 5c58994eb2 Add fallback warning Martin Kroeker 2023-07-19 18:27:41 +02:00
  • ca7199f249 Treat newer Neoverse as N1 if SVE unavailable (may be disabled in container/cloud env) Martin Kroeker 2023-07-19 14:48:42 +02:00
  • 9e81a3a0a2 Merge pull request #4100 from martin-frbg/cirrusm1gccmake Martin Kroeker 2023-07-18 08:04:29 +02:00
  • ada9e442eb Add Apple M1 build using gcc,gmake and OpenMP Martin Kroeker 2023-07-17 23:13:56 +02:00
  • 81228fc586 Merge pull request #4147 from martin-frbg/aldern Martin Kroeker 2023-07-17 09:11:23 +02:00
  • 8da6aca2ec Support Alder Lake N (fam 6 exmodel 11 model 14) as Haswell Martin Kroeker 2023-07-16 22:15:15 +02:00
  • b61e64da6f Merge pull request #4142 from exyntech/armv8-as-arm64 Martin Kroeker 2023-07-15 23:15:49 +02:00
  • f82a197143 Merge pull request #4137 from felixonmars/patch-1 Martin Kroeker 2023-07-15 19:41:06 +02:00
  • 0a637cc403 Fix workspace query corner cases to always return at least 1 (Reference-LAPACK PR 883) (#4146) Martin Kroeker 2023-07-15 16:37:42 +02:00
  • 4c43d1eeba Fix C prototypes and LAPACKE headers for ?GEDMD/?GEDMDQ (#4134) Martin Kroeker 2023-07-15 07:47:19 +02:00
  • 49077e7bde Merge pull request #4145 from martin-frbg/issue4144 Martin Kroeker 2023-07-14 12:44:05 +02:00
  • 3d31191b0f Work around Clang failing to disambiguate SVE intrinsics and add AppleClang crossbuild to MacOS/arm64 DYNAMIC_ARCH in AzureCI (#4140) Martin Kroeker 2023-07-14 11:06:48 +02:00
  • 04cdf5efb4 fix typo and missing declaration Martin Kroeker 2023-07-14 00:05:00 +02:00
  • 5e1103b8d7 Update rotg.c Martin Kroeker 2023-07-13 23:35:38 +02:00
  • cfa0a80664 Restore initialization of data variables Martin Kroeker 2023-07-13 23:23:12 +02:00
  • 9567305e4c Restore initialization of data01,data02 Martin Kroeker 2023-07-13 23:21:18 +02:00
  • 4cc232bb07 Merge branch 'xianyi:develop' into issue4130 Martin Kroeker 2023-07-13 21:40:22 +02:00
  • 7c75c8b2fe fix truncated edit Martin Kroeker 2023-07-13 21:40:12 +02:00
  • 0f2ce93904 typo fix Martin Kroeker 2023-07-13 10:56:59 +02:00
  • affeef0b9c Fix gmake build not always picking the right ARM64 arch options for clang (#4136) Martin Kroeker 2023-07-13 08:38:03 +02:00
  • e08743d977 Update to use safe scaling algorithm from Reference-LAPACK PR 527 Martin Kroeker 2023-07-12 23:02:36 +02:00
  • 45b2cd2fb2 treat armv8 CMAKE_SYSTEM_PROCESSOR as arm64 Andy Mroczkowski 2023-07-12 09:37:45 -04:00
  • 1e4a3a2b5e Fixes RVV masked intrinsics for izamax/izamin kernels Octavian Maghiar 2023-07-12 12:55:50 +01:00
  • 494313e75e Merge pull request #4138 from martin-frbg/fix4126 Martin Kroeker 2023-07-11 20:41:02 +02:00
  • afef854863 Add C versions of C/ZRSCL Martin Kroeker 2023-07-11 17:08:27 +02:00
  • 35dedb68ce Add C versions of C/ZRSCL Martin Kroeker 2023-07-11 17:07:30 +02:00
  • a721fccfdc Fix riscv64 detection in system_check.cmake Felix Yan 2023-07-11 16:34:20 +03:00
  • 2edebc5fb9 Merge pull request #4133 from martin-frbg/issue4132 Martin Kroeker 2023-07-10 01:50:38 +02:00
  • bcebe9b4c9 Merge pull request #4131 from martin-frbg/lapack878 Martin Kroeker 2023-07-10 01:50:16 +02:00
  • 26fd4b9c8c Merge pull request #4129 from martin-frbg/lapack876 Martin Kroeker 2023-07-10 01:49:55 +02:00
  • 22ad23abb1 Merge pull request #4126 from martin-frbg/lapack839 Martin Kroeker 2023-07-10 01:49:33 +02:00
  • 351645b8af Merge pull request #4123 from martin-frbg/lapack867 Martin Kroeker 2023-07-10 01:48:18 +02:00
  • f5413447aa Merge pull request #4122 from martin-frbg/issue4121 Martin Kroeker 2023-07-09 22:57:44 +02:00
  • 5dd1d9cacd Merge pull request #4120 from martin-frbg/jenkinsbadge Martin Kroeker 2023-07-09 22:57:11 +02:00
  • 15dfb2f2cf Merge pull request #4118 from XiWeiGu/develop Martin Kroeker 2023-07-09 22:56:47 +02:00
  • 7e93ab1b9e Fix info code returned for invalid ldb Martin Kroeker 2023-07-09 17:00:25 +02:00
  • 63ec2ad395 Fix computation of UPLO in LAPACKE_?larfb (Reference-LAPACK PR 878) Martin Kroeker 2023-07-08 10:00:05 +02:00
  • bed3a6a304 Fix segfault when NRHS is zero (Reference-LAPACK PR 876) Martin Kroeker 2023-07-07 10:13:41 +02:00
  • e3277e134c Add CRSCL/ZRSCL (Reference-LAPACK PR 839) Martin Kroeker 2023-07-05 14:22:59 +02:00
  • 1d399ad0d6 Add CRSCL/ZRSCL (Reference-LAPACK PR839) Martin Kroeker 2023-07-05 14:17:26 +02:00
  • 069353bd44 Add reciprocal scaling of a complex vector and use it in C/ZGETF2 (Reference-LAPACK PR839) Martin Kroeker 2023-07-05 14:13:58 +02:00
  • e1958eb705 Fixes RVV masked intrinsics for iamax/iamin/imax/imin kernels Octavian Maghiar 2023-07-05 11:34:00 +01:00
  • 1363a7c4f1 Correct the order of eigenvalues/vector for 2x2 matrices (Reference-LAPACK PR 867) Martin Kroeker 2023-07-05 10:10:13 +02:00
  • 2f9f70584e Fix target list syntax for ARM64 DYNAMIC_ARCH Martin Kroeker 2023-07-05 09:48:54 +02:00
  • ac698cedad Add compiler options for ARM64 SVE targets in DYNAMIC_ARCH builds Martin Kroeker 2023-07-05 09:47:49 +02:00
  • a32af56761 Add status badges for OSUOSL's POWERCI and IBMZ-CI services Martin Kroeker 2023-07-02 21:50:15 +02:00
  • 4d0b7fbec0 Adjust M1 crossbuilds after image autoupgrade and add an M1-to-Android crossbuild on Cirrus CI (#4117) Martin Kroeker 2023-07-02 12:38:57 +02:00
  • 6deb52812d LoongArch64: Add WhereAmI() gxw 2023-06-30 14:11:01 +08:00
  • 2183dbcfe2 Merge pull request #4113 from xianyi/revert-4103-windows_perf Martin Kroeker 2023-06-30 00:00:25 +02:00
  • bb862b82d5 Fix integer overflow in multithreading threshold calculation for SYMM/SYRK (#4116) Martin Kroeker 2023-06-29 23:59:25 +02:00
  • cbb5deb2cc Merge pull request #4114 from xianyi/jenkins Martin Kroeker 2023-06-29 08:31:06 +02:00
  • f7b916076b Add Jenkins configuration files for OSUOSL powerci and ibmz-ci jenkins Martin Kroeker 2023-06-28 21:30:59 +02:00
  • e14a025bb1 Temporily walk around zaxpy vector kernel bug. Xianyi Zhang 2023-06-28 11:17:38 +00:00
  • 616fdea82a Revert "Improve Windows threading performance scaling" revert-4103-windows_perf Martin Kroeker 2023-06-28 09:45:17 +02:00
  • b4f233e881 Merge pull request #4108 from martin-frbg/c910-iamax Martin Kroeker 2023-06-27 18:15:18 +02:00
  • ceaee7dc64 remove the limitation to -O1 again Martin Kroeker 2023-06-27 16:13:23 +02:00
  • 772b0cc715 Fix early bailout Martin Kroeker 2023-06-27 16:12:27 +02:00
  • d6be5036d7 Fix IDAMAX Martin Kroeker 2023-06-26 21:19:33 +02:00
  • dc2439108d Merge pull request #4107 from martin-frbg/lapack860 Martin Kroeker 2023-06-26 21:18:19 +02:00
  • 3688c42628 Merge pull request #4106 from martin-frbg/lapack852 Martin Kroeker 2023-06-26 18:09:28 +02:00
  • 49202228d7 Remove OMP-private variable outside OMP context (LAPACK PR860) Martin Kroeker 2023-06-26 16:29:04 +02:00
  • a0e413ef05 Update ssytrd_sb2st.FRemove OMP-private variable outside OMP context (LAPACK PR860) Martin Kroeker 2023-06-26 16:28:15 +02:00
  • 5b69217808 Update zhetrd_hb2st.FRemove OMP-private variable outside OMP context (LAPACK PR860) Martin Kroeker 2023-06-26 16:26:21 +02:00
  • b9fb63c05e Remove OMP-private variable outside OMP context (LAPACK PR860) Martin Kroeker 2023-06-26 16:25:08 +02:00
  • 329bd3410b Remove legacy warning comments and rename variable LAMDA to LAMBDA (Reference-LAPACK PR 852) Martin Kroeker 2023-06-26 15:44:10 +02:00
  • 4ecb68554a Merge pull request #4086 from sl1pkn07/fix4083 Martin Kroeker 2023-06-26 14:53:34 +02:00
  • 8488dc9876 Merge pull request #4105 from martin-frbg/c910v-fix Martin Kroeker 2023-06-26 01:19:39 +02:00
  • b13787971e Enforce -O1 to work around miscompilation by the vendor gcc Martin Kroeker 2023-06-25 22:37:56 +02:00
  • 1fe96f8da7 Fix failures to handle increments of zero Martin Kroeker 2023-06-25 22:36:57 +02:00
  • f863895ebf Merge pull request #4104 from martin-frbg/lapack855 Martin Kroeker 2023-06-25 20:58:49 +02:00
  • ebef09219b Update Makefile.riscv64 utest-riscv Martin Kroeker 2023-06-25 16:20:19 +02:00
  • 7acff1be1c Merge pull request #4103 from mseminatore/windows_perf Martin Kroeker 2023-06-25 16:01:12 +02:00
  • 572e482b38 Fix inline documentation of S/DLASD0 (Reference-LAPACK PR 855) Martin Kroeker 2023-06-25 12:42:05 +02:00
  • d6991dd230 fix missing #endif Mark Seminatore 2023-06-24 15:43:32 -07:00
  • 7783a9af02 attempt to fix old mingw gcc issue Mark Seminatore 2023-06-24 14:35:11 -07:00
  • 6001418057 Add pragma to restrict optimization Martin Kroeker 2023-06-24 12:34:43 +02:00
  • 426cd09595 Add pragma to restrict optimization Martin Kroeker 2023-06-24 12:33:55 +02:00