Commit Graph

  • 326b200b08 Merge pull request #3996 from martin-frbg/issue3989 Martin Kroeker 2023-04-07 23:31:51 +02:00
  • 3effdc1505 Protect CROSS_PATH against spurious addition of linebreaks from isolated dashes Martin Kroeker 2023-04-07 19:32:22 +02:00
  • 654d87d73a Merge pull request #3994 from rgommers/fix-ssyconvf-export Martin Kroeker 2023-04-07 18:15:14 +02:00
  • d677214570 Remove the badge for the dead drone.io service and add Cirrus CI in its place Martin Kroeker 2023-04-07 14:11:16 +02:00
  • a4ee1c84f0 Export ssyconvf symbol Ralf Gommers 2023-04-07 12:50:36 +01:00
  • ca8544be6d Merge pull request #3991 from martin-frbg/lapack808 Martin Kroeker 2023-04-04 15:27:17 +02:00
  • d175b8f56f Refactor ?GEBAL (Reference-LAPACK PR 808) Martin Kroeker 2023-04-03 15:02:10 +02:00
  • 5f1fb27c40 Rename cirrus.yml to .cirrus.yml Martin Kroeker 2023-04-03 11:00:17 +02:00
  • ab0755590f Merge pull request #3990 from martin-frbg/cirrus Zhang Xianyi 2023-04-03 16:54:40 +08:00
  • 65b7bf9f3e Add Apple M1 testing via Cirrus CI Martin Kroeker 2023-04-03 10:51:38 +02:00
  • 516f22b8ca Update version to 0.3.23.dev Martin Kroeker 2023-04-01 22:25:55 +02:00
  • 3e8f51e7cf Update version to 0.3.23.dev Martin Kroeker 2023-04-01 22:25:07 +02:00
  • f9a701b6dd Merge pull request #3988 from xianyi/release-0.3.0 Martin Kroeker 2023-04-01 22:24:26 +02:00
  • 394a9fbafe Increment version to 0.3.23 v0.3.23 Martin Kroeker 2023-04-01 22:18:01 +02:00
  • 8f32384633 Increment version to 0.3.23 Martin Kroeker 2023-04-01 22:17:27 +02:00
  • af3606d9fb Merge pull request #3987 from xianyi/develop Martin Kroeker 2023-04-01 22:16:24 +02:00
  • cd2e80ca2e Merge branch 'release-0.3.0' into develop Martin Kroeker 2023-04-01 22:15:52 +02:00
  • e2614eb6ce Merge pull request #3986 from martin-frbg/changelog0323 Martin Kroeker 2023-04-01 22:08:43 +02:00
  • 1f70481384 Update with 0.3.23 changes Martin Kroeker 2023-04-01 20:33:31 +02:00
  • eb0793bfd0 Merge pull request #3984 from martin-frbg/develop Martin Kroeker 2023-04-01 11:35:52 +02:00
  • 30222d0832 Merge pull request #3971 from HellerZheng/risc-v Zhang Xianyi 2023-04-01 12:43:43 +08:00
  • 36fcb52094 Fix logic - we want real OR imaginary part of X to be nonzero here Martin Kroeker 2023-04-01 00:02:54 +02:00
  • 397108fba2 serialize shared prerequisites Guillaume Horel 2023-03-31 09:22:40 -04:00
  • 281e834566 do not pass -j flag to the MAKE variable Guillaume Horel 2023-03-30 15:15:25 -04:00
  • d708951375 Merge pull request #3980 from martin-frbg/fix3941-2 Martin Kroeker 2023-03-30 06:56:05 +02:00
  • 6c431239da Split test condition in LU computation - non-denormal for computation, exact zero for reporting singularity Martin Kroeker 2023-03-29 22:14:21 +02:00
  • 23f2c4ca5b Merge pull request #3978 from martin-frbg/fix3941 Martin Kroeker 2023-03-29 16:22:27 +02:00
  • 12aabb9f9b fix conditional Martin Kroeker 2023-03-29 09:44:33 +02:00
  • fd0614cbc0 Merge pull request #3975 from martin-frbg/issue3974 Martin Kroeker 2023-03-28 22:57:27 +02:00
  • 912d713b52 redo lost edit Martin Kroeker 2023-03-28 18:31:04 +02:00
  • dc15c18efc Fix build failures seen with the NO_LAPACK option - cspr/csymv/csyr belong on the LAPACK list Martin Kroeker 2023-03-28 16:33:09 +02:00
  • 5d9d382e36 Merge pull request #3970 from linouxis9/develop Martin Kroeker 2023-03-28 16:22:27 +02:00
  • 867eee60af Merge pull request #3972 from h-vetinari/ifndef Martin Kroeker 2023-03-28 16:17:49 +02:00
  • f2659516ef remove unqualified ifdef's for NO_LAPACK(E) H. Vetinari 2023-03-28 18:51:30 +11:00
  • 6b74bee2f9 Update TARGET=x280 description. Heller Zheng 2023-03-27 18:59:24 -07:00
  • 280b6d57d1 Add more Intel Raptor Lake CPUIDs linouxis9 2023-03-28 00:55:23 +02:00
  • 13add99540 Update Changelog for 0.3.22 Martin Kroeker 2023-03-27 00:13:07 +02:00
  • c05da5960d Update Changelog for 0.3.22 (#3964) Martin Kroeker 2023-03-27 00:11:05 +02:00
  • 1c2a60e6cf Update version to 0.3.22.dev Martin Kroeker 2023-03-27 00:00:04 +02:00
  • 3c9654219f Update version to 0.3.22.dev Martin Kroeker 2023-03-26 23:37:13 +02:00
  • cf97c01244 Merge pull request #3963 from xianyi/release-0.3.0 Martin Kroeker 2023-03-26 23:34:58 +02:00
  • e46971b9d5 Merge branch 'develop' into release-0.3.0 v0.3.22 Martin Kroeker 2023-03-26 23:34:17 +02:00
  • 5736dbae16 Merge pull request #3961 from martin-frbg/lapack807 Martin Kroeker 2023-03-26 23:17:36 +02:00
  • 09ace3cf23 Merge pull request #3846 from lilh9598/sbgemm_opt Martin Kroeker 2023-03-26 19:04:57 +02:00
  • c48bbe93d4 Replace the conditionally defined index variable INDIBL with a constant 1 Martin Kroeker 2023-03-26 17:33:21 +02:00
  • 2d39e715e2 Merge pull request #3960 from martin-frbg/symmsyrk_sp Martin Kroeker 2023-03-26 17:02:39 +02:00
  • f2d6b1c70e Add multithreading threshold Martin Kroeker 2023-03-26 00:25:28 +01:00
  • a495ffc554 Rework multithreading threshold Martin Kroeker 2023-03-26 00:23:57 +01:00
  • 1659750861 Add ARMV5 to CMAKE prebuild.cmake for cross-compiles (#3959) Martin Kroeker 2023-03-26 00:22:06 +01:00
  • 7f56c31e21 Merge pull request #3958 from tttapa/develop Martin Kroeker 2023-03-25 19:34:57 +01:00
  • cf80350b89 Merge pull request #3957 from martin-frbg/issue3954 Martin Kroeker 2023-03-25 18:33:58 +01:00
  • a79ae116e2 Add ARMV6 to cmake/prebuild.cmake, based on cpuid_arm.c and param.h Pieter Pas 2023-03-23 14:18:51 +01:00
  • 280c8469d4 Fix detection of Cray CCE after detection changes for gfortran on Cray Martin Kroeker 2023-03-25 14:18:21 +01:00
  • c68afeb567 Merge pull request #3956 from martin-frbg/issue3955 Martin Kroeker 2023-03-24 23:06:25 +01:00
  • c2b01c6f81 Update to 3.11.0 Martin Kroeker 2023-03-24 22:54:43 +01:00
  • 02920be53a Merge pull request #3953 from martin-frbg/issue3801 Martin Kroeker 2023-03-24 13:51:37 +01:00
  • 244147495a Do not use multithreading for small workloads Martin Kroeker 2023-03-23 23:13:02 +01:00
  • 2885e4d898 Merge pull request #3951 from martin-frbg/issue3940 Martin Kroeker 2023-03-22 15:14:00 +01:00
  • e09e494f25 Merge pull request #3950 from martin-frbg/issue3856-syr Martin Kroeker 2023-03-22 10:42:57 +01:00
  • dac2d7aef0 Merge pull request #3952 from martin-frbg/cleanup3936 Martin Kroeker 2023-03-22 09:00:14 +01:00
  • 743ae5ab18 move declaration of openmp_nthreads under the related ifdef Martin Kroeker 2023-03-21 21:07:37 +01:00
  • 579bc86671 remove call to omp_set_num_threads Martin Kroeker 2023-03-21 20:58:56 +01:00
  • ab32f832a8 fix stray blank on continuation line Martin Kroeker 2023-03-21 08:29:05 +01:00
  • e359787e28 restore C/Z SPMV, SPR, SYR,SYMV Martin Kroeker 2023-03-21 07:43:03 +01:00
  • 6ca6eb7613 Re-enable overriding the LAPACK SYMV,SYR,SPMV and SPR implementations Martin Kroeker 2023-03-20 23:04:12 +01:00
  • 2b77f28810 Merge pull request #3949 from martin-frbg/lapack750 Martin Kroeker 2023-03-20 22:48:06 +01:00
  • 164eafd61d Improve inline documentation of ?GEJSV (Reference-LAPACK PR 750) Martin Kroeker 2023-03-20 19:51:07 +01:00
  • 871b730dc5 Create SECURITY.md Martin Kroeker 2023-03-20 15:12:35 +01:00
  • 7719dbecde Merge pull request #3946 from martin-frbg/lapack682 Martin Kroeker 2023-03-20 13:48:57 +01:00
  • 147e2fbf87 Add numerical tests for TRECV3 (Reference-LAPACK 682) Martin Kroeker 2023-03-20 10:04:05 +01:00
  • 2a83ec1f79 Rewrite to use FMA with Householder reflectors Martin Kroeker 2023-03-20 10:00:42 +01:00
  • e1c3c34178 Standardize coding style (Reference-LAPACK 682) Martin Kroeker 2023-03-20 09:59:03 +01:00
  • 1374a2d08b This PR adapts latest spec changes Add prefix (_riscv) for all riscv intrinsics Update some intrinsics' parameter, like vfredxxxx, vmerge Heller Zheng 2023-03-19 23:59:03 -07:00
  • f7b9391119 Merge pull request #3945 from ahojnnes/user/joschonb/fix-prebuild-path Martin Kroeker 2023-03-19 23:14:18 +01:00
  • fbfe474583 f Johannes Schönberger 2023-03-19 16:23:44 +01:00
  • ec4700087e Wrap variables in cmake/prebuild in quotes to handle spaces in paths Johannes Schönberger 2023-03-19 16:09:47 +01:00
  • e8db1fe89b Merge pull request #3943 from martin-frbg/llvm15 Martin Kroeker 2023-03-18 11:24:52 +01:00
  • de937b3194 Add clang option to avoid running out of registers in AVX512 assembly Martin Kroeker 2023-03-17 21:22:37 +01:00
  • f3d21039ce Improve fix from PR3924 (#3941) Martin Kroeker 2023-03-16 15:09:32 +01:00
  • 19f17c8bc6 Merge pull request #3893 from HellerZheng/develop Zhang Xianyi 2023-03-15 10:17:13 +08:00
  • 20511dfa65 Merge pull request #3919 from sergei-lewis/risc-v-latest-rvv-intrinsics Zhang Xianyi 2023-03-15 10:16:19 +08:00
  • 8d6813ec41 Merge pull request #3938 from martin-frbg/issue3937 Martin Kroeker 2023-03-10 15:33:07 +01:00
  • 19d6795122 Merge pull request #3936 from martin-frbg/issue3933 Martin Kroeker 2023-03-10 15:32:50 +01:00
  • 50c263716e Correct DGEMM_UNROLL_M value for A53 in cross-compile Martin Kroeker 2023-03-10 11:53:24 +01:00
  • e298d613fa initialize status variable for openblas_set_num_threads Martin Kroeker 2023-03-08 23:43:15 +01:00
  • 05aa88268f add status variable for openblas_set_num_threads Martin Kroeker 2023-03-08 23:41:57 +01:00
  • 9f9d0012a3 observe thread limit imposed by openblas_set_num_threads() Martin Kroeker 2023-03-08 23:38:34 +01:00
  • fe5d3ca8e0 Merge pull request #3935 from martin-frbg/omp_place_num Martin Kroeker 2023-03-08 22:41:18 +01:00
  • e38ab079a0 Fix OpenMP thread counting returning places rather than cores Martin Kroeker 2023-03-08 19:17:33 +01:00
  • 9feaaa3f39 Merge pull request #3932 from martin-frbg/issue3931 Martin Kroeker 2023-03-03 12:01:25 +01:00
  • 8272dfc552 Handle unrecognized ASM compiler (from Arm Compiler 22.1) Martin Kroeker 2023-03-03 00:21:59 +01:00
  • f616c86404 Merge pull request #3930 from sergei-lewis/dot-kernel-early-bail Martin Kroeker 2023-03-02 16:46:25 +01:00
  • cb0a70e0e2 dot.c early bail fix Sergei Lewis 2023-03-02 09:51:10 +00:00
  • 9b61be4545 factoring riscv64/dot.c fix into separate PR as requested Sergei Lewis 2023-03-01 17:40:42 +00:00
  • 5925178d03 Merge pull request #3924 from martin-frbg/numpy22025 Martin Kroeker 2023-02-27 15:59:44 +01:00
  • f58080278f Merge pull request #3923 from xctan/fix-cmake-riscv64 Zhang Xianyi 2023-02-27 09:39:30 +08:00
  • 3d27cbd9a3 avoid overflow in division Martin Kroeker 2023-02-26 23:44:14 +01:00
  • a39ced0551 avoid overflow in division Martin Kroeker 2023-02-26 23:42:20 +01:00
  • 6a0de3aa39 Add missing RISC-V architecture in arch.cmake xctan 2023-02-26 20:21:57 +08:00
  • 2406958629 * update intrinsics to match latest spec at https://github.com/riscv-non-isa/rvv-intrinsic-doc (in particular, __riscv_ prefixes for rvv intrinsics) * fix multiple numerical stability and corner case issues * add a script to generate arbitrary gemm kernel shapes * add a generic zvl256b target to demonstrate large gemm kernel unrolls Sergei Lewis 2023-02-24 10:44:55 +00:00