Commit Graph

  • e284c048df Merge branch 'develop' into risc-v Xianyi Zhang 2022-12-03 11:56:55 +08:00
  • 0a24f631e9 Merge pull request #3844 from Mousius/switch-ratio-16 Martin Kroeker 2022-12-02 12:48:43 +01:00
  • 65984fbe68 Merge pull request #3847 from bartoldeman/scal-benchmark Martin Kroeker 2022-12-02 11:51:50 +01:00
  • f6f0d13b9f Merge pull request #3842 from Mousius/sve-dot Martin Kroeker 2022-12-02 08:30:51 +01:00
  • 5c3169ecd8 dscal: use ymm registers in Haswell microkernel Bart Oldeman 2022-12-01 07:48:05 -05:00
  • eea006a688 Wrap SVE header with __has_include check Chris Sidebottom 2022-11-29 17:53:38 +00:00
  • fd4f52c797 Add SVE implementation for sdot/ddot Chris Sidebottom 2022-11-17 06:49:57 +00:00
  • b6a4ef98b9 Merge pull request #3845 from Mousius/asimd-dot-opt Martin Kroeker 2022-11-30 21:07:30 +01:00
  • 2fb096315e Set SWITCH_RATIO for Arm(R) Neoverse(TM) V1 CPUs Chris Sidebottom 2022-11-25 14:46:24 +00:00
  • bae45d94d1 scal benchmark: eliminate y, move init/timing out of loop Bart Oldeman 2022-11-29 08:02:45 -05:00
  • fdac8a97c1 Add sbgemm_ncopy_8 and sbgemm_tcopy_4 lilianhuang 2022-11-29 04:46:14 -05:00
  • 387e8970cd Fix merge problem; Update compiling COMMON_OPT per review comments. Heller Zheng 2022-11-28 21:42:29 -08:00
  • 135718eafc Improve the performance of sbgemm_tcopy on neoversen2 lilianhuang 2022-11-28 04:17:54 -05:00
  • 4f7b77e08a Remove unnecessary instructions from Advanced SIMD dot Chris Sidebottom 2022-11-25 15:24:32 +00:00
  • e9a911fb9f Merge pull request #3841 from martin-frbg/lapack755+764 Martin Kroeker 2022-11-23 22:38:06 +01:00
  • bf0e8d67b5 Merge pull request #3840 from martin-frbg/lapack760 Martin Kroeker 2022-11-23 19:16:25 +01:00
  • a5470521ee Fix array indexation in copy, and fix test (Reference-LAPACK PR764) Martin Kroeker 2022-11-23 15:31:25 +01:00
  • b0393ea4e1 Fix test (Reference-LAPACK PR764) Martin Kroeker 2022-11-23 15:27:46 +01:00
  • 0d26f1a4c7 Fix wrong indexation in test (Reference-LAPACK PR755) Martin Kroeker 2022-11-23 15:22:27 +01:00
  • 19fd2d7f00 Use LSAME for character comparison (Reference-LAPACK PR755) Martin Kroeker 2022-11-23 15:19:07 +01:00
  • 663bf68dbd Merge pull request #3839 from martin-frbg/lapack758 Martin Kroeker 2022-11-23 14:57:56 +01:00
  • c2ba4e6249 Remove unnecessary return in void function call (Reference-LAPACK PR760) Martin Kroeker 2022-11-23 10:43:34 +01:00
  • 74962c7f53 Remove unnecessary return in void function call (Reference-LAPACK PR760) Martin Kroeker 2022-11-23 10:42:29 +01:00
  • d952cbf7bc Remove unnecessary return in void function call (Reference-LAPACK PR760) Martin Kroeker 2022-11-23 10:41:50 +01:00
  • 7694ff495f Remove unnecessary return in void function call (Reference-LAPACK PR760) Martin Kroeker 2022-11-23 10:40:59 +01:00
  • 825ae316e2 Fix typo in EXTERNAL (Reference-LAPACK PR760) Martin Kroeker 2022-11-23 10:36:10 +01:00
  • 730ed549e6 Fix typo in EXTERNAL (Reference-LAPACK PR760) Martin Kroeker 2022-11-23 10:35:23 +01:00
  • bc3393f703 Fix array dimension (Reference-LAPACK 758) Martin Kroeker 2022-11-23 10:31:18 +01:00
  • 0b2f8dabbf Fix array dimension (Reference-LAPACK 758) Martin Kroeker 2022-11-23 10:30:35 +01:00
  • b4c9228441 Merge pull request #3838 from martin-frbg/lapa311 Martin Kroeker 2022-11-22 17:39:51 +01:00
  • e6e2a63650 Update LAPACK version number to 3.11.0 Martin Kroeker 2022-11-22 14:02:21 +01:00
  • 8408357bab Update LAPACK version number to 3.11.0 Martin Kroeker 2022-11-22 14:01:48 +01:00
  • ba8fb8b4b2 Merge pull request #3837 from martin-frbg/lapack655+697 Martin Kroeker 2022-11-22 13:51:57 +01:00
  • cabf9453e2 Merge pull request #3836 from martin-frbg/lapack665+735 Martin Kroeker 2022-11-22 09:25:24 +01:00
  • 3918d8504e nrm2 simple optimization Heller Zheng 2022-11-21 19:06:07 -08:00
  • d321357558 Fix bug in DORCSD2BY1 (from Reference-LAPACK PR697) Martin Kroeker 2022-11-21 21:19:44 +01:00
  • afcd7e88b6 Improve convergence of DLAED4/SLAED4 (Reference-LAPACK PR655) Martin Kroeker 2022-11-21 21:18:39 +01:00
  • f8f2bebf11 Fix function documentation for LAPACK ?TPRFB (Reference-LAPACK PR665) Martin Kroeker 2022-11-21 20:01:47 +01:00
  • c45edcb537 Fix typo in comment (Reference-LAPACK PR735) Martin Kroeker 2022-11-21 19:59:33 +01:00
  • 880a3fb20f Merge pull request #3835 from martin-frbg/lapack217 Martin Kroeker 2022-11-21 19:56:28 +01:00
  • 50aba02910 Simplify ?SYSWAPR and fix its documentation (Reference-LAPACK 217) Martin Kroeker 2022-11-21 18:00:31 +01:00
  • 0b68dd6a9b Merge pull request #3834 from martin-frbg/lapack631 Martin Kroeker 2022-11-21 08:30:14 +01:00
  • 9343499256 Merge pull request #3833 from martin-frbg/lapack712+747 Martin Kroeker 2022-11-21 08:29:49 +01:00
  • 7ae4269add Use new algorithms for computing Givens rotations (Reference-LAPACK PR631) Martin Kroeker 2022-11-20 22:52:28 +01:00
  • e00f0fb26a Fix function documentation (Reference-LAPACK PR747) Martin Kroeker 2022-11-20 22:46:58 +01:00
  • 31d2145988 Set scale early for robust triangular solvers (Reference-LAPACK PR712) Martin Kroeker 2022-11-20 22:44:36 +01:00
  • 1d5a3aff0d Merge pull request #3832 from martin-frbg/lapack681+698 Martin Kroeker 2022-11-20 22:40:52 +01:00
  • c6816bb576 Use normwise criterion in multishift QZ (Reference-LAPACK PR698) Martin Kroeker 2022-11-20 19:39:12 +01:00
  • 6f09e4c121 Improve FMA usage in ?LAQR5 (Reference-LAPACK PR681) Martin Kroeker 2022-11-20 19:37:28 +01:00
  • f63c93274c Merge pull request #3831 from martin-frbg/lapack647+697+702 Martin Kroeker 2022-11-20 19:34:41 +01:00
  • aaea0804bc Fix function documentation (Reference-LAPACK PR697) Martin Kroeker 2022-11-20 16:38:57 +01:00
  • b946820502 Fix uninitialized variable (Reference-LAPACK PR647) Martin Kroeker 2022-11-20 16:36:19 +01:00
  • 9e29312c83 Fix type precision and function documentation (Reference-LAPACK PRs 647+702) Martin Kroeker 2022-11-20 16:34:45 +01:00
  • b1102fe250 Merge pull request #3830 from martin-frbg/lapack691+698 Martin Kroeker 2022-11-20 16:29:46 +01:00
  • 3f31b69121 Add quick return if scaling with one (Reference-LAPACK PR674) Martin Kroeker 2022-11-20 13:30:25 +01:00
  • 60af35bfab Fix workspace query for ?SYEVD and ?HEEVD (Reference-LAPACK PR691) Martin Kroeker 2022-11-20 13:25:21 +01:00
  • eea1636380 Use normwise criterion for INF eigenvalues in QZ (Reference-LAPACK PR698) Martin Kroeker 2022-11-20 13:22:55 +01:00
  • 1714d640f1 Merge pull request #3829 from martin-frbg/lapack684+739 Martin Kroeker 2022-11-20 13:06:51 +01:00
  • 88cd91c490 Fix stray character Martin Kroeker 2022-11-19 23:15:20 +01:00
  • f157d6d671 Add C equivalents of ?GELST (for Reference-LAPACK PR739) Martin Kroeker 2022-11-19 22:50:57 +01:00
  • 5ff46f4092 Add ?GELST (Reference-LAPACK PR739) Martin Kroeker 2022-11-19 22:49:31 +01:00
  • 1d32ce5135 Add ?GELST (Reference-LAPACK PR739) Martin Kroeker 2022-11-19 22:42:50 +01:00
  • 1497336b20 Add tests for ?GELST (Reference-LAPACK PR739) Martin Kroeker 2022-11-19 22:39:16 +01:00
  • d0afbd8d29 Add new routines for ?GELST similar to ?GELS (Reference-LAPACK PR739) Martin Kroeker 2022-11-19 22:34:42 +01:00
  • 63014e99ae Cast work array sizes to integer (Reference-LAPACK PR 684) Martin Kroeker 2022-11-19 22:31:33 +01:00
  • 3a38dad18f Merge pull request #3828 from martin-frbg/lapack703-3 Martin Kroeker 2022-11-19 22:28:48 +01:00
  • 15967809ad Define type conversions explicitly (Reference-LAPACK PR703) Martin Kroeker 2022-11-19 19:15:09 +01:00
  • 4ab4d6ff82 Merge pull request #3827 from martin-frbg/lapack703-2 Martin Kroeker 2022-11-19 18:51:23 +01:00
  • 4e60737c2d Define type conversions explicitly (Reference-LAPACK PR 703) Martin Kroeker 2022-11-19 15:22:46 +01:00
  • e9b0f5a364 Define type conversions explicitly (Reference-LAPACK PR 703) Martin Kroeker 2022-11-19 15:11:05 +01:00
  • 08bc43c73d Define type conversions explicitly (Reference-LAPACK PR 703) Martin Kroeker 2022-11-19 15:04:30 +01:00
  • 35295912a3 Define type conversions explicitly (Reference-LAPACK PR 703) Martin Kroeker 2022-11-19 14:57:54 +01:00
  • 943372bdf5 Merge branch 'develop' into develop HellerZheng 2022-11-18 10:12:46 +08:00
  • ec0ae034bc Merge pull request #3826 from martin-frbg/lapack540+725 Martin Kroeker 2022-11-17 22:07:22 +01:00
  • 9fe75af528 Add a LAPACKE interface for ?LANGB (Reference-LAPACK PR725) Martin Kroeker 2022-11-17 19:46:53 +01:00
  • 48c9c6efb9 Add ?LANGB interface (Reference-LAPACK PR725) Martin Kroeker 2022-11-17 18:19:14 +01:00
  • e4a31c0d23 add ?LANGB interface (Reference-LAPACK PR725) Martin Kroeker 2022-11-17 18:15:04 +01:00
  • 6c9dbe5afa Add a LAPACKE interface for ?LANGB (Reference-LAPACK PR725) Martin Kroeker 2022-11-17 18:09:49 +01:00
  • 4f82699ec9 Fix errors in LAPACKE ?tpmqrt for row major matrices (Reference-LAPACK PR540) Martin Kroeker 2022-11-17 18:06:17 +01:00
  • 76ae221330 Merge pull request #3824 from martin-frbg/issue3822 Martin Kroeker 2022-11-17 16:38:11 +01:00
  • 26b5009ebc Merge pull request #3820 from martin-frbg/lapack700 Martin Kroeker 2022-11-17 14:19:05 +01:00
  • f92dd6e303 change line endings from CRLF to LF Martin Kroeker 2022-11-17 10:18:36 +01:00
  • f73cfb7e2c change line endings from CRLF to LF Martin Kroeker 2022-11-17 09:39:56 +01:00
  • 1688c7da43 change line endings from CRLF to LF Martin Kroeker 2022-11-16 22:24:01 +01:00
  • 258a83a642 change line endings from CRLF to LF Martin Kroeker 2022-11-16 21:57:42 +01:00
  • ad95ad52f2 Merge pull request #3819 from martin-frbg/lapack703 Martin Kroeker 2022-11-16 20:49:54 +01:00
  • f16aa1ce7a Merge pull request #3821 from martin-frbg/lapack651 Martin Kroeker 2022-11-16 13:59:02 +01:00
  • 2a97ca615f MSVC compatibility fixes Martin Kroeker 2022-11-16 07:36:40 +01:00
  • 9702d57b11 Update Makefile.install HellerZheng 2022-11-16 11:11:04 +08:00
  • 5d0d1c5551 Remove redundant files Heller Zheng 2022-11-15 18:22:21 -08:00
  • 5dec93e93b Complete the C conversion of the xTRSYL3 files Martin Kroeker 2022-11-15 20:36:58 +01:00
  • b6d74b7fff Add f2c-converted files for the BLAS3-based Sylvester solver Martin Kroeker 2022-11-15 16:26:44 +01:00
  • b2cc310470 Add f2c-converted versions of the new BLAS3-based Sylvester solver Martin Kroeker 2022-11-15 14:23:46 +01:00
  • 379efbe5af Fix typos Martin Kroeker 2022-11-15 11:03:12 +01:00
  • bef47917bd Initial version for riscv sifive x280 Heller Zheng 2022-11-15 00:06:25 -08:00
  • 95da5141f0 Add a BLAS3-based triangular Sylvester equation solver (Reference-LAPACK PR 651) Martin Kroeker 2022-11-14 22:21:29 +01:00
  • 2592853fc7 Restore OpenBLAS-specific changes Martin Kroeker 2022-11-14 21:47:37 +01:00
  • 52c2a0397b Restore OpenBLAS modifications to link line Martin Kroeker 2022-11-14 17:13:08 +01:00
  • bb652f65a3 Typo fix Martin Kroeker 2022-11-14 16:35:13 +01:00
  • fb42a0cf8b Add a BLAS3-based triangular Sylvester equation solver (Reference-LAPACK PR 651) Martin Kroeker 2022-11-14 14:06:50 +01:00