Commit Graph

  • 0d72705adc fix segfault when executing with row major (Reference-LAPACK 768) Martin Kroeker 2023-05-24 11:41:28 +02:00
  • abbc387145 fix segfault when executing with row major Martin Kroeker 2023-05-24 11:40:25 +02:00
  • b0a72586a3 fix segfault when executing with row major Martin Kroeker 2023-05-24 11:39:20 +02:00
  • c0da3af13b Fix segfault when executing with row major Martin Kroeker 2023-05-24 11:37:32 +02:00
  • d3bf5a5401 Combine two reduction operations of zhe/symv into one, with tail undisturbed setted. sh-zheng 2023-05-22 22:39:45 +08:00
  • 71e4125795 Fix syscall error on non-x86 platform Honglin Zhu 2023-05-22 21:59:59 +08:00
  • 9e80a194d6 Fix dynamic_list build and gcc version check error Honglin Zhu 2023-05-21 19:52:58 +08:00
  • d96d386d13 Add CMake dynamic build for Sapphire Rapids Honglin Zhu 2023-05-20 14:13:49 +08:00
  • a76afdc047 Compatible with older version of GNU make Honglin Zhu 2023-05-19 16:22:01 +08:00
  • 18d7afe69d Add rvv support for zsymv and active rvv support for zhemv sh-zheng 2023-05-20 01:19:44 +08:00
  • 88c205c958 Merge pull request #4047 from martin-frbg/lapack827 Martin Kroeker 2023-05-19 07:21:22 +02:00
  • 90f041e348 Invoke the syscall to allow the use of amx tiles Honglin Zhu 2023-05-19 00:21:16 +08:00
  • 0b83088887 spr dynamic arch support Honglin Zhu 2023-05-19 00:19:50 +08:00
  • f249ccb741 Fix spr sbgemm error Honglin Zhu 2023-05-18 23:51:37 +08:00
  • a789211a2e Add interfaces for [cz]unhr_col and [sd]orhr_col (Reference-LAPACK PR 827) Martin Kroeker 2023-05-18 23:56:27 +02:00
  • 58ac660d07 Add interfaces for [cz]unhr_col and [sd]orhr_col (Reference-LAPACK PR 827) Martin Kroeker 2023-05-18 23:52:07 +02:00
  • 2182cc0ff5 Add interfaces for [cz]unhr_col and [sd]orhr_col (Reference-LAPACK PR 827) Martin Kroeker 2023-05-18 23:48:21 +02:00
  • 4f10899f5a Add interfaces for [cz]unhr_col and [sd]orhr_col (Reference-LAPACK PR 827) Martin Kroeker 2023-05-18 23:43:18 +02:00
  • bc967e7709 Add interfaces for [cz]unhr_col and [sd]orhr_col (Reference-LAPACK PR 827) Martin Kroeker 2023-05-18 23:41:46 +02:00
  • 2bbd61950e Add interfaces for [cz]unhr_col and [sd]orhr_col (Reference-LAPACK PR 827) Martin Kroeker 2023-05-18 23:35:57 +02:00
  • 941a34bb96 Merge pull request #4046 from martin-frbg/lapack795 Martin Kroeker 2023-05-18 21:55:05 +02:00
  • ca1791cfeb Extend tests for error exit sysv/sytd2/gehd2 (Reference-LAPACK PR 795) Martin Kroeker 2023-05-18 17:35:00 +02:00
  • 0c38ebd599 Extend tests for error exit sysv/sytd2/gehd2 (Reference-LAPACK PR 795) Martin Kroeker 2023-05-18 17:31:55 +02:00
  • be05ba4374 Fix typos in comments and documentation of LAPACK (Reference-LAPACK PR 820) (#4045) Martin Kroeker 2023-05-18 16:28:20 +02:00
  • 9f2233bfdf Merge pull request #4044 from martin-frbg/lapack814 Martin Kroeker 2023-05-17 16:16:21 +02:00
  • 47715b5726 Fix typos in comments (Reference-LAPACK PR 814) Martin Kroeker 2023-05-17 14:36:21 +02:00
  • b01894adcb Fix typos in comments (Reference-LAPACK PR 814) Martin Kroeker 2023-05-17 14:35:13 +02:00
  • a82c1443db Fix typos in comments (Reference-LAPACK PR 814) Martin Kroeker 2023-05-17 14:33:46 +02:00
  • 617e8bcfe7 Merge pull request #4043 from martin-frbg/lapack809-811-812 Martin Kroeker 2023-05-17 06:38:11 +02:00
  • 5fbd5f531b Fix typo in description of VR argument (Reference-LAPACK 812) Martin Kroeker 2023-05-16 20:05:05 +02:00
  • 02efa8d6be Fix typos in comments (Reference-LAPACK 811) Martin Kroeker 2023-05-16 20:01:47 +02:00
  • c5f7e46526 Fix typos and errors in comments (Reference-LAPACK 809) Martin Kroeker 2023-05-16 19:54:42 +02:00
  • 86f48997c7 CirrusCI: Add Neoverse build with OpenMP (#4042) Martin Kroeker 2023-05-16 12:01:50 +02:00
  • e2779c852f Do not build the tests when only the CBLAS interface is selected (#4041) Martin Kroeker 2023-05-15 20:49:56 +02:00
  • ccad94162a Merge pull request #4039 from klho/develop Martin Kroeker 2023-05-14 10:51:24 +02:00
  • df1b1f6a91 More detailed error message in [z]imatcopy.c. Ken Ho 2023-05-12 09:41:52 -07:00
  • 7a86c437b5 Change some "if" statements to "else if" following suggestion by @mmuetzel. Ken Ho 2023-05-10 09:13:04 -07:00
  • 33ab415f68 Bug fix and improvements for [z]imatcopy interface. Ken Ho 2023-05-08 14:43:56 -07:00
  • c74ee11376 Add an M1-based OSX crossbuild and a NeoverseN1 build to CIRRUS CI (#3997) Martin Kroeker 2023-05-08 14:24:38 +02:00
  • 65a7941aa5 Merge pull request #4036 from martin-frbg/issue4020 Martin Kroeker 2023-05-08 12:54:30 +02:00
  • c2078b2356 Mark xerbla's arguments as const Martin Kroeker 2023-05-07 20:15:13 +02:00
  • d6a42ed574 Merge pull request #4035 from martin-frbg/issue4034 Martin Kroeker 2023-05-06 15:51:07 +02:00
  • 60226b35e1 Fix (redundant) lapack-runtest target Martin Kroeker 2023-05-06 12:44:38 +02:00
  • 4e597ae00b Merge pull request #4031 from martin-frbg/issue4026 Martin Kroeker 2023-05-05 09:32:32 +02:00
  • e5538a62cb Add suggestions to NUM_THREADS/auxiliary buffer message Martin Kroeker 2023-05-04 22:56:39 +02:00
  • 6f38a946e8 Merge pull request #4028 from catap/mktemp-fix Martin Kroeker 2023-05-03 11:25:25 +02:00
  • 29c717050f Merge pull request #4022 from martin-frbg/gemmtm Martin Kroeker 2023-05-03 11:24:54 +02:00
  • b1781ad338 Do not requires GNU mktemp Kirill A. Korinsky 2023-04-29 11:13:26 +02:00
  • 7b16c4c051 CI (C910V): add test Han Gao 2023-04-28 04:32:06 +00:00
  • 1f6f7328eb remove redundant declaration Martin Kroeker 2023-04-27 09:14:12 +02:00
  • 7152d6b06d fix cblas_gemmt Martin Kroeker 2023-04-27 08:36:20 +02:00
  • f2c6a34de9 fix typo lilinjie 2023-04-24 10:29:59 +08:00
  • e9a8d5b45f Merge pull request #4015 from martin-frbg/issue4013-2 Martin Kroeker 2023-04-23 18:51:12 +02:00
  • 72caceb324 Merge pull request #4009 from Mousius/sve-gemm Martin Kroeker 2023-04-22 13:56:45 +02:00
  • d1b631899b Merge pull request #4018 from mmuetzel/ci Martin Kroeker 2023-04-21 23:52:13 +02:00
  • e27e9a50b1 CI (MSYS2): Save ccache before running tests. Markus Mützel 2023-04-21 14:10:40 +02:00
  • 67d33e5b98 CI (MSYS2): Update location of compiler cache. Markus Mützel 2023-04-21 13:02:23 +02:00
  • 84bcf6639f Disable gcc's tree-vectorizer pass on all operating systems Martin Kroeker 2023-04-20 23:24:52 +02:00
  • 30a0ccbd14 Merge pull request #4014 from martin-frbg/issue4013 Martin Kroeker 2023-04-20 10:45:15 +02:00
  • c9174ae8d7 Disable gcc's tree-vectorizer pass on all operating systems Martin Kroeker 2023-04-19 23:45:44 +02:00
  • c2fe9cb91f Disable gcc's tree-vectorizer pass on all operating systems Martin Kroeker 2023-04-19 23:45:14 +02:00
  • 66b39b835c Disable gcc's tree-vectorizer pass on all operating systems Martin Kroeker 2023-04-19 23:44:45 +02:00
  • bb6d6735bf Disable gcc's tree-vectorizer pass on all operating systems Martin Kroeker 2023-04-19 23:44:15 +02:00
  • d18efaed20 Disable gcc's tree-vectorizer pass on all operating systems Martin Kroeker 2023-04-19 23:43:43 +02:00
  • 99f6d31ed5 Disable gcc's tree-vectorizer pass on all operating systems Martin Kroeker 2023-04-19 23:42:55 +02:00
  • 7de9335c56 Disable gcc's tree-vectorizer pass on all operating systems Martin Kroeker 2023-04-19 23:42:09 +02:00
  • 437c0bf2b4 Merge pull request #3843 from Mousius/switch-ratio Martin Kroeker 2023-04-19 11:51:54 +02:00
  • c628030669 Merge pull request #3855 from Mousius/more-switch-ratio-tuning Martin Kroeker 2023-04-18 22:45:51 +02:00
  • efcf71255a Merge pull request #4003 from martin-frbg/issue3995 Martin Kroeker 2023-04-18 14:55:23 +02:00
  • 51dd1339e7 Merge pull request #4010 from martin-frbg/issue3989-2 Martin Kroeker 2023-04-18 14:55:02 +02:00
  • 479509bb37 Remove any stray trailing dash from CROSS_SUFFIX (as would result from clang -arch) Martin Kroeker 2023-04-17 21:57:25 +02:00
  • ec334e69dc Use SVE kernel for SGEMM/DGEMM on Arm(R) Neoverse(TM) V1 Chris Sidebottom 2023-04-17 17:38:42 +01:00
  • 5b165420b5 SWITCH_RATIO for Arm(R) Neoverse(TM) architecture Chris Sidebottom 2022-12-05 15:17:52 +00:00
  • 32f2fafde7 Propagate SWITCH_RATIO to DYNAMIC_ARCH builds Chris Sidebottom 2022-11-24 13:38:20 +00:00
  • a5e1fdd525 Merge pull request #4007 from Mousius/update-contributors Martin Kroeker 2023-04-17 15:45:39 +02:00
  • 44164e3a3d revert "move alpha out of register 18" (out of PR scope, no SVE on Apple hw) Martin Kroeker 2023-04-17 14:23:13 +02:00
  • bfc20c2e97 Add Chris Sidebottom to CONTRIBUTORS.md Chris Sidebottom 2023-04-17 11:17:42 +01:00
  • a44422f0d5 Merge pull request #3983 from thrasibule/makeflags Martin Kroeker 2023-04-16 13:49:05 +02:00
  • 73e6fcb925 Merge pull request #4006 from martin-frbg/issue4005 Martin Kroeker 2023-04-16 13:30:17 +02:00
  • 38d7a7b562 Fix ?GEMMT Martin Kroeker 2023-04-16 00:07:58 +02:00
  • 8be68fa7f4 move declaration of sca to really keep the compiler from throwing it out (for now) Martin Kroeker 2023-04-15 12:02:39 +02:00
  • 4eac244c9a Merge pull request #4004 from martin-frbg/ccheckif Martin Kroeker 2023-04-14 22:57:18 +02:00
  • 970e611e00 fix missing blank in test Martin Kroeker 2023-04-14 19:42:34 +02:00
  • f096a339e4 Use long value fields for cpu ident on OSX Martin Kroeker 2023-04-13 18:16:09 +02:00
  • 3727672a74 Improve workaround and keep compilers from optimizing it out Martin Kroeker 2023-04-13 18:07:52 +02:00
  • 108a21e47a Move ALPHA out of register 18 (reserved on OSX) Martin Kroeker 2023-04-13 18:05:14 +02:00
  • 0b1acb0ba3 Move ALPHA_I out of register 18 (reserved on OSX) Martin Kroeker 2023-04-13 18:03:35 +02:00
  • c7bbad09ad Move ALPHA_I out of register 18 (reserved on OSX) Martin Kroeker 2023-04-13 18:00:47 +02:00
  • cda29633a3 move ALPHA_I out of register 18 (reserved on OSX) Martin Kroeker 2023-04-13 17:59:48 +02:00
  • 6f759a9ce9 Merge pull request #4002 from imzhuhl/spr_detect Martin Kroeker 2023-04-13 13:18:39 +02:00
  • ac650225c1 Fix x86 detection error Honglin Zhu 2023-04-13 00:08:27 +08:00
  • 58de28f332 Merge pull request #3999 from martin-frbg/issue3998 Martin Kroeker 2023-04-12 10:38:27 +02:00
  • 2ea00788c2 Add ?GEMMT Martin Kroeker 2023-04-11 22:46:51 +02:00
  • 6c45c98083 Add (only) the GEMMT functions Martin Kroeker 2023-04-11 22:41:18 +02:00
  • cd8eb33a9c Expose BUILD_LAPACK_DEPRECATED Martin Kroeker 2023-04-11 22:39:53 +02:00
  • 57bdc36c84 add conditionals for BUILD_LAPACK_DEPRECATED Martin Kroeker 2023-04-11 22:38:38 +02:00
  • e0f8b4fef4 Merge pull request #4000 from martin-frbg/applem2 Martin Kroeker 2023-04-11 08:28:44 +02:00
  • caa2945138 Support Apple A15/M2 cpus through the existing VORTEX target Martin Kroeker 2023-04-11 00:04:09 +02:00
  • d5fbec7c20 Export ?MIN/?MAX, ?AMIN/?AMAX, CDOT/ZDOT and ?GEMMT Martin Kroeker 2023-04-10 23:49:35 +02:00
  • fd20a2e8c6 Convert CMAKE booleans to 0/1 values for gensymbol Martin Kroeker 2023-04-10 22:28:00 +02:00