Commit Graph

  • c5c4888ed6 Merge pull request #3818 from martin-frbg/lapack665 Martin Kroeker 2022-11-14 07:27:58 +01:00
  • 92411dfecb Merge pull request #3817 from martin-frbg/lapack738742 Martin Kroeker 2022-11-13 23:49:15 +01:00
  • 13f3bbece1 Add a BLAS3-based triangular Sylvester equation solver (Reference-LAPACK PR 651) Martin Kroeker 2022-11-13 23:18:09 +01:00
  • 92174725d9 Add a BLAS3-based triangular Sylvester equation solver (Reference-LAPACK PR 651) Martin Kroeker 2022-11-13 23:16:12 +01:00
  • 6eb707d941 Add a BLAS3-based triangular Sylvester equation solver (Reference-LAPACK PR 651) Martin Kroeker 2022-11-13 23:10:13 +01:00
  • 7eb2653268 Add a BLAS3-based triangular Sylvester equation solver (Reference-LAPACK PR 651) Martin Kroeker 2022-11-13 23:07:10 +01:00
  • 4bc918a791 Add a BLAS3-based triangular Sylvester equation solver (Reference-LAPACK PR 651) Martin Kroeker 2022-11-13 23:03:31 +01:00
  • 29dc086f38 Add macros for 32/64bit integer printf Martin Kroeker 2022-11-13 22:50:02 +01:00
  • 8b3f9715ec Add macros for 32/64bit integer printf Martin Kroeker 2022-11-13 22:48:37 +01:00
  • 35dac5677a Merge pull request #3816 from martin-frbg/lapack638 Martin Kroeker 2022-11-13 20:38:42 +01:00
  • fdb012ceed Fix implicit conversions and unused variables (Reference-LAPACK PR 703) Martin Kroeker 2022-11-13 20:37:18 +01:00
  • c99d27ae45 Fix implicit conversions and unused variables (Reference-LAPACK PR 703) Martin Kroeker 2022-11-13 20:33:20 +01:00
  • a485e4f515 Fix implicit conversions and unused variables (Reference-LAPACK PR 703) Martin Kroeker 2022-11-13 20:30:06 +01:00
  • 0c2aa0bed7 Fix implicit conversions and unused variables (Reference-LAPACK PR 703) Martin Kroeker 2022-11-13 20:29:08 +01:00
  • 2226a82f2e Fix leading dimension check of eigen-/Schur vectors (Reference-LAPACK PR 665) Martin Kroeker 2022-11-13 17:50:49 +01:00
  • 645633e321 Fix leading dimension check of eigen-/Schur vectors (Reference-LAPACK PR 665) Martin Kroeker 2022-11-13 17:48:02 +01:00
  • ee6643bc6b Merge pull request #3815 from martin-frbg/lapack690 Martin Kroeker 2022-11-13 16:26:31 +01:00
  • 90d7451df5 Add NaN check functions for trapezoidal matrices (Reference-LAPACK PR738+742) Martin Kroeker 2022-11-13 15:10:00 +01:00
  • eba1112e38 Add NaN check functions for trapezoidal matrices (Reference-LAPACK PR738+742) Martin Kroeker 2022-11-13 15:03:39 +01:00
  • 23cfe58ee3 Add NaN check functions for trapezoidal matrices (Reference-LAPACK PR738+742) Martin Kroeker 2022-11-13 14:55:45 +01:00
  • 6dcf737c5d Add NaN check functions for trapezoidal matrices (Reference-LAPACK PR738+742) Martin Kroeker 2022-11-13 14:51:39 +01:00
  • 3e2d52c502 Fix workspace calculation in GEQRF/GERQF (Reference-LAPACK PR 638) Martin Kroeker 2022-11-13 13:00:52 +01:00
  • cb48c29b6f Fix workspace calculation (Reference-LAPACK PR690) Martin Kroeker 2022-11-13 12:49:59 +01:00
  • 8c99d5d1b6 Merge pull request #3796 from martin-frbg/gemmt Martin Kroeker 2022-11-12 19:06:05 +01:00
  • b53b0f6bb6 Merge pull request #3802 from martin-frbg/relafix Martin Kroeker 2022-11-12 15:11:31 +01:00
  • 9a31faf420 Merge pull request #3811 from martin-frbg/issue3805 Martin Kroeker 2022-11-10 10:57:33 +01:00
  • e326ef9f0f Merge pull request #3812 from bartoldeman/cscal-zscal-skylakex Martin Kroeker 2022-11-10 08:00:27 +01:00
  • 827a9c6079 Merge pull request #3814 from martin-frbg/traviswait-3 Martin Kroeker 2022-11-10 08:00:02 +01:00
  • d141cf341f Increase the wait time for ppc jobs again Martin Kroeker 2022-11-09 20:31:30 +01:00
  • aad79ab516 Merge pull request #3813 from martin-frbg/azuredynosx Martin Kroeker 2022-11-09 20:29:17 +01:00
  • 09dd90ca09 Limit cpu models in OSX_dynarch_cmake Martin Kroeker 2022-11-09 15:35:57 +01:00
  • f14435cb4b Merge pull request #3810 from martin-frbg/fix3800 Martin Kroeker 2022-11-09 15:28:12 +01:00
  • 6c1043eb41 Add [cz]scal microkernels for SKYLAKEX Bart Oldeman 2022-11-09 08:28:23 -05:00
  • be546ec1ad Add gcc options for Neoverse cpus Martin Kroeker 2022-11-09 11:00:41 +01:00
  • c957ad684e Bump gcc requirement for NeoverseN2 and V1 to 10.4 Martin Kroeker 2022-11-09 10:46:43 +01:00
  • 1865b15240 Add fallbacks to RaptorLake entry Martin Kroeker 2022-11-09 10:31:30 +01:00
  • e6204d254f Update CMakeLists.txt Martin Kroeker 2022-11-08 16:21:11 +01:00
  • 2e64722681 Update Makefile.rule Martin Kroeker 2022-11-08 16:20:17 +01:00
  • aa2a2d9c01 Conditionally compile files that may get replaced by ReLAPACK Martin Kroeker 2022-11-08 12:04:46 +01:00
  • 1b77764182 Conditionally leave out bits of LAPACK to be overridden by ReLAPACK Martin Kroeker 2022-11-08 12:02:59 +01:00
  • fcda11c1ae Revert special handling of GEMMT Martin Kroeker 2022-11-05 23:48:50 +01:00
  • 4743d80c22 Merge pull request #3800 from thrasibule/raptorlake Martin Kroeker 2022-11-05 18:05:48 +01:00
  • 5d02f2e83e Merge pull request #3806 from martin-frbg/dyn_coop Martin Kroeker 2022-11-03 21:37:39 +01:00
  • da6e426b13 fix Cooperlake not selectable via environment variable Martin Kroeker 2022-11-03 18:13:35 +01:00
  • c970717157 fix missing t in xgemmt rule Martin Kroeker 2022-11-01 13:51:20 +01:00
  • 62a44c9c5d Merge pull request #3804 from martin-frbg/issue3803 Martin Kroeker 2022-10-31 20:42:33 +01:00
  • c9d78dc3b2 Remove excess initializer (leftover from rework of PR 3793) Martin Kroeker 2022-10-31 16:57:03 +01:00
  • 65338a9493 Merge pull request #3799 from bartoldeman/cscal-zscal-no-fma Martin Kroeker 2022-10-30 18:56:10 +01:00
  • ea6c5f3cf5 Add option RELAPACK_REPLACE Martin Kroeker 2022-10-30 12:55:23 +01:00
  • d39978cd7f Fix includes Martin Kroeker 2022-10-30 12:53:19 +01:00
  • ce7ea72de1 Fix include paths Martin Kroeker 2022-10-30 12:50:51 +01:00
  • 3ebf5d219d handle INCLUDE_ALL and optional function prefixes Martin Kroeker 2022-10-30 12:49:07 +01:00
  • a082d54035 Rename to avoid conflict with OpenBLAS' toplevel config.h Martin Kroeker 2022-10-30 12:47:01 +01:00
  • eeebaf2294 move INCLUDE_ALL to (c)make options Martin Kroeker 2022-10-30 12:45:54 +01:00
  • 06b022b139 Fix ReLAPACK source selection Martin Kroeker 2022-10-30 12:42:36 +01:00
  • 03bd1157d8 Merge pull request #3793 from imzhuhl/new_sbgemm Martin Kroeker 2022-10-30 12:09:46 +01:00
  • e27ad3a6cc add raptor lake ids Guillaume Horel 2022-10-28 09:10:40 -04:00
  • 79066b6bf3 Change file name to match the norm and delete useless code. Honglin Zhu 2022-10-28 17:09:39 +08:00
  • e7e3aa2948 x86_64: prevent GCC and Clang from generating FMAs in cscal/zscal. Bart Oldeman 2022-10-27 17:20:44 -04:00
  • 4989e039a5 Define SBGEMM_ALIGN_K for DYNAMIC_ARCH build Honglin Zhu 2022-10-27 14:10:26 +08:00
  • e7fd8d21a6 Add GEMMT based on looped GEMV Martin Kroeker 2022-10-26 15:33:58 +02:00
  • 843e9fd0b9 Fix typo error Honglin Zhu 2022-10-26 17:06:06 +08:00
  • b00d5b9746 New sbgemm implementation for Neoverse N2 Honglin Zhu 2022-10-19 11:36:26 +08:00
  • 8c10f0abba Merge pull request #3794 from bartoldeman/benchmark-align-malloc Martin Kroeker 2022-10-21 16:13:58 +02:00
  • 9e6b060bf3 Fix comment. Bart Oldeman 2022-10-20 20:11:09 -04:00
  • 9959a60873 Benchmarks: align malloc'ed buffers. Bart Oldeman 2022-10-20 13:28:20 -04:00
  • ad424fce08 Merge pull request #3791 from martin-frbg/issue3790 Martin Kroeker 2022-10-19 07:11:33 +02:00
  • 5f72415f10 Suffix the pkgconfig file itself in INTERFACE64 builds Martin Kroeker 2022-10-18 20:29:24 +02:00
  • 747ade5adf fix INTERFACE64/USE64BITINT reporting Martin Kroeker 2022-10-18 17:28:07 +02:00
  • 8bacea1254 Pass libsuffix to openblas.pc and fix passing of INTERFACE64/USE64BITINT flag Martin Kroeker 2022-10-18 16:18:29 +02:00
  • b2523471c9 Add libsuffix support Martin Kroeker 2022-10-18 16:16:26 +02:00
  • 11b2570c13 Merge pull request #3786 from martin-frbg/issue3784 Martin Kroeker 2022-10-13 18:34:28 +02:00
  • ab6009b0b6 Merge pull request #3773 from staticfloat/sf/openblas_default_num_threads Martin Kroeker 2022-10-13 14:15:14 +02:00
  • 32566bfb44 Disable the gfortran tree vectorizer for netlib LAPACK Martin Kroeker 2022-10-13 14:04:25 +02:00
  • 57809526c4 Disable the gfortran tree vectorizer for lapack-netlib Martin Kroeker 2022-10-13 09:12:23 +02:00
  • eece0dfd14 Merge pull request #3781 from martin-frbg/issue3779 Martin Kroeker 2022-10-01 19:26:09 +02:00
  • db50ab4a72 Add BUILD_vartype defines Martin Kroeker 2022-10-01 15:14:51 +02:00
  • a84a8a7096 Merge pull request #3778 from martin-frbg/issue3775 Martin Kroeker 2022-10-01 15:12:40 +02:00
  • 79d842047a Move Cray case after GNU as Cray builds of gfortran have both names in the version string Martin Kroeker 2022-09-30 11:58:15 +02:00
  • 5e78493d95 Move Cray case after GNU as Cray builds of gfortran have both names in the version string Martin Kroeker 2022-09-30 11:55:56 +02:00
  • d2ce93179f Add OPENBLAS_DEFAULT_NUM_THREADS Elliot Saba 2022-09-22 10:38:36 -07:00
  • 8e851160d7 Merge pull request #3772 from siko1056/develop Martin Kroeker 2022-09-29 20:22:50 +02:00
  • cf132deb14 Merge pull request #3774 from sashashura/patch-1 Martin Kroeker 2022-09-29 18:49:50 +02:00
  • 6077d81161 Merge pull request #3777 from martin-frbg/fixmips64generic2 Martin Kroeker 2022-09-29 13:50:59 +02:00
  • f6f35a4288 fix copyobj declarations to work with DYNAMIC_ARCH Martin Kroeker 2022-09-29 08:47:14 +02:00
  • c726604319 build: harden dynamic_arch.yml permissions Alex 2022-09-26 13:48:11 +02:00
  • 4de8e1b8f9 build: harden mips64.yml permissions Alex 2022-09-26 13:47:15 +02:00
  • 11cd108095 build: harden nightly-Homebrew-build.yml permissions Alex 2022-09-26 13:46:34 +02:00
  • c2892f0e31 Makefile.rule: update CONSISTENT_FPCSR documentation Kai T. Ohlhus 2022-09-22 00:25:13 +09:00
  • 84453b924f Support CONSISTENT_FPCSR on AARCH64 Kai T. Ohlhus 2022-09-22 00:20:40 +09:00
  • 667d0e0b48 Merge pull request #3771 from martin-frbg/fixmips64generic Martin Kroeker 2022-09-19 18:58:14 +02:00
  • b1d69fb3ac Add MIPS64_GENERIC as a copy of GENERIC Martin Kroeker 2022-09-17 23:52:32 +02:00
  • 63d063cb6d Merge pull request #3769 from XiWeiGu/mips64-test Martin Kroeker 2022-09-17 23:48:53 +02:00
  • edea1bcfaf MIPS64: Fixed failed utest dsdot:dsdot_n_1 when TARGET=I6500 gxw 2022-09-17 16:39:30 +08:00
  • 548a11b9d9 [WIP,Testing]: Add test for mips64 gxw 2022-09-16 09:19:54 +08:00
  • 47120f20ca Merge pull request #3768 from martin-frbg/fixwarnings Martin Kroeker 2022-09-15 13:26:21 +02:00
  • 101a2c77c3 Fix warnings Martin Kroeker 2022-09-15 09:19:19 +02:00
  • 7ee3cab4ff Merge pull request #3767 from martin-frbg/decl_adaptive Martin Kroeker 2022-09-15 07:20:07 +02:00
  • 9402df5604 Fix missing external declaration Martin Kroeker 2022-09-14 21:44:34 +02:00
  • dd846e72ed Merge pull request #3766 from martin-frbg/issue3640 Martin Kroeker 2022-09-14 20:03:57 +02:00