Commit Graph

  • 251dcd2bc5
    fix missing INTERFACE64 setting for gfortran Martin Kroeker 2024-02-21 23:41:16 +0100
  • 9b24b31419 resolve second_ conflict which breaks xlf timef frjohnst 2024-02-21 15:52:29 -0500
  • bf2310442b Fix get_num_cores for AIX. Chip-Kerchner 2024-02-21 13:26:28 -0600
  • 67e3e62592 Fix get_num_cores for AIX. Chip-Kerchner 2024-02-21 12:43:03 -0600
  • a69adbbd11 Merge branch 'develop' of https://github.com/openmathlib/openblas into develop Chip-Kerchner 2024-02-21 12:18:18 -0600
  • 99ef76f9bb
    Merge pull request #4511 from ErnstPeng/feature-branch Martin Kroeker 2024-02-21 14:25:57 +0100
  • 4787a55c64 Optimized cgemm kernel 16x4 LASX for LoongArch pengxu 2024-02-20 20:41:45 +0800
  • 5c5f1dddc6
    Update CMakeLists.txt Martin Kroeker 2024-02-20 23:39:54 +0100
  • 03fc5623fa
    Update CMakeLists.txt Martin Kroeker 2024-02-20 23:38:29 +0100
  • c0b1fdd83c
    Update apple_m.yml Martin Kroeker 2024-02-20 22:42:12 +0100
  • 085e1ca555
    Merge branch 'OpenMathLib:develop' into gh-apple-m Martin Kroeker 2024-02-20 20:53:32 +0100
  • 4254db7b78
    Add files via upload Martin Kroeker 2024-02-20 20:52:43 +0100
  • 1a2f5872d9
    Add files via upload Martin Kroeker 2024-02-20 20:52:03 +0100
  • c22c117f24 Move cirun runner to GCP Amit Kumar 2024-02-19 20:48:19 +0000
  • 586a4bd1c3 Optimized cgemm kernel 16x4 LASX for LoongArch pengxu 2024-02-19 15:05:21 +0800
  • 522c696636 Optimized cgemm kernel 16*4 LASX for LoongArch pengxu 2024-02-19 15:05:21 +0800
  • 1510f05121
    No flang-new in the homebrew LLVM Martin Kroeker 2024-02-17 14:38:56 +0100
  • ebbf5b3ea0
    Merge pull request #4504 from sergei-lewis/dev/slewis/ci Martin Kroeker 2024-02-16 22:48:28 +0100
  • 461ecabb22 add RISCV64_ZVL128B and RISCV64_ZVL256B targets to CI flows and to README.md Sergei Lewis 2024-02-16 11:33:28 +0000
  • ba17758c02 fix axpy implementations where y has a stride of 0 Sergei Lewis 2024-02-16 15:58:02 +0000
  • a75dac6401
    Link OpenMP clang+gfortran build with -lomp Martin Kroeker 2024-02-16 15:46:07 +0100
  • ba4ea7bb07
    Link OpenMP clang+gfortran builds with libomp Martin Kroeker 2024-02-16 15:41:46 +0100
  • 42d35f1084
    Update apple_m.yml Martin Kroeker 2024-02-16 11:59:02 +0100
  • 3e6e6183b0
    Update apple_m.yml Martin Kroeker 2024-02-16 09:45:45 +0100
  • 6de899e75c
    Update apple_m.yml Martin Kroeker 2024-02-16 07:48:49 +0100
  • df286958e2
    try to work around xcode15 linker bug Martin Kroeker 2024-02-15 23:42:03 +0100
  • dbfc521502
    Update apple_m.yml Martin Kroeker 2024-02-15 22:50:13 +0100
  • 2a446bd051
    Update apple_m.yml Martin Kroeker 2024-02-15 22:14:10 +0100
  • fdfc642afd
    Update apple_m.yml Martin Kroeker 2024-02-15 20:56:07 +0100
  • 648347acc7
    Update apple_m.yml Martin Kroeker 2024-02-15 20:54:20 +0100
  • 478f3bfd8a
    Add configurations Martin Kroeker 2024-02-15 18:57:47 +0100
  • 4d375cb6ca
    Merge branch 'OpenMathLib:develop' into gh-apple-m Martin Kroeker 2024-02-15 18:05:51 +0100
  • 5266998b9f
    Merge pull request #4498 from mseminatore/win_tidy Martin Kroeker 2024-02-15 14:37:37 +0100
  • ca6b4961e4
    updates to fix option conflicts and config file generation Martin Kroeker 2024-02-15 14:31:11 +0100
  • c90979d8ef
    allow for more pre- and suffixes in the name of the openblas library Martin Kroeker 2024-02-15 14:17:11 +0100
  • 3120f12e76
    allow for more pre- and suffixes in the name of the openblas library Martin Kroeker 2024-02-15 14:16:20 +0100
  • a0e3f77e0b
    add FIXED_LIBNAME, PREFIX and SUFFIX Martin Kroeker 2024-02-15 12:17:38 +0100
  • ffbfc3c692
    Add libname prefix and suffix Martin Kroeker 2024-02-15 12:16:34 +0100
  • 179527f622
    Merge branch 'OpenMathLib:develop' into issue4468 Martin Kroeker 2024-02-15 12:15:39 +0100
  • a28afac791
    Add FIXED_LIBNAME, LIBNAMEPREFIX and LIBNAMESUFFIX Martin Kroeker 2024-02-15 11:48:33 +0100
  • 57dd894af0
    Merge pull request #4502 from dmikushin/add-missing-use_gemm3m-macro Martin Kroeker 2024-02-15 11:13:36 +0100
  • b29fd48998
    Merge branch 'develop' into win_tidy Mark Seminatore 2024-02-12 10:23:17 -0800
  • 0a7ae326d2 Merge branch 'win_tidy' of https://github.com/mseminatore/OpenBLAS into win_tidy Mark Seminatore 2024-02-12 10:22:26 -0800
  • 10548a0460 update contributors Mark Seminatore 2024-02-12 10:22:12 -0800
  • d0f5dc763b Adding USE_GEMM3M macro to kernel targets, so that the *gemm3m functions and parameters can be included into the gotoblas structure. Fixes #4500 Dmitry Mikushin 2024-02-12 02:18:03 +0100
  • 8698f9e37f Adding basic support of benchmarks into CMake for single, double, single complex and double complex cases. Each benchmarking target has a suffix to identify the data type, for example ./benchmark_gemm3m_COMPLEX_DOUBLE is a gemm3m.c source compiled with COMPLEX and DOUBLE macros defined Dmitry Mikushin 2024-02-10 19:12:16 +0100
  • 7e9b1c0807 fix uninitialized data usage kseniyazaytseva 2024-02-10 00:49:42 +0300
  • c6f30fd414 check for zero inc kseniyazaytseva 2024-02-10 00:48:07 +0300
  • 5e9ead09ac fix info return kseniyazaytseva 2024-02-10 00:47:25 +0300
  • 4c554bd527 check abs zero inc kseniyazaytseva 2024-02-10 00:46:52 +0300
  • 46de7c8a2b Merge remote-tracking branch 'origin/risc-v-new-tests' into new-tests kseniyazaytseva 2024-02-09 23:52:51 +0300
  • 10ea3fb742
    fix duplication of name parts Martin Kroeker 2024-02-09 17:09:55 +0100
  • b1ae777afb
    Merge pull request #4497 from sergei-lewis/dev/slewis/zaxpy Martin Kroeker 2024-02-09 16:22:00 +0100
  • bb96e466ae
    Introduce LIBNAMEPREFIX to avoid messing with the internal LIBPREFIX Martin Kroeker 2024-02-09 15:50:11 +0100
  • 32ed6e391a Merge branch 'develop' of https://github.com/openmathlib/openblas into develop Chip-Kerchner 2024-02-09 07:25:04 -0600
  • ff1523163f Fix axpy test hangs when n==0. Reenable zaxpy_vector kernel for C910V. Sergei Lewis 2024-02-09 12:59:14 +0000
  • ba3bfe85ee
    Merge pull request #4495 from martin-frbg/update-gensymbol Martin Kroeker 2024-02-09 08:55:22 +0100
  • 93872f4681
    drop the ?laqz? symbols for now (not translatable by f2c) Martin Kroeker 2024-02-08 23:02:09 +0100
  • 98c56a7314 more cleanup Mark Seminatore 2024-02-08 13:50:15 -0800
  • 83bec51355
    Update with recently added CBLAS interfaces and LAPACK/LAPACKE functions Martin Kroeker 2024-02-08 21:23:48 +0100
  • 974f29c4e9
    Merge pull request #4494 from ChipKerchner/fixPower10CPUID Martin Kroeker 2024-02-08 21:21:32 +0100
  • d408ecedba Add environment variable to display coretype for dynamic arch. Chip Kerchner 2024-02-08 12:17:18 -0600
  • a96a04ee61
    Merge pull request #4493 from martin-frbg/issue4475-3 Martin Kroeker 2024-02-08 16:50:06 +0100
  • ac6b4b7aa4 Make sure CPU ID works for all POWER_10 conditions Chip Kerchner 2024-02-08 08:56:30 -0600
  • 500ac4de5e
    fix incompatible pointer types Martin Kroeker 2024-02-08 13:18:34 +0100
  • b3fa16345d
    fix prototype for c/zaxpby Martin Kroeker 2024-02-08 13:15:34 +0100
  • cfabc48190 Update rotg tests kseniyazaytseva 2024-02-08 00:22:15 +0300
  • ec5cfe3bc8 Fix invalid tests kseniyazaytseva 2024-02-08 00:21:38 +0300
  • ff10e6b6dc Fix zero step tests kseniyazaytseva 2024-02-08 00:19:54 +0300
  • e9cfb7fd30
    Merge pull request #4491 from martin-frbg/fixup-4488 Martin Kroeker 2024-02-07 21:34:40 +0100
  • cb9aa2a587 Merge branch 'develop' of https://github.com/openmathlib/openblas into develop Chip-Kerchner 2024-02-07 13:09:58 -0600
  • e9f480111e
    fix sbgemm bfloat16 conversion errors introduced in PR 4488 Martin Kroeker 2024-02-07 19:57:18 +0100
  • 22b487b622
    Merge pull request #4488 from martin-frbg/issue4475-2 Martin Kroeker 2024-02-07 18:40:35 +0100
  • 818bf30628
    Merge pull request #4490 from ChipKerchner/missingCPUIDsForAIX Martin Kroeker 2024-02-07 17:31:26 +0100
  • 344763331a
    Merge pull request #4484 from martin-frbg/lapack981 Martin Kroeker 2024-02-07 15:22:48 +0100
  • 574912f534 Add missing CPU ID definitions for old versions of AIX. Chip Kerchner 2024-02-07 07:54:06 -0600
  • 08ce6b1c1c Add missing CPU ID definitions for old versions of AIX. Chip Kerchner 2024-02-07 07:54:06 -0600
  • fb99fc2e6e
    fix type conversion warnings Martin Kroeker 2024-02-07 13:42:08 +0100
  • 08e479f956
    Merge pull request #4487 from ErnstPeng/feature-branch Martin Kroeker 2024-02-07 13:19:04 +0100
  • 25b300bbee
    improve internal names Martin Kroeker 2024-02-06 23:40:01 +0100
  • 9ef10ffa49
    Handle prefixed and suffixed libnames, optionally suppress softlinking Martin Kroeker 2024-02-06 23:38:19 +0100
  • 1ed69ea1c0
    improve naming Martin Kroeker 2024-02-06 23:35:12 +0100
  • d4db6a9f16
    Separate the interface for SBGEMMT from GEMMT due to differences in GEMV arguments Martin Kroeker 2024-02-06 22:23:47 +0100
  • fe3da43b7d Optimized zgemm kernel 8*4 LASX, 4*4 LSX and cgemm kernel 8*4 LSX for LoongArch pengxu 2024-02-06 11:49:01 +0800
  • 345a408ffa Optimized cgemm kernel 16*4 LASX for LoongArch pengxu 2024-02-06 10:48:49 +0800
  • 440edfd997
    Add option to suppress versioning of the internal name Martin Kroeker 2024-02-05 21:44:50 +0100
  • 63fbffddf8
    Add option FIXED_LIBNAME to suppress versioning and softlinking Martin Kroeker 2024-02-05 21:44:03 +0100
  • e5d2725e5a
    Merge pull request #4185 from XiWeiGu/mips_enable_msa Martin Kroeker 2024-02-05 15:50:16 +0100
  • 479e4af089
    Rescale input vector more often to minimize relative error (Reference-LAPACK PR 981) Martin Kroeker 2024-02-05 15:35:24 +0100
  • a4fde2c5ac
    Merge pull request #4451 from martin-frbg/overflow_reset Martin Kroeker 2024-02-05 07:27:04 +0100
  • b537528feb
    Merge pull request #4480 from XiWeiGu/loongarch64-fixed-{s/d}amin-lsx Martin Kroeker 2024-02-05 06:24:50 +0100
  • bc7154a80d
    Merge pull request #4482 from martin-frbg/issue4476 Martin Kroeker 2024-02-04 23:13:10 +0100
  • 6d8a273cca
    Handle zero increment(s) in C910V ?AXPBY (#4483) Martin Kroeker 2024-02-04 22:07:51 +0100
  • 05cf634929
    Update zaxpby_vector.c Martin Kroeker 2024-02-04 20:50:18 +0100
  • 2cab2ca294
    Update axpby_vector.c Martin Kroeker 2024-02-04 20:49:41 +0100
  • d65c01e3d0
    Handle zero increment(s) Martin Kroeker 2024-02-04 20:27:49 +0100
  • dbcf4f8b7d
    Merge pull request #4479 from XiWeiGu/loongarch-opt-axpby Martin Kroeker 2024-02-04 19:50:28 +0100
  • dc802dd637
    Merge pull request #4474 from ChipKerchner/sgemmIncopy_PR Martin Kroeker 2024-02-04 18:51:09 +0100
  • e307675222
    Merge pull request #4478 from martin-frbg/issue4475 Martin Kroeker 2024-02-04 16:36:40 +0100
  • 033168cdf0
    Merge pull request #4481 from martin-frbg/cpuid_riscv Martin Kroeker 2024-02-04 14:09:44 +0100