Commit Graph

  • a29f91ae9a
    Merge pull request #4471 from ChipKerchner/fixMakefileAIXOpenMP Martin Kroeker 2024-02-04 12:13:26 +0100
  • e61d96303d
    Fix missing NO_AVX2 fallback for SapphireRapids Martin Kroeker 2024-02-04 10:05:20 +0100
  • d02c61e82e
    Update lowercase cpunames for RISC-V Martin Kroeker 2024-02-04 10:01:27 +0100
  • 7228c708d7
    Merge pull request #4461 from markdryan/cpuid_riscv64_crash Martin Kroeker 2024-02-04 09:57:00 +0100
  • adde725321 LoongArch64: Fixed {s/d}amin LSX optimization gxw 2024-02-04 14:43:08 +0800
  • 7bc93d95a1 LoongArch64: Opt {c/z}axpby gxw 2024-01-24 16:11:45 +0800
  • 1e1f487dc7 LoongArch64: Fixed {s/d}axpby gxw 2024-02-01 19:57:05 +0800
  • 3597827c93 utest: add axpby gxw 2024-02-01 16:33:58 +0800
  • 68d354814f
    Fix incompatible pointer type in BFLOAT16 mode Martin Kroeker 2024-02-04 01:14:22 +0100
  • 3848d4e9f4
    Merge pull request #4477 from martin-frbg/c910caxpy Martin Kroeker 2024-02-04 01:10:57 +0100
  • 4d8dee508c
    temporarily disable the CAXPY/ZAXPY kernels Martin Kroeker 2024-02-04 01:05:03 +0100
  • 27816fa929
    Merge pull request #4472 from sergei-lewis/dev/slewis/merge-from-riscv Martin Kroeker 2024-02-03 20:56:11 +0100
  • b6949ce74c add axpyc to cmake build kseniyazaytseva 2024-02-02 14:42:27 +0300
  • 441339104f fix test ext cmake build kseniyazaytseva 2024-02-02 13:49:39 +0300
  • f68e9989c4 Remove zero rows/columns matcopy tests kseniyazaytseva 2024-02-02 12:26:23 +0300
  • 87ba528d8b Changed C files to straighten out indentation. Removed commented lines from other file. austinpagan 2024-02-01 18:46:07 -0600
  • 461cf9083c Merge remote-tracking branch 'origin/develop' into cgemm_zgemm_c_code austinpagan 2024-02-01 12:40:04 -0600
  • ddac75e0ef Adding .C versions of CGEMM and ZGEMM austinpagan 2024-02-01 12:24:25 -0600
  • 2bb7ea64a1 Only vectorize 64-bit version for Power8. Chip Kerchner 2024-02-01 08:11:43 -0600
  • 3ffd6868d7 Merge branch 'develop' into dev/slewis/merge-from-riscv Sergei Lewis 2024-02-01 11:29:41 +0000
  • a3b0ef6596 Restore riscv64 fixes from develop branch: dot product double precision accumulation, zscal NaN handling Sergei Lewis 2024-02-01 10:26:02 +0000
  • ec74dcd213
    Merge pull request #4470 from martin-frbg/issue4455 Martin Kroeker 2024-01-31 23:51:01 +0100
  • 61c8e19f95 Fix Makefile to support OpenMP on AIX for xlc (clang) with xlf. Chip Kerchner 2024-01-31 15:27:50 -0600
  • 42cb567f0f more cleanup Mark Seminatore 2024-01-31 13:24:28 -0800
  • 47bd064763
    Fix names in build rules Martin Kroeker 2024-01-31 20:49:43 +0100
  • 349a4bf046
    Update f_check.cmake Martin Kroeker 2024-01-31 19:23:59 +0100
  • 7c88437070
    rename and fix reference to removed variable Martin Kroeker 2024-01-31 18:38:40 +0100
  • a7d004e820
    Fix CBLAS prototype Martin Kroeker 2024-01-31 17:55:42 +0100
  • b54cda8490
    Unify creation of CBLAS interfaces for ?AMIN/?AMAX and C/ZAXPYC between gmake and cmake builds Martin Kroeker 2024-01-31 16:00:52 +0100
  • 1a6fdb0353
    Add prototypes for extensions ?AMIN/?AMAX and CAXPYC/ZAXPYC Martin Kroeker 2024-01-31 15:57:57 +0100
  • d1343302bd
    Merge pull request #4465 from XiWeiGu/utest-zscal Martin Kroeker 2024-01-31 14:19:19 +0100
  • 218e5309a3
    CI: Add github workflow using Apple M Martin Kroeker 2024-01-31 12:31:44 +0100
  • 969601a1dc X86_64: Fixed bug in zscal gxw 2024-01-31 11:20:25 +0800
  • b21be2eda7
    Add CBLAS interfaces for the extensions ?AMIN, ?AMAX and SCAXPYC/DZAXPYC Martin Kroeker 2024-01-30 23:19:57 +0100
  • 896f0169cc
    Unify generation of (C/Z)AXPYC, CBLAS_SCA(MIN/MAX), CBLAS:_DZA(MIN/MAX) Martin Kroeker 2024-01-30 23:18:35 +0100
  • 98c9ff3194
    Merge pull request #4464 from XiWeiGu/loongarch64-zscal Martin Kroeker 2024-01-30 22:53:29 +0100
  • 9f0630187a
    Merge pull request #4463 from XiWeiGu/loongarch64-zamax-zamin Martin Kroeker 2024-01-30 18:01:30 +0100
  • 09bb48d1b9 Vectorize in-copy packing/copying for SGEMM - 4X faster. Chip Kerchner 2024-01-30 09:13:16 -0600
  • bb043a021f utest: Add tests for zscal gxw 2024-01-30 17:27:59 +0800
  • 83ce97a4ca LoongArch64: Handle NAN and INF gxw 2024-01-30 16:54:14 +0800
  • 0d7fe5ea61 clean up whitespace Mark Seminatore 2024-01-29 22:33:47 -0800
  • 3d4dfd0085 Benchmark: Rename the executable file names for {sc/dz}a{min/max} gxw 2024-01-30 11:25:59 +0800
  • a79d117405 LoogArch64: Fixed bug for {s/d}amin gxw 2024-01-30 11:03:56 +0800
  • 519ea6e87a utest: Add utest for the {sc/dz}amax and {s/d/sc/dz}amin gxw 2024-01-30 10:39:22 +0800
  • 1093def0d1 Merge branch 'risc-v' into develop Sergei Lewis 2024-01-29 11:11:39 +0000
  • 8892121130
    Merge pull request #4462 from martin-frbg/issue4449 Martin Kroeker 2024-01-26 22:41:16 +0100
  • 48a4c4d454
    Use +sve in arch declarations of the fallback paths for SVE targets Martin Kroeker 2024-01-26 16:30:52 +0100
  • e0b610d01f Harmonize riscv64 LIBNAME for forced and non-forced targets Mark Ryan 2024-01-26 13:57:33 +0000
  • ec2aa32eb0 Fix crash in cpuid_riscv64.c Mark Ryan 2024-01-25 15:20:58 +0000
  • 47218d827f
    Remove erroneous early exit for alpha=(1,0) that skipped conjugation Martin Kroeker 2024-01-26 14:32:10 +0100
  • 776dbf66fc
    Add prototypes for ?GEMMT Martin Kroeker 2024-01-26 14:29:39 +0100
  • c08994be55
    Add prototypes for ?GEMMT Martin Kroeker 2024-01-26 14:26:49 +0100
  • 41515e6e7c
    Fixed handling of complex conjugate matrices and error codes for complex cases Martin Kroeker 2024-01-26 14:25:38 +0100
  • 889c5d026a
    Merge pull request #4456 from kseniyazaytseva/riscv-rvv10 risc-v Martin Kroeker 2024-01-26 13:31:09 +0100
  • 4e2a32ff51
    Merge pull request #4454 from kseniyazaytseva/riscv-rvv07 Martin Kroeker 2024-01-26 11:40:46 +0100
  • 276e3ebf9e LoongArch64: Add dzamax and dzamin opt gxw 2024-01-26 10:03:50 +0800
  • a21b2fa5e4
    Merge pull request #4452 from kseniyazaytseva/riscv-generic Martin Kroeker 2024-01-24 17:52:25 +0100
  • 73530b03fa remove RISCV64_ZVL256B additional extentions Andrey Sokolov 2024-01-24 11:38:14 +0300
  • 86943afa9c Fix x280 taget include riscv_vector.h kseniyazaytseva 2024-01-24 10:53:13 +0300
  • d938aed7fe
    reset "mem structure overflowed" state on shutdown Martin Kroeker 2024-01-23 17:15:53 +0100
  • 9c49a81d54 Resolve conflicts Andrey Sokolov 2024-01-23 19:08:53 +0300
  • e1afb23811 Fix BLAS and LAPACK tests for C910V and RISCV64_ZVL256B targets kseniyazaytseva 2023-04-07 11:13:23 +0300
  • f1ff4c5c02
    Merge 76d675bd55 into d6a5174e9c Martin Kroeker 2024-01-22 11:11:12 +0300
  • d6a5174e9c
    Merge pull request #4447 from RevySR/update-thead-toolchains Martin Kroeker 2024-01-22 08:10:02 +0100
  • 304a9b60af Update T-Head toolchains v2.8.0 Han Gao/Revy/Rabenda 2024-01-21 14:32:52 +0000
  • f5de4fad27
    Merge pull request #4444 from Mousius/part-mapping Martin Kroeker 2024-01-20 15:55:07 +0100
  • aaf65210cc Add dynamic support for Arm(R) Neoverse(TM) V2 processor Chris Sidebottom 2024-01-19 19:04:21 +0000
  • 10c22f4a39
    Merge pull request #4355 from imaginationtech/img-riscv64-zvl128b Martin Kroeker 2024-01-19 13:51:07 +0100
  • ccbc3f875b [RISC-V] Add RISCV64_ZVL128B target to common_riscv64.h Octavian Maghiar 2024-01-19 12:40:00 +0000
  • deecfb1a39 Merge branch 'risc-v' into img-riscv64-zvl128b Octavian Maghiar 2024-01-19 12:26:38 +0000
  • f51d36ecb9
    Merge 75fe9c21e5 into 500442cf96 Christopher Sidebottom 2024-01-18 19:07:43 -0800
  • c99e231fc5 Fix rand_generate Andrey Sokolov 2024-01-18 23:54:51 +0300
  • bf39c0d8b5 Added new tests for BLAS-like and BLAS API in utest kseniyazaytseva 2023-06-23 14:51:39 +0300
  • 7e2c6eb8b6 Fix INTERFACE64 builds on riscv and loongarch Martin Kroeker 2022-12-15 18:52:46 +0100
  • da82ee4d97 Fix LAPACK for f2c converted sources Andrey Sokolov 2023-05-05 11:22:47 +0300
  • f89e0034a4 Fix LAPACK usage from BLAS kseniyazaytseva 2023-12-20 21:20:30 +0300
  • f7cf637d7a redo lost edit Martin Kroeker 2023-03-28 18:31:04 +0200
  • 85548e66ca Fix build failures seen with the NO_LAPACK option - cspr/csymv/csyr belong on the LAPACK list Martin Kroeker 2023-03-28 16:33:09 +0200
  • f129161453 restore C/Z SPMV, SPR, SYR,SYMV Martin Kroeker 2023-03-21 07:43:03 +0100
  • 5222b5fc18 Added axpby kernels for GENERIC RISC-V target kseniyazaytseva 2023-10-12 22:06:00 +0300
  • 1c04df20bd Re-enable overriding the LAPACK SYMV,SYR,SPMV and SPR implementations Martin Kroeker 2023-03-20 23:04:12 +0100
  • 5b4df851d7 fix stray blank on continuation line Martin Kroeker 2023-03-21 08:29:05 +0100
  • ff41cf5c49 Fix BLAS, BLAS-like functions and Generic RISC-V kernels kseniyazaytseva 2023-03-17 14:28:26 +0300
  • 500442cf96
    Merge pull request #4442 from pbo-linaro/fix-utest-compilation Martin Kroeker 2024-01-18 20:59:13 +0100
  • b193ea3d7b Fix BLAS and LAPACK tests for RVV 1.0 target, update to 0.12.0 intrincics kseniyazaytseva 2024-01-18 22:11:12 +0300
  • a4992e09bc Fix utest compilation Pierrick Bouvier 2024-01-18 18:20:37 +0400
  • 6f0e0e4021
    Merge pull request #4438 from Dirreke/csky-support Martin Kroeker 2024-01-18 13:04:52 +0100
  • 43cb266178
    Merge pull request #4441 from martin-frbg/gemv-threshold Martin Kroeker 2024-01-17 22:25:01 +0100
  • d2fc4f3b4d
    Increase multithreading threshold by a factor of 50 Martin Kroeker 2024-01-17 20:59:24 +0100
  • 88e994116c
    Merge pull request #4354 from imaginationtech/img-rvv-kernel-generator Martin Kroeker 2024-01-17 15:19:37 +0100
  • ec46ca7a43
    Support Arm Compiler for Linux as classic flang (#4436) Martin Kroeker 2024-01-17 07:33:10 +0100
  • e8199e4d5f
    Update f_check Martin Kroeker 2024-01-16 23:48:56 +0100
  • e3508d3713
    Merge pull request #4439 from sergei-lewis/risc-v Martin Kroeker 2024-01-16 20:35:12 +0100
  • ec89466e14 Add CSKY support Dirreke 2024-01-16 23:45:06 +0800
  • 9edb805e64 fix builds with t-head toolchains that use old versions of the intrinsics spec Sergei Lewis 2024-01-16 14:24:18 +0000
  • 452741b67f
    Merge pull request #4435 from imciner2/im/sapphire Martin Kroeker 2024-01-16 13:57:29 +0100
  • b5f19b58a7
    fix acfl string Martin Kroeker 2024-01-16 13:06:09 +0100
  • c1c957fbce Add CSKY support Dirreke 2024-01-16 16:27:12 +0800
  • 9c6b9bfb91
    Support ArmCompilerforLinux as classic flang Martin Kroeker 2024-01-16 00:58:24 +0100
  • 8f4e325ea8 Fix Clang sapphire rapids march flag Ian McInerney 2024-01-15 23:42:03 +0000