Commit Graph

  • 1dd979959d
    set version to 0.3.21.dev Martin Kroeker 2022-08-07 23:02:36 +0200
  • b89fb708ca
    Update version to 0.3.21 v0.3.21 Martin Kroeker 2022-08-07 22:36:26 +0200
  • 9a34217cc6
    Merge pull request #3717 from xianyi/develop Martin Kroeker 2022-08-07 22:35:20 +0200
  • 79f54f266d
    Update version to 0.3.21 Martin Kroeker 2022-08-07 22:32:11 +0200
  • 94cba8e3c5
    Merge pull request #3716 from martin-frbg/0321changes Martin Kroeker 2022-08-07 22:30:58 +0200
  • 25ce2e2a63
    Update with 0.3.21 changes Martin Kroeker 2022-08-07 22:21:23 +0200
  • b633eb79f2 Use $at as temporary register for mips/loongson CPUCFG read Jiaxun Yang 2022-07-31 18:42:51 +0100
  • 9f89b62b25
    Merge pull request #3715 from martin-frbg/issue3648 Martin Kroeker 2022-08-07 08:45:06 +0200
  • 9c4e91a77d
    Increase threshold Martin Kroeker 2022-08-07 00:03:50 +0200
  • 1fe06caf49
    Increase threshold Martin Kroeker 2022-08-07 00:03:20 +0200
  • ff58e9a7f1
    Merge pull request #3609 from martin-frbg/lapack3101 Martin Kroeker 2022-08-06 14:31:56 +0200
  • f6a1854ce9
    resync gensymbol with develop Martin Kroeker 2022-08-06 09:29:09 +0200
  • 2bee490287
    Merge pull request #3714 from martin-frbg/crosscmake Martin Kroeker 2022-08-04 23:58:21 +0200
  • 85fd3c4279
    Support compilation with the Cray C and Fortran compilers (#3712) Martin Kroeker 2022-08-04 20:42:18 +0200
  • 3784b3d45c
    Add more x86_64 target definitions for cross-compiling Martin Kroeker 2022-08-04 19:18:32 +0200
  • ff3c938fc9
    Fix duplicate endif and collateral damage Martin Kroeker 2022-08-04 19:03:54 +0200
  • 0301e7e9c8
    Restore cleaning of Cray LAPACK_FFLAGS and extend to regular FFLAGS as well Martin Kroeker 2022-08-04 18:36:44 +0200
  • bc7cedb8b7
    Fix placement of CRAY Fortran options block Martin Kroeker 2022-08-04 17:43:37 +0200
  • 78412c392f
    Add -hnopattern to Cray link test Martin Kroeker 2022-08-04 09:12:27 +0200
  • e145901f69
    Use -hnopattern in Cray link options test Martin Kroeker 2022-08-04 09:02:36 +0200
  • b41250efaa
    clean up LAPACK_FFLAGS for NAG and Cray compilers Martin Kroeker 2022-08-03 23:32:43 +0200
  • 4b6d9e56c5
    clean up lapack_fflags for Cray Fortran too Martin Kroeker 2022-08-03 23:22:00 +0200
  • 480f794a32
    Add support for Cray Fortran compiler Martin Kroeker 2022-08-03 19:17:09 +0200
  • 43924da28b
    Add support for Cray Fortran compiler Martin Kroeker 2022-08-03 19:10:20 +0200
  • bc3b286b05
    Add Cray Fortran compiler Martin Kroeker 2022-08-03 19:08:42 +0200
  • 096ae6f2bd
    Merge pull request #3709 from nursik/develop Martin Kroeker 2022-08-03 15:43:27 +0200
  • 19fefd100e
    Merge pull request #3703 from martin-frbg/omp_adaptive Martin Kroeker 2022-08-03 15:38:39 +0200
  • 2e51a61914
    Merge pull request #3693 from Mayank-Raj3/Mayank-Raj3-patch-1 Martin Kroeker 2022-08-03 15:38:14 +0200
  • a7ac252fd9 Add TCORE Generic in prebuild.cmake Nursultan Zarlyk 2022-08-02 10:50:58 +0200
  • 19d4f90c44 Use auvx to detect CPUCFG on mips/loongson Jiaxun Yang 2022-07-31 18:17:07 +0100
  • 648a69a67e
    Merge pull request #3707 from martin-frbg/getarch_risc Martin Kroeker 2022-07-31 10:13:38 +0200
  • ef9c976a94
    Really fix compilation; fix crash when pmodel is not present in cpuinfo Martin Kroeker 2022-07-31 00:41:04 +0200
  • f727235be4
    Merge pull request #3706 from martin-frbg/czifunding Martin Kroeker 2022-07-30 14:11:45 +0200
  • 880bc1d1db
    Acknowledge past CZI EOSS 1/EOSS 3 funding Martin Kroeker 2022-07-30 12:34:09 +0200
  • 699c57d376
    Fix ABI CFLAGS setting on new-world LoongArch WANG Xuerui 2022-05-13 16:23:46 +0800
  • 869061c783
    Probe for old-world LASX flavor on LoongArch and use generic DGEMM kernel otherwise WANG Xuerui 2022-05-13 15:53:22 +0800
  • 5af7b8638b
    Move LoongArch make rules to Makefile.loongarch64 WANG Xuerui 2022-05-13 15:26:33 +0800
  • d0ba257de0
    Merge pull request #3704 from XiWeiGu/loongarch64_dynamic_arch Martin Kroeker 2022-07-28 20:31:20 +0200
  • 78da6a750a
    Merge pull request #3705 from RajalakshmiSR/bf16ppc Martin Kroeker 2022-07-28 18:38:14 +0200
  • 1d97405c02 POWER: Enable bfloat16 kernels by default Rajalakshmi Srinivasaraghavan 2022-07-28 07:43:53 -0500
  • fbfe1daf6e LoongArch64: Add DYNAMIC_ARCH support gxw 2022-07-28 13:47:20 +0800
  • 80cdfed7b2
    Use OMP_ADAPTIVE setting to choose between static and dynamic OMP threadpool size Martin Kroeker 2022-07-27 23:43:20 +0200
  • 08e3754b39
    Add environment variable OMP_ADAPTIVE Martin Kroeker 2022-07-27 23:41:47 +0200
  • 047a279f09
    Merge pull request #3702 from martin-frbg/issue3687 Martin Kroeker 2022-07-27 20:57:50 +0200
  • 30473b6a9d
    add openblas_getaffinity() Martin Kroeker 2022-07-27 19:15:18 +0200
  • 8668571040
    add openblas_getaffinity() Martin Kroeker 2022-07-27 19:14:36 +0200
  • daca01622b
    fix detection of Neoverse V1 and user-enforced selection of N2 in ARM64 DYNAMIC_ARCH (#3700) Martin Kroeker 2022-07-27 09:17:43 +0200
  • 05a7d3cce9
    another missed declaration of NEOVERSEV1 Martin Kroeker 2022-07-26 22:49:57 +0200
  • f41730817e
    add missing define for NEOVERSEV1 Martin Kroeker 2022-07-26 22:07:55 +0200
  • 151d7fc423
    fix detection of Neoverse V1 and user-enforced selection of N2 Martin Kroeker 2022-07-26 21:25:18 +0200
  • c322aab685
    Merge pull request #3684 from imzhuhl/neoversen2_dynamic_arch Martin Kroeker 2022-07-26 20:06:26 +0200
  • cf796aee8c
    Merge pull request #3699 from martin-frbg/issue3692 Martin Kroeker 2022-07-26 16:36:43 +0200
  • 28d40ba60b
    Merge pull request #3696 from XiWeiGu/loongson2k1000 Martin Kroeker 2022-07-26 13:55:41 +0200
  • 692848d20c
    typo fix Martin Kroeker 2022-07-25 21:59:03 +0200
  • 76ea7739dd
    Merge pull request #3698 from martin-frbg/issue3697 Martin Kroeker 2022-07-25 20:25:23 +0200
  • f8c5bdfbab
    Treat Fujitsu fcc on Fugaku like clang Martin Kroeker 2022-07-25 19:48:59 +0200
  • 70001e1e9e
    Add Fujitsu compiler Martin Kroeker 2022-07-25 19:42:59 +0200
  • cf37182260
    Add Fujitsu compiler (fcc) Martin Kroeker 2022-07-25 19:39:17 +0200
  • 68d86ea150
    Add Fujitsu compiler Martin Kroeker 2022-07-25 19:34:16 +0200
  • 7aaa0ce0e8
    utest needs to be linked against libm on QNX as well Martin Kroeker 2022-07-25 17:02:16 +0200
  • cd8e57040c
    Merge pull request #3691 from martin-frbg/issue3679-sparc Martin Kroeker 2022-07-25 15:41:15 +0200
  • 3573306a69 LoongArch64: Add core LOONGSON2K1000 and LOONGSONGENERIC gxw 2022-07-22 17:23:43 +0800
  • a4303ae378
    Merge pull request #3695 from martin-frbg/ppc6nrm2 Martin Kroeker 2022-07-25 06:14:30 +0200
  • 31377d04f0
    Merge pull request #3694 from martin-frbg/traviswait Martin Kroeker 2022-07-24 22:13:08 +0200
  • 6c118b7977
    Fix DNRM2 returning INF instead of zero due to intermediate overflow Martin Kroeker 2022-07-24 17:42:31 +0200
  • b60415a347
    Add back travis_wait to keep ppc jobs from getting cancelled Martin Kroeker 2022-07-24 16:44:16 +0200
  • a9939111d7
    Update dgemv_thread_safety.cpp Mayank Raj 2022-07-24 11:51:25 +0530
  • c43ec53bdd
    Merge pull request #3690 from RajalakshmiSR/cdotp10 Martin Kroeker 2022-07-19 13:59:16 +0200
  • b7c65d08cb
    Merge pull request #3689 from RajalakshmiSR/dgemvgcc10 Martin Kroeker 2022-07-19 10:25:01 +0200
  • fcbbd8c25c
    Merge pull request #3682 from XiWeiGu/develop Martin Kroeker 2022-07-19 10:24:28 +0200
  • 06ef015234
    fix DNRM2 returning INF instead of zero due to intermediate overflow Martin Kroeker 2022-07-19 10:19:27 +0200
  • a612e78a97 POWER: Fix complex dot function failures Rajalakshmi Srinivasaraghavan 2022-07-18 14:48:43 -0500
  • 432fd99445 POWER10: dgemv builtin rename Rajalakshmi Srinivasaraghavan 2022-07-18 09:48:01 -0500
  • 4dd05e526b LoongArch64: Fix dnrm2_tiny testcase failure gxw 2022-07-15 11:18:59 +0800
  • 7da799dc66
    Merge pull request #3686 from martin-frbg/issue3685 Martin Kroeker 2022-07-13 08:24:15 +0200
  • 6e018b84c4
    Fix function prototypes and INTERFACE64 support Martin Kroeker 2022-07-12 19:37:30 +0200
  • ccd87cc472
    Fix switching between Fortran and C build Martin Kroeker 2022-07-12 19:35:31 +0200
  • d5ca477f42 Neoverse N2: DYNAMIC_ARCH Honglin Zhu 2022-07-12 00:40:22 +0800
  • cce4b1d956 MIPS64: Fix dnrm2_tiny testcase failure gxw 2022-07-07 20:39:01 +0800
  • 7918ba11c2
    Merge pull request #3680 from martin-frbg/issue3636-2 Martin Kroeker 2022-07-07 11:38:24 +0200
  • 69148ae795
    Guard against sysconf returning zero processors Martin Kroeker 2022-07-06 17:22:18 +0200
  • e9260f5451
    Guard against system call returning zero processors Martin Kroeker 2022-07-06 17:21:10 +0200
  • 4cfd6f110a
    Merge pull request #3678 from martin-frbg/issue3677 Martin Kroeker 2022-07-05 10:40:32 +0200
  • e12d474780
    Eliminate uses of CREAL on left-hand side of assignments Martin Kroeker 2022-07-05 00:01:09 +0200
  • 686e6d7c10
    Merge pull request #3676 from martin-frbg/dnrm2-utest Martin Kroeker 2022-07-04 08:37:18 +0200
  • c5041ae270
    properly embed test_dnrm2 Martin Kroeker 2022-07-03 23:48:30 +0200
  • 8e6f719ad3
    use huge_val not huge_valf for portability Martin Kroeker 2022-07-03 20:19:24 +0200
  • af88494f87
    old systems may not have inf in math.h Martin Kroeker 2022-07-03 18:23:51 +0200
  • ee41b6eb24
    Add DNRM2 regression test for issues 2998 and 3654 Martin Kroeker 2022-07-03 17:56:49 +0200
  • bf8998a9f4
    Merge pull request #3675 from martin-frbg/issue3654 Martin Kroeker 2022-07-03 08:45:45 +0200
  • 9e29598575
    workaround fault with ssq=inf,scale=0 Martin Kroeker 2022-07-02 23:47:17 +0200
  • 3df3d622eb
    Merge pull request #3672 from imzhuhl/neoversen2_bf16 Martin Kroeker 2022-07-01 12:13:42 +0200
  • 407a1a242c
    Merge pull request #3670 from martin-frbg/osxvermin Martin Kroeker 2022-06-29 08:31:04 +0200
  • ec0d5c7a2a Add gfortran parameters Honglin Zhu 2022-06-29 10:08:06 +0800
  • 123e0dfb62 Neoverse N2 sbgemm: Honglin Zhu 2022-06-22 23:00:40 +0800
  • bc3728475f format code Honglin Zhu 2022-06-16 19:36:22 +0800
  • 55d686d41e neoverse n2 sbgemm: implement ncopy tcopy kernel_8x4 Honglin Zhu 2022-06-15 14:20:25 +0800
  • 04593bb27c neoverse n2 sbgemm: init file Honglin Zhu 2022-06-13 17:05:43 +0800
  • 1fb4259077
    Merge pull request #3673 from martin-frbg/azuredynmingw Martin Kroeker 2022-06-28 23:13:11 +0200
  • 47a0e53196
    mingw-dynamic arch: drop Haswell too Martin Kroeker 2022-06-28 21:40:04 +0200