Commit Graph

  • 3d9a50e841 added optimized sswap kernel for POWER8 Werner Saar 2016-03-25 17:34:55 +0100
  • 828c849b44 added optimized ccopy kernel for POWER8 Werner Saar 2016-03-25 16:54:25 +0100
  • ecc0bc9813 added optimized scopy kernel for POWER8 Werner Saar 2016-03-25 16:06:56 +0100
  • 12f209b7b0 added optimized zswap kernel for POWER8 Werner Saar 2016-03-25 15:27:34 +0100
  • 7316a87930 added optimized dswap kernel for POWER8 Werner Saar 2016-03-25 14:35:43 +0100
  • 0bff057a87 added optimized dcopy kernel for POWER8 Werner Saar 2016-03-25 13:03:02 +0100
  • 7ee1d29dd4 Merge pull request #822 from wernsaar/develop wernsaar 2016-03-25 10:15:51 +0100
  • 1e6cf9808c added optimized dscal kernel for POWER8 Werner Saar 2016-03-25 09:42:08 +0100
  • 749e605e9d Affinity shared memory area must be at least a page Anton Blanchard 2016-03-24 06:17:06 +0000
  • 278511ad2d Cortex-A57: Fix clang compilation errors Ashwin Sekhar T K 2016-03-24 10:31:28 +0530
  • 3b5ffb49d3 Cortex-A57: Improve DGEMM 8x4 Implementation Ashwin Sekhar T K 2016-03-17 10:23:51 +0530
  • 8519e4ed9f Merge pull request #817 from wernsaar/develop wernsaar 2016-03-23 13:37:04 +0100
  • 55eda3813b added optimized zaxpy kernel for POWER8 Werner Saar 2016-03-23 11:20:23 +0100
  • 53bfc83c26 Update appveyor version. Zhang Xianyi 2016-03-22 11:37:35 -0400
  • 13ca89f6f0 Merge pull request #813 from theoractice/develop Zhang Xianyi 2016-03-22 11:31:37 -0400
  • 461cf9ea38 Merge pull request #814 from wernsaar/develop wernsaar 2016-03-22 15:24:59 +0100
  • 0664ba4c97 added optimized daxpy kernel for POWER8 Werner Saar 2016-03-22 14:50:03 +0100
  • aa744dfa59 Update memory.c Theoractice 2016-03-22 20:02:37 +0800
  • 61cf8f74d9 Fix access violation on Windows while static linking theoractice 2016-03-22 19:14:54 +0800
  • de202fa375 Merge pull request #1 from xianyi/develop Theoractice 2016-03-22 05:33:20 -0500
  • 6f93b53590 Merge pull request #812 from wernsaar/develop wernsaar 2016-03-21 13:59:44 +0100
  • 11c44dede1 added optimized sdot kernel for POWER8 Werner Saar 2016-03-21 13:18:23 +0100
  • f00d642592 Merge pull request #811 from wernsaar/develop wernsaar 2016-03-21 10:48:41 +0100
  • 9e4584d069 added optimized zdot kernel for POWER8 Werner Saar 2016-03-21 10:12:07 +0100
  • 2a5679da5f Merge branch 'release-0.2.17' into develop Zhang Xianyi 2016-03-20 20:52:43 -0400
  • a71e8c82f6 Fix change log typo. v0.2.17 Zhang Xianyi 2016-03-20 20:52:15 -0400
  • 9b987badb0 Merge branch 'master' into develop Bump to 0.2.18.dev Zhang Xianyi 2016-03-20 20:48:21 -0400
  • 1619b2f3c8 Merge branch 'release-0.2.17' Zhang Xianyi 2016-03-20 20:44:01 -0400
  • 4f3153395a Update doc for 0.2.17. Zhang Xianyi 2016-03-20 20:43:42 -0400
  • d7a1a7ff2a Merge branch 'release-0.2.17' into develop Zhang Xianyi 2016-03-20 09:24:28 -0400
  • 308e6195b7 Refs #807. Enable BUILD_LAPACK_DEPRECATED=1 by default. Zhang Xianyi 2016-03-20 09:22:56 -0400
  • 7a3d7b1f52 Merge pull request #808 from theoractice/develop Zhang Xianyi 2016-03-20 09:07:47 -0400
  • 74cc2d6623 Merge pull request #809 from wernsaar/develop wernsaar 2016-03-20 13:16:41 +0100
  • fc3a558515 Fix a minor compiler error in VisualStudio with CMake theoractice 2016-03-20 18:58:18 +0800
  • cd9fafc054 ddot for POWER8: updated licence information Werner Saar 2016-03-20 11:19:27 +0100
  • 84b92e6373 added optimized ddot kernel for POWER8 Werner Saar 2016-03-20 11:06:06 +0100
  • c279a53ed8 Merge pull request #806 from wernsaar/develop wernsaar 2016-03-18 12:46:16 +0100
  • e1df5a6e23 fixed sgemm- and strmm-kernel Werner Saar 2016-03-18 12:12:03 +0100
  • 5c658f8746 add optimized cgemm- and ctrmm-kernel for POWER8 Werner Saar 2016-03-18 08:17:25 +0100
  • ec4390a967 Bump devlop version to 0.2.17.dev. Zhang Xianyi 2016-03-15 14:52:01 -0400
  • fced5744fb Merge branch 'release-0.2.16' v0.2.16 Zhang Xianyi 2016-03-15 14:49:10 -0400
  • 8c0fb1258d Update 0.2.16 doc Zhang Xianyi 2016-03-15 14:48:41 -0400
  • aae581d004 Merge branch 'develop' into release-0.2.16 Zhang Xianyi 2016-03-15 13:56:01 -0400
  • e17303933a Merge pull request #802 from ashwinyes/develop_20160314_dgemm_optimization Zhang Xianyi 2016-03-14 20:31:03 -0400
  • f9226275f4 Merge pull request #801 from Keno/patch-3 Zhang Xianyi 2016-03-14 15:42:31 -0400
  • cf8c7e28b3 Update CONTRIBUTORS.md Ashwin Sekhar T K 2016-03-14 19:59:41 +0530
  • 5ac02f6dc7 Optimize Dgemm 4x4 for Cortex A57 Ashwin Sekhar T K 2016-03-14 19:35:23 +0530
  • 7aa1ad4923 Functional Assembly Kernels for CortexA57 Ashwin Sekhar T K 2016-03-14 19:33:21 +0530
  • dcd15b546c BUGFIX: KERNEL.POWER8 Werner Saar 2016-03-14 14:36:59 +0100
  • 96284ab295 added sgemm- and strmm-kernel for POWER8 Werner Saar 2016-03-14 13:52:44 +0100
  • d5e1255ca7 Don't pass REALNAME to `.end` Keno Fischer 2016-03-13 18:56:21 -0400
  • 587455868e Merge pull request #800 from jeromerobert/smallscaling Zhang Xianyi 2016-03-10 15:45:33 -0500
  • 323c237e7b Fix smallscaling compilation Jerome Robert 2016-03-10 20:24:41 +0100
  • faa5e2e5e3 FIX: forgot the add the files cgemv_n_4.c and cgemv_t_4.c Werner Saar 2016-03-10 11:10:38 +0100
  • 551fdf53e8 Merge pull request #799 from wernsaar/develop wernsaar 2016-03-10 10:22:08 +0100
  • fdf291be30 Added optimized cgemv_n and cgemv_t kernels for bulldozer, piledriver and steamroller Werner Saar 2016-03-10 09:42:07 +0100
  • 68eb4fa329 Add missing openblas_env makefile. Zhang Xianyi 2016-03-09 14:52:47 -0500
  • 05196a8497 Refs #716. Only call getenv at init function. Zhang Xianyi 2016-03-09 12:50:07 -0500
  • db9b611b12 Merge pull request #798 from wernsaar/develop wernsaar 2016-03-09 15:55:56 +0100
  • 2e6333f74e modified common.h for piledriver Werner Saar 2016-03-09 15:48:29 +0100
  • c99cc41cbd Added optimized zgemv_n kernel for bulldozer, piledriver and steamroller Werner Saar 2016-03-09 14:02:03 +0100
  • 711ecb8bd5 Merge pull request #797 from wernsaar/develop wernsaar 2016-03-07 16:44:17 +0100
  • 10c2ebdfc5 BUGFIX: removed fixes for bugs #148 and #149, because info for xerbla is wrong Werner Saar 2016-03-07 10:34:04 +0100
  • 26b3b3a3e6 bugfixes form lapack svn for bugs #142 - #155 Werner Saar 2016-03-07 10:10:00 +0100
  • acdff55a6a Bugfix for ztrmv Werner Saar 2016-03-07 09:39:34 +0100
  • 7d6b68eb4a Refs #786. Revert to default assembly kernel. Zhang Xianyi 2016-03-07 11:34:58 +0800
  • 0bbca5e803 removed build of smallscaling, because build on arm, arm64 and power fails Werner Saar 2016-03-06 11:54:41 +0100
  • cd5241d0cf modified KERNEL for power, to use the generic DSDOT-KERNEL Werner Saar 2016-03-06 09:07:24 +0100
  • 8d652f11e7 updated smallscaling.c to build without C99 or C11 increased the threshold value of nep.in to 40 Werner Saar 2016-03-06 08:40:51 +0100
  • 6c86570e1f Merge pull request #790 from jeromerobert/bug786 Zhang Xianyi 2016-03-05 15:25:27 -0500
  • 53ba1a77c8 ztrmv_L.c: no longer need a 4kB buffer Jerome Robert 2016-03-05 19:07:03 +0100
  • d23c7c713c Fixed #789 Fix utest/ctest.h on Mingw. Zhang Xianyi 2016-03-05 09:34:37 -0500
  • 8c43d7fa5f Merge remote-tracking branch 'origin/power8' into develop Zhang Xianyi 2016-03-05 06:03:19 -0500
  • 085f215257 Modified assembly label name, so that they are hidden. Added license informations. power8 Werner Saar 2016-03-05 10:27:27 +0100
  • 8f758eeff9 Refs #786. avoid old assembly c/zgemv kernels. Zhang Xianyi 2016-03-05 08:32:03 +0800
  • 0afc76fd65 enabled gemm_beta assembly kernels Werner Saar 2016-03-04 15:01:15 +0100
  • 91e1c5080c modified configuration, to use power6 sgemm kernel for power8 Werner Saar 2016-03-04 13:38:57 +0100
  • 73f04c2c72 enabled hemv assemly function for power8 Werner Saar 2016-03-04 13:20:50 +0100
  • 3e633152c6 enabled symv assembly kernels on power8 Werner Saar 2016-03-04 13:08:18 +0100
  • d5130ce7e3 enabled gemv assembly on power8 Werner Saar 2016-03-04 12:53:31 +0100
  • 4824b88fcb enabled all level1 assembly kernels for power8 Werner Saar 2016-03-04 12:35:25 +0100
  • cc26d888b8 BUGFIX: increased BUFFER_SIZE for POWER8 Werner Saar 2016-03-04 10:26:53 +0100
  • 8577be2a95 Modify travis script. Zhang Xianyi 2016-03-04 04:24:43 +0800
  • 1edf30b790 Change Opteron(SSE3) to Opteron_SSE3 at dyanmaic core name. Zhang Xianyi 2016-03-01 20:13:08 +0800
  • b752858d6c added dgemm-, dtrmm-, zgemm- and ztrmm-kernel for power8 Werner Saar 2016-03-01 07:33:56 +0100
  • 4fc8c937d4 Refs #695 add testcase. Zhang Xianyi 2016-03-01 01:05:56 -0500
  • efa4f5c936 Refs #695 #783. Replace default x86_64 cgemv_t asm kernel by C kernel. Zhang Xianyi 2016-03-01 11:18:56 +0800
  • 17d655fa64 Merge pull request #784 from peterph/develop Zhang Xianyi 2016-02-27 11:24:20 -0500
  • f68141cf1d collected usage notes Petr Cerny 2016-02-27 16:57:22 +0100
  • aa90518201 Update Changelog for 0.2.16.rc1. Zhang Xianyi 2016-02-24 15:21:22 -0500
  • 6b85dbb6dc Refs #696. Turn off stack limit setting on Linux. Zhang Xianyi 2016-02-24 14:18:39 -0500
  • a0debd4293 Refs #696. Turn off stack limit setting on Linux. Zhang Xianyi 2016-02-24 14:18:39 -0500
  • 937493bfeb Release 0.2.16 rc1 v0.2.16.rc1 Zhang Xianyi 2016-02-23 18:29:21 -0500
  • 74b0672223 Fix c/zaxpyc kernel bug on Cortex-A57. Zhang Xianyi 2016-02-23 22:47:53 +0000
  • 6e7be06e07 Refs JuliaLang/julia#5728. Fix gemv performance bug on Haswell Mac OSX. Zhang Xianyi 2016-02-19 17:56:07 -0500
  • a04d0555ba [av skip] Fix utest makefile bug on travis ci. Zhang Xianyi 2016-02-20 00:21:43 +0800
  • 3761c30ba4 Fix makefile bug for utest. Zhang Xianyi 2016-02-18 17:01:48 -0500
  • 38593cd3a3 Fix compiling bug on ARM Cortex-A57. Zhang Xianyi 2016-02-13 15:38:52 +0000
  • e3b7781c2b Update readme. Zhang Xianyi 2016-02-13 00:33:53 +0800
  • 5e6965ea47 Run utest when building. Zhang Xianyi 2016-02-13 00:33:31 +0800