Commit Graph

  • 5af06c764a Fix MingW build Martin Köhler 2016-10-26 16:03:00 +0200
  • 7cd26f7e38 Update gitignore Martin Köhler 2016-10-26 15:19:40 +0200
  • 77006cc2a3 Import LAPACK: top directory Martin Köhler 2016-10-26 15:14:13 +0200
  • 57eee3fa43 Import LAPACK: TESTING directory Martin Köhler 2016-10-26 15:13:03 +0200
  • 92a858e69e Import LAPACK: SRC directory Martin Köhler 2016-10-26 15:12:09 +0200
  • 13d40e7591 Import LAPACK: LAPACKE directory Martin Köhler 2016-10-26 15:06:08 +0200
  • 4c29d20108 Import LAPACK: INSTALL directory Martin Köhler 2016-10-26 15:04:39 +0200
  • 1357b8d93b Import LAPACK: DOCS directory Martin Köhler 2016-10-26 15:03:51 +0200
  • fb7057babe Import LAPACK: CMAKE directory Martin Köhler 2016-10-26 15:03:16 +0200
  • 358ee318ed Import LAPACK: CBLAS directory Martin Köhler 2016-10-26 15:02:41 +0200
  • 4c024b85e4 Import LAPACK: BLAS directory Martin Köhler 2016-10-26 15:02:09 +0200
  • e9ccbe738c Add CMAKE install target Martin Kroeker 2016-10-19 15:27:22 +0200
  • e54b6ddaa0 Merge pull request #986 from ksraste/develop Zhang Xianyi 2016-10-18 12:38:52 +0800
  • bcfc298c38 Merge pull request #987 from Sbte/master Zhang Xianyi 2016-10-18 12:38:33 +0800
  • ce7c6c6b2d Fix HASWELL capitalization in kernel cmake file Sven Baars 2016-10-17 16:32:34 +0200
  • f3419e634c SGEMM, DGEMM, CGEMM, ZGEMM functions data prefetch kaustubh 2016-10-17 18:29:38 +0530
  • 7472c79ea6 Merge pull request #984 from ksraste/develop Zhang Xianyi 2016-10-17 11:33:16 +0800
  • 66c9a9b33d Merge pull request #981 from howard0su/develop Zhang Xianyi 2016-10-17 11:32:57 +0800
  • 3705f5675a Merge pull request #982 from martin-frbg/develop Zhang Xianyi 2016-10-17 11:32:20 +0800
  • bce2b34f7a Merge pull request #1 from martin-frbg/martin-frbg-patch-1 Martin Kroeker 2016-10-16 22:51:42 +0200
  • da83ec94d1 Merge pull request #2 from martin-frbg/martin-frbg-patch-1-1 Martin Kroeker 2016-10-16 22:48:58 +0200
  • 3409bccb21 Update cpuid_x86.c Martin Kroeker 2016-10-16 22:45:44 +0200
  • 8a8f3932eb Update dynamic.c Martin Kroeker 2016-10-16 22:40:00 +0200
  • 90e2321ac3 STRSM, DTRSM functions data prefetch kaustubh 2016-10-14 16:41:28 +0530
  • 4998e19869 Change file comments to work around clang 3.9 assembler bug Martin Kroeker 2016-10-13 16:51:08 +0200
  • ff1da01476 USE NPROCESSOR_CONF instaed of NPORCESSOR_ONLN Howard Su 2016-10-13 12:37:50 +0000
  • ef52a9266b Fixed #979. Patch for NetBSD. Zhang Xianyi 2016-10-13 10:17:07 +0800
  • 4f38ae3199 Merge pull request #970 from martin-frbg/develop Zhang Xianyi 2016-10-13 10:13:56 +0800
  • 4baf0c7cfc Merge pull request #980 from kiwifb/utest_ldflags Zhang Xianyi 2016-10-13 10:13:12 +0800
  • 595a0224e4 Merge pull request #973 from vladimir-ch/fix-lapacke-xlarfb Zhang Xianyi 2016-10-13 10:12:35 +0800
  • f124ffab47 make utest/Makefile respect LDFLAGS François Bissey 2016-10-13 09:32:25 +1300
  • 91610f3835 Update zdot_msa.c Martin Kroeker 2016-10-05 18:59:09 +0200
  • 6e22ecf102 Update zdot.c Martin Kroeker 2016-10-05 18:58:03 +0200
  • 6221d6df5f Update zdot.c Martin Kroeker 2016-10-05 18:57:14 +0200
  • 117d3371d4 LAPACKE: fix wrong direction check in LAPACKE_?larfb_work Vladimir Chalupecky 2016-10-01 05:31:30 +0900
  • 16446d1d23 Remove explicit include of complex.h Martin Kroeker 2016-09-29 23:45:56 +0200
  • a6e9e0b94b Remove explicit include of complex.h Martin Kroeker 2016-09-29 23:43:28 +0200
  • 3178e4fea0 Remove explicit include of complex.h Martin Kroeker 2016-09-29 23:41:43 +0200
  • 95c245ddb0 Remove explicit include of complex.h Martin Kroeker 2016-09-29 23:40:36 +0200
  • 4b1b27347f Remove explicit include of complex.h Martin Kroeker 2016-09-29 23:39:35 +0200
  • 161c927071 Merge pull request #968 from buffer51/develop Zhang Xianyi 2016-09-22 11:34:57 -0400
  • 662f89f059 Merge pull request #969 from sva-img/develop Zhang Xianyi 2016-09-22 11:33:51 -0400
  • 54747fe24a DGEMM function split and data prefech Shivraj Patil 2016-09-22 17:25:46 +0530
  • 157ee498ac Updated CROSS_SUFFIX regex to work with CC containing arguments Paul MUSTIÈRE 2016-09-14 11:42:22 -0700
  • b09cc3b9bb Merge pull request #958 from intelfx/remove-stabs Zhang Xianyi 2016-09-13 16:15:37 -0400
  • b3656610f1 Remove unnecessary const from cblas.h Phil Ruffwind 2016-09-11 08:49:46 -0400
  • 6c0862a94f common_arm.h, common_mips.h: get rid of .func directives Ivan Shapovalov 2016-09-09 03:36:49 +0300
  • 842d842751 Update develop for 0.2.20.dev. Zhang Xianyi 2016-09-01 00:01:23 -0400
  • 85636ff1a0 Merge branch 'develop' v0.2.19 Zhang Xianyi 2016-08-31 23:58:42 -0400
  • 821affb9a0 Update doc for 0.2.19. Zhang Xianyi 2016-08-31 23:58:29 -0400
  • 515bc56ea9 Refs #946. Use nrm2 reference implementation for Power8. Zhang Xianyi 2016-08-18 18:59:43 -0700
  • ae70b916f4 Refs #929. Deal with zero and NaNs for scale. Zhang Xianyi 2016-08-18 10:24:42 -0700
  • 9ea0144482 Merge pull request #941 from sva-img/develop Zhang Xianyi 2016-08-18 09:31:31 -0400
  • 1f217a6175 Merge pull request #943 from ibmsoe/IBMMASS_Support Zhang Xianyi 2016-08-12 17:20:59 -0400
  • 78348a2853 Added support of IBM's MASS library that optimizes performance on Power architectures nishidha@us.ibm.com 2016-08-11 14:43:26 +0530
  • 9687437928 MIPS n32 ABI and build time mips simd support check Shivraj Patil 2016-08-10 17:44:22 +0530
  • d1c6469283 MIPS n32 ABI support, MSA support detection and rename ARCH, ARCHFLAGS Shivraj Patil 2016-08-08 11:58:01 +0530
  • b544be914d Merge pull request #933 from ashwinyes/develop_aarch64_20160726_Dgemm_8x4_Opts Zhang Xianyi 2016-07-26 09:54:31 -0400
  • c54a29bb48 Cortex A57: Improvements to DGEMM 8x4 kernel Ashwin Sekhar T K 2016-07-25 14:33:25 +0530
  • ff4c5deafa Merge pull request #930 from sva-img/develop Zhang Xianyi 2016-07-22 11:42:30 -0400
  • 22b9c2747d P6600/I6400 Build fix. Reverted the changes which was done to support for MIPS n32 ABI Shivraj Patil 2016-07-22 18:45:06 +0530
  • 27b5211ccd Merge pull request #927 from sva-img/develop Zhang Xianyi 2016-07-15 11:17:30 -0400
  • beb1d076a4 Added MSA optimization for GEMV_N, GEMV_T, ASUM, DOT functions Shivraj Patil 2016-07-15 18:38:25 +0530
  • 9e44f3ddd0 Refs #917 Avoid detecting gfortran bug on IBM POWER + Ubuntu Zhang Xianyi 2016-07-14 13:09:36 -0700
  • eece9fd889 Merge pull request #926 from vriera/develop Zhang Xianyi 2016-07-14 15:49:33 -0400
  • 5dfa0712c3 Merge pull request #925 from martin-frbg/develop Zhang Xianyi 2016-07-14 15:48:58 -0400
  • 8a592ee386 Merge pull request #924 from ashwinyes/develop_aarch64_improvements_20160714 Zhang Xianyi 2016-07-14 15:47:55 -0400
  • 7f2409a8e1 Merge pull request #918 from sva-img/develop Zhang Xianyi 2016-07-14 15:45:39 -0400
  • 7f28cd1f88 Complete support for MIPS n32 ABI Vicente Olivert Riera 2016-07-14 17:20:51 +0100
  • 154729908e Update cpuid_x86.c Martin Kroeker 2016-07-14 17:29:34 +0200
  • 97bd1e42c8 Update cpuid_x86.c Martin Kroeker 2016-07-14 12:25:17 +0200
  • 7de829f713 Update dynamic.c Martin Kroeker 2016-07-14 12:22:55 +0200
  • 9b69d8a8e5 Update zgetrf2.f Martin Kroeker 2016-07-14 11:41:57 +0200
  • 0a5ff9f9f9 Improvements to TRMM and GEMM kernels Ashwin Sekhar T K 2016-07-14 13:51:17 +0530
  • 8a40f1355e Improvements to GEMV kernels Ashwin Sekhar T K 2016-07-14 13:50:38 +0530
  • 78782485b6 Improvements to COPY and IAMAX kernels Ashwin Sekhar T K 2016-07-14 13:49:15 +0530
  • 8d86d14d3f Add time prints in benchmark output Ashwin Sekhar T K 2016-07-14 13:48:13 +0530
  • 925d4e1dc6 Add IAMAX and NRM2 benchmarks Ashwin Sekhar T K 2016-07-14 13:46:01 +0530
  • 57df7956ee Added CGEMM, ZGEMM, STRMM, DTRMM, CTRMM, ZTRMM. Updated macros in SGEMM, DGEMM, STRMM. Shivraj Patil 2016-06-28 17:51:10 +0530
  • 437c7d64f2 Merge pull request #913 from dpfoose/develop Zhang Xianyi 2016-06-27 10:05:30 -0400
  • ca5c25c870 Merge pull request #907 from jeromerobert/bug786 Zhang Xianyi 2016-06-27 10:04:54 -0400
  • 4a30a2584a Merge pull request #897 from ksraste/develop Zhang Xianyi 2016-06-27 10:04:18 -0400
  • a722056e76 change old GNU keyword __real__ __imag__ to CREAL, CIMAG macro mdong 2016-06-24 17:06:46 -0400
  • 098d8ec5d6 remove input from clobbered list mdong 2016-06-24 16:37:58 -0400
  • a94f2b7848 Change to allow compiling with USE_OPENMP on MSVC Daniel Patrick Foose 2016-06-14 14:37:28 -0400
  • d346c533b1 Fix z/ctrmv stack allocation on AMD bulldozer and barcelona target Jerome Robert 2016-06-07 16:11:09 +0200
  • f04af36ad0 Merge pull request #898 from wernsaar/develop Werner Saar 2016-05-31 14:13:52 +0200
  • 41000c8443 added directory for optimized lapack fortan codes and added dlaqr5.f Werner Saar 2016-05-31 12:53:07 +0200
  • 011431b9d7 STRSM optimized for MSA Kaustubh Raste 2016-05-31 10:17:23 +0530
  • 36a86945a2 Merge c8a7860eb3 into 2daad2bcb5 ksraste 2016-05-30 15:37:41 +0000
  • c8a7860eb3 STRSM optimized Kaustubh Raste 2016-05-30 21:17:00 +0530
  • 2daad2bcb5 Merge pull request #893 from biddisco/develop Zhang Xianyi 2016-05-30 14:52:58 +0800
  • bac478d17e Merge pull request #891 from rndfax/develop Zhang Xianyi 2016-05-30 14:52:40 +0800
  • 053044ae4d Replace CMAKE_SOURCE_DIR/CMAKE_BINARY_DIR with PROJECT_SOURCE_DIR/PROJECT_BINARY_DIR John Biddiscombe 2016-05-25 09:13:28 +0200
  • fca66262c4 mips64/axpy: fix error when INCY == 0 Aleksey Kuleshov 2016-05-23 13:24:15 +0300
  • 412bcd187a optimized dtrsm_logic_LT_16x4_power8.S and dtrsm_macros_LT_16x4_power8.S Werner Saar 2016-05-23 11:20:41 +0200
  • bd06b246cc Merge pull request #890 from wernsaar/develop Werner Saar 2016-05-22 16:01:35 +0200
  • 8b140220c8 optimized dtrsm_kernel_LT for POWER8 Werner Saar 2016-05-22 15:20:04 +0200
  • 318cad9c37 added trsm bencharks for POWER8 to benchmark/Makefile Werner Saar 2016-05-22 13:51:47 +0200
  • 8fb5a1aaff added optimized dtrsm_LT kernel for POWER8 Werner Saar 2016-05-22 13:09:05 +0200