Commit Graph

  • f09a9afa03 Merge pull request #1107 from quickwritereader/develop Martin Kroeker 2017-02-26 09:49:01 +0100
  • 0d96b0e2a7 Merge branch 'z13' into develop Abdurrauf 2017-02-26 06:17:33 +0400
  • 848cb27b1e ztrmm kernel. Abdurrauf 2017-02-26 05:59:24 +0400
  • dc34a0da96 Merge pull request #915 from mdong/small_fix_for_icc Martin Kroeker 2017-02-23 20:00:22 +0100
  • a3935f0dfb THUNDERX2T99: Add Optimized D/Z NRM2 Implementation Ashwin Sekhar T K 2017-02-22 02:26:51 -0800
  • 47e9fe0bb4 Merge pull request #1105 from martin-frbg/testing-eig-typos Martin Kroeker 2017-02-22 22:42:52 +0100
  • c7bc0ee823 Remove spurious names from EXTERNAL list Martin Kroeker 2017-02-22 21:48:35 +0100
  • 6bdee6d50a Remove spurious names from EXTERNAL list Martin Kroeker 2017-02-22 21:45:27 +0100
  • 009c0d2e5a Fix typo in EXTERNAL declaration Martin Kroeker 2017-02-22 21:41:07 +0100
  • 4d88e1a4ad Merge pull request #1104 from martin-frbg/lapack-comma Martin Kroeker 2017-02-22 10:31:39 +0100
  • 0958b49811 Fix missing comma on continued line Martin Kroeker 2017-02-22 08:40:39 +0100
  • 09b240f1ef Fix missing comma on continued line Martin Kroeker 2017-02-22 08:39:06 +0100
  • 69f4e8b86c Fix missing comma on continued line Martin Kroeker 2017-02-22 08:34:20 +0100
  • e072e68aa0 Fix missing comma in continued line Martin Kroeker 2017-02-22 08:32:20 +0100
  • 738628e9a8 ARM64: Remove unused code Ashwin Sekhar T K 2017-02-21 21:42:32 -0800
  • e527dbffaa Merge pull request #1103 from vladimir-ch/fix-lapacke-ormbr Martin Kroeker 2017-02-21 22:58:30 +0100
  • eeaee46e86 LAPACKE: fix wrong matrix size in ?ormbr Vladimir Chalupecky 2017-02-21 21:57:18 +0100
  • 040672ecf6 Merge pull request #1098 from martin-frbg/amodra-power8 Martin Kroeker 2017-02-21 15:26:14 +0100
  • c8ce9e4377 Merge pull request #1101 from martin-frbg/martin-frbg-patch-1 Martin Kroeker 2017-02-21 15:19:56 +0100
  • ab3ffab96a THUNDERX2T99: Add Optimized C/Z DOT Implementation Ashwin Sekhar T K 2017-02-21 03:25:00 -0800
  • f036be9ce2 THUNDERX2T99: Add Optimized SDOT Implementation Ashwin Sekhar T K 2017-02-19 23:12:27 -0800
  • 69fa4eb701 Merge b8c0a1f7e2 into 39eecfd20c Martin Kroeker 2017-02-21 07:33:37 +0000
  • 39eecfd20c Merge pull request #1102 from brada4/develop Martin Kroeker 2017-02-21 08:26:39 +0100
  • 5088523786 detect apollo lake for real Andrew 2017-02-20 23:54:59 +0100
  • 3f7720ec4b LAPACKE: fix wrong number of columns in ?ormlq Martin Kroeker 2017-02-20 16:20:43 +0100
  • faba876fda THUNDERX2T99: Bug fix in C/Z IAMAX Ashwin Sekhar T K 2017-02-19 23:11:50 -0800
  • 172a62d73e THUNDERX2T99: Add Optimized C/Z IAMAX Implementation Ashwin Sekhar T K 2017-02-17 03:06:32 -0800
  • e545a66a5b Merge pull request #1091 from staticfloat/sf/corei5_7600k Martin Kroeker 2017-02-17 10:30:09 +0100
  • 228c75a69c THUNDERX2T99: Add parallel SCNRM2 Implementation Ashwin Sekhar T K 2017-02-14 04:10:06 -0800
  • 9e2f316ede Power8 inline assembly fixes Martin Kroeker 2017-02-13 23:38:50 +0100
  • e2489c9a92 Merge pull request #1096 from martin-frbg/pkg-config Martin Kroeker 2017-02-12 17:00:17 +0100
  • c4ea9eea67 Add cmake template for openblas.pc Martin Kroeker 2017-02-12 14:38:32 +0100
  • cd8f80634f Create and install openblas.pc in cmake builds Martin Kroeker 2017-02-12 14:37:33 +0100
  • faf06f0d8b Create and install only a single openblas.pc file Martin Kroeker 2017-02-12 14:35:48 +0100
  • c6fa4aef0c Rename blas.pc.in to openblas.pc.in Martin Kroeker 2017-02-12 14:34:03 +0100
  • 1029dcd60d Merge pull request #1095 from martin-frbg/lapack370-cmake Martin Kroeker 2017-02-12 14:30:29 +0100
  • d12c8bbcbb Add zlasyf_aa to lapack.cmake Martin Kroeker 2017-02-12 13:49:49 +0100
  • 15f0d65010 Add another bunch of lapack 3.7 functions to cmake list Martin Kroeker 2017-02-12 01:59:30 +0100
  • 7d831af1ba Add LAPACK 3.7 files not mentioned in announcement Martin Kroeker 2017-02-12 01:37:35 +0100
  • ee3e87cf46 Update cmake file list for lapacke 3.7.0 Martin Kroeker 2017-02-12 00:40:16 +0100
  • 8772c00bb0 Update cmake file list for lapack 3.7.0 Martin Kroeker 2017-02-11 23:11:26 +0100
  • 0a4a7e18f6 Merge pull request #1094 from martin-frbg/cmake-1 Martin Kroeker 2017-02-11 20:48:41 +0100
  • 357ef3cd8c Reflect name change of lapacke_mangling.h template Martin Kroeker 2017-02-11 19:56:02 +0100
  • 002e646476 Add new functions from LAPACK 3.6.1 Martin Kroeker 2017-02-11 19:54:02 +0100
  • 3dad87bbb5 Merge pull request #1093 from martin-frbg/restore-cmakeinstall Martin Kroeker 2017-02-11 17:41:39 +0100
  • bdd51cdabc Add cmake install target Martin Kroeker 2017-02-11 16:43:46 +0100
  • 1d8ab99e09 Add `exfamily == 9` case (Kaby Lake) to dynamic arch detection Elliot Saba 2017-02-10 15:23:34 -0800
  • 04b2b06665 CPUID mappings for Core i5-7600K (Kaby Lake) Elliot Saba 2017-02-10 14:47:10 -0800
  • 8a83daf4bf Merge pull request #1084 from isuruf/develop Martin Kroeker 2017-02-08 01:01:18 +0100
  • 39abb079fb Merge pull request #1087 from grisuthedragon/enable-a12 Martin Kroeker 2017-02-08 01:00:32 +0100
  • 76c6e33e54 Enable EXCAVATOR kernels for A12-9800 Martin Koehler 2017-02-07 21:38:28 +0100
  • 6204d8596d Merge a31511dcea into a9594e8072 Martin Köhler 2017-02-07 16:24:56 +0000
  • a31511dcea Add the A12-9800 to the dynamic arch detection. Martin Köhler 2017-02-07 17:23:40 +0100
  • a9594e8072 Merge pull request #1085 from vladimir-ch/lapacke_laswp_work Martin Kroeker 2017-02-07 11:40:41 +0100
  • 8e89668f62 THUNDERX2T99: Fix bug in SNRM2 Ashwin Sekhar T K 2017-02-07 02:14:33 -0800
  • f63deae9de THUNDERX2T99: Add Optimized S/D IAMAX Implementation Ashwin Sekhar T K 2017-02-05 20:57:54 -0800
  • d711588cf2 Enable the EXCAVATOR kernels on AMD PRO A12-9800 Martin Köhler 2017-02-07 09:52:02 +0100
  • 4c2b713ce5 LAPACKE: fix incorrect value of lda_t in lapacke_?laswp_work Vladimir Chalupecky 2017-02-07 09:21:46 +0100
  • cdc954675c Install pkg-config files Isuru Fernando 2017-02-06 11:59:48 +0530
  • 60eea75409 Merge pull request #1076 from ashwinyes/develop_20170130_thunderx2t99 Martin Kroeker 2017-02-04 17:25:43 +0100
  • b8c0a1f7e2 Fix register clobbers Martin Kroeker 2017-02-03 21:17:33 +0100
  • 071a830e8b THUNDERX2T99: Add optimized S/D/C/Z SWAP Implementations Ashwin Sekhar T K 2017-02-03 02:09:17 -0800
  • 46573a7c2d dgesvd_ bechmark Andrew 2017-02-03 01:46:12 +0100
  • d09f88192c THUNDERX2T99: Add optimized S/D/C/Z COPY Implementations Ashwin Sekhar T K 2017-02-01 22:10:35 -0800
  • e58233460a THUDNERX2T99: Add optimized D/C/Z ASUM Implementations Ashwin Sekhar T K 2017-01-31 23:25:41 -0800
  • 3918d17025 LAPACK: Fix lapack-test errors in ARM64 threaded version Ashwin Sekhar T K 2017-01-31 11:40:45 +0530
  • 99bd2892bf THUNDERX2T99: Add optimized CASUM Implementation Ashwin Sekhar T K 2017-01-27 01:26:00 -0800
  • ff6f572f2e THUNDERX2T99: Rename labels in for DDOT and SNRM2 Ashwin Sekhar T K 2017-01-30 12:09:04 +0530
  • e0dc5f58c5 THUNDERX2T99: Remove Duplicate Code Ashwin Sekhar T K 2017-01-27 01:11:58 -0800
  • 2757b49767 THUNDERX2T99: Add Optimized CGEMM Implementation Ashwin Sekhar T K 2017-01-25 03:14:59 -0800
  • ff41e13385 Merge pull request #1074 from ashwinyes/develop_20170116_thunderx2t99_sgemm Zhang Xianyi 2017-01-25 22:17:05 +0800
  • 1de6fa0f50 Update .gitignore Ashwin Sekhar T K 2017-01-24 23:14:09 -0800
  • efda640723 Benchmark: Add MFlops print in iamax benchmark Ashwin Sekhar T K 2017-01-24 23:13:47 -0800
  • 1530e78cfe Benchmarks: Avoid building lapack benchmarks when NO_LAPACK=1 Ashwin Sekhar T K 2017-01-24 20:50:23 -0800
  • 907e286eb6 THUNDERX2T99: Add threaded SNRM2 Implementation Ashwin Sekhar T K 2017-01-24 21:39:29 +0530
  • cde3aee08b ARM64: Rename kernel files to have consistent naming Ashwin Sekhar T K 2017-01-24 14:49:49 +0530
  • ee6ea7e988 THUNDERX2T99: Add Optimized CNRM2 Implementation Ashwin Sekhar T K 2017-01-19 15:57:13 +0530
  • ca0b36b012 THUNDERX2T99: Add Optimized SNRM2 Implementation Ashwin Sekhar T K 2017-01-19 00:57:02 -0800
  • 01e1d85339 Update .gitignore Ashwin Sekhar T K 2017-01-18 00:39:04 -0800
  • d0a79ca6e0 THUNDERX2T99: Add threaded DDOT Implementation Ashwin Sekhar T K 2017-01-19 10:56:17 +0530
  • 0c07003ccf THUNDERX2T99: Add Optimized DDOT Implementation Ashwin Sekhar T K 2017-01-19 10:53:48 +0530
  • f33fcedb30 THUNDERX2T99: Improve SGEMM Ashwin Sekhar T K 2017-01-18 00:57:11 -0800
  • 0f1d6e8b39 THUNDERX2T99: Improve DGEMM Ashwin Sekhar T K 2017-01-16 23:16:23 -0800
  • 981064acc6 THUNDERX2T99: Add Optimized DAXPY Implementation Ashwin Sekhar T K 2017-01-17 00:28:54 -0800
  • ab2033f2db Merge pull request #1068 from sva-img/develop Zhang Xianyi 2017-01-17 22:02:21 +0800
  • a4d97d980f Added rot functions. Shivraj Patil 2017-01-17 12:15:07 +0530
  • f279ff4789 THUNDERX2T99: Add Optimized SGEMM Implementation Ashwin Sekhar T K 2017-01-11 15:07:11 +0530
  • 759f37feba ARM64: Let target VULCAN inherit THUNDERX2T99 properties Ashwin Sekhar T K 2017-01-11 13:17:10 +0530
  • e8d0e66982 Merge pull request #1067 from martin-frbg/msysinst Martin Kroeker 2017-01-16 16:03:53 +0100
  • 331fd51260 Fix DESTDIR support for cygwin/msys2 install Martin Kroeker 2017-01-16 15:15:46 +0100
  • 0863a0d4b4 Merge pull request #1061 from ashwinyes/develop_aarch64_vulcan_thunderx_patch Zhang Xianyi 2017-01-16 13:20:10 +0800
  • fb31c8154d Replace sched_yield on Linux with nop instruction Martin Kroeker 2017-01-11 23:36:04 +0100
  • 2e5f906f41 Update Makefile.install (#1064) Martin Kroeker 2017-01-11 17:40:06 +0100
  • 6881d9d1bc Update Makefile.install Martin Kroeker 2017-01-11 16:55:39 +0100
  • 513b4cb364 Update Makefile.install Martin Kroeker 2017-01-11 16:06:40 +0100
  • d1a97bad39 Merge pull request #1063 from wernsaar/develop Werner Saar 2017-01-11 12:37:45 +0100
  • 28e2fab33e prepared kernel/setparam-ref.c for UNROLL values, that are not a power of two Werner Saar 2017-01-11 11:56:50 +0100
  • 752fdc6f82 Merge pull request #1062 from wernsaar/develop Werner Saar 2017-01-11 10:30:46 +0100
  • c1c5a63d3c prepared parameter.c for UNROLL values, that are not a power of two Werner Saar 2017-01-11 09:50:28 +0100
  • 209b63197e prepared lapack/lauum for UNROLL values, that are not a power of two Werner Saar 2017-01-11 07:29:17 +0100