Commit Graph

  • 010d59bfee Merge pull request #1973 from martin-frbg/issue1464 Martin Kroeker 2019-01-20 20:30:11 +01:00
  • 83b5c6b92d Fix compilation with NO_AVX=1 set Martin Kroeker 2019-01-20 12:18:53 +01:00
  • bbfdd6c0fe Increase Zen SWITCH_RATIO to 16 Martin Kroeker 2019-01-19 23:01:31 +01:00
  • cda81cfae0 Shift transition to multithreading towards larger matrix sizes Martin Kroeker 2019-01-19 00:10:01 +01:00
  • 32b0f1168e Fix declaration of input arguments in the Sandybridge GER microkernels (#1967) Martin Kroeker 2019-01-18 08:11:39 +01:00
  • b495e54310 Fix declaration of input arguments in the x86_64 SCAL microkernels (#1966) Martin Kroeker 2019-01-18 08:11:07 +01:00
  • d5e6940253 Fix declaration of input arguments in the x86_64 microkernels for DOT and AXPY (#1965) Martin Kroeker 2019-01-17 23:20:32 +01:00
  • 24e697eadb Merge pull request #1970 from quickwritereader/develop Martin Kroeker 2019-01-17 16:42:11 +01:00
  • 3e9fd6359d Bump xcode version to 10.1 to make sure it handles AVX512 Martin Kroeker 2019-01-17 16:19:03 +01:00
  • 43a4572038 crot fix Ubuntu 2019-01-17 14:45:31 +00:00
  • 256eb588bb Merge pull request #1963 from quickwritereader/develop Martin Kroeker 2019-01-16 18:41:03 +01:00
  • a034e65512 Merge branch 'develop' into develop Abdelrauf 2019-01-16 19:25:13 +04:00
  • 8c3386be87 Added missing Blas1 single fp {saxpy, caxpy, cdot, crot(refactored version of srot),isamax ,isamin, icamax, icamin}, Fixed idamin,icamin choosing the first occurance index of equal minimals Ubuntu 2019-01-16 15:16:21 +00:00
  • 3e601bd419 disable NaN checks before BLAS calls dgemm.R Andrew 2019-01-16 11:54:22 +02:00
  • 478d3c4569 disable NaN checks before BLAS calls deig.R (shorten matrix def) Andrew 2019-01-16 11:41:46 +02:00
  • 3afceb6c2a disable NaN checks before BLAS calls deig.R Andrew 2019-01-16 11:38:14 +02:00
  • 7af8b21dbb disable NaN checks before BLAS calls dsolve.R (shorter formula) Andrew 2019-01-16 11:34:46 +02:00
  • 1e3ada6db4 Merge pull request #1960 from cnjsdfcy/Hygon Martin Kroeker 2019-01-16 10:27:14 +01:00
  • 2777a7f506 disable NaN checks before BLAS calls dsolve.R (shorter config part) Andrew 2019-01-16 11:23:51 +02:00
  • b70fd23836 disable NaN checks before BLAS calls dsolve.R Andrew 2019-01-16 11:18:54 +02:00
  • def0385caa init Andrew 2019-01-16 09:51:29 +02:00
  • 29dc72889f Add support for Hygon Dhyana caiyu 2019-01-16 14:25:19 +08:00
  • b815a04c87 [ZARCH] fix a bug in max/min functions maamountki 2019-01-15 21:04:22 +02:00
  • dbc9a060ef Fix missing braces in support_av() call Martin Kroeker 2019-01-14 22:41:31 +01:00
  • 00401489c2 Fix missing braces in support_avx() Martin Kroeker 2019-01-14 22:38:32 +01:00
  • 1a7925b3a3 [ZARCH] Update dgemv_n_4.c maamountki 2019-01-11 17:43:11 +02:00
  • 406f835f00 [ZARCH] update cgemv_n_4.c maamountki 2019-01-11 17:39:17 +02:00
  • 621dedb37b [ZARCH] Update cgemv_t_4.c maamountki 2019-01-11 17:37:11 +02:00
  • b731e8246f Update sgemv_t_4.c maamountki 2019-01-11 17:14:04 +02:00
  • ecc31b743f Update dgemv_t_4.c maamountki 2019-01-11 17:13:02 +02:00
  • 5d89d6b143 [ZARCH] fix sgemv_n_4.c maamountki 2019-01-11 17:08:24 +02:00
  • 67432b23c2 [ZARCH] fix cgemv_n_4.c maamountki 2019-01-11 16:44:46 +02:00
  • 21c0f2af7b Merge pull request #1957 from martin-frbg/issue1954 Martin Kroeker 2019-01-10 12:04:08 +01:00
  • ad2c386d6a Move TLS key deletion to openblas_quit Martin Kroeker 2019-01-10 00:32:50 +01:00
  • be66f5d5c2 [ZARCH] fix data prefetch type in sdot maamountki 2019-01-09 16:50:07 +02:00
  • c2ffef8156 [ZARCH] fix data prefetch type in ddot maamountki 2019-01-09 16:49:44 +02:00
  • e7455f500c [ZARCH] fix dsdot.c maamountki 2019-01-09 16:33:54 +02:00
  • 3eafcfa650 [ZARCH] fix cgemv_n_4.c maamountki 2019-01-09 07:43:45 +02:00
  • 8d99dba86b Merge pull request #1949 from martin-frbg/issue1947 Martin Kroeker 2019-01-08 20:44:08 +01:00
  • 1650311246 Bump xcode to 8.3 Martin Kroeker 2019-01-08 14:43:45 +01:00
  • cf5d48e833 Update OSX environment to Sierra Martin Kroeker 2019-01-08 14:41:48 +01:00
  • 191677b902 Add travis_wait to the OSX brew install phase Martin Kroeker 2019-01-08 10:46:47 +01:00
  • 31ed19e8b9 Add message for SkylakeX and KNL fallbacks to Haswell Martin Kroeker 2019-01-05 19:41:13 +01:00
  • e1574fa2b4 Add xcr0 (os support) check Martin Kroeker 2019-01-05 18:08:02 +01:00
  • 68eb3146ce Add xcr0 (os support) check Martin Kroeker 2019-01-05 18:07:14 +01:00
  • 0afaae4b23 Query AVX2 and AVX512VL capability in x86 cpu detection Martin Kroeker 2019-01-05 16:58:56 +01:00
  • ae1d1f74f7 Query AVX2 and AVX512 capability for runtime cpu selection Martin Kroeker 2019-01-05 16:55:33 +01:00
  • 94cd946b96 [ZARCH] fix cgemv_n_4.c maamountki 2019-01-04 17:45:56 +02:00
  • ed01f4932a Merge pull request #1946 from martin-frbg/issue1908 Martin Kroeker 2019-01-04 01:37:37 +01:00
  • 1aa840a0a2 [ZARCH] fix sgemv_t_4.c maamountki 2019-01-04 01:38:18 +02:00
  • 802f0dbde1 More fixes for cross-compiling ARM64 targets Martin Kroeker 2019-01-03 22:17:31 +01:00
  • 20d1aad13f Fix missing quotes around thunderx targets Martin Kroeker 2019-01-02 20:15:35 +01:00
  • d11554c88f Validate user supplied TARGET (#1941) TiborGY 2018-12-31 23:19:44 +01:00
  • ed704185ab Increment version to 0.3.6.dev Martin Kroeker 2018-12-31 23:11:37 +01:00
  • 2940798ea7 Increment version to 0.3.6.dev Martin Kroeker 2018-12-31 23:10:59 +01:00
  • eebc189287 Version 0.3.5 v0.3.5 Martin Kroeker 2018-12-31 23:09:59 +01:00
  • 9185d419d3 Version 0.3.5 Martin Kroeker 2018-12-31 23:09:20 +01:00
  • 4cf9d32694 Merge pull request #1945 from xianyi/develop Martin Kroeker 2018-12-31 23:08:25 +01:00
  • 1c75b65d53 Merge branch 'release-0.3.0' into develop Martin Kroeker 2018-12-31 23:07:53 +01:00
  • 13d006339b Update ChangeLog.txt with changes from 0.3.5 Martin Kroeker 2018-12-31 23:00:46 +01:00
  • bf76162635 Merge pull request #1944 from hartzell/patch-1 Martin Kroeker 2018-12-31 18:36:18 +01:00
  • 0d52aefc6b Typo: Skyalke -> Skylake George Hartzell 2018-12-30 14:55:34 -08:00
  • a6787b0f81 Merge pull request #1939 from TiborGY/patch-2 Martin Kroeker 2018-12-30 20:10:05 +01:00
  • 8643521127 Merge pull request #1943 from martin-frbg/issue1748 Martin Kroeker 2018-12-30 20:07:01 +01:00
  • 5a720cf9ca Re-enable loop unrolling in trmv and remove the scary warning Martin Kroeker 2018-12-30 15:22:37 +01:00
  • ccd5945d38 Merge pull request #1942 from martin-frbg/issue1720 Martin Kroeker 2018-12-30 14:47:05 +01:00
  • 9f80e0f5fc Remove stray include of complex.h Martin Kroeker 2018-12-30 14:39:18 +01:00
  • bba1e67269 Delete the pthread key on cleanup in TLS mode Martin Kroeker 2018-12-29 21:59:31 +01:00
  • 93240f489e Fix wrong case in TARGET setting for Alpine Martin Kroeker 2018-12-29 18:12:54 +01:00
  • 7cbc2c37d6 Update cpuid_mips64.c TiborGY 2018-12-28 14:36:39 +01:00
  • c329de2931 Update Makefile TiborGY 2018-12-28 14:35:41 +01:00
  • 187233953c Update cpuid_mips.c TiborGY 2018-12-28 14:34:38 +01:00
  • 09170268a3 Update cpuid_arm.c TiborGY 2018-12-28 14:33:18 +01:00
  • 211120c508 Fix typo in UNKNOWN core name TiborGY 2018-12-27 23:09:21 +01:00
  • 9e4d190f4f Merge pull request #1932 from martin-frbg/issue1915 Martin Kroeker 2018-12-24 23:48:33 +01:00
  • fe02ba86a4 Remove unnecessary change again Martin Kroeker 2018-12-24 20:46:04 +01:00
  • 284fb00971 Merge pull request #1934 from fenrus75/betagoof Martin Kroeker 2018-12-24 19:53:50 +01:00
  • 795285c587 Fix thinko in skylake beta handling Arjan van de Ven 2018-12-24 18:49:50 +00:00
  • d6818777d1 Make sure that -fPIC is present if needed Martin Kroeker 2018-12-23 23:47:37 +01:00
  • 5bd21ab6e1 Make sure that -fPIC is present when needed Martin Kroeker 2018-12-23 23:46:48 +01:00
  • e1eab96502 Merge pull request #1931 from martin-frbg/pr1921 Martin Kroeker 2018-12-23 23:15:54 +01:00
  • 76b4b8980f Use -dumpversion with gcc only Martin Kroeker 2018-12-23 19:08:19 +01:00
  • 49e0f485da Add -mavx2 for TARGET=HASWELL if compiler supports and requires it Martin Kroeker 2018-12-23 17:26:09 +01:00
  • 43c2b0eb55 Add -mavx2 to TARGET=HASWELL builds Martin Kroeker 2018-12-23 17:16:43 +01:00
  • 942e229ed5 Merge pull request #1930 from martin-frbg/issue1908 Martin Kroeker 2018-12-23 15:06:33 +01:00
  • 26a3402773 Reflect ARMV8 target definition changes from PR1876 Martin Kroeker 2018-12-23 12:26:01 +01:00
  • 20033f992a Merge pull request #1929 from martin-frbg/issue1924 Martin Kroeker 2018-12-23 09:03:58 +01:00
  • f343ed65b5 Avoid taking the root of a negative number Martin Kroeker 2018-12-22 22:30:29 +01:00
  • a5a1118527 Merge pull request #1 from xianyi/develop Martin Kroeker 2018-12-22 22:13:44 +01:00
  • e23366e860 Merge pull request #1921 from fenrus75/haswelldgemm Martin Kroeker 2018-12-17 08:39:20 +01:00
  • b28f75cd7e set GEMM_PREFERED_SIZE for HASWELL Arjan van de Ven 2018-12-16 23:08:31 +00:00
  • d321448a63 dgemm: use dgemm_ncopy_8_skylakex.c also for Haswell Arjan van de Ven 2018-12-16 23:06:58 +00:00
  • c43331ad0a dgemm: Use the skylakex beta function also for haswell Arjan van de Ven 2018-12-16 22:59:02 +00:00
  • e8ca5a59a9 Merge pull request #1919 from fenrus75/haswelltuning Martin Kroeker 2018-12-16 20:11:05 +01:00
  • c4e23dd016 Update Makefile Martin Kroeker 2018-12-16 18:14:40 +01:00
  • cfc4acc221 typo Martin Kroeker 2018-12-16 16:19:51 +01:00
  • 545c2b1bbb Add -mavx2 on Haswell only if the compiler supports it Martin Kroeker 2018-12-16 13:09:19 +01:00
  • 69d206440a Make the skylakex/haswell sgemm code compile and run even with compilers without avx2 support Arjan van de Ven 2018-12-16 00:19:41 +00:00
  • 3843e3e017 use -maxv2 on haswell Martin Kroeker 2018-12-15 23:30:31 +01:00
  • fbcb14a74b should be core-avx2 Martin Kroeker 2018-12-15 20:18:59 +01:00