wjc404
f6fcbd7906
Fix performance bug when LDC is a multiple of 1024
2020-02-22 23:37:45 +08:00
Martin Kroeker
1e8410f18c
Merge pull request #2441 from martin-frbg/ismin2
...
Add proper defaults for the IxMIN/IxMAX kernels on mips64 and power
2020-02-22 11:21:03 +01:00
Martin Kroeker
07454bf4d5
Add proper defaults for IxMIN/IxMAX kernels
...
the fallbacks from Makefile.L1 assume a combined source for absolute value and non-absolute (with ifdef USE_ABS) but here we have separate implementations
2020-02-21 11:58:15 +01:00
Martin Kroeker
4046985913
Add proper defaults for IxMIN/IxMAX kernels
...
the fallbacks from Makefile.L1 assume a combined source for absolute value and non-absolute (with ifdef USE_ABS) but here we have separate implementations
2020-02-21 11:55:52 +01:00
Martin Kroeker
75577f95a7
Merge pull request #33 from xianyi/develop
...
rebase
2020-02-21 09:56:05 +01:00
Martin Kroeker
33d92c7a37
Merge pull request #2435 from martin-frbg/issue2433
...
Fix handling of ppc endianness
2020-02-21 00:01:58 +01:00
Martin Kroeker
e57b11acca
Add preliminary support for EMAG8180
2020-02-19 19:00:28 +01:00
Martin Kroeker
71e5669c3e
Add preliminary support for EMAG8180 ARMV8 processor
2020-02-19 18:57:26 +01:00
Martin Kroeker
e8d82c01d4
Recognize Ampere EMAG8180
2020-02-19 18:49:13 +01:00
Martin Kroeker
0b39cf95b0
Fix endianness conditionals
2020-02-19 18:09:54 +01:00
Martin Kroeker
76b2cec6ce
Get endianness into Makefile variable
2020-02-19 18:08:20 +01:00
Martin Kroeker
276c1791ea
Merge pull request #32 from xianyi/develop
...
rebase
2020-02-19 18:06:39 +01:00
Martin Kroeker
c5bbfd8fee
Merge pull request #2432 from isuruf/install_name
...
Fix install name on osx again
2020-02-19 08:14:28 +01:00
Isuru Fernando
130c1741e5
Fix install name on osx again
2020-02-18 10:22:49 -08:00
Martin Kroeker
8f782f0673
Merge pull request #2426 from zbeekman/nightly-homebrew-check
...
Nightly homebrew check
2020-02-18 12:09:15 +01:00
Martin Kroeker
6a517dcb6a
Merge pull request #2427 from martin-frbg/powermin
...
Fix ISMIN and ISMAX kernel choices for POWER8
2020-02-18 08:15:02 +01:00
Martin Kroeker
9f39f0a2c3
Specify ismin/ismax assembly kernels for POWER8 directly
...
to fix utest failure in new ismin test - Makefile.L1 defaults look wrong
2020-02-17 19:55:39 +01:00
Izaak Beekman
1a88c4ab26
Fix bottle upload problem & typo
2020-02-17 13:36:17 -05:00
Izaak Beekman
0b44802164
Test push & PRs only when workflow file changes
...
Also, add comments to clarify what the test is testing
2020-02-17 13:22:09 -05:00
Izaak Beekman
2c242b4cef
Add Github Action to build development branch nightly with Homebrew
2020-02-17 12:36:37 -05:00
Martin Kroeker
0bfb7336d2
Merge pull request #2424 from isuruf/osx
...
Fix building on osx
2020-02-17 17:00:08 +01:00
Martin Kroeker
403cde104e
Merge pull request #30 from xianyi/develop
...
rebase
2020-02-17 14:53:46 +01:00
Martin Kroeker
634f2bddda
Merge pull request #2414 from marxin/fix-iamax_sse-implementation
...
Fix iamax sse implementation and add utests
2020-02-17 14:50:18 +01:00
Martin Liska
aeea14ee40
Come up with LOAD_AND_COMPARE_TO_MXX macro in iamax_sse.S.
2020-02-17 09:01:53 +01:00
Martin Liska
18bcc36a69
Fix implementation of iamax_sse.S as reported in #2116 .
...
The was a typo in iamax_sse.S where one of the comparison
was cmpeqps instead of cmpeqss. That misdetected index
for sequences where the minimum value was 0.
2020-02-17 09:01:53 +01:00
Martin Liska
0e7f43c898
Add missing USE_MIN in kernel/CMakeLists.txt.
2020-02-17 09:01:53 +01:00
Martin Kroeker
79e201fbba
Merge pull request #2423 from xianyi/issue2419
...
Restore -march flag for Android builds
2020-02-17 07:24:02 +01:00
Isuru Fernando
4326dcb460
Pass CFLAGS from env to Makefile.prebuild and remove iOS hack
2020-02-16 15:13:01 -06:00
Martin Kroeker
e32f3b1447
Restore -march flag for Android builds
...
fixes #2419 - renewed discussion in #2112 suggests removal of the option was primarily aimed at non-Android builds
2020-02-16 17:32:13 +01:00
Martin Kroeker
d483e9270a
Update KERNEL.POWER8
2020-02-16 17:29:35 +01:00
Martin Kroeker
01834aee33
Merge pull request #29 from xianyi/develop
...
rebase
2020-02-16 17:28:10 +01:00
wjc404
b0558c11b9
Update param.h
2020-02-16 23:01:31 +08:00
wjc404
f566787e6e
Update KERNEL.SKYLAKEX
2020-02-16 22:58:44 +08:00
wjc404
e3368cbf18
AVX512 STRMM kernel
2020-02-16 22:58:00 +08:00
Martin Kroeker
d92bd5be24
Update KERNEL.POWER8
2020-02-15 23:07:50 +01:00
Martin Kroeker
46e4b12946
Update KERNEL.POWER8
2020-02-15 23:06:51 +01:00
Martin Kroeker
5e94aa4877
Merge pull request #2417 from marxin/make-ctest-verbose-for-drone
...
Make ctest verbose for drone
2020-02-15 21:57:41 +01:00
Martin Kroeker
93f3e27574
Merge pull request #2415 from marxin/add-cmake-to-gitignore
...
Add CMake related files to .gitignore.
2020-02-15 21:57:03 +01:00
Martin Kroeker
785c389b0e
Merge pull request #2420 from martin-frbg/issue2396
...
Correct generation of GETRF files by the CMAKE build
2020-02-15 21:56:16 +01:00
Martin Kroeker
c222b25b81
Correct generation of GETRF files by the CMAKE build
...
fixes #2396
2020-02-15 19:29:14 +01:00
Martin Kroeker
221da8bf05
Merge pull request #2411 from martin-frbg/fix2254-038
...
Fix pre-processed POWER8 codes and wrong conditionals in the POWER8,PPC440 and PPC970 KERNEL files
2020-02-14 23:07:43 +01:00
Martin Liska
eb285b4d20
Make ctest verbose for drone builder.
2020-02-14 10:46:37 +01:00
Martin Kroeker
cafdd999b8
Update caxpy_power8.S
2020-02-13 22:44:09 +01:00
Martin Kroeker
92ca92a46c
Update caxpy_power8.S
2020-02-13 21:24:54 +01:00
Martin Kroeker
486c35c5dc
Update icamin_power8.S
2020-02-13 18:38:43 +01:00
Martin Liska
0e05ea9bac
Add CMake related files to .gitignore.
2020-02-13 14:51:55 +01:00
Martin Kroeker
5ba3699f41
Update isamin_power8.S
2020-02-13 00:00:32 +01:00
Martin Kroeker
8eefa530cd
Update isamax_power8.S
2020-02-12 23:59:50 +01:00
Martin Kroeker
de40d47edf
Update isamin_power8.S
2020-02-12 23:57:48 +01:00
Martin Kroeker
7c162b8a21
Update isamax_power8.S
2020-02-12 23:56:57 +01:00