Martin Kroeker
|
f1a4071d8c
|
Clean up STACKSIZE redefinition
|
2020-10-18 19:41:43 +02:00 |
Martin Kroeker
|
97cf10062f
|
Clean up STACKSIZE redefinition
|
2020-10-18 19:39:18 +02:00 |
Martin Kroeker
|
17e288e18d
|
Clean up STACKSIZE redefinition
|
2020-10-18 19:37:04 +02:00 |
Martin Kroeker
|
c1422f3e46
|
Clean up STACKSIZE redefinition
|
2020-10-18 19:31:01 +02:00 |
Martin Kroeker
|
d85b24e103
|
Clean up STACKSIZE redefinition
|
2020-10-18 19:29:45 +02:00 |
Martin Kroeker
|
7d6c85f9da
|
Add compiler option -mmma for POWER10
|
2020-10-18 19:27:51 +02:00 |
Martin Kroeker
|
2e7ee7c716
|
Fix naming of L2 cache size item reported for Vortex
|
2020-10-18 19:22:05 +02:00 |
Martin Kroeker
|
efd47b0104
|
Merge pull request #2909 from isuruf/patch-1
Need a space when redirecting to file
|
2020-10-18 19:16:08 +02:00 |
Martin Kroeker
|
f5902ab0a1
|
Support cross-compiling for Apple Vortex
|
2020-10-18 19:10:58 +02:00 |
Martin Kroeker
|
1a0c185122
|
Support cross-compiling for Apple Vortex
|
2020-10-18 18:54:54 +02:00 |
Martin Kroeker
|
89eea6b455
|
Merge pull request #102 from xianyi/develop
rebase
|
2020-10-18 18:49:59 +02:00 |
Isuru Fernando
|
a5c667b55c
|
Need a space when redirecting to file
Following two commands have two completely different meanings
perl ./gensymbol objcopy x86_64 _ 0 0 0 0 0 0 "" "64_" 1 0 1 1 1 1 > objcopy.def
perl ./gensymbol objcopy x86_64 _ 0 0 0 0 0 0 "" "64_" 1 0 1 1 1 1> objcopy.def
|
2020-10-18 09:40:31 -05:00 |
Martin Kroeker
|
0ac6102708
|
Update version string to 0.3.11.dev
|
2020-10-17 22:40:47 +02:00 |
Martin Kroeker
|
26a701f4ad
|
Update version string to 0.3.11.dev
|
2020-10-17 22:40:06 +02:00 |
Martin Kroeker
|
fcd0fa1a3a
|
Merge pull request #2908 from xianyi/release-0.3.0
Synchronyse tag with release 0.3.11
|
2020-10-17 22:38:58 +02:00 |
Martin Kroeker
|
51c22612eb
|
Merge pull request #2907 from xianyi/develop
Update from develop for 0.3.11
|
2020-10-17 22:14:12 +02:00 |
Martin Kroeker
|
b8f689200e
|
Update version number to 0.3.11
|
2020-10-17 22:11:34 +02:00 |
Martin Kroeker
|
fe9015b619
|
Update version for 0.3.11 release
|
2020-10-17 22:10:50 +02:00 |
Martin Kroeker
|
f99b8c1502
|
Merge pull request #2906 from martin-frbg/changelog-0311
Update Changelog.txt with the 0.3.11 changes
|
2020-10-17 22:07:14 +02:00 |
Martin Kroeker
|
5381a18056
|
Update Changelog.txt with the 0.3.11 changes
|
2020-10-17 22:05:36 +02:00 |
Martin Kroeker
|
e35576c6fc
|
Merge pull request #2905 from martin-frbg/aocc-clang
Add -mavx for clang & aocc
|
2020-10-17 09:45:22 +02:00 |
Martin Kroeker
|
f1bb85d378
|
Add AVX flags for clang/aocc as well
|
2020-10-16 20:52:15 +02:00 |
Martin Kroeker
|
25907e672b
|
Merge pull request #101 from xianyi/develop
rebase
|
2020-10-16 20:48:58 +02:00 |
Zhang Xianyi
|
d7ba7679b6
|
Merge branch 'develop' into risc-v
|
2020-10-16 23:27:38 +08:00 |
Martin Kroeker
|
9789375389
|
Merge pull request #2900 from martin-frbg/fixcmake_sse
Add compiler options for SSE to the cmake support files
|
2020-10-16 16:17:36 +02:00 |
Martin Kroeker
|
f64243ff57
|
Add compiler options for sse/sse2/ssse3/sse4.1
|
2020-10-16 10:47:06 +02:00 |
Martin Kroeker
|
786c0a3ce8
|
Add sse options for use of intrinics with older compilers
|
2020-10-16 10:41:53 +02:00 |
Martin Kroeker
|
df70667043
|
fix core list for sse/sse2
|
2020-10-16 09:55:48 +02:00 |
Martin Kroeker
|
e6c5b13a18
|
Merge pull request #2898 from martin-frbg/morefixes
More pre-release fixes
|
2020-10-16 07:26:39 +02:00 |
Martin Kroeker
|
f071d1207a
|
add sse2
|
2020-10-15 22:10:32 +02:00 |
Martin Kroeker
|
dc6cefd2f5
|
Expressly enable -msse for 32bit DYNAMIC_ARCH kernels
|
2020-10-15 20:16:15 +02:00 |
Martin Kroeker
|
c339c40c01
|
Silence a redefinition warning
|
2020-10-15 19:08:12 +02:00 |
Martin Kroeker
|
ac8af9cec6
|
Add -msse where supported, apparently required for older gcc
|
2020-10-15 19:06:45 +02:00 |
Martin Kroeker
|
10379fc83b
|
Use ifdef instead of if
|
2020-10-15 19:05:37 +02:00 |
Martin Kroeker
|
a85ac71633
|
Merge pull request #100 from xianyi/develop
rebase
|
2020-10-15 18:54:20 +02:00 |
Martin Kroeker
|
4c25910da0
|
Merge pull request #2896 from martin-frbg/intrin-double
Add compiler flag for SSE4 where available
|
2020-10-15 11:12:35 +02:00 |
damonyu
|
ef8e7d0279
|
Add the support for RISC-V Vector.
Change-Id: Iae7800a32f5af3903c330882cdf6f292d885f266
|
2020-10-15 16:09:02 +08:00 |
Martin Kroeker
|
9b9ee92d5f
|
Merge pull request #2897 from Qiyu8/usimd-double
Add double precision universal intrinsics for X86/ARM
|
2020-10-15 08:38:24 +02:00 |
Martin Kroeker
|
ae6ac83991
|
Revert "add double precision SSE"
|
2020-10-15 08:37:02 +02:00 |
Qiyu8
|
4fac91ef37
|
adapt arm platform
|
2020-10-15 11:08:10 +08:00 |
Qiyu8
|
bfdf4b56da
|
Add double precision universal intrinsics for X86/ARM
|
2020-10-15 10:29:42 +08:00 |
Martin Kroeker
|
ebf0470fc2
|
add sse4.1 for DYNAMIC_ARCH kernels
|
2020-10-14 20:34:33 +02:00 |
Martin Kroeker
|
ca160bb440
|
Add -msse4.1 when SSE4.1 is supported
|
2020-10-14 19:18:07 +02:00 |
Martin Kroeker
|
c9c3ae07af
|
Add double precision operations
|
2020-10-14 18:10:45 +02:00 |
Martin Kroeker
|
a897bc3bd2
|
Merge pull request #99 from xianyi/develop
rebase
|
2020-10-14 18:09:20 +02:00 |
Martin Kroeker
|
756802df61
|
Merge pull request #2890 from martin-frbg/s-d-sum
Revert special handling of Windows xNRM2 and enable C+intrinsics kern…
|
2020-10-14 09:02:03 +02:00 |
Martin Kroeker
|
01492decf4
|
Merge pull request #2895 from martin-frbg/sb-tests
Fix remaining build errors related to bfloat16 and cmake
|
2020-10-14 09:01:16 +02:00 |
Martin Kroeker
|
bd0752444a
|
Merge pull request #2894 from RajalakshmiSR/bf16_packing
POWER10: Change the packing format for bfloat16
|
2020-10-14 08:12:08 +02:00 |
Martin Kroeker
|
c1f4f5d4e7
|
Replace Makefile with simplified version again
|
2020-10-14 01:08:50 +02:00 |
Martin Kroeker
|
75e3a92df6
|
Add express -mavx and -msse options (and fix a stray = for cooperlake)
|
2020-10-14 01:01:58 +02:00 |