Martin Kroeker
5dd14e3d48
Make building the bfloat16 functions conditional on option BUILD_HALF ( #2590 )
...
* make building the bfloat16 BLAS functions conditional on BUILD_HALF
* pass the BUILD_HALF option to gensymbol
* Pass BUILD_HALF as a compiler define for dynamic_arch builds
2020-05-01 09:58:30 +02:00
Martin Kroeker
a54e35e780
Merge pull request #2586 from martin-frbg/miscfixes
...
Trivial fix for compiler warnings
2020-04-29 22:01:41 +02:00
Rajalakshmi Srinivasaraghavan
564b0d39ef
Add test for shgemm
...
This patch has Makefile changes to add test for shgemm which
compares sgemm and shgemm result.
2020-04-29 13:40:34 -05:00
Martin Kroeker
5d58b11101
Merge pull request #52 from xianyi/develop
...
rebase
2020-04-29 14:36:15 +02:00
Martin Kroeker
d394d4e677
Merge pull request #2585 from martin-frbg/mips64fix
...
Increase default BUFFER_SIZE on MIPS64
2020-04-28 19:47:55 +02:00
Martin Kroeker
f4248af26e
Fix compiler warnings
2020-04-28 10:43:12 +02:00
Martin Kroeker
2d89603e9d
Increase BUFFER_SIZE on mips64 to match SGEMM parameters
2020-04-28 10:40:40 +02:00
Martin Kroeker
26bc15258a
Merge pull request #51 from xianyi/develop
...
rebase
2020-04-28 10:38:50 +02:00
Martin Kroeker
141998dce2
Merge pull request #2584 from martin-frbg/issue2583
...
[WIP] Have CMAKE parse conditional lines in KERNEL files
2020-04-28 10:35:12 +02:00
Martin Kroeker
3bd56846bb
Silence a debug message
2020-04-27 16:27:09 +02:00
Martin Kroeker
e7bbdfdf84
Have CMAKE parse conditional lines in KERNEL files
...
Supports ifeq and ifneq, but requires both to have an else branch
2020-04-27 15:20:03 +02:00
Martin Kroeker
b6795db731
Merge pull request #2582 from martin-frbg/mips32fix
...
Increase BUFFER_SIZE on MIPS32 to accomodate SGEMM requirements
2020-04-27 09:18:34 +02:00
Martin Kroeker
5e0dbf8dfe
Increase default BUFFER_SIZE to accomodate SGEMM parameters
...
in response to compile-time warning from #2551
2020-04-26 22:21:05 +02:00
Martin Kroeker
955d73127f
Merge pull request #50 from xianyi/develop
...
rebase
2020-04-26 22:17:56 +02:00
Martin Kroeker
a8c1bea7ae
Merge pull request #2581 from martin-frbg/raji
...
Fix travis configuration and update CONTRIBUTORS.md
2020-04-25 19:57:10 +02:00
Martin Kroeker
e43b49e064
Drop the set -e from travis scripts
2020-04-25 16:18:54 +02:00
Martin Kroeker
3e28db7f38
Update CONTRIBUTORS.md
2020-04-25 13:51:44 +02:00
Martin Kroeker
4b69ee31af
Merge pull request #2580 from martin-frbg/issue2538-3
...
Increase POWER8 ZGEMM_R and use same R values for POWER9
2020-04-25 00:28:18 +02:00
Martin Kroeker
03ff213c51
Increase POWER8 ZGEMM_R and use same R values for POWER9
...
fixes lapack-test zger failures seen in #2299 after application of my PR #2551
2020-04-24 21:46:54 +02:00
Martin Kroeker
299d1c8de0
Merge pull request #2578 from martin-frbg/issue2576
...
Quote getarch include paths in prebuild.cmake
2020-04-24 14:32:46 +02:00
Martin Kroeker
70869d571f
Quote include paths for getarch to protect any embedded spaces
2020-04-24 10:30:44 +02:00
Martin Kroeker
cba87222b2
Merge pull request #49 from xianyi/develop
...
rebase
2020-04-24 10:21:48 +02:00
Martin Kroeker
f80dd2151e
xcode 11.4.1 for homebrew ?
2020-04-23 14:31:09 +02:00
Martin Kroeker
4412ee1754
Switch homebrew build env to new xcode 11.4
...
default 11.3.1 in the github image is causing brew to fail with "outdated xcode" message
2020-04-23 10:54:46 +02:00
Martin Kroeker
f6104b68c1
Merge pull request #2571 from martin-frbg/issue2299
...
Work around IDAMAX/IZAMAX bugs on POWER8BE with ELFv2 FreeBSD
2020-04-22 18:27:13 +02:00
Martin Kroeker
84f2c71e93
Merge pull request #2573 from martin-frbg/issue2572
...
Enable cblas interfaces to GEMM3M in CMAKE builds
2020-04-22 15:04:49 +02:00
Martin Kroeker
06208c8d01
Limit this fix to ELFv2 builds
2020-04-22 14:16:40 +02:00
Martin Kroeker
c90b28dee6
Export ELF_VERSION for use in powerpc kernel configurations
2020-04-22 14:14:20 +02:00
Martin Kroeker
6275b43918
Avoid duplicate printout of byte order and report ELF_VERSION
2020-04-22 14:12:27 +02:00
Martin Kroeker
2db5178e2d
enable cblas interfaces to GEMM3M in CMAKE builds
2020-04-22 11:01:28 +02:00
Martin Kroeker
57549f5c92
Merge pull request #2569 from martin-frbg/issue2472-2
...
Fix linker option passing for MSVS and ReLAPACK
2020-04-21 20:26:53 +02:00
Martin Kroeker
f5c4c28b98
Work around POWER8BE bugs on FreeBSD (ELFv2)
...
for #2299
2020-04-21 17:17:17 +02:00
Martin Kroeker
239282d5e2
Use CMAKE_SHARED_LINKER_FLAGS to pass MSVC linker option
...
target_link_libraries does not work here according to issue 2472
2020-04-20 22:30:51 +02:00
Martin Kroeker
568674477c
Merge pull request #48 from xianyi/develop
...
rebase
2020-04-20 21:51:59 +02:00
Martin Kroeker
fa42588e1f
Merge pull request #2565 from martin-frbg/mips24k
...
Support MIPS32 24K family as P5600
2020-04-20 17:13:53 +02:00
Martin Kroeker
8a6d26458b
Merge pull request #2559 from RajalakshmiSR/shgemm
...
Add half precision gemm for bfloat16 in OpenBLAS
2020-04-19 22:09:55 +02:00
Martin Kroeker
db86f516b9
Merge pull request #2568 from martin-frbg/azure-win
...
Add a Windows/CL build job to the Azure CI
2020-04-19 19:06:33 +02:00
Martin Kroeker
aec353b5a7
Add a Windows/CL build to the Azure Ci configuration
2020-04-19 19:04:33 +02:00
Martin Kroeker
c62fbefad4
Merge pull request #2567 from xianyi/revert-2566-azurewin
...
Revert "Add Windows build job on Azure CI"
2020-04-19 19:01:58 +02:00
Martin Kroeker
04706e760d
Revert "Add Windows build job on Azure CI ( #2566 )"
...
This reverts commit e1e543b145
.
2020-04-19 19:00:37 +02:00
Martin Kroeker
e1e543b145
Add Windows build job on Azure CI ( #2566 )
...
* Add Windows-CL build job on Azure
2020-04-19 16:16:15 +02:00
Martin Kroeker
e55ec82bb9
Delete KERNEL.1004K
2020-04-19 15:44:30 +02:00
Martin Kroeker
7353ea5afc
Delete KERNEL.24K
2020-04-19 15:44:19 +02:00
Martin Kroeker
6a04efb122
Rename KERNEL files to include MIPS prefix
2020-04-19 15:43:54 +02:00
Martin Kroeker
5afb66812f
Update getarch.c
2020-04-19 14:55:31 +02:00
Martin Kroeker
0d18f231fc
Update getarch.c
2020-04-19 13:52:58 +02:00
Martin Kroeker
2f4a8e5bc4
Rename the FORCE entries for 24K and 1004K to include the MIPS prefix
2020-04-19 13:22:19 +02:00
Martin Kroeker
4f70512b97
Update kernel.cmake
2020-04-19 08:10:26 +02:00
Martin Kroeker
8792fc4d5f
Disable RPCC macro on MIPS24K
2020-04-19 07:21:48 +02:00
Martin Kroeker
577c5d9f8f
Update README.md
2020-04-19 06:54:52 +02:00