Werner Saar
|
c81dc6322f
|
prepared lapack/potrf functions for UNROLL values, that are not a power of two
|
2017-01-10 10:50:28 +01:00 |
Andrew Pinski
|
8fdb0655e9
|
THUNDERX: Add an optimized version of ddot
|
2017-01-10 15:01:37 +05:30 |
Andrew Pinski
|
fb200c7245
|
ARM64: Add Cavium THUNDERX Target
|
2017-01-10 15:01:37 +05:30 |
Ashwin Sekhar T K
|
0b8e876d89
|
VULCAN: Add optimized DGEMM implementation
|
2017-01-10 15:01:37 +05:30 |
Ashwin Sekhar T K
|
4713e7c47f
|
ARM64: Add the VULCAN Target
|
2017-01-10 15:01:17 +05:30 |
Ashwin Sekhar T K
|
6085386b10
|
CORTEXA57: Add assembly kernels for copy routines
|
2017-01-10 15:01:05 +05:30 |
Zhang Xianyi
|
002b41f024
|
Merge pull request #1055 from ksraste/develop
Add msa optimization for AXPY, COPY, SCALE, SWAP
|
2017-01-10 13:58:26 +08:00 |
jiahaipeng
|
84b8170bfb
|
Adding multi-threading for copy, dot, rot, and asum funcitons
|
2017-01-10 11:48:58 +08:00 |
jiahaipeng
|
1aa1e6cb54
|
modify the blas_l1_thread.c for support multi-threded for L1 fuction with return value
|
2017-01-10 11:47:06 +08:00 |
Martin Kroeker
|
cbd2bf1f6e
|
Merge pull request #1057 from martin-frbg/lapacke-mingw
Split the obj list of LAPACKE 3.7.0
|
2017-01-09 20:45:26 +01:00 |
Martin Kroeker
|
9f5cfd43dc
|
Split the obj list of LAPACKE 3.7.0
Split obj list to allow building with mingw (argument list too long for the msys ar)
|
2017-01-09 18:29:53 +01:00 |
kaustubh
|
1480f3df71
|
Add msa optimization for AXPY, COPY, SCALE, SWAP
Signed-off-by: kaustubh <kaustubh.raste@imgtec.com>
|
2017-01-09 18:27:23 +05:30 |
kaustubh
|
88afb3bc94
|
Add msa optimization for AXPY, COPY, SCALE, SWAP
Signed-off-by: kaustubh <kaustubh.raste@imgtec.com>
|
2017-01-09 18:22:09 +05:30 |
Werner Saar
|
2ffbbb54f6
|
Merge pull request #1054 from wernsaar/develop
prepared lapack/getrf functions for UNROLL values, that are not a pow…
|
2017-01-09 13:38:56 +01:00 |
Werner Saar
|
3e1bbd6b5f
|
prepared lapack/getrf functions for UNROLL values, that are not a power of two
|
2017-01-09 12:57:26 +01:00 |
Zhang Xianyi
|
b678471d65
|
Merge branch 'z13' into develop
Conflicts:
CONTRIBUTORS.md
|
2017-01-09 05:52:42 -05:00 |
Zhang Xianyi
|
864e202afd
|
Add USE_TRMM=1 for IBM z13 in kernel/Makefile.L3
|
2017-01-09 05:48:09 -05:00 |
Werner Saar
|
b9bb009236
|
Merge pull request #1053 from wernsaar/develop
prepared driver/level3 functions for UNROLL values, that are not a po…
|
2017-01-09 11:17:38 +01:00 |
Werner Saar
|
a2672d5589
|
prepared driver/level3 functions for UNROLL values, that are not a power of two
|
2017-01-09 10:38:15 +01:00 |
Zhang Xianyi
|
c2496d8f48
|
Merge pull request #1050 from martin-frbg/fflags
Apply COMMON_OPT to default FFLAGS
|
2017-01-09 16:23:22 +08:00 |
Zhang Xianyi
|
fb0afdaf99
|
Merge pull request #1052 from martin-frbg/locking
Fix thread data races detected by helgrind 3.12
|
2017-01-09 16:22:58 +08:00 |
Martin Kroeker
|
51aa157e64
|
Relocate declaration of alloc_lock outside ifdef block
|
2017-01-09 01:10:43 +01:00 |
Martin Kroeker
|
87c7d10b34
|
Fix thread data races detected by helgrind 3.12
Ref. #995, may possibly help solve issues seen in 660,883
|
2017-01-08 23:33:51 +01:00 |
Martin Kroeker
|
d0035b857d
|
Apply COMMON_OPT to default FFLAGS to avoid building non-optimized LAPACK by mistake
|
2017-01-08 21:17:22 +01:00 |
Werner Saar
|
c61a7cd293
|
Merge pull request #1049 from wernsaar/develop
removed blas_thread_shutdown from gensymbol
|
2017-01-08 09:30:19 +01:00 |
Werner Saar
|
a8bb5003de
|
removed blas_thread_shutdown from gensymbol
|
2017-01-08 08:51:30 +01:00 |
Zhang Xianyi
|
9a48adff3f
|
Merge pull request #1047 from brada4/erre
Improve R benchmark timing
|
2017-01-08 11:19:06 +08:00 |
Zhang Xianyi
|
823a40a110
|
Merge pull request #1040 from martin-frbg/develop
Use appropriate int32/int64 format for error number in message string
|
2017-01-08 11:18:38 +08:00 |
Zhang Xianyi
|
0bd706ac8d
|
Merge pull request #1036 from sva-img/develop
Added prefetch to CGEMV and ZGEMV.
|
2017-01-08 11:18:05 +08:00 |
Andrew
|
8379550076
|
anti GC and reflow
|
2017-01-07 19:01:42 +01:00 |
Andrew
|
fc148b7e4d
|
init
|
2017-01-07 19:01:21 +01:00 |
Werner Saar
|
5bb2b91a03
|
Merge pull request #1046 from wernsaar/develop
updated lapack to version 3.7.0 with latest patches from git
|
2017-01-07 15:09:56 +01:00 |
Werner Saar
|
abc3304587
|
fix for appveyor test
|
2017-01-07 14:27:08 +01:00 |
Werner Saar
|
a836fe8ec1
|
updated exports/gensymbol for lapack-3.7.0
|
2017-01-07 13:20:28 +01:00 |
Werner Saar
|
1153e3ac39
|
filtered out -fopenmp and fix for mingw
|
2017-01-07 08:41:42 +01:00 |
Werner Saar
|
7c2c488c23
|
removed xerbla and lsame for Makefile
|
2017-01-06 16:35:20 +01:00 |
Werner Saar
|
ae4ac6f984
|
removed obj-files, that are moved to lapack 3.7.0
|
2017-01-06 16:14:53 +01:00 |
Werner Saar
|
4494d03a21
|
filtered out optimized functions
|
2017-01-06 13:42:31 +01:00 |
Werner Saar
|
d35baf30cf
|
added lapack 3.7.0 with latest patches from git
|
2017-01-06 11:48:40 +01:00 |
Werner Saar
|
24efbbd339
|
removed lapack-devel.log
|
2017-01-06 11:46:58 +01:00 |
Werner Saar
|
8cd46acebb
|
removed lapack 3.6.0
|
2017-01-06 11:44:57 +01:00 |
Martin Kroeker
|
9e4b6971e2
|
Merge pull request #1043 from quickwritereader/z13
Z13
|
2017-01-05 19:15:36 +01:00 |
Martin Kroeker
|
0ef7841473
|
Update xerbla.c
|
2017-01-04 23:16:48 +01:00 |
Abdurrauf
|
7f2a959e3e
|
Update README.md
|
2017-01-04 19:41:24 +04:00 |
Abdurrauf
|
6418667818
|
dtrmm and dgemm for z13
|
2017-01-04 19:32:33 +04:00 |
Martin Kroeker
|
104ad066af
|
Use appropriate int32/int64 format for error number in message string
|
2016-12-30 00:45:59 +01:00 |
Shivraj Patil
|
a9bf8a781a
|
Added prefetch to CGEMV and ZGEMV.
Signed-off-by: Shivraj Patil <shivraj.patil@imgtec.com>
|
2016-12-27 11:33:51 +05:30 |
Zhang Xianyi
|
8f9975e013
|
Merge pull request #1032 from kiwifb/OSX_target
Do not override MACOSX_DEPLOYMENT_TARGET if it is already defined.
|
2016-12-18 14:48:22 +08:00 |
Zhang Xianyi
|
e7bd736802
|
Merge pull request #1025 from mfoster96/develop
Fix for issue #1024: arm-linux-androideabi-g++ Compiler Error in /cpu…
|
2016-12-18 14:47:59 +08:00 |
Zhang Xianyi
|
14571d8b08
|
Merge pull request #1031 from kiwifb/make
Never use "make" in makefiles. Only $(MAKE).
|
2016-12-18 14:46:52 +08:00 |