wernsaar
339ab34c4c
added additional test value to dstest.in
2014-07-13 18:29:19 +02:00
wernsaar
7424e2b609
added additional test value
2014-07-13 18:26:38 +02:00
wernsaar
73594cff73
segment violation in x86_64 sgemv kernels
2014-07-13 10:49:43 +02:00
wernsaar
880597b301
segment violation in sgemv kernels
2014-07-13 10:46:14 +02:00
wernsaar
9c835431d0
modified pathes to atlas, mkl and acml
2014-07-12 16:20:29 +02:00
wernsaar
1d4ffddf69
added conf option for number of loops
2014-07-12 11:54:39 +02:00
wernsaar
b0e7810a6b
added her2k benchmark
2014-07-11 16:31:05 +02:00
wernsaar
2b92a8c499
added herk benchmark
2014-07-11 16:16:48 +02:00
wernsaar
274b8dc91a
add hemm benchmark
2014-07-11 15:26:34 +02:00
wernsaar
74b237ca22
added syr2k benchmark
2014-07-11 14:48:25 +02:00
wernsaar
c353abd38c
added syrk benchmark
2014-07-11 14:21:25 +02:00
wernsaar
0acce17979
added trsm benchmark
2014-07-11 13:51:08 +02:00
wernsaar
2016a685e6
added trmm benchmark
2014-07-11 13:20:42 +02:00
wernsaar
1b9a6aac30
added benchmark for symm
2014-07-11 12:47:48 +02:00
wernsaar
e27433ab6a
added gemm benchmark and modified Makefile for benchmark
2014-07-11 11:09:47 +02:00
Zhang Xianyi
7961404a40
Merge pull request #411 from wernsaar/develop
...
Lapack-test on x86 32bit now runs without errors.
2014-07-10 22:38:15 +08:00
wernsaar
cedc1f4b14
Ref #410 : disabled optimized potri functions ( single threading bug)
2014-07-10 13:42:32 +02:00
wernsaar
0884b73c69
Lapack-test Windows 32bit now error free
2014-07-10 11:01:47 +02:00
wernsaar
9bd9472ae9
Lapack-test: cleanup of x86 32bit KERNEL file
2014-07-09 16:08:19 +02:00
Zhang Xianyi
2e2473f390
Merge pull request #409 from wernsaar/develop
...
some fixes for Lapack and ARM platform
2014-07-09 21:11:00 +08:00
wernsaar
c4a423a642
bugfixes for lapack on ARM Platform
2014-07-09 12:21:39 +02:00
Zhang Xianyi
f9991fd5f6
Merge branch 'develop'
2014-07-09 08:48:00 +08:00
Zhang Xianyi
47688e24e9
OpenBLAS 0.2.10 rc2 version.
2014-07-09 08:47:36 +08:00
wernsaar
61ef0c3419
added cross compiler examples for 32bit and 64bit ARM
2014-07-08 12:55:18 +02:00
Zhang Xianyi
698e77dba4
Refs #406 . Fixed utest building bug.
2014-07-08 17:26:49 +08:00
wernsaar
2081f6e8ff
Lapack bug114: replaced cgesvd.f and zgesvd.f
2014-07-08 10:21:10 +02:00
wernsaar
dc6b809f15
Lapack bug117: replaced zstemr.f
2014-07-08 10:08:34 +02:00
wernsaar
0f08684649
Lapack bug118: replaced clanhf.f and zlanhf.f
2014-07-08 09:57:40 +02:00
Zhang Xianyi
552119c484
Fixed #407 . Support outputing the CPU corename on runtime.
...
The user can use char * openblas_get_config() or char * openblas_get_corename().
2014-07-08 12:48:08 +08:00
Zhang Xianyi
94d3cfaa10
Merge pull request #404 from wernsaar/develop
...
A lot of fixes for v0.2.10-rc2
2014-07-07 00:39:33 +08:00
wernsaar
13348b2137
removed reference to daxpy_bulldozer kernel (Windows bug in lapack-test)
2014-07-06 16:39:32 +02:00
wernsaar
783a7d2202
bugfix for fortran compiler
2014-07-06 13:33:42 +02:00
wernsaar
50e99a52ea
added definitions for PILEDRIVER and HASWELL
2014-07-06 12:08:27 +02:00
wernsaar
9964ed2f79
bugfix for CORE2
2014-07-06 11:47:28 +02:00
wernsaar
d5b976f92d
fallback to zgemm_kernel_4x2_sse.S
2014-07-06 11:05:28 +02:00
wernsaar
f7267d9b0e
added missing definition for DUNNINGTON
2014-07-06 10:17:07 +02:00
wernsaar
e0c080a28c
removed reference to zgemm_kernel_4x2_sse3.S (bug in lapack-test)
2014-07-05 16:13:17 +02:00
wernsaar
e80b144932
enabled compiling of *3M functions
2014-07-02 14:11:53 +02:00
wernsaar
02a504c0b8
fixed my bug in ger.c
2014-07-02 10:39:33 +02:00
wernsaar
be94db096c
disabled *3M functions for x86_64 platforms
2014-07-01 16:18:05 +02:00
wernsaar
b079df9ef4
added optimized sdot- and dsdot-kernel, written in C
2014-06-30 14:46:38 +02:00
wernsaar
aee61456a4
disabled SMP for sbmv and zsbmv again
2014-06-29 21:18:38 +02:00
wernsaar
01a119abfc
enabled SMP for sbmv and zsbmv, but only for 64bit binaries
2014-06-29 20:35:56 +02:00
wernsaar
1fad2b759f
enabled smp for ger.c and zger.c, but only for 64bit binaries
2014-06-29 16:43:04 +02:00
wernsaar
e1e83a1b71
modification, to run blas-test on Windows
2014-06-29 10:15:29 +02:00
Zhang Xianyi
da3d70420a
Merge branch 'develop'
2014-06-29 10:46:22 +08:00
Zhang Xianyi
1127f5a2d7
OpenBLAS 0.2.10 rc1 version.
2014-06-29 10:45:50 +08:00
Zhang Xianyi
0ae4cc2803
Merge branch 'wernsaar-develop' into develop
2014-06-29 10:40:54 +08:00
Zhang Xianyi
99efbbbad5
Fixed #395 . Enable optimized cgemm for Sandybridge. Added optimized sdot kernel.
...
Fixed c/zgemm, zgemv computational error of haswell, piledriver, bullldozer, and
barcelona on Windows.
Merge branch 'develop' of https://github.com/wernsaar/OpenBLAS into wernsaar-develop
Conflicts:
kernel/Makefile.L1
kernel/x86_64/KERNEL
param.h
2014-06-29 10:34:51 +08:00
wernsaar
22e5aee2dd
fixed zgemv bug for older AMD Processors
2014-06-28 19:04:49 +02:00