wernsaar
|
6acbafe45b
|
added sgemv_n microkernel for haswell
|
2014-07-20 14:52:25 +02:00 |
wernsaar
|
5392d11b04
|
optimized sgemv_n_microk_sandy.c
|
2014-07-20 14:08:04 +02:00 |
wernsaar
|
c0fe95fb72
|
added sgemv_n microkernel for sandybridge
|
2014-07-20 13:17:47 +02:00 |
wernsaar
|
d9d4077c93
|
added sgemv_t microkernel for haswell
|
2014-07-20 11:30:32 +02:00 |
wernsaar
|
02eb72ac42
|
bugfix in sgemv_t_microk_sandy.c
|
2014-07-20 10:48:41 +02:00 |
wernsaar
|
c06f9986d4
|
added sgemv_t microkernel for sandybridge
|
2014-07-20 10:21:08 +02:00 |
wernsaar
|
2cce125c79
|
added optimized sgemv_t for bulldozer and piledriver
|
2014-07-19 15:48:07 +02:00 |
wernsaar
|
b3938fe371
|
don't use this sgemv_n on Windows
|
2014-07-19 07:15:34 +02:00 |
wernsaar
|
c8a4a56177
|
performance optimizations for sgemv_n
|
2014-07-18 11:25:21 +02:00 |
wernsaar
|
3c5732615d
|
added blocked sgemv_n and microkernel for bulldozer and piledriver
|
2014-07-17 23:15:07 +02:00 |
wernsaar
|
7ceb25d7b3
|
changed string GFORTRAN to lowercase
|
2014-07-16 17:08:43 +02:00 |
wernsaar
|
51413925bd
|
adjust number of threads for small size in cgemv and zgemv
|
2014-07-15 16:27:02 +02:00 |
wernsaar
|
b985cea65d
|
adjust number of threads for sgemv and dgemv
|
2014-07-15 16:04:46 +02:00 |
wernsaar
|
d286daa2ba
|
adjusted number of threads for small size
|
2014-07-15 14:41:35 +02:00 |
wernsaar
|
bcb115b55b
|
added benchmark for gemv
|
2014-07-15 13:35:36 +02:00 |
wernsaar
|
7424e2b609
|
added additional test value
|
2014-07-13 18:26:38 +02:00 |
wernsaar
|
880597b301
|
segment violation in sgemv kernels
|
2014-07-13 10:46:14 +02:00 |
wernsaar
|
9c835431d0
|
modified pathes to atlas, mkl and acml
|
2014-07-12 16:20:29 +02:00 |
wernsaar
|
1d4ffddf69
|
added conf option for number of loops
|
2014-07-12 11:54:39 +02:00 |
wernsaar
|
b0e7810a6b
|
added her2k benchmark
|
2014-07-11 16:31:05 +02:00 |
wernsaar
|
2b92a8c499
|
added herk benchmark
|
2014-07-11 16:16:48 +02:00 |
wernsaar
|
274b8dc91a
|
add hemm benchmark
|
2014-07-11 15:26:34 +02:00 |
wernsaar
|
74b237ca22
|
added syr2k benchmark
|
2014-07-11 14:48:25 +02:00 |
wernsaar
|
c353abd38c
|
added syrk benchmark
|
2014-07-11 14:21:25 +02:00 |
wernsaar
|
0acce17979
|
added trsm benchmark
|
2014-07-11 13:51:08 +02:00 |
wernsaar
|
2016a685e6
|
added trmm benchmark
|
2014-07-11 13:20:42 +02:00 |
wernsaar
|
1b9a6aac30
|
added benchmark for symm
|
2014-07-11 12:47:48 +02:00 |
wernsaar
|
e27433ab6a
|
added gemm benchmark and modified Makefile for benchmark
|
2014-07-11 11:09:47 +02:00 |
Zhang Xianyi
|
7961404a40
|
Merge pull request #411 from wernsaar/develop
Lapack-test on x86 32bit now runs without errors.
|
2014-07-10 22:38:15 +08:00 |
wernsaar
|
cedc1f4b14
|
Ref #410: disabled optimized potri functions ( single threading bug)
|
2014-07-10 13:42:32 +02:00 |
wernsaar
|
0884b73c69
|
Lapack-test Windows 32bit now error free
|
2014-07-10 11:01:47 +02:00 |
wernsaar
|
9bd9472ae9
|
Lapack-test: cleanup of x86 32bit KERNEL file
|
2014-07-09 16:08:19 +02:00 |
Zhang Xianyi
|
2e2473f390
|
Merge pull request #409 from wernsaar/develop
some fixes for Lapack and ARM platform
|
2014-07-09 21:11:00 +08:00 |
wernsaar
|
c4a423a642
|
bugfixes for lapack on ARM Platform
|
2014-07-09 12:21:39 +02:00 |
Zhang Xianyi
|
47688e24e9
|
OpenBLAS 0.2.10 rc2 version.
|
2014-07-09 08:47:36 +08:00 |
wernsaar
|
61ef0c3419
|
added cross compiler examples for 32bit and 64bit ARM
|
2014-07-08 12:55:18 +02:00 |
Zhang Xianyi
|
698e77dba4
|
Refs #406. Fixed utest building bug.
|
2014-07-08 17:26:49 +08:00 |
wernsaar
|
2081f6e8ff
|
Lapack bug114: replaced cgesvd.f and zgesvd.f
|
2014-07-08 10:21:10 +02:00 |
wernsaar
|
dc6b809f15
|
Lapack bug117: replaced zstemr.f
|
2014-07-08 10:08:34 +02:00 |
wernsaar
|
0f08684649
|
Lapack bug118: replaced clanhf.f and zlanhf.f
|
2014-07-08 09:57:40 +02:00 |
Zhang Xianyi
|
552119c484
|
Fixed #407. Support outputing the CPU corename on runtime.
The user can use char * openblas_get_config() or char * openblas_get_corename().
|
2014-07-08 12:48:08 +08:00 |
Zhang Xianyi
|
94d3cfaa10
|
Merge pull request #404 from wernsaar/develop
A lot of fixes for v0.2.10-rc2
|
2014-07-07 00:39:33 +08:00 |
wernsaar
|
13348b2137
|
removed reference to daxpy_bulldozer kernel (Windows bug in lapack-test)
|
2014-07-06 16:39:32 +02:00 |
wernsaar
|
783a7d2202
|
bugfix for fortran compiler
|
2014-07-06 13:33:42 +02:00 |
wernsaar
|
50e99a52ea
|
added definitions for PILEDRIVER and HASWELL
|
2014-07-06 12:08:27 +02:00 |
wernsaar
|
9964ed2f79
|
bugfix for CORE2
|
2014-07-06 11:47:28 +02:00 |
wernsaar
|
d5b976f92d
|
fallback to zgemm_kernel_4x2_sse.S
|
2014-07-06 11:05:28 +02:00 |
wernsaar
|
f7267d9b0e
|
added missing definition for DUNNINGTON
|
2014-07-06 10:17:07 +02:00 |
wernsaar
|
e0c080a28c
|
removed reference to zgemm_kernel_4x2_sse3.S (bug in lapack-test)
|
2014-07-05 16:13:17 +02:00 |
wernsaar
|
e80b144932
|
enabled compiling of *3M functions
|
2014-07-02 14:11:53 +02:00 |