Hank Anderson
|
67e39bd8fb
|
Added mangled complex filenames to interface and lapack CMakeLists.txt.
|
2015-02-17 13:12:30 -06:00 |
Hank Anderson
|
9eb1499095
|
Added another param to GenerateNamedObjects to mangle complex source names.
There are a lot of sources for complex float types that are the same
names as the real sources, except with z prepended.
|
2015-02-17 10:30:28 -06:00 |
Hank Anderson
|
4662a0b13a
|
Changed generate functions to iterate through a list of float types.
This will generate obj files for SINGLE/DOUBLE/COMPLEX/DOUBLE COMPLEX.
|
2015-02-15 17:44:37 -06:00 |
Hank Anderson
|
e74462a3f5
|
Moved declarations to start of functions to satisfy MSVC C89 implementation.
|
2015-02-11 11:16:57 -06:00 |
Hank Anderson
|
e8c39138c6
|
Removed return value from GenerateNamedObjects.
It sets DBLAS_OBJS directly to save a bunch of list appending in the
CMakeLists.txt files.
|
2015-02-09 12:28:09 -06:00 |
Hank Anderson
|
58cff2fed8
|
Added CBLAS define/naming convention to GenerateNamedObjects.
|
2015-02-04 11:30:15 -06:00 |
Hank Anderson
|
5690cf3f0e
|
Added override for function names in GenerateNamedObjects.
The BLAS interface folder should now be generated the correct objects
for the DOUBLE case.
|
2015-02-04 10:52:19 -06:00 |
Hank Anderson
|
a0aeda6187
|
Added function to set defines for the object names (e.g. -DNAME=dgemm).
|
2015-02-04 10:37:34 -06:00 |
Hank Anderson
|
20e593a44a
|
Added cblas_ objects to interface CMakeLists.
Naming isn't right, though, not seeing cblas_xxxx exports in the
resulting library.
|
2015-02-02 16:25:30 -06:00 |
Hank Anderson
|
9e154aba58
|
Added LAPACK object files to interface CMakeLists.
|
2015-02-02 12:31:15 -06:00 |
Hank Anderson
|
5057a4b4df
|
Added openblas add_library call that uses DBLAS_OBJS ojbects.
|
2015-01-30 15:21:21 -06:00 |
Hank Anderson
|
a6cf8aafc0
|
Updated level3/CMakeLists with correct defines using all combos.
|
2015-01-30 11:21:50 -06:00 |
Hank Anderson
|
5eefe18ae4
|
Added CMakeLists.txt for the first of the BLAS folders.
It only does the double precision compile currently.
I realized I didn't finish converting Makefile.system yet, so I made
a note of that.
|
2015-01-27 16:17:17 -06:00 |
wernsaar
|
9e829ce98f
|
enabled cblas gemm3m functions
|
2014-09-20 17:20:02 +02:00 |
wernsaar
|
d49fd33885
|
disabled SYMM3M and HEMM3M functions because segment violations
|
2014-09-20 15:27:40 +02:00 |
wernsaar
|
7aae4a62e7
|
enabled use of GEMM3M functions
|
2014-09-20 14:27:10 +02:00 |
wernsaar
|
3300f5ebff
|
optimized multithreading lower limits
|
2014-09-15 11:38:25 +02:00 |
wernsaar
|
fd2478c9e2
|
optimized interface/zgemv.c for multithreading
|
2014-09-12 19:18:23 +02:00 |
Zhang Xianyi
|
1cba8e7b11
|
Merge pull request #446 from grisuthedragon/cblas_matcopy
Add a CBLAS interface for the BLAS extension s/d/c/z*matcopy routines.
|
2014-09-10 16:31:31 +08:00 |
Martin Koehler
|
a057e5434d
|
add CBLAS interface for s/d/c/zimatcopy
|
2014-09-09 09:52:13 +02:00 |
Martin Köhler
|
7794766d3c
|
Add cblas_(s/d/c/z)omatcopy in order to have cblas interface for them.
|
2014-09-08 17:57:44 +02:00 |
wernsaar
|
f511807fc0
|
modified multithreading threshold
|
2014-09-08 12:27:32 +02:00 |
wernsaar
|
d1800397f5
|
optimized interface/gemv.c for multithreading
|
2014-09-02 17:36:07 +02:00 |
wernsaar
|
f4ff889491
|
updated interface/gemv.c for multithreading
|
2014-09-02 16:30:04 +02:00 |
wernsaar
|
51413925bd
|
adjust number of threads for small size in cgemv and zgemv
|
2014-07-15 16:27:02 +02:00 |
wernsaar
|
b985cea65d
|
adjust number of threads for sgemv and dgemv
|
2014-07-15 16:04:46 +02:00 |
wernsaar
|
d286daa2ba
|
adjusted number of threads for small size
|
2014-07-15 14:41:35 +02:00 |
wernsaar
|
cedc1f4b14
|
Ref #410: disabled optimized potri functions ( single threading bug)
|
2014-07-10 13:42:32 +02:00 |
wernsaar
|
02a504c0b8
|
fixed my bug in ger.c
|
2014-07-02 10:39:33 +02:00 |
wernsaar
|
be94db096c
|
disabled *3M functions for x86_64 platforms
|
2014-07-01 16:18:05 +02:00 |
wernsaar
|
aee61456a4
|
disabled SMP for sbmv and zsbmv again
|
2014-06-29 21:18:38 +02:00 |
wernsaar
|
01a119abfc
|
enabled SMP for sbmv and zsbmv, but only for 64bit binaries
|
2014-06-29 20:35:56 +02:00 |
wernsaar
|
1fad2b759f
|
enabled smp for ger.c and zger.c, but only for 64bit binaries
|
2014-06-29 16:43:04 +02:00 |
Timothy Gu
|
6c2ead30f0
|
Remove all trailing whitespace except lapack-netlib
Signed-off-by: Timothy Gu <timothygu99@gmail.com>
|
2014-06-27 12:05:18 -07:00 |
wernsaar
|
15d5dfa92c
|
fixed compiler warnings
|
2014-06-25 11:32:44 +02:00 |
wernsaar
|
86d8c8978b
|
Ref #391: disabled SMP in ger.c and zger.c
|
2014-06-22 12:01:24 +02:00 |
wernsaar
|
a19d209005
|
Ref #103: enhancement for small matrix dimensions
|
2014-06-18 15:04:11 +02:00 |
wernsaar
|
faeab93df0
|
Ref #51: added blas extensions simatcopy, dimatcopy, cimatcopy, zimatcopy
|
2014-06-10 16:14:34 +02:00 |
wernsaar
|
cee257f384
|
Ref #51: added blas extensions zomatcopy and comatcopy
|
2014-06-10 10:34:54 +02:00 |
wernsaar
|
7bfb3011e8
|
Ref #51: added blas extension somatcopy
|
2014-06-09 20:21:13 +02:00 |
wernsaar
|
8c8f596238
|
Ref #51: added blas extension domatcopy as not opimized reference
|
2014-06-09 17:11:07 +02:00 |
wernsaar
|
bff575d0b1
|
Ref #375: added workaround for small sizes to scal.c and zscal.c
|
2014-06-08 13:49:19 +02:00 |
wernsaar
|
faf3ac0aad
|
Ref #285: added axpby kernels
|
2014-06-08 11:54:24 +02:00 |
Zhang Xianyi
|
b31ec99372
|
Fixed #374.
Merge branch 'TimothyGu-develop' into develop
|
2014-06-05 17:01:44 +08:00 |
wernsaar
|
25e899b60b
|
fixed function profile in zpotri.c
|
2014-05-25 09:15:22 +02:00 |
wernsaar
|
89da450800
|
enabled and tested optimized potri lapack functions
|
2014-05-23 12:14:30 +02:00 |
wernsaar
|
c26bbee489
|
enabled abd tested optimized trtri lapack functions
|
2014-05-23 10:55:39 +02:00 |
Timothy Gu
|
ced13574a0
|
Random "walk (a)round" --> "work-around" typo fixes
Signed-off-by: Timothy Gu <timothygu99@gmail.com>
|
2014-05-22 18:11:52 -07:00 |
wernsaar
|
a748d3a75d
|
enabled optimized trti2 lapack functions again
|
2014-05-21 11:02:07 +02:00 |
wernsaar
|
a5ab231ad4
|
enabled optimized complex lauum lapack functions again
|
2014-05-21 10:35:28 +02:00 |