Commit Graph

395 Commits

Author SHA1 Message Date
Xianyi Zhang fa8e4fd879 Fixed #26 the wrong result of rotmg. Used fabs() instead of abs(). 2011-05-11 01:12:32 +08:00
traz 432c309f63 Finish dtrsm_kernel_Lx.S on Loongson3A. 2011-05-10 12:48:43 +00:00
traz d2f351d819 Modify dtrsm compiler options 2011-05-09 17:31:58 +00:00
traz 5a991b7149 Fixed #24 drmm error on Loongson3A 2011-05-09 17:28:20 +00:00
Xianyi Zhang 417b8ec792 Added openblas_set_num_threads for Fortran. 2011-05-06 17:03:35 +08:00
Xianyi Zhang 7dcf4eeee7 Fixed #23. Fixed a bug of f_check script about generating link flags. 2011-05-04 13:03:10 +08:00
Xianyi Zhang 1acf5ace29 Fixed a bug when detecting Intel CPU. 2011-05-03 17:19:36 +08:00
traits fcf9b82f14 Fixed a build bug with NO_LAPACK=1 and SANNITY_CHECK=1. 2011-05-03 14:42:11 +08:00
Xianyi Zhang 2aab238c61 Fixed #16. Print the user-friendly message when detecting CPU failed. 2011-04-22 22:14:06 +08:00
Xianyi Zhang b8d93812f0 Added docs for make TARGET=your_cpu_target. 2011-04-22 22:07:46 +08:00
Xianyi Zhang ff6ae89d3e Fixed #19. Provided an error msg when the arch is not supported. 2011-04-22 20:21:42 +08:00
Xianyi Zhang 0a45e5495f Fixed #21. Added extern C to support C++. Thank Tasio for the patch. 2011-04-20 13:41:38 +08:00
traz 9320933520 Completely dtrmm function. 2011-04-17 20:26:49 +00:00
traz 921caefa56 Increased handling trmm part, no edge handling. Test size(M and N) must be a multiple of 4 . 2011-04-15 21:56:25 +00:00
traz ecd4c1f3d9 Modify prefetching C. 2011-04-11 22:46:36 +00:00
traz ab9e4ce351 Adjust kc size from 112 to 116 . 2011-04-11 22:17:57 +00:00
Xianyi Zhang 921e040b15 Changed default page size to 16KB on Loongson 3A. 2011-04-11 21:46:48 +00:00
Xianyi Zhang 00ef0cd434 Supported goto_set_num_threads & openblas_set_num_threads functions when USE_OPENMP=1. 2011-04-07 14:52:35 +08:00
Xianyi Zhang 989c6f8b06 Fixed #14 the SEGFAULT bug on 64 cores. On SMP server, the number of CPUs or cores should be less than or equal to 64. 2011-04-07 14:48:10 +08:00
Xianyi Zhang 552f31dbbd Fixed #13. Fixed blasint undefined bug in <cblas.h> file. 2011-04-07 14:48:10 +08:00
Xianyi Zhang 5452ba3850 Updated the developing version to v0.1 alpha2. 2011-04-07 14:48:10 +08:00
Xianyi Zhang 54745902b8 Init Changelog file for next release version(v0.1alpha2). 2011-04-07 14:48:10 +08:00
traz 1aa9a298e1 Change BLOCK SIZE of LOONGSON3A TARGET. 2011-04-06 10:39:31 +00:00
traz 782205a693 Add dgemm compiler Options in KERNEL.LOONGSON3A. 2011-04-06 10:38:34 +00:00
traz ac494c0d04 New kernel in LOONGSON3A. 2011-04-06 10:36:44 +00:00
Xianyi Zhang 85f99d4769 Fixed #14 the SEGFAULT bug on 64 cores. On SMP server, the number of CPUs or cores should be less than or equal to 64. 2011-03-28 10:58:39 +08:00
Xianyi Zhang 5e7f29b19e Fixed #13. Fixed blasint undefined bug in <cblas.h> file. 2011-03-25 01:17:27 +08:00
Xianyi Zhang 141091f528 Merge branch 'master' of github.com:xianyi/OpenBLAS into x86 2011-03-22 14:16:18 +08:00
Xianyi Zhang e4bb6f2482 Fixed the detecting bug on Intel Core i5. Thank ggl329 for the patch. 2011-03-22 14:09:47 +08:00
Xianyi Zhang 0edcdd470e Updated the developing version to v0.1 alpha2. 2011-03-20 23:35:31 +08:00
Xianyi Zhang d672491122 Init Changelog file for next release version(v0.1alpha2). 2011-03-20 23:30:09 +08:00
Xianyi Zhang 972062903c OpenBLAS 0.1 alpha version 1. 2011-03-20 22:44:57 +08:00
Xianyi Zhang d9aa359e69 Merge remote branch 'origin/loongson3a' into x86 2011-03-20 21:57:58 +08:00
Xianyi Zhang 04769bdf54 Merge remote branch 'origin/loongson3a' into x86 2011-03-20 21:57:09 +08:00
Xianyi Zhang 6f058487ab Detect Intel Core Clarkdale & Arrandale 2011-03-20 21:56:40 +08:00
Xianyi Zhang f405b5bcc5 Fixed the bug about Loongson3A gsLQC1 & gsSQC1 instructions in daxpy kernel. Now daxpy is correct. 2011-03-18 23:05:56 +00:00
Xianyi Zhang 2b8643e0de Merge branch 'loongson3a' of github.com:xianyi/OpenBLAS into loongson3a 2011-03-18 01:20:15 +00:00
Xianyi Zhang c84f8be453 Supported detecting new kernel(2.6.36) & new Loongson3A03 CPU. 2011-03-18 01:10:58 +00:00
Wang Qian d5cffd506a Modified the default kernel makefile in MIPS64 arch. 2011-03-07 11:23:12 +00:00
Xianyi Zhang 5838f12995 Support unalign address in daxpy on loongson3a simd.. 2011-03-05 10:17:10 +08:00
Xianyi Zhang 5444a3f8f7 Unroll to 16 in daxpy on loongson3a. 2011-03-04 17:50:17 +08:00
Xianyi Zhang 88cbfcc5b5 Merge commit 'origin/x86' into loongson3a 2011-03-04 14:11:52 +00:00
Xianyi Zhang ce78abe37e Merge branch 'x86' of github.com:xianyi/OpenBLAS into x86 2011-03-04 11:53:04 +08:00
Xianyi Zhang 8f1090d32a Support NO_LAPACK=1 to build the lib without LAPACK functions. 2011-03-04 11:51:32 +08:00
Xianyi 272f62a2b6 Changed movlps macro name in capital in x86/zdot_sse2.S file. 2011-03-03 00:46:39 +08:00
Xianyi 36016fe349 On x86 32bits, gcc 4.4.3 generated wrong codes (movsd) from movlps in zdot_sse2.S line 191.
This would casue zdotu & zdotc failures. Instead, use movlpd to walk around it. Fixed #8. Fixed #9.
2011-03-02 18:45:43 +08:00
Xianyi Zhang 44acb7503e Added zdotu with x & y offset=1 test case. 2011-03-02 18:03:40 +08:00
Xianyi Zhang 6eb02bbb9c Merge remote branch 'origin/x86' into loongson3a 2011-03-02 13:52:05 +08:00
Xianyi Zhang 0e782b9bd3 updated the changelog. 2011-03-02 13:40:55 +08:00
Xianyi Zhang 588737210d Fixed randomly SEGFAULT when nodemask==NULL with above Linux 2.6.34. Fixed #12. Thank Mr.Ei-ji Nakama providing this patch. 2011-03-02 13:38:32 +08:00