Commit Graph

5621 Commits

Author SHA1 Message Date
Xianyi Zhang c642b61d4d Merge branch 'master' of github.com:xianyi/OpenBLAS into develop 2011-06-19 11:59:38 +08:00
Xianyi Zhang aeed8d6225 Fixed #27. Temporarily walk around axpy's low performance issue with small imput size & multithreads. 2011-06-19 11:55:29 +08:00
Xianyi Zhang 1a4181afd0 Merge pull request #36 from pipping/master
Fixed the bug about USE_OPENMP=0 enabling OpenMP
2011-06-11 05:59:00 -07:00
Elias Pipping 49742cb2d3 Make USE_OPENMP=0 disable openmp 2011-06-11 14:38:10 +02:00
Xianyi Zhang b3d1887745 Fixed #35 a build bug with NO_LAPACK=1 DYNAMIC_ARCH=1 FC=gfortran. I forgot to test it with gfortran in last bug fixed commit. 2011-06-09 22:59:49 +08:00
Xianyi Zhang 8d50a9fd1a Fixed #35 a build bug with NO_LAPACK=1 & DYNAMIC_ARCH=1. 2011-06-09 11:38:59 +08:00
Xianyi Zhang 1496383224 Print the wall time (cycles) with enabling FUNCTION_PROFILE. 2011-06-09 10:40:15 +08:00
Wang Qian 4335bca2f7 Fixed #33 ztrmm bug on Nehalem. 2011-06-07 12:53:25 +08:00
Xianyi 31040e4d80 Fixed #32 a SEGFAULT bug with gcc-4.6. According to i386 calling convention, The called funtion should remove the hidden return value address from the stack. 2011-06-03 13:19:54 +08:00
Xianyi Zhang 3d7e62eb8b Fixed #31 Shared library placement on Mac. Thank Mr.Viral B. Shah for this patch. 2011-05-30 12:42:17 +08:00
traz 88d94d0ec8 Fixed #30 strmm computational error on Loongson3A. 2011-05-28 09:48:34 +00:00
Xianyi Zhang af40551c9f Fixed the makefile bug about openblas_set_num_threads. 2011-05-27 21:15:30 +08:00
Xianyi Zhang c30c22a76c Fixed a bug about detecting underscore prefix in c_check. 2011-05-27 18:16:19 +08:00
Xianyi Zhang cc09e6ef3a Ingnore *.obj files in git. 2011-05-27 18:12:45 +08:00
traz fc84909115 Modify single precision compiler conditions, increasing single precision kernel code on Loongson3a. 2011-05-27 09:47:17 +00:00
traz 5ca4e51df0 Remove the useless code, modify code comments and format. 2011-05-18 10:54:51 +00:00
Xianyi Zhang fcb5ce011b Fixed #28. Convert the result to double precision in MIPS64 dsdot_k kernel. 2011-05-17 21:24:00 +00:00
traz a9320f896e Fixed #25 dtrmm and dtrsm computational error on Loongson3A. 2011-05-14 22:00:57 +00:00
Xianyi Zhang 830a823be1 Added missed testing codes for dsdot. 2011-05-13 02:41:39 +08:00
Xianyi Zhang b206fc7075 Fixed #28. Convert the result to double precision in the end of dsdot kernel. 2011-05-13 02:34:30 +08:00
Xianyi Zhang 1d60510959 Added the unit testcase for dsdot. 2011-05-13 02:19:55 +08:00
Xianyi Zhang 03272a606d Added the unit test for drotmg. 2011-05-13 01:21:39 +08:00
Xianyi Zhang 0dc9eca36f Merge branch 'hotfix-readme_about_branches' into develop 2011-05-12 19:06:31 +08:00
Xianyi Zhang 8cc628a953 Merge branch 'hotfix-readme_about_branches' 2011-05-12 19:06:02 +08:00
Xianyi Zhang bbc517292a Added the spec of git branches about this project. 2011-05-12 19:05:20 +08:00
traz 29dce62b8f Finish dtrsm_kernel_Rx.S on Loongson3A. 2011-05-11 10:44:23 +00:00
Xianyi Zhang fa8e4fd879 Fixed #26 the wrong result of rotmg. Used fabs() instead of abs(). 2011-05-11 01:12:32 +08:00
traz 432c309f63 Finish dtrsm_kernel_Lx.S on Loongson3A. 2011-05-10 12:48:43 +00:00
traz d2f351d819 Modify dtrsm compiler options 2011-05-09 17:31:58 +00:00
traz 5a991b7149 Fixed #24 drmm error on Loongson3A 2011-05-09 17:28:20 +00:00
Xianyi Zhang 417b8ec792 Added openblas_set_num_threads for Fortran. 2011-05-06 17:03:35 +08:00
Xianyi Zhang 7dcf4eeee7 Fixed #23. Fixed a bug of f_check script about generating link flags. 2011-05-04 13:03:10 +08:00
Xianyi Zhang 1acf5ace29 Fixed a bug when detecting Intel CPU. 2011-05-03 17:19:36 +08:00
traits fcf9b82f14 Fixed a build bug with NO_LAPACK=1 and SANNITY_CHECK=1. 2011-05-03 14:42:11 +08:00
Xianyi Zhang 2aab238c61 Fixed #16. Print the user-friendly message when detecting CPU failed. 2011-04-22 22:14:06 +08:00
Xianyi Zhang b8d93812f0 Added docs for make TARGET=your_cpu_target. 2011-04-22 22:07:46 +08:00
Xianyi Zhang ff6ae89d3e Fixed #19. Provided an error msg when the arch is not supported. 2011-04-22 20:21:42 +08:00
Xianyi Zhang 0a45e5495f Fixed #21. Added extern C to support C++. Thank Tasio for the patch. 2011-04-20 13:41:38 +08:00
traz 9320933520 Completely dtrmm function. 2011-04-17 20:26:49 +00:00
traz 921caefa56 Increased handling trmm part, no edge handling. Test size(M and N) must be a multiple of 4 . 2011-04-15 21:56:25 +00:00
traz ecd4c1f3d9 Modify prefetching C. 2011-04-11 22:46:36 +00:00
traz ab9e4ce351 Adjust kc size from 112 to 116 . 2011-04-11 22:17:57 +00:00
Xianyi Zhang 921e040b15 Changed default page size to 16KB on Loongson 3A. 2011-04-11 21:46:48 +00:00
Xianyi Zhang 00ef0cd434 Supported goto_set_num_threads & openblas_set_num_threads functions when USE_OPENMP=1. 2011-04-07 14:52:35 +08:00
Xianyi Zhang 989c6f8b06 Fixed #14 the SEGFAULT bug on 64 cores. On SMP server, the number of CPUs or cores should be less than or equal to 64. 2011-04-07 14:48:10 +08:00
Xianyi Zhang 552f31dbbd Fixed #13. Fixed blasint undefined bug in <cblas.h> file. 2011-04-07 14:48:10 +08:00
Xianyi Zhang 5452ba3850 Updated the developing version to v0.1 alpha2. 2011-04-07 14:48:10 +08:00
Xianyi Zhang 54745902b8 Init Changelog file for next release version(v0.1alpha2). 2011-04-07 14:48:10 +08:00
traz 1aa9a298e1 Change BLOCK SIZE of LOONGSON3A TARGET. 2011-04-06 10:39:31 +00:00
traz 782205a693 Add dgemm compiler Options in KERNEL.LOONGSON3A. 2011-04-06 10:38:34 +00:00