Commit Graph

395 Commits

Author SHA1 Message Date
Xianyi Zhang 33313b0221 Merge branch 'develop' into loongson3a 2011-07-07 14:25:51 +08:00
traits a5300420e2 Merge branch 'hotfix-0.1alpha2.1' into develop 2011-06-28 15:46:55 +08:00
traits 9b46bf1eb4 Merge branch 'hotfix-0.1alpha2.1' 2011-06-28 15:43:08 +08:00
traits c06b7be32f Refs #42. Output the error message when detecting fortran compiler failed. 2011-06-28 15:42:09 +08:00
traz 68532fa9ec Merge branch 'loongson3a' of github.com:xianyi/OpenBLAS into loongson3a 2011-06-24 09:28:12 +00:00
traz 708d2b6255 Fix compute error in ztrmm. 2011-06-24 09:27:41 +00:00
traz e72113f06a Add ztrmm and ztrsm part on loongson3a. The average performance is 2.2G. 2011-06-23 21:11:00 +00:00
traz 14f81da375 Change prefetch length of A and B, the performance is 2.1G now. 2011-06-23 10:46:58 +00:00
Xianyi Zhang fc21f7ad28 Merge branch 'release-v0.1alpha2' into loongson3a 2011-06-23 16:08:23 +08:00
Xianyi Zhang ca8bf5abb0 Merge branch 'release-v0.1alpha2' into develop 2011-06-23 16:07:34 +08:00
traits 4a73f5c5ea Merge branch 'release-v0.1alpha2' 2011-06-23 15:18:40 +08:00
traits 6a0762949d Fixed #38. Released v0.1 alpha2. 2011-06-23 15:16:24 +08:00
traits 859b71645a Refs #37. Updated REAME about the compatible issue with EKOPath compiler. 2011-06-23 15:09:34 +08:00
Xianyi Zhang 078bfd0b4f Refs #39. Moved the shared lib (dll) to top directory in MingW64 compiler environment. 2011-06-22 13:19:39 +08:00
traz 1c96d345e2 Improve zgemm performance from 1G to 1.8G, change block size in param.h. 2011-06-21 22:16:23 +00:00
Xianyi Zhang 82f5274828 Refs #39. It's unnecessary to include sys/mman.h file in blas_server_omp.c. 2011-06-22 01:52:20 +08:00
Xianyi Zhang e568df0dae Refs #38. Prepare the docs with v0.1alpha2. 2011-06-21 18:06:13 +08:00
Xianyi Zhang c4efde7713 Merge branch 'loongson3a' into release-v0.1alpha2 2011-06-21 17:50:00 +08:00
Xianyi Zhang 7a1e6202e1 Merge branch 'add_install_target' into develop 2011-06-21 17:40:16 +08:00
Xianyi Zhang 32353a9d30 Refs #20. Fixed the installation bug with DYNAMIC_ARCH=1. 2011-06-21 17:39:08 +08:00
Xianyi Zhang 2e6e9272fe Merge branch 'add_install_target' into develop
Conflicts:
	Changelog.txt
2011-06-20 18:40:05 +08:00
Xianyi Zhang d978436c4b Refs #20. Updated the docs. 2011-06-20 18:36:29 +08:00
Xianyi Zhang fab36f1adb Fixed #20. Added install target in makefile. You can use "make install PREFIX=your_installation_directory". 2011-06-20 18:35:35 +08:00
Xianyi Zhang 7945919f22 Updated gitignore file. 2011-06-19 12:07:31 +08:00
Xianyi Zhang c642b61d4d Merge branch 'master' of github.com:xianyi/OpenBLAS into develop 2011-06-19 11:59:38 +08:00
Xianyi Zhang aeed8d6225 Fixed #27. Temporarily walk around axpy's low performance issue with small imput size & multithreads. 2011-06-19 11:55:29 +08:00
Xianyi Zhang 1a4181afd0 Merge pull request #36 from pipping/master
Fixed the bug about USE_OPENMP=0 enabling OpenMP
2011-06-11 05:59:00 -07:00
Elias Pipping 49742cb2d3 Make USE_OPENMP=0 disable openmp 2011-06-11 14:38:10 +02:00
Xianyi Zhang b3d1887745 Fixed #35 a build bug with NO_LAPACK=1 DYNAMIC_ARCH=1 FC=gfortran. I forgot to test it with gfortran in last bug fixed commit. 2011-06-09 22:59:49 +08:00
Xianyi Zhang 8d50a9fd1a Fixed #35 a build bug with NO_LAPACK=1 & DYNAMIC_ARCH=1. 2011-06-09 11:38:59 +08:00
Xianyi Zhang 1496383224 Print the wall time (cycles) with enabling FUNCTION_PROFILE. 2011-06-09 10:40:15 +08:00
Wang Qian 4335bca2f7 Fixed #33 ztrmm bug on Nehalem. 2011-06-07 12:53:25 +08:00
Xianyi 31040e4d80 Fixed #32 a SEGFAULT bug with gcc-4.6. According to i386 calling convention, The called funtion should remove the hidden return value address from the stack. 2011-06-03 13:19:54 +08:00
Xianyi Zhang 3d7e62eb8b Fixed #31 Shared library placement on Mac. Thank Mr.Viral B. Shah for this patch. 2011-05-30 12:42:17 +08:00
traz 88d94d0ec8 Fixed #30 strmm computational error on Loongson3A. 2011-05-28 09:48:34 +00:00
Xianyi Zhang af40551c9f Fixed the makefile bug about openblas_set_num_threads. 2011-05-27 21:15:30 +08:00
Xianyi Zhang c30c22a76c Fixed a bug about detecting underscore prefix in c_check. 2011-05-27 18:16:19 +08:00
Xianyi Zhang cc09e6ef3a Ingnore *.obj files in git. 2011-05-27 18:12:45 +08:00
traz fc84909115 Modify single precision compiler conditions, increasing single precision kernel code on Loongson3a. 2011-05-27 09:47:17 +00:00
traz 5ca4e51df0 Remove the useless code, modify code comments and format. 2011-05-18 10:54:51 +00:00
Xianyi Zhang fcb5ce011b Fixed #28. Convert the result to double precision in MIPS64 dsdot_k kernel. 2011-05-17 21:24:00 +00:00
traz a9320f896e Fixed #25 dtrmm and dtrsm computational error on Loongson3A. 2011-05-14 22:00:57 +00:00
Xianyi Zhang 830a823be1 Added missed testing codes for dsdot. 2011-05-13 02:41:39 +08:00
Xianyi Zhang b206fc7075 Fixed #28. Convert the result to double precision in the end of dsdot kernel. 2011-05-13 02:34:30 +08:00
Xianyi Zhang 1d60510959 Added the unit testcase for dsdot. 2011-05-13 02:19:55 +08:00
Xianyi Zhang 03272a606d Added the unit test for drotmg. 2011-05-13 01:21:39 +08:00
Xianyi Zhang 0dc9eca36f Merge branch 'hotfix-readme_about_branches' into develop 2011-05-12 19:06:31 +08:00
Xianyi Zhang 8cc628a953 Merge branch 'hotfix-readme_about_branches' 2011-05-12 19:06:02 +08:00
Xianyi Zhang bbc517292a Added the spec of git branches about this project. 2011-05-12 19:05:20 +08:00
traz 29dce62b8f Finish dtrsm_kernel_Rx.S on Loongson3A. 2011-05-11 10:44:23 +00:00