Commit Graph

1318 Commits

Author SHA1 Message Date
wangqian e4a27d194e Merge branch 'sandybridge' of github.com:xianyi/OpenBLAS into sandybridge 2012-06-20 19:54:57 +08:00
wangqian d34fce56e4 Refs #83 Fixed S/DGEMM calling conventions bug on windows. 2012-06-20 19:53:18 +08:00
Xianyi Zhang b8b922d334 Fixed #106. Use fetch instead of curl on FreeBSD. 2012-06-20 11:07:36 +08:00
Zhang Xianyi 88c272f6a7 Refs #83. Added the missing ALIGN_5 macro on Mac OSX. However, it still exists SEGFAULT bug. 2012-06-20 09:20:20 +08:00
wangqian 6cfcb54a28 Fixed align problem in S and C precision GEMM kernels. 2012-06-20 07:38:39 +08:00
wangqian 3ef96aa567 Fixed bug in MOVQ redefine and ALIGN SIZE problem. 2012-06-19 20:37:22 +08:00
Xianyi Zhang 996dc6d1c8 Fixed dynamic_arch building bug. 2012-06-19 17:29:06 +08:00
Xianyi Zhang 61b1c2db5b Merge branch 'develop' into sandybridge 2012-06-19 17:06:44 +08:00
Xianyi Zhang 34fd3b85a8 Refs #113. Fixed BOBCATE typo in dynamic arch building. 2012-06-19 17:05:16 +08:00
wangqian f76f952547 Refs #83 #53. Adding Intel Sandy Bridge (AVX supported) kernel codes for BLAS level 3 functions. 2012-06-19 16:37:12 +08:00
Zhang Xianyi 11b4a0e4b6 Merge branch 'develop' into sandybridge 2012-06-16 22:07:17 +08:00
Zhang Xianyi eefd30881c Refs #113. Fixed the build bug on AMD Bobcat 64-bit OS. 2012-06-02 21:34:23 +08:00
Zhang Xianyi d3b67d0bd8 Refs #113. Fixed the typo BOBCATE -> BOBCAT 2012-05-31 22:40:15 +08:00
Zhang Xianyi d6cab3f37e Refs #113. Support AMD Bobcate using Barcelona kernel codes. Replace 3DNow! with MMX. 2012-05-31 18:17:45 +08:00
Zhang Xianyi 37edae1c90 Refs #75. Check ffreep macro before the define. 2012-05-31 17:17:02 +08:00
Zhang Xianyi 90d6ad569d Merge branch 'sandybridge' into develop
Just copy the kernel codes from Nehalem. The optimization is ongoing.
2012-05-31 12:44:55 +08:00
Xianyi Zhang a4daa34db7 Refs #75. Use ffreep opcode directly. Please check out http://www.sandpile.org/x86/opc_fpu.htm . 2012-05-30 20:25:01 +08:00
Xianyi Zhang cfc2940412 Merge branch 'fix-64-cores-limitation' into develop 2012-05-30 16:43:45 +08:00
Xianyi Zhang a6adbb299d Refs #112. Improved setting thread affinity in Linux. Remove the limit (64) about the number of CPU cores. 2012-05-29 15:23:52 +08:00
Xianyi Zhang 8cc7f86cf7 Detect Intel Core i7 3000 with Sandybridge. 2012-05-25 23:20:29 +08:00
Xianyi Zhang a53c6e2440 Merge branch 'develop' into sandybridge 2012-05-25 23:16:44 +08:00
Zaheer Chothia 5199809bba Fix typo: OS_CYGWIN -> OS_CYGWIN_NT, OS_INERIX -> OS_INTERIX 2012-05-23 00:23:25 +02:00
Zaheer Chothia a431042475 Fix inconsistent case for OS_* macros (Refs pull request #111) 2012-05-23 00:01:14 +02:00
zchothia 33941033e2 Merge pull request #111 from nolta/develop
fix a couple of FreeBSD issues
2012-05-21 14:44:20 -07:00
Mike Nolta 4e29b6ffc0 FreeBSD: fix OS_FreeBSD -> OS_FREEBSD typos 2012-05-21 16:57:19 -04:00
Mike Nolta 839b18aa26 FreeBSD: allow CC & FC to have different versions 2012-05-21 16:56:28 -04:00
Zaheer Chothia a27339b244 DLL: replace FEXTRALIB -> EXTRALIB (for consistency) 2012-05-21 12:25:12 +02:00
Zaheer Chothia 10e25690b4 Fix FreeBSD build (undefined reference to `pthread_create') 2012-05-21 12:10:26 +02:00
Zhang Xianyi 660d9e09f8 Fixed #110. Merge branch 'patch-2' of https://github.com/nolta/OpenBLAS into develop 2012-05-21 13:01:37 +08:00
Mike Nolta e9be1fdd2b FreeBSD: replace EXTRALIB -> FEXTRALIB 2012-05-20 21:44:15 -03:00
Zaheer Chothia 44124d3055 Fix Fortran compiler detection
- Test with '-x' operator to ensure file is executable.
- 'break' is not a valid Perl keyword.
2012-05-20 18:11:34 +02:00
Zaheer Chothia 14c3511e92 Respect C compiler set on the command line or inherited from the environment 2012-05-20 18:09:35 +02:00
Zhang Xianyi d647f751ee Merge branch 'patch-1' of https://github.com/nolta/OpenBLAS into develop 2012-05-20 12:06:04 +08:00
Mike Nolta be1692d64f fix 'sched_yield' warnings on FreeBSD,NetBSD 2012-05-20 00:49:38 -03:00
Zaheer Chothia f404a17787 Symbol list: document how LAPACKE exports are derived and synchronize with lapack-3.4.1
This change adds the missing LAPACKE_[zc]syr routines but does not remove any exported functions.
2012-05-16 11:28:45 +02:00
Zaheer Chothia fc4927fa0f Fixed #107. Export missing LAPACK auxiliary routines (ALLAUX, SCLAUX, DZLAUX)
Added some documentation on how the symbol list is derived and synchronized with
lapack-3.4.1 to minimize the differences.
2012-05-16 00:27:48 +02:00
Zhang Xianyi 06e208c5c3 Refs #106. Fixed wget and md5 bug on FreeBSD and NetBSD. 2012-05-13 11:43:29 +08:00
Xianyi Zhang 52485e5fd0 Refs #105. Export missing LAPACK functions in shared library.
They are as following,
slabad, dlabad,
slacpy, dlacpy,
slamch, dlamch,
slartg, slartgp, slartgs, dlartg, dlartgp, dlartgs,
slascl, dlascl,
slaset, dlaset.
2012-05-10 13:01:35 +08:00
Xianyi Zhang dee74174ff Refs #85 #104. Use patch instead of git to apply this segfaults.patch. 2012-05-08 23:50:46 +08:00
Xianyi Zhang e7846547be Refs #85 #104. Disable my_bind to fix this segfault issue. 2012-05-07 16:41:22 +08:00
Xianyi Zhang 7f89edee3e refs #103 Increase GEMM_MULTITHREAD_THRESHOLD to 50. 2012-05-03 20:05:34 +08:00
Xianyi Zhang 11cc9dc151 Merge pull request #104 from aeberspaecher/develop
Fixed #85. Add the patch for segfaults on kernel 2.6.32 and add documentation accordingly.
2012-05-03 05:00:40 -07:00
Alexander Eberspächer 4236d0d938 Add note on compiler warnings for the segfaults patch. 2012-05-02 12:03:07 +02:00
Alexander Eberspächer 8218cbea2a Add Xianyi's patch for segfaults on kernel 2.6.32 and add documentation
accordingly.
2012-05-02 11:33:06 +02:00
Xianyi Zhang 5b7f443cf4 Merge branch 'develop' 2012-04-30 13:07:14 +08:00
Xianyi Zhang 0e39699c8c Fixed #102. Export the missing LAPACK functions (slapy2,slapy3,dlapy2,dlapy3) in shared library. 2012-04-30 13:03:34 +08:00
Xianyi Zhang 60a58e03ea Merge branch 'release-0.1.1' into develop 2012-04-29 18:47:26 +08:00
Xianyi Zhang e154920bc6 Merge branch 'release-0.1.1' 2012-04-29 18:41:21 +08:00
Xianyi Zhang fcb89ad94d Refs #91. Updated the doc for 0.1.1 version. 2012-04-29 18:40:24 +08:00
Xianyi Zhang d48a1d1928 Fixed #101. Install the missing lapacke header with LAPACK-3.4.1. Thank Zaheer for this patch. 2012-04-28 12:33:56 +08:00