wangqian
|
e4a27d194e
|
Merge branch 'sandybridge' of github.com:xianyi/OpenBLAS into sandybridge
|
2012-06-20 19:54:57 +08:00 |
wangqian
|
d34fce56e4
|
Refs #83 Fixed S/DGEMM calling conventions bug on windows.
|
2012-06-20 19:53:18 +08:00 |
Xianyi Zhang
|
b8b922d334
|
Fixed #106. Use fetch instead of curl on FreeBSD.
|
2012-06-20 11:07:36 +08:00 |
Zhang Xianyi
|
88c272f6a7
|
Refs #83. Added the missing ALIGN_5 macro on Mac OSX. However, it still exists SEGFAULT bug.
|
2012-06-20 09:20:20 +08:00 |
wangqian
|
6cfcb54a28
|
Fixed align problem in S and C precision GEMM kernels.
|
2012-06-20 07:38:39 +08:00 |
wangqian
|
3ef96aa567
|
Fixed bug in MOVQ redefine and ALIGN SIZE problem.
|
2012-06-19 20:37:22 +08:00 |
Xianyi Zhang
|
996dc6d1c8
|
Fixed dynamic_arch building bug.
|
2012-06-19 17:29:06 +08:00 |
Xianyi Zhang
|
61b1c2db5b
|
Merge branch 'develop' into sandybridge
|
2012-06-19 17:06:44 +08:00 |
Xianyi Zhang
|
34fd3b85a8
|
Refs #113. Fixed BOBCATE typo in dynamic arch building.
|
2012-06-19 17:05:16 +08:00 |
wangqian
|
f76f952547
|
Refs #83 #53. Adding Intel Sandy Bridge (AVX supported) kernel codes for BLAS level 3 functions.
|
2012-06-19 16:37:12 +08:00 |
Zhang Xianyi
|
11b4a0e4b6
|
Merge branch 'develop' into sandybridge
|
2012-06-16 22:07:17 +08:00 |
Zhang Xianyi
|
eefd30881c
|
Refs #113. Fixed the build bug on AMD Bobcat 64-bit OS.
|
2012-06-02 21:34:23 +08:00 |
Zhang Xianyi
|
d3b67d0bd8
|
Refs #113. Fixed the typo BOBCATE -> BOBCAT
|
2012-05-31 22:40:15 +08:00 |
Zhang Xianyi
|
d6cab3f37e
|
Refs #113. Support AMD Bobcate using Barcelona kernel codes. Replace 3DNow! with MMX.
|
2012-05-31 18:17:45 +08:00 |
Zhang Xianyi
|
37edae1c90
|
Refs #75. Check ffreep macro before the define.
|
2012-05-31 17:17:02 +08:00 |
Zhang Xianyi
|
90d6ad569d
|
Merge branch 'sandybridge' into develop
Just copy the kernel codes from Nehalem. The optimization is ongoing.
|
2012-05-31 12:44:55 +08:00 |
Xianyi Zhang
|
a4daa34db7
|
Refs #75. Use ffreep opcode directly. Please check out http://www.sandpile.org/x86/opc_fpu.htm .
|
2012-05-30 20:25:01 +08:00 |
Xianyi Zhang
|
cfc2940412
|
Merge branch 'fix-64-cores-limitation' into develop
|
2012-05-30 16:43:45 +08:00 |
Xianyi Zhang
|
a6adbb299d
|
Refs #112. Improved setting thread affinity in Linux. Remove the limit (64) about the number of CPU cores.
|
2012-05-29 15:23:52 +08:00 |
Xianyi Zhang
|
8cc7f86cf7
|
Detect Intel Core i7 3000 with Sandybridge.
|
2012-05-25 23:20:29 +08:00 |
Xianyi Zhang
|
a53c6e2440
|
Merge branch 'develop' into sandybridge
|
2012-05-25 23:16:44 +08:00 |
Zaheer Chothia
|
5199809bba
|
Fix typo: OS_CYGWIN -> OS_CYGWIN_NT, OS_INERIX -> OS_INTERIX
|
2012-05-23 00:23:25 +02:00 |
Zaheer Chothia
|
a431042475
|
Fix inconsistent case for OS_* macros (Refs pull request #111)
|
2012-05-23 00:01:14 +02:00 |
zchothia
|
33941033e2
|
Merge pull request #111 from nolta/develop
fix a couple of FreeBSD issues
|
2012-05-21 14:44:20 -07:00 |
Mike Nolta
|
4e29b6ffc0
|
FreeBSD: fix OS_FreeBSD -> OS_FREEBSD typos
|
2012-05-21 16:57:19 -04:00 |
Mike Nolta
|
839b18aa26
|
FreeBSD: allow CC & FC to have different versions
|
2012-05-21 16:56:28 -04:00 |
Zaheer Chothia
|
a27339b244
|
DLL: replace FEXTRALIB -> EXTRALIB (for consistency)
|
2012-05-21 12:25:12 +02:00 |
Zaheer Chothia
|
10e25690b4
|
Fix FreeBSD build (undefined reference to `pthread_create')
|
2012-05-21 12:10:26 +02:00 |
Zhang Xianyi
|
660d9e09f8
|
Fixed #110. Merge branch 'patch-2' of https://github.com/nolta/OpenBLAS into develop
|
2012-05-21 13:01:37 +08:00 |
Mike Nolta
|
e9be1fdd2b
|
FreeBSD: replace EXTRALIB -> FEXTRALIB
|
2012-05-20 21:44:15 -03:00 |
Zaheer Chothia
|
44124d3055
|
Fix Fortran compiler detection
- Test with '-x' operator to ensure file is executable.
- 'break' is not a valid Perl keyword.
|
2012-05-20 18:11:34 +02:00 |
Zaheer Chothia
|
14c3511e92
|
Respect C compiler set on the command line or inherited from the environment
|
2012-05-20 18:09:35 +02:00 |
Zhang Xianyi
|
d647f751ee
|
Merge branch 'patch-1' of https://github.com/nolta/OpenBLAS into develop
|
2012-05-20 12:06:04 +08:00 |
Mike Nolta
|
be1692d64f
|
fix 'sched_yield' warnings on FreeBSD,NetBSD
|
2012-05-20 00:49:38 -03:00 |
Zaheer Chothia
|
f404a17787
|
Symbol list: document how LAPACKE exports are derived and synchronize with lapack-3.4.1
This change adds the missing LAPACKE_[zc]syr routines but does not remove any exported functions.
|
2012-05-16 11:28:45 +02:00 |
Zaheer Chothia
|
fc4927fa0f
|
Fixed #107. Export missing LAPACK auxiliary routines (ALLAUX, SCLAUX, DZLAUX)
Added some documentation on how the symbol list is derived and synchronized with
lapack-3.4.1 to minimize the differences.
|
2012-05-16 00:27:48 +02:00 |
Zhang Xianyi
|
06e208c5c3
|
Refs #106. Fixed wget and md5 bug on FreeBSD and NetBSD.
|
2012-05-13 11:43:29 +08:00 |
Xianyi Zhang
|
52485e5fd0
|
Refs #105. Export missing LAPACK functions in shared library.
They are as following,
slabad, dlabad,
slacpy, dlacpy,
slamch, dlamch,
slartg, slartgp, slartgs, dlartg, dlartgp, dlartgs,
slascl, dlascl,
slaset, dlaset.
|
2012-05-10 13:01:35 +08:00 |
Xianyi Zhang
|
dee74174ff
|
Refs #85 #104. Use patch instead of git to apply this segfaults.patch.
|
2012-05-08 23:50:46 +08:00 |
Xianyi Zhang
|
e7846547be
|
Refs #85 #104. Disable my_bind to fix this segfault issue.
|
2012-05-07 16:41:22 +08:00 |
Xianyi Zhang
|
7f89edee3e
|
refs #103 Increase GEMM_MULTITHREAD_THRESHOLD to 50.
|
2012-05-03 20:05:34 +08:00 |
Xianyi Zhang
|
11cc9dc151
|
Merge pull request #104 from aeberspaecher/develop
Fixed #85. Add the patch for segfaults on kernel 2.6.32 and add documentation accordingly.
|
2012-05-03 05:00:40 -07:00 |
Alexander Eberspächer
|
4236d0d938
|
Add note on compiler warnings for the segfaults patch.
|
2012-05-02 12:03:07 +02:00 |
Alexander Eberspächer
|
8218cbea2a
|
Add Xianyi's patch for segfaults on kernel 2.6.32 and add documentation
accordingly.
|
2012-05-02 11:33:06 +02:00 |
Xianyi Zhang
|
5b7f443cf4
|
Merge branch 'develop'
|
2012-04-30 13:07:14 +08:00 |
Xianyi Zhang
|
0e39699c8c
|
Fixed #102. Export the missing LAPACK functions (slapy2,slapy3,dlapy2,dlapy3) in shared library.
|
2012-04-30 13:03:34 +08:00 |
Xianyi Zhang
|
60a58e03ea
|
Merge branch 'release-0.1.1' into develop
|
2012-04-29 18:47:26 +08:00 |
Xianyi Zhang
|
e154920bc6
|
Merge branch 'release-0.1.1'
|
2012-04-29 18:41:21 +08:00 |
Xianyi Zhang
|
fcb89ad94d
|
Refs #91. Updated the doc for 0.1.1 version.
|
2012-04-29 18:40:24 +08:00 |
Xianyi Zhang
|
d48a1d1928
|
Fixed #101. Install the missing lapacke header with LAPACK-3.4.1. Thank Zaheer for this patch.
|
2012-04-28 12:33:56 +08:00 |