Commit Graph

7586 Commits

Author SHA1 Message Date
Shiyou Yin
13b8c44b44 loongarch: Add optimization for dsdot kernel. 2023-11-28 20:24:16 +08:00
Shiyou Yin
3def6a8143 loongarch: Add LASX optimization for dot. 2023-11-28 20:24:14 +08:00
Shiyou Yin
1310a0931b loongarch: Refine build control for loongarch64.
1. Use getauxval instead of cpucfg to test hardware capability.
2. Remove unnecessary code and option for compiler check in c_check.
2023-11-28 20:23:55 +08:00
Shiyou Yin
f745f02f35 benchmark: Fix missing colons in outputs of ./strsv.goto 2023-11-24 14:55:18 +08:00
Martin Kroeker
d36b86a794 Merge pull request #4320 from ChipKerchner/fixOldGCCPower
Fix older versions of gcc - missing __has_builtin, cpuid and no support of P10.
2023-11-15 08:48:17 +01:00
Chip-Kerchner
d99aad8ee3 Fix older version of gcc - missing __has_builtin, cpuid and no support of P10. 2023-11-14 11:07:08 -06:00
Martin Kroeker
46440a0486 Merge pull request #4317 from OpenMathLib/release-0.3.0
Merge release 0.3.25 back into develop to copy tag
2023-11-12 23:09:47 +01:00
Martin Kroeker
f4cc1b7a6f Update version to 0.3.25.dev 2023-11-12 23:07:19 +01:00
Martin Kroeker
dff686a86c Update version to 0.3.25.dev 2023-11-12 23:06:46 +01:00
Martin Kroeker
5e1a429eab Merge pull request #4316 from OpenMathLib/develop
Merge develop into release-0.3.0 for 0.3.25
v0.3.25
2023-11-12 22:55:00 +01:00
Martin Kroeker
64c96716f7 Merge branch 'release-0.3.0' into develop 2023-11-12 22:54:42 +01:00
Martin Kroeker
0e54cbd18c Update version to 0.3.25 2023-11-12 22:52:05 +01:00
Martin Kroeker
f1940010e4 Update version to 0.3.25 2023-11-12 22:51:26 +01:00
Martin Kroeker
a47ceda465 Merge pull request #4315 from martin-frbg/m3_cpufamily
Add OSX hw.cpufamily autodetection for Apple M3 as VORTEX
2023-11-12 22:49:58 +01:00
Martin Kroeker
e1f529d024 Add OSX hw.cpufamily value for Apple M3 2023-11-12 22:37:11 +01:00
Martin Kroeker
c245c12dc2 Update Changelog for 0.3.25 (#4314)
* Update Changelog.txt for 0.3.25
2023-11-12 22:17:39 +01:00
Martin Kroeker
fa615967cd Merge pull request #4312 from martin-frbg/fixotherproto
Fix empty function prototypes
2023-11-12 21:10:27 +01:00
Martin Kroeker
9b5f8eb33a Fix empty function prototypes 2023-11-12 19:35:53 +01:00
Martin Kroeker
ecaaece695 Merge pull request #4311 from martin-frbg/lapack930
Make vector orthogonalization more reliable (Reference-LAPACK PR 930)
2023-11-12 18:42:32 +01:00
Martin Kroeker
6f094c35ee Merge pull request #4305 from rgommers/ci-limit-runs
Limit CI runs to pushes and pull requests on main repo
2023-11-12 18:39:27 +01:00
Martin Kroeker
3d38da2bc4 Make vector orthogonalization more reliable (Reference-LAPACK PR 930) 2023-11-12 16:50:52 +01:00
Martin Kroeker
d58c88cf42 Merge pull request #4310 from martin-frbg/lapack904
Apply rounding up to workspace calculations done with reals (Reference-LAPACK PR 904)
2023-11-12 16:45:10 +01:00
Martin Kroeker
feeb10435b Merge pull request #4309 from martin-frbg/lapack926
Change ?GECON to return INFO=1 if RCOND is NaN (Reference-LAPACK PR 926)
2023-11-12 15:28:16 +01:00
Martin Kroeker
2ce67e2ada Apply ROUNDUP_LWORK (Reference-LAPACK PR 904) 2023-11-12 14:42:52 +01:00
Martin Kroeker
f5664740cd Apply ROUNDUP_LWORK (Reference-LAPACK PR 904) 2023-11-12 14:29:04 +01:00
Martin Kroeker
71fbdd908d Apply ROUNDUP_LWORK (Reference-LAPACK PR 904) 2023-11-12 14:10:16 +01:00
Martin Kroeker
c9378badd9 Apply ROUNDUP_LWORK (Reference-LAPACK PR 904) 2023-11-12 13:56:06 +01:00
Martin Kroeker
225036fd92 Apply ROUNDUP_LWORK (Reference-LAPACK PR 904) 2023-11-12 13:43:22 +01:00
Martin Kroeker
eef4d15369 Merge pull request #4308 from martin-frbg/issue4277-2
Add workaround for omp_get_max_threads hanging on FreeBSD/LLVM14
2023-11-12 13:08:43 +01:00
Martin Kroeker
58427ff74d Deprecate ?GELQS and ?GEQRS from TESTING/LIN (Reference-LAPACK PR 900) (#4307)
* Move ?GELQS and ?GEQRS from TESTING/LIN to DEPRECATED (Reference-LAPACK PR 900)

* Add f2c-converted versions of ?GELQS and ?GEQRS
2023-11-12 10:54:39 +01:00
Martin Kroeker
b6144f70ff Change ?GECON to return INFO=1 if RCOND is NaN (Reference-LAPACK PR 926) 2023-11-11 23:41:18 +01:00
Martin Kroeker
00ef1bb58a Merge pull request #4306 from angsch/develop
Improve matcopy interface
2023-11-11 23:19:10 +01:00
Martin Kroeker
9324520d0e typo fix 2023-11-11 23:14:58 +01:00
Martin Kroeker
ff6437f2d7 Add workaround for omp_get_max_threads hanging on FreeBSD with libomp from LLVM14 2023-11-11 21:30:32 +01:00
Martin Kroeker
9c3c1cfbd6 Merge pull request #4304 from martin-frbg/issue4277
Move clang/gfortran OpenMP dependency rewriting out of f_check
2023-11-11 20:58:21 +01:00
Martin Kroeker
cad10a3caa Merge pull request #4303 from martin-frbg/ryzen-avx512
Enable autodetection of Zen 3/4 cpus as their AVX512 Intel counterparts
2023-11-11 18:36:24 +01:00
Martin Kroeker
95ed8125fa Merge pull request #4290 from martin-frbg/issue4275-2
Put more build information into Makefile.conf_last
2023-11-11 15:28:57 +01:00
Angelika Schwarz
5ffbe646e1 Improve matcopy interface
* rows = 0 or cols = 0 is now a legal input and
  takes quick return path
* Follow BLAS/LAPACK convention that the leading
  dimensions must be at least 1.
2023-11-11 11:16:10 +01:00
Ralf Gommers
8613632dc5 Trigger AWS Graviton 3 CI job only for develop and release branches 2023-11-10 16:34:47 +01:00
Ralf Gommers
2418a20f1f Cancel running CI jobs when new changes are pushed to a PR
The `group` expression ensures that the cancel-in-progress
behavior is to only cancel if a new commit is pushed to the PR for
which the job is running, not other PRs.

This is a fairly standard snippet, used also in CI jobs for NumPy
and other projects.
2023-11-10 16:34:47 +01:00
Ralf Gommers
1d4ed20c2f Add conditions to all CI jobs to only run on main repo by default
This is a follow-up to gh-4271. At the moment, when a contributor
pushes the latest `develop` to their own branch to bring their own
fork in sync with `main`, or if they push another branch, this triggers
30 CI jobs to run. Most will complete silently and only burn CPU
time unnecessarily. If there's a failure, this may result in unexpected
failure notifications. And the AWS Graviton3 run won't complete at all
and time out, since the Cirun hook will only work when triggered from
the main repo.
2023-11-10 16:34:47 +01:00
Martin Kroeker
1a308a0066 Move OpenMP dependency handling for clang/gfortran combo 2023-11-10 15:27:46 +01:00
Martin Kroeker
3ad27007fc rebase 2023-11-10 15:19:11 +01:00
Martin Kroeker
778e3b746a Enable autodetection of current AMD cpus as their AVX512 Intel counterparts 2023-11-10 11:55:29 +01:00
Martin Kroeker
273f4e8794 Merge pull request #4300 from ChipKerchner/powerFortranAIXMakefiles
Fix FCOMMON_OPT for power.  Error out for certain C and Fortran compiler combos on AIX
2023-11-08 23:33:17 +01:00
Chip Kerchner
3ada8fbf8a Merge remote-tracking branch 'origin/develop' into powerFortranAIXMakefiles 2023-11-08 11:17:45 -06:00
Martin Kroeker
0a609cc689 Merge pull request #4298 from ChipKerchner/fixBuiltinsDynamicArch
Only define __builtin_cpu_is and __builtin_cpu_supports if not present.
2023-11-08 15:57:57 +01:00
Chip-Kerchner
4eecccd49b Fix __builtin_cpu_is for AIX. 2023-11-08 07:12:21 -06:00
Chip-Kerchner
5e31c57083 Only define __builtin_cpu_is and __builtin_cpu_supports if not present. 2023-11-07 20:58:34 -06:00
Chip Kerchner
206e76187e Fix FCOMMON_OPT for power. Error out for certain C and Fortran compiler combos in AIX. 2023-11-07 18:08:57 -06:00