Commit Graph

8393 Commits

Author SHA1 Message Date
Martin Kroeker 3cf57a61d5
Merge pull request #4609 from yu-chen-surf/develop
Get the l2 cache size via environment variable on confidential VM
2024-04-06 19:02:00 +02:00
Martin Kroeker fbd42e9e0e
Merge pull request #4616 from MehdiChinoune/patch-1
Don't pass `-exhaustive-register-search` directly to clang compiler
2024-04-06 14:12:10 +02:00
Martin Kroeker 03ff65190d
Merge pull request #4614 from martin-frbg/issue4449-2
Retain the bf16 in fallback versions of the NeoverseN2 -march flag
2024-04-06 12:29:28 +02:00
Martin Kroeker 12650c912c
Merge pull request #4613 from martin-frbg/issue4612
Do not run the CBLAS_?GEMM3M tests when cross-compiling with gmake
2024-04-06 11:11:27 +02:00
Martin Kroeker 4eb4b033e5
Merge pull request #4610 from martin-frbg/issue4608
Make the new ZSCAL utest not require CBLAS
2024-04-06 08:38:36 +02:00
cyk 95315f95d2
Update user_manual.md for static linker
when I use static link method to compile , result is undefined with pthread_create, so we should add -lpthread
2024-04-06 13:51:01 +08:00
مهدي شينون (Mehdi Chinoune) cda55f2fd2
Don't pass `-exhaustive-register-search` directly to clang compiler
`-exhaustive-register-search` is an LLVM code generation flag that shouldn't be passed directly to clang compiler.
2024-04-06 05:54:48 +01:00
Martin Kroeker 14e71c249d
retain the bf16 capability in fallback versions of the -march option for NeoverseN2 2024-04-05 23:01:34 +02:00
Martin Kroeker 48e017de09
fix position of endif - gemm3m tests should not be run in cross-compiles 2024-04-05 22:54:34 +02:00
Martin Kroeker 9c86838279
use blasint for INTERFACE64 compatibility 2024-04-05 22:48:08 +02:00
Martin Kroeker d3f93c6015
fix arguments of zscal 2024-04-05 07:23:41 +02:00
Martin Kroeker 1f080b9328
Update test_zscal.c 2024-04-05 07:09:34 +02:00
Martin Kroeker ec8e9451f0
make independent of CBLAS 2024-04-05 06:45:59 +02:00
Chen Yu 8e39c05efd Get the l2 cache size via environment variable on confidential VM
The CPUID(leaf:2 or leaf:0x80000006) is not supported on some confidential
VMs. As a result the get_l2_size() returns the default 512M which brings
performance issues.

Introduce the environment variable OPENBLAS_L2_SIZE provided by the user
to get the l2 cache size.

Suggested-by: "Keshavamurthy, Anil S" <anil.s.keshavamurthy@intel.com>
Signed-off-by: Chen Yu <yu.c.chen@intel.com>
2024-04-05 11:39:01 +08:00
Martin Kroeker bebe5e5399
Merge pull request #4562 from honno/mkdocs-wiki
Fold wiki contents into formal documentation, build-able with `mkdocs`
2024-04-04 23:05:25 +02:00
Martin Kroeker 1c7d27c750
Update version to 0.3.27.dev 2024-04-04 23:02:29 +02:00
Martin Kroeker 17ab724da9
Update version to 0.3.27.dev 2024-04-04 23:01:32 +02:00
Martin Kroeker 5f204bb008
Merge pull request #4607 from OpenMathLib/release-0.3.0
merge back from release branch to copy tag
2024-04-04 23:00:33 +02:00
Martin Kroeker ce3f668c99
Update version to 0.3.27 2024-04-04 22:26:04 +02:00
Martin Kroeker 8f3bb62254
Merge pull request #4606 from OpenMathLib/develop
Merge develop branch for 0.3.27
2024-04-04 22:24:56 +02:00
Martin Kroeker c17f5bee81
Merge branch 'release-0.3.0' into develop 2024-04-04 22:24:40 +02:00
Martin Kroeker 0475716e2e
Update version to 0.3.27 2024-04-04 22:03:18 +02:00
Martin Kroeker 1dcbc4e0bb
Merge pull request #4605 from martin-frbg/changelog0327
Update Changelog.txt for 0.3.27
2024-04-04 21:35:47 +02:00
Martin Kroeker c5184078b4
Update Changelog.txt for 0.3.27 2024-04-04 19:07:51 +02:00
Martin Kroeker f5e5109318
Merge pull request #4604 from martin-frbg/zenprefsize
Adjust SWITCH_RATIO for ZEN and apply GEMM_PREFERRED_SIZE
2024-04-04 18:53:41 +02:00
Martin Kroeker ba6d485102
Adjust SWITCH_RATIO for ZEN and apply GEMM_PREFERRED_SIZE 2024-04-04 18:52:38 +02:00
Martin Kroeker ffedd8a2cb
Merge pull request #4603 from martin-frbg/cleanup4043
Clean up misplaced LAPACK files from PR4043 (in-code documentation changes only)
2024-04-04 13:45:16 +02:00
Martin Kroeker 5e1937531f
Merge pull request #4602 from martin-frbg/gitign_3m
Add GEMM3M tests and logs to .gitignore
2024-04-04 12:36:41 +02:00
Martin Kroeker 20145ca868
Delete misplaced file (move to SRC) 2024-04-04 11:48:54 +02:00
Martin Kroeker 45164fe406
Delete misplaced file (move to SRC) 2024-04-04 11:48:26 +02:00
Martin Kroeker f58f097a51
Delete misplaced file (move to SRC) 2024-04-04 11:48:00 +02:00
Martin Kroeker 099f10b706
Delete misplaced file (move to SRC) 2024-04-04 11:47:34 +02:00
Martin Kroeker bdcb5a23f6
Delete misplaced file (move to SRC) 2024-04-04 11:47:06 +02:00
Martin Kroeker 5e510a1289
Delete misplaced file (move to SRC) 2024-04-04 11:46:40 +02:00
Martin Kroeker 34fcd687eb
Delete misplaced file (move to SRC) 2024-04-04 11:46:12 +02:00
Martin Kroeker 587b193e23
Delete misplaced file (move to SRC) 2024-04-04 11:45:43 +02:00
Martin Kroeker e4b695d798
Fix typos and errors in comments (Reference-LAPACK 809) 2024-04-04 11:44:55 +02:00
Martin Kroeker bd07a6e7b2
Delete misplaced (and obsoleted) file 2024-04-04 11:41:49 +02:00
Martin Kroeker 18d9759650
Delete misplaced (and obsoleted) file 2024-04-04 11:41:20 +02:00
Martin Kroeker edcd5b1797
Delete misplaced (and obsoleted) file 2024-04-04 11:40:46 +02:00
Martin Kroeker a721cac76a
Add GEMM3M tests and logs 2024-04-04 11:28:49 +02:00
Martin Kroeker b41088bef7
Merge pull request #4601 from martin-frbg/clapacklogical
Fix typedef of logical in the f2c-converted LAPACK C files to match INTERFACE64
2024-04-04 11:21:05 +02:00
Martin Kroeker 0bd0df8418
fix typedef of logical to support INTERFACE64 2024-04-04 02:00:24 +02:00
Martin Kroeker d93944466c
fix typedef of logical to support INTERFACE64 2024-04-04 01:54:47 +02:00
Martin Kroeker 03698f4c1d
fix typedef of logical to support INTERFACE64 2024-04-04 01:46:46 +02:00
Martin Kroeker 353b1180c1
fix typedef of logical to support INTERFACE64 2024-04-04 01:38:10 +02:00
Martin Kroeker 8bed05a936
fix typedef of logical to support INTERFACE64 2024-04-04 01:32:37 +02:00
Martin Kroeker 466fb61c95
fix typedef of logical to support INTERFACE64 2024-04-04 01:27:57 +02:00
Martin Kroeker 7e470400c9
fix typedef of logical to support INTERFACE64 2024-04-04 01:24:27 +02:00
Martin Kroeker 373e36377c
fix typedef of logical to support INTERFACE64 2024-04-04 01:19:55 +02:00