Commit Graph

  • 63a103ba6e sbgemm: spr: disable small matrix path by default Wangyang Guo 2021-10-12 01:18:37 -0700
  • 82194ea9d2 sbgemm: spr: implement otcopy_16 Wangyang Guo 2021-09-23 01:08:40 -0700
  • 8632380a96 sbgemm: spr: reuse ncopy_16 from cooperlake as incopy Wangyang Guo 2021-09-18 01:11:31 -0700
  • 6bc8204ce5 sbgemm: spr: optimization for tmp_c buffer Wangyang Guo 2021-09-17 23:59:32 -0700
  • f018aa342a sbgemm: spr: kernel handle alpha != 1.0 Wangyang Guo 2021-09-17 00:48:52 -0700
  • a52456b168 sbgemm: spr: oncopy: use tile load/store instead Wangyang Guo 2021-09-16 20:08:42 -0700
  • f2485352a6 sbgemm: spr: only load A once in tail_k handling Wangyang Guo 2021-09-16 01:04:01 -0700
  • 9ab33228bb sbgemm: spr: process k2 and odd k at the same time Wangyang Guo 2021-09-15 23:59:38 -0700
  • 7b2f5cb3b7 sbgemm: spr: enlarge P to 256 for performance Wangyang Guo 2021-09-15 20:29:49 -0700
  • 10d52646e2 sbgemm: spr: oncopy: avoid handling too much pointer at a time Wangyang Guo 2021-09-15 19:36:02 -0700
  • 88154ed02d sbgemm: spr: reduce tile conf loading by seperate tail k handling Wangyang Guo 2021-09-15 01:11:15 -0700
  • 0abbcd19c1 sbgemm: spr: tuning for blocking params Wangyang Guo 2021-09-13 01:44:53 -0700
  • a70bfb52d5 sbgemm: spr: kernel works for NN case when alpha is 1.0 Wangyang Guo 2021-09-12 19:22:58 -0700
  • 6051c86741 sbgemm: spr: kernel works for m32 in NN case Wangyang Guo 2021-09-10 01:14:05 -0700
  • d0b253ac6e sbgemm: spr: implement oncopy_16 Wangyang Guo 2021-09-08 19:41:12 -0700
  • 1d48b7cb16 sbgemm: spr: add dummy source files Wangyang Guo 2021-09-06 19:48:23 -0700
  • 6adea52436
    Add initial support for Fujitsu A64FX as generic ARMV8 Martin Kroeker 2021-10-18 00:28:57 +0200
  • 3cb1ec2a96
    Add basic autodetection support for Fujitsu A64FX Martin Kroeker 2021-10-18 00:27:54 +0200
  • b57acdf2d3
    Add march/mtune flags for clang builds on ARM64 as well (#3414) Martin Kroeker 2021-10-18 00:26:14 +0200
  • 02ea3db8e7
    Merge pull request #3404 from guowangy/spr-build Martin Kroeker 2021-10-17 23:05:11 +0200
  • 4e4f78442e
    Merge pull request #3413 from MehdiChinoune/cmake-readibiltiy Martin Kroeker 2021-10-17 22:46:48 +0200
  • 556788281d [NFC] Improve CMakeLists.txt file readibility Mehdi Chinoune 2021-10-17 05:19:30 +0100
  • f348506463
    Merge pull request #3411 from MehdiChinoune/both_shared_static Martin Kroeker 2021-10-17 20:07:14 +0200
  • 8c6673c276
    fix typo and remove clang entry for EMAG8180 for now Martin Kroeker 2021-10-17 16:26:38 +0200
  • eeb76d63b3
    Fix syntax Martin Kroeker 2021-10-17 15:46:44 +0200
  • f06c3c7d2b
    Add march/mtune flags for clang as well Martin Kroeker 2021-10-17 14:57:35 +0200
  • 059e972d3e
    Update azure-pipelines.yml Martin Kroeker 2021-10-16 22:09:17 +0200
  • 28a77a8698 Support building both static and shared libraries Mehdi Chinoune 2021-10-16 08:33:47 +0100
  • 3791d320d2
    Update azure-pipelines.yml Martin Kroeker 2021-10-16 16:45:14 +0200
  • a663567307
    Update azure-pipelines.yml Martin Kroeker 2021-10-16 15:30:15 +0200
  • 3f7232d8bd
    Update azure-pipelines.yml Martin Kroeker 2021-10-16 13:58:47 +0200
  • 481b3dc4b4
    Merge pull request #3410 from MehdiChinoune/mingw-clang-64 Martin Kroeker 2021-10-16 13:52:41 +0200
  • a7ba58ee2e Silent warning from cmake about version<2.8.12 Mehdi Chinoune 2021-10-16 09:25:26 +0100
  • efd7ac241d Fix MinGW/Clang 64 bits detection. مهدي شينون (Mehdi Chinoune) 2021-10-16 07:55:10 +0100
  • 6c92f09215
    Update azure-pipelines.yml Martin Kroeker 2021-10-15 16:24:40 +0200
  • a5c733f9a9
    Update azure-pipelines.yml Martin Kroeker 2021-10-15 15:08:38 +0200
  • 5264ea396f
    Update azure-pipelines.yml Martin Kroeker 2021-10-15 13:55:02 +0200
  • bed8e250ba
    Fix missing gfortran in OSX cmake-clang-gfortran job Martin Kroeker 2021-10-15 12:35:22 +0200
  • 4f1aae7eda
    Update param.h Martin Kroeker 2021-10-15 09:40:58 +0200
  • 95ce6d31fb
    Update param.h Martin Kroeker 2021-10-14 23:52:56 +0200
  • 763042b980
    Update param.h Martin Kroeker 2021-10-14 22:16:47 +0200
  • 9991bb578e
    Update param.h Martin Kroeker 2021-10-14 17:45:14 +0200
  • f7193770ad
    Update param.h Martin Kroeker 2021-10-14 16:25:33 +0200
  • d2df5bd72c
    Update param.h Martin Kroeker 2021-10-14 14:51:04 +0200
  • af4d4e55d1
    Update param.h Martin Kroeker 2021-10-14 13:51:19 +0200
  • f5e7fe0ec4
    Update param.h Martin Kroeker 2021-10-14 10:37:09 +0200
  • 93cec29c8f
    Update param.h Martin Kroeker 2021-10-14 08:59:14 +0200
  • fca8259062
    Update param.h Martin Kroeker 2021-10-13 22:30:26 +0200
  • 656b17b4bf
    Update param.h Martin Kroeker 2021-10-13 21:43:47 +0200
  • c684cae97c
    Update param.h Martin Kroeker 2021-10-13 20:59:13 +0200
  • a7a05b78fe
    Update param.h Martin Kroeker 2021-10-13 14:52:56 +0200
  • 49878cad51
    Update param.h Martin Kroeker 2021-10-13 13:59:16 +0200
  • bb05550b13
    Update .travis.yml Martin Kroeker 2021-10-13 11:40:03 +0200
  • 699c0a0365
    Update param.h Martin Kroeker 2021-10-12 15:01:05 +0200
  • 3ce413d1db
    Update param.h Martin Kroeker 2021-10-12 13:30:58 +0200
  • 1049dfefa1
    Update param.h Martin Kroeker 2021-10-12 11:17:52 +0200
  • 1eca91f315 Fix build error in legacy gcc Wangyang Guo 2021-10-12 02:01:20 -0700
  • 4280dff103 Add NO_AVX=1 fallbacks to Sapphire Rapids build Wangyang Guo 2021-10-12 01:39:09 -0700
  • 3e409b156d
    Update param.h Martin Kroeker 2021-10-12 10:32:24 +0200
  • 3dc6052c7e initial support for Sapphire Rapids platform Wangyang Guo 2021-09-03 00:39:50 -0700
  • 4217096c92
    Update param.h Martin Kroeker 2021-10-11 22:15:41 +0200
  • ceb535c1ea
    Update param.h Martin Kroeker 2021-10-11 21:55:37 +0200
  • 2b3d2ef789
    Update param.h Martin Kroeker 2021-10-11 21:30:54 +0200
  • 17376df24f
    Update param.h Martin Kroeker 2021-10-11 21:08:00 +0200
  • 2cc76cc843
    Update param.h Martin Kroeker 2021-10-11 18:51:59 +0200
  • 1489e977bf
    Update param.h Martin Kroeker 2021-10-11 17:56:47 +0200
  • 0a92a783b1
    Update param.h Martin Kroeker 2021-10-11 17:32:54 +0200
  • 4224f7ee5d
    Update param.h Martin Kroeker 2021-10-11 17:19:25 +0200
  • 98548457e8
    Update param.h Martin Kroeker 2021-10-11 17:07:51 +0200
  • eda222a144
    Update .travis.yml Martin Kroeker 2021-10-11 14:55:04 +0200
  • fa7e4d86fc
    try 512/512 for neoverse dgemm Martin Kroeker 2021-10-11 13:03:24 +0200
  • 8a87e80c74
    Update conda in Appveyor CI and move jobs from Appveyor to Azure (#3400) Martin Kroeker 2021-10-10 23:24:52 +0200
  • 7e0c935854
    Add clang/flang and mingw builds on Windows Martin Kroeker 2021-10-10 22:19:48 +0200
  • 7241283198
    Fix clang/cl builds and move them to Azure Martin Kroeker 2021-10-10 22:18:51 +0200
  • b83b0808db
    Update appveyor.yml Martin Kroeker 2021-10-10 18:07:53 +0200
  • 8789d85022
    Update appveyor.yml Martin Kroeker 2021-10-10 16:00:32 +0200
  • 32dab3e0b4
    Update appveyor.yml Martin Kroeker 2021-10-10 15:51:31 +0200
  • d4ff5eafb3
    Update appveyor.yml Martin Kroeker 2021-10-10 15:48:12 +0200
  • 17c237c5a0
    Update appveyor.yml Martin Kroeker 2021-10-10 15:36:04 +0200
  • 58455df95f
    Update azure-pipelines.yml Martin Kroeker 2021-10-10 15:34:39 +0200
  • 0c9efc81dd
    Update azure-pipelines.yml Martin Kroeker 2021-10-10 14:07:49 +0200
  • f42e745369
    Update CMakeLists.txt Martin Kroeker 2021-10-10 13:07:21 +0200
  • 5173b33275
    Update azure-pipelines.yml Martin Kroeker 2021-10-09 23:28:07 +0200
  • dd41bbe267
    Update azure-pipelines.yml Martin Kroeker 2021-10-09 23:03:54 +0200
  • bb9c3eb32a
    Update azure-pipelines.yml Martin Kroeker 2021-10-09 22:22:34 +0200
  • 47ecf8add4
    Update azure-pipelines.yml Martin Kroeker 2021-10-09 21:45:25 +0200
  • 1cc1498e6c
    Update azure-pipelines.yml Martin Kroeker 2021-10-09 19:27:25 +0200
  • 4726668688
    Update azure-pipelines.yml Martin Kroeker 2021-10-09 16:31:19 +0200
  • 01eeb18691
    Update azure-pipelines.yml Martin Kroeker 2021-10-09 15:59:16 +0200
  • 536c9eabc5
    Update azure-pipelines.yml Martin Kroeker 2021-10-09 13:45:00 +0200
  • 238d207753
    Update azure-pipelines.yml Martin Kroeker 2021-10-09 13:07:29 +0200
  • 12a4e194e5
    Update azure-pipelines.yml Martin Kroeker 2021-10-09 12:37:44 +0200
  • 75fbfa379f
    Update azure-pipelines.yml Martin Kroeker 2021-10-09 10:25:44 +0200
  • d00f836206 Remove test branch from travis.yml Tanveen Bharaj 2021-10-08 14:39:04 -0500
  • 5b4f511011 Update travis.yml to build arm binaries Tanveen Bharaj 2021-10-06 15:02:00 -0500
  • 8381667002
    Update azure-pipelines.yml Martin Kroeker 2021-10-08 14:36:46 +0200
  • c72e6321ba
    Update azure-pipelines.yml Martin Kroeker 2021-10-08 14:22:11 +0200
  • 1e03613cfc
    Update CMakeLists.txt Martin Kroeker 2021-10-08 13:57:46 +0200
  • c9a936d257
    Update CMakeLists.txt Martin Kroeker 2021-10-08 13:40:07 +0200
  • 4bd03d4876
    Add Fortran to language list Martin Kroeker 2021-10-08 11:49:04 +0200