c0b5992fab289b6c5dcf1f4335f9db9ccd5ea0ab
added axpy kernel with prefetch for Loongson3A. To-Do: tuning prefetch distance & instruction order.
added axpy kernel with prefetch for Loongson3A. To-Do: tuning prefetch distance & instruction order.
OpenBLAS Readme 1.Introduction OpenBLAS is an optimized BLAS library based on GotoBLAS2 1.13 BSD version. OpenBLAS is an open source project supported by Lab of Parallel Software and Computational Science, ISCAS.(http://www.rdcps.ac.cn) 2.Intallation check out codes from git://github.com/xianyi/OpenBLAS.git 1)Normal compile Please read GotoBLAS_02QuickInstall.txt or type "make" 2)Cross compile Please set CC and FC with the cross toolchains. Then, set HOSTCC with your host C compiler. At last, set TARGET explicitly. examples: On X86 box, compile this library for loongson3a CPU. make BINARY=64 CC=mips64el-unknown-linux-gnu-gcc FC=mips64el-unknown-linux-gnu-gfortran HOSTCC=gcc TARGET=LOONGSON3A 3.Support CPU & OS Please read GotoBLAS_01Readme.txt 4.Usages Link with libopenblas.a or -lopenblas for shared library. Set the number of threads. for example, export OPENBLAS_NUM_THREADS=4 or export OMP_NUM_THREADS=4 OPENBLAS_NUM_THREAD is prior to OMP_NUM_THREADS. 5.Report Bugs Please add a issue in https://github.com/xianyi/OpenBLAS/issues 6.To-Do List: Support ICT Loongson 3A CPU 7.Contact OpenBLAS users mailing list: http://list.rdcps.ac.cn/mailman/listinfo/openblas
Description
Languages
C
51.1%
Fortran
29.9%
Assembly
17.5%
Makefile
0.5%
C++
0.4%
Other
0.4%