2.2 KiB

Raw Blame History

title	sidebar_label
部署 Time-MoE 模型	部署 Time-MoE 模型

准备环境

为了使用时间序列基础模型，需要在本地部署环境支持其运行。首先需要准备 Python 环境。使用 PiPy 安装必要的依赖包：

pip install torch==2.4.1+cpu -f https://download.pytorch.org/whl/torch_stable.html
pip install flask==3.0.3
pip install transformers==4.40.0
pip install accelerate

设置服务端口和 URL 地址

TDgpt 安装根目录下的 ./lib/taosanalytics/time-moe.py 文件负责 Time-MoE 模型的部署和服务，修改该问题设置合适的服务 URL 和服务端口即可。

@app.route('/ds_predict', methods=['POST'])
def time_moe():
...

修改 ds_predict 为需要开启的 URL 服务地址，或者使用默认路径即可。

    app.run(
            host='0.0.0.0',
            port=5001,
            threaded=True,  
            debug=False     
        )

其中的 port 修改为希望开启的端口，重启脚本即可。

启动部署 Python 脚本

nohup Python time-moe.py > custom_output.out 2>&1 &

第一次启动脚本会从 huggingface 自动加载2亿参数时序数据基础模型。该模型是 Time-MoE 200M参数版本，如果您需要部署参数规模更小的版本请将 time-moe.py 文件中 'Maple728/TimeMoE-200M' 修改为 Maple728/TimeMoE-50M，此时将加载 0.5亿参数版本模型。

如果加载失败，请尝试执行如下命令切换为国内镜像下载模型。

export HF_ENDPOINT=https://hf-mirror.com

再执行脚本：

nohup Python time-moe.py > custom_output.out 2>&1 &

显示如下，则说明加载成功

Running on all addresses (0.0.0.0)
Running on http://127.0.0.1:5001

验证服务是否正常

使用 Shell 命令可以验证服务是否正常

curl 127.0.0.1:5001/ds_predict

如果看到如下返回表明服务正常

<!doctype html>
<html lang=en>
<title>405 Method Not Allowed</title>
<h1>Method Not Allowed</h1>
<p>The method is not allowed for the requested URL.</p>

2.2 KiB Raw Blame History Unescape Escape

准备环境

设置服务端口和 URL 地址

启动部署 Python 脚本

验证服务是否正常

2.2 KiB

Raw Blame History