大模型-FastChat-Vicuna（小羊驼的部署与安装）

相关文章推荐

英勇无比的领带 · python ...· 2 周前 ·

痛苦的双杠 · python读取和保存json文件_pyth ...· 2 周前 ·

冷冷的草稿本 · txt文件转数组_python读取txt为数组· 2 周前 ·

体贴的松树 · 性能测试--Jmeter随机生成/随机选取/ ...· 1 周前 ·

开朗的枕头 · 精通 Oracle+Python，第 3 ...· 5 天前 ·

奔放的黄瓜 · Do not render Try It ...· 1 年前 ·

道上混的鸡蛋 · 移动端material风格日期时间选择器-5 ...· 1 年前 ·

风流的书签 · java 获取网络传输时延数据 ...· 2 年前 ·

听话的麻辣香锅 · Docker的容器管理操作 - 掘金· 2 年前 ·

想出家的烈酒 · t-SNE：最好的降维方法之一 - 知乎· 2 年前 ·

conda activate fastchat #安装pytorch pip install torch==1.13.1+cu116 torchvision==0.14.1+cu116 torchaudio==0.13.1 --extra-index-url https://download.pytorch.org/whl/cu116 #安装后测试 conda activate fastchat python >>> import torch >>> print(torch.__version__) 1.13.1+cu116 >>> print(torch.version.cuda) >>> exit() #安装fastchat pip install fschat -i https://pypi.tuna.tsinghua.edu.cn/simple #安装完fastchat需要重新安装下protobuf pip install protobuf==3.20.0 -i https://pypi.tuna.tsinghua.edu.cn/simple

#解析7b模型文件
python /home/hcx/transformers-main/src/transformers/models/llama/convert_llama_weights_to_hf.py \
    --input_dir /home/hcx/LLaMA --model_size 7B --output_dir /home/hcx/out/model/transformer_model_13b
#生成FastChat对应的模型Vicuna
export https_proxy=http://192.168.12.65:1984 #使用代理vpn
python -m fastchat.model.apply_delta --base /home/hcx/out/model/transformer_model_7b --target /home/hcx/out/model/vicuna-7b --delta lmsys/vicuna-7b-delta-v1.1
--model-path：表示模型的路径
--target：表示生成后的vicuna模型的路径
--delta
启动小羊驼
#服务器端启动小羊驼
CUDA_VISIBLE_DEVICES='4,5'  python -m fastchat.serve.cli --model-path /home/hcx/out/model/vicuna-7b --num-gpus 2
#webGUI模型启动小羊驼
#s1启动controller服务
python3 -m fastchat.serve.controller
#s2启动work服务
CUDA_VISIBLE_DEVICES='1,2' python -m fastchat.serve.model_worker --model-path /home/hcx/out/model/vicuna-7b --num-gpus 2
#s2.1测试controller与worker服务是否连通
python3 -m fastchat.serve.test_message --model-name vicuna-7b
#s3启动Gradio web server
python -m fastchat.serve.gradio_web_server
#访问IP:7860

推荐文章

英勇无比的领带 · python 16进制转float_mob64ca12e58adb的技术博客_

2 周前

痛苦的双杠 · python读取和保存json文件_python json 保存编码

2 周前

冷冷的草稿本 · txt文件转数组_python读取txt为数组

2 周前

体贴的松树 · 性能测试--Jmeter随机生成/随机选取/csv读取关键字 - Wilson_Blogs

1 周前

开朗的枕头 · 精通 Oracle+Python，第 3 部分：数据解析

5 天前

奔放的黄瓜 · Do not render Try It Out button and enable Execute button in Swagger UI - Stack Overflow

1 年前

道上混的鸡蛋 · 移动端material风格日期时间选择器-51CTO.COM

1 年前

风流的书签 · java 获取网络传输时延数据 java实时接收数据_mob6454cc74c0fc的技术博客_51CTO博客

2 年前

听话的麻辣香锅 · Docker的容器管理操作 - 掘金

2 年前

想出家的烈酒 · t-SNE：最好的降维方法之一 - 知乎

2 年前