feat(chatglm_int8_onnx):纯CPU推理,最多仅需8GB内存,推理速度未测评,token数有限,暂时还不能流式输出 #1008
This commit is contained in:
11
request_llm/requirements_chatglm_onnx.txt
Normal file
11
request_llm/requirements_chatglm_onnx.txt
Normal file
@@ -0,0 +1,11 @@
|
||||
protobuf
|
||||
transformers==4.27.1
|
||||
cpm_kernels
|
||||
torch>=1.10
|
||||
mdtex2html
|
||||
sentencepiece
|
||||
numpy
|
||||
onnxruntime
|
||||
sentencepiece
|
||||
streamlit
|
||||
streamlit-chat
|
||||
Reference in New Issue
Block a user