modelscope transformers_stream_generator auto-gptq optimum