binary-husky 5fcd02506c
version 3.75 (#1702)
* Update version to 3.74

* Add support for Yi Model API (#1635)

* 更新以支持零一万物模型

* 删除newbing

* 修改config

---------

Co-authored-by: binary-husky <qingxu.fu@outlook.com>

* Refactor function signatures in bridge files

* fix qwen api change

* rename and ref functions

* rename and move some cookie functions

* 增加haiku模型,新增endpoint配置说明 (#1626)

* haiku added

* 新增haiku,新增endpoint配置说明

* Haiku added

* 将说明同步至最新Endpoint

---------

Co-authored-by: binary-husky <qingxu.fu@outlook.com>

* private_upload目录下进行文件鉴权 (#1596)

* private_upload目录下进行文件鉴权

* minor fastapi adjustment

* Add logging functionality to enable saving
conversation records

* waiting to fix username retrieve

* support 2rd web path

* allow accessing default user dir

---------

Co-authored-by: binary-husky <qingxu.fu@outlook.com>

* remove yaml deps

* fix favicon

* fix abs path auth problem

* forget to write a return

* add `dashscope` to deps

* fix GHSA-v9q9-xj86-953p

* 用户名重叠越权访问patch (#1681)

* add cohere model api access

* cohere + can_multi_thread

* fix block user access(fail)

* fix fastapi bug

* change cohere api endpoint

* explain version

* # fix com_zhipuglm.py illegal temperature problem (#1687)

* Update com_zhipuglm.py

# fix 用户在使用 zhipuai 界面时遇到了关于温度参数的非法参数错误

* allow store lm model dropdown

* add a btn to reverse previous reset

* remove extra fns

* Add support for glm-4v model (#1700)

* 修改chatglm3量化加载方式 (#1688)

Co-authored-by: zym9804 <ren990603@gmail.com>

* save chat stage 1

* consider null cookie situation

* 在点击复制按钮时激活语音

* miss some parts

* move all to js

* done first stage

* add edge tts

* bug fix

* bug fix

* remove console log

* bug fix

* bug fix

* bug fix

* audio switch

* update tts readme

* remove tempfile when done

* disable auto audio follow

* avoid play queue update after shut up

* feat: minimizing common.js

* improve tts functionality

* deterine whether the cached model is in choices

* Add support for Ollama (#1740)

* print err when doc2x not successful

* add icon

* adjust url for doc2x key version

* prepare merge

---------

Co-authored-by: Menghuan1918 <menghuan2003@outlook.com>
Co-authored-by: Skyzayre <120616113+Skyzayre@users.noreply.github.com>
Co-authored-by: XIao <46100050+Kilig947@users.noreply.github.com>
Co-authored-by: Yuki <903728862@qq.com>
Co-authored-by: zyren123 <91042213+zyren123@users.noreply.github.com>
Co-authored-by: zym9804 <ren990603@gmail.com>
2024-04-30 20:37:41 +08:00

1.7 KiB
Raw Permalink Blame History

使用TTS文字转语音

1. 使用EDGE-TTS简单

将本项目配置项修改如下即可

TTS_TYPE = "EDGE_TTS"
EDGE_TTS_VOICE = "zh-CN-XiaoxiaoNeural"

2. 使用SoVITS需要有显卡

使用以下docker-compose.yml文件先启动SoVITS服务API

  1. 创建以下文件夹结构
    .
    ├── docker-compose.yml
    └── reference
        ├── clone_target_txt.txt
        └── clone_target_wave.mp3
    
  2. 其中docker-compose.yml
    version: '3.8'
    services:
      gpt-sovits:
        image: fuqingxu/sovits_gptac_trim:latest
        container_name: sovits_gptac_container
        working_dir: /workspace/gpt_sovits_demo
        environment:
          - is_half=False
          - is_share=False
        volumes:
          - ./reference:/reference
        ports:
          - "19880:9880"  # 19880 为 sovits api 的暴露端口,记住它
        shm_size: 16G
        deploy:
          resources:
            reservations:
              devices:
              - driver: nvidia
                count: "all"
                capabilities: [gpu]
        command: bash -c "python3 api.py"
    
  3. 其中clone_target_wave.mp3为需要克隆的角色音频,clone_target_txt.txt为该音频对应的文字文本( https://wiki.biligame.com/ys/%E8%A7%92%E8%89%B2%E8%AF%AD%E9%9F%B3
  4. 运行docker-compose up
  5. 将本项目配置项修改如下即可 (19880 为 sovits api 的暴露端口与docker-compose.yml中的端口对应)
    TTS_TYPE = "LOCAL_SOVITS_API"
    GPT_SOVITS_URL = "http://127.0.0.1:19880"
    
  6. 启动本项目