GPT-SoVITS

 
git clone https://github.com/73xuetu/GPT-SoVITS

Features

 
1.Zero-shot TTS:
Input a 5-second vocal sample and experience instant text-to-speech conversion.

2.Few-shot TTS:
Fine-tune the model with just 1 minute of training data for improved voice similarity and realism.
仅用1分钟的训练数据微调模型,以提高语音相似性和真实感。

3.Cross-lingual Support:
Inference in languages different from the training dataset, currently supporting English, Japanese, Korean, Cantonese and Chinese.
使用与训练数据集不同的语言进行推理,目前支持英语、日语、韩语、粤语和中文。

4.WebUI Tools:
Integrated tools include voice accompaniment separation, automatic training set segmentation, Chinese ASR, and text labeling, assisting beginners in creating training datasets and GPT/SoVITS models.
集成工具包括语音伴奏分离、自动训练集分割、中文ASR和文本标记,帮助初学者创建训练数据集和GPT/SoVITS模型。
  

重点

 
语音伴奏分离、
自动训练集分割、
中文ASR和文本标记
帮助初学者创建训练数据集

  

 

  

 
conda create -n GPTSoVits python=3.9
conda activate GPTSoVits
bash install.sh
  

 
$ cat install.sh
#!/bin/bash
conda install -c conda-forge gcc
conda install -c conda-forge gxx
conda install ffmpeg cmake
conda install pytorch==2.1.1 torchvision==0.16.1 torchaudio==2.1.1 pytorch-cuda=11.8 -c pytorch -c nvidia
pip install -r requirements.txt
  

 

  

 


参考