git clone https://github.com/73xuetu/GPT-SoVITS Features 1.Zero-shot TTS: Input a 5-second vocal sample and experience instant text-to-speech conversion. 2.Few-shot TTS: Fine-tune the model with just 1 minute of training data for improved voice similarity and realism. 仅用1分钟的训练数据微调模型,以提高语音相似性和真实感。 3.Cross-lingual Support: Inference in languages different from the training dataset, currently supporting English, Japanese, Korean, Cantonese and Chinese. 使用与训练数据集不同的语言进行推理,目前支持英语、日语、韩语、粤语和中文。 4.WebUI Tools: Integrated tools include voice accompaniment separation, automatic training set segmentation, Chinese ASR, and text labeling, assisting beginners in creating training datasets and GPT/SoVITS models. 集成工具包括语音伴奏分离、自动训练集分割、中文ASR和文本标记,帮助初学者创建训练数据集和GPT/SoVITS模型。 重点 语音伴奏分离、 自动训练集分割、 中文ASR和文本标记 帮助初学者创建训练数据集 |
conda create -n GPTSoVits python=3.9 conda activate GPTSoVits bash install.sh $ cat install.sh #!/bin/bash conda install -c conda-forge gcc conda install -c conda-forge gxx conda install ffmpeg cmake conda install pytorch==2.1.1 torchvision==0.16.1 torchaudio==2.1.1 pytorch-cuda=11.8 -c pytorch -c nvidia pip install -r requirements.txt |
|