Name		Name	Last commit message	Last commit date
parent directory ..
data		data
models		models
modules		modules
utils		utils
LICENSE		LICENSE
README.md		README.md
macros.py		macros.py
requirements.txt		requirements.txt
vall-e-x.py		vall-e-x.py

README.md

VALL-E X: Multilingual Text-to-Speech Synthesis and Voice Cloning.

Input

A sentence for text to speech
A audio and transcript for voice cloning

Output

The Voice file is output as .wav which path is defined as SAVE_WAV_PATH in vall-e-x.py.

Requirements

This model requires pyopenjtalk for g2p.

pip3 install -r requirements.txt

Usage

Automatically downloads the onnx and prototxt files on the first run. It is necessary to be connected to the Internet while downloading.

For the sample sentence,

python3 vall-e-x.py

If you want to specify the input sentence, put the wav path after the --input option. You can use --savepath option to change the name of the output file to save.

python3 vall-e-x.py --input "Hello world." --savepath SAVE_WAV_PATH

Run with audio prompt.

python3 vall-e-x.py -i "音声合成のテストを行なっています。" --audio BASIC5000_0001.wav --transcript "水をマレーシアから買わなくてはならないのです" -e 1

Reference

VALL-E-X

Framework

PyTorch 2.2.0.dev20230910

Model Format

ONNX opset = 15

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

vall-e-x

vall-e-x

README.md

VALL-E X: Multilingual Text-to-Speech Synthesis and Voice Cloning.

Input

Output

Requirements

Usage

Reference

Framework

Model Format

Netron

Files

vall-e-x

Directory actions

More options

Directory actions

More options

Latest commit

History

vall-e-x

Folders and files

parent directory

README.md

VALL-E X: Multilingual Text-to-Speech Synthesis and Voice Cloning.

Input

Output

Requirements

Usage

Reference

Framework

Model Format

Netron