This is a deep-learning-based tool to clone the voice of a singer/narrator from a source video
subDesTagesMitExtraKaese 129c5c98a2 remove ingested files | 11 meses atrás | |
---|---|---|
app | 11 meses atrás | |
vocal-remover @ 8a02fc5fdc | 11 meses atrás | |
.dockerignore | 11 meses atrás | |
.gitignore | 11 meses atrás | |
.gitmodules | 11 meses atrás | |
Dockerfile | 11 meses atrás | |
README.md | 11 meses atrás | |
docker-compose.yml | 11 meses atrás |
This is a deep-learning-based tool to clone the voice of a singer/narrator from a source video.
It uses vocal-remover to remove the voice from the source video, and then uses SoftVC VITS Singing Voice Conversion to convert the voice.
Clone this repository with submodules
git clone --recursive https://gogs.justprojects.de/subDesTagesMitExtraKaese/video-dubbing-svc.git
cd video-dubbing-svc
Create the data
folder
mkdir -p data/output data/ingest data/models
Download the pretrained so-vits-svc model and place it in the data/models
folder
G_*.pth
file should be given as the MODEL_PATH
environment variableconfig.json
file should be given as the MODEL_CONFIG_PATH
environment variableDownload the vocal-remover release and copy the pretrained model models/baseline.pth
into the vocal-remover/models
folder
curl https://github.com/tsurumeso/vocal-remover/releases/download/v5.1.0/vocal-remover-v5.1.0.zip -o /tmp/vocal-remover.zip
unzip /tmp/vocal-remover.zip -d /tmp/vocal-remover
cp /tmp/vocal-remover/models/baseline.pth vocal-remover/models/
rm -rf /tmp/vocal-remover /tmp/vocal-remover.zip
Build the docker image
docker compose build
Insert your source video into the data/ingest
folder
Run the docker image
docker compose up
The output video will be in the data/output
folder