This is a deep-learning-based tool to clone the voice of a singer/narrator from a source video

subDesTagesMitExtraKaese 129c5c98a2 remove ingested files 11 luni în urmă
app 129c5c98a2 remove ingested files 11 luni în urmă
vocal-remover @ 8a02fc5fdc 479173ae2c init 11 luni în urmă
.dockerignore 479173ae2c init 11 luni în urmă
.gitignore 479173ae2c init 11 luni în urmă
.gitmodules 479173ae2c init 11 luni în urmă
Dockerfile 479173ae2c init 11 luni în urmă
README.md 980fb5c436 typo 11 luni în urmă
docker-compose.yml 02682d4767 disable f0 auto predict 11 luni în urmă

README.md

Video Dubbing with SoftVC VITS Singing Voice Conversion

This is a deep-learning-based tool to clone the voice of a singer/narrator from a source video.

It uses vocal-remover to remove the voice from the source video, and then uses SoftVC VITS Singing Voice Conversion to convert the voice.

Installation

Requirements

Setup

  1. Clone this repository with submodules

    git clone --recursive https://gogs.justprojects.de/subDesTagesMitExtraKaese/video-dubbing-svc.git
    cd video-dubbing-svc
    
  2. Create the data folder

    mkdir -p data/output data/ingest data/models
    
  3. Download the pretrained so-vits-svc model and place it in the data/models folder

  4. Download the vocal-remover release and copy the pretrained model models/baseline.pth into the vocal-remover/models folder

    curl https://github.com/tsurumeso/vocal-remover/releases/download/v5.1.0/vocal-remover-v5.1.0.zip -o /tmp/vocal-remover.zip
    unzip /tmp/vocal-remover.zip -d /tmp/vocal-remover
    cp /tmp/vocal-remover/models/baseline.pth vocal-remover/models/
    rm -rf /tmp/vocal-remover /tmp/vocal-remover.zip
    
  5. Build the docker image

    docker compose build
    
  6. Insert your source video into the data/ingest folder

  7. Run the docker image

    docker compose up
    
  8. The output video will be in the data/output folder