Retrieval-based Voice Conversion WebUI

Retrieval-based Voice Conversion WebUI is an open-source framework that facilitates voice conversion using retrieval-based techniques. It leverages VITS and allows users to train voice conversion models with limited voice data (<= 10 minutes). The system operates by replacing input source features with training set features using top1 retrieval, mitigating voice leakage. It offers a user-friendly web interface built with Gradio. Key features include fast training on modest hardware, model merging for voice alteration, UVR5 model integration for vocal and instrumental separation, and RMVPE for advanced pitch extraction to eliminate silent sounds. A-card and I-card acceleration are supported.

About Retrieval-based Voice Conversion WebUI

Core Capabilities

Main Tasks

Synthesize speech

Voice Cloning

Key Features

Top1 Retrieval Feature Replacement

Model Merging

UVR5 Integration

RMVPE Pitch Extraction

A/I Card Acceleration

Use Cases

Creating custom voice for a game character

Generating AI covers of songs

Voice cloning for virtual assistants

Creating audiobooks with unique voices

Real-time voice changing for streaming

Quick Start Guide

Pros

Cons

Frequently Asked Questions

Reviews & Ratings

AI Verdict

Write a Review

Feedback & Questions

User Comments

Free

Specs

Core Tasks

Data Interface

Analytics

Categories

Use Retrieval-based Voice Conversion WebUI For

Alternative Tools

Supertone

ElevenLabs

Musicfy

Podcastle

Piper

Acapela Voice Banking

Supertone

Lalals