Our revolutionary new model that redefines audio separation quality. Experience crystal clear vocals and precise instrument isolation.
Multi Stem HQ is built upon the advanced Band-Split architecture. Unlike traditional models that process the full spectrogram as a single image, this technology divides the audio into multiple frequency bands.
This allows the model to learn distinct features for each frequency range—capturing the deep resonance of the bass independently from the intricate harmonics of vocals. The result is a dramatic reduction in spectral leakage and artifacts.
Furthermore, the architecture integrates Rotary Position Embeddings (RoPE). This mathematical innovation allows the transformer to understand the relative position of audio features across time more effectively than standard absolute position embeddings. It enables the model to maintain coherence over longer musical phrases, ensuring that transient sounds like drum hits are sharp and sustained notes are smooth.
| Modell | Bass | Schlagzeug | Inst | Gesang |
|---|---|---|---|---|
| Multi stem HQ | 10.52 ★ | 13.19 ★ | 19.01 ★ | 12.22 ★ |
| Vocals HQ | - | - | 18.21 | 11.53 |
| Hybrid | 8.98 | 10.51 | 14.36 | 8.75 |
* SDR (Signal-to-Distortion Ratio) in dB. Higher is better.
Upgrade your workflow with the cleanest separation technology available today.
Start Separating Now©2026 VocalRemover. All rights reserved.