Skip to main content
Chi Siamo

Chi è StemSplit

Rendere la separazione audio professionale accessibile a tutti.

La nostra missione

Crediamo che tutti dovrebbero avere accesso a strumenti audio professionali. Che tu sia un produttore amatoriale, un appassionato di karaoke o un creatore di contenuti, StemSplit ti dà risultati di qualità studio senza il budget da studio. Stiamo costruendo pubblicamente, e i tuoi suggerimenti plasmano direttamente la nostra roadmap.

Chi serviamo

  • Produttori musicali e DJ
  • Appassionati di karaoke
  • Creatori di contenuti e YouTuber
  • Podcaster ed editor audio
  • Educatori e studenti di musica
  • Sviluppatori che creano app audio
Under the Hood

Built on Serious Technology

StemSplit uses HTDemucs — a state-of-the-art hybrid transformer model developed by Meta Research and published at NeurIPS 2022. It's the same model researchers use, made accessible through a clean web interface.

HTDemucs Model

HTDemucs is a hybrid transformer/convolutional model that works in both the waveform and spectrogram domain simultaneously. This dual-domain approach is what gives it its accuracy advantage over older CNN-only models like Demucs v3 or Spleeter.

ONNX Export

We export the model to ONNX format — an open standard for machine learning interoperability. This means the model runs without PyTorch, making it lighter, faster, and portable. We've published the ONNX weights on HuggingFace for anyone to use.

Up to 6 Stems

Standard mode separates audio into vocals, drums, bass, and other. The 6-stem model additionally separates guitar and piano. Fine-tuned variants (htdemucs_ft) are available for single-stem extraction with higher accuracy.

No GPU Required

Processing happens entirely in the cloud. You upload a file, we run inference server-side, and you download the separated stems. No software to install, no GPU needed on your end.

The Story

Who Built This

StemSplit was built by a solo developer who kept running into the same frustration: existing vocal removal tools were either buried behind monthly subscriptions you'd forget to cancel, or they used credits that expired before you needed them again.

The goal was simple — build the tool that should already exist. One where you pay a fair price for what you actually use, your balance never expires, and the output quality is good enough that you don't need to try three different services.

It's an indie product. There's no VC money, no growth team, no sales calls. Just a tool that works, priced honestly, maintained by one person who uses it too. The roadmap is driven by users — if you have a suggestion, it genuinely gets read.

Built in public

Development happens openly. Packages are published on GitHub, PyPI, npm, and HuggingFace. The benchmark methodology is public. When things break, they get fixed and the fix is visible.

Research & Benchmarks

Open Methodology

We built and published a benchmark dataset on HuggingFace that evaluates multiple HTDemucs model variants across real music tracks. The evaluation uses Signal-to-Distortion Ratio (SDR) — the standard academic metric for source separation quality — alongside listening tests across different musical genres and stem types.

View the benchmark dataset on HuggingFace

Evaluation metric

What is SDR?

SDR (Signal-to-Distortion Ratio) measures how cleanly a model separates a target source from the mix, in decibels. Higher is better. It's the same metric used in the SiSEC Music Separation benchmark, the standard academic evaluation for this problem.

Typical SDR scores (HTDemucs)

Vocals~9 dB
Drums~8.5 dB
Bass~8 dB
Other~6 dB

Source: StemSplit benchmark dataset on HuggingFace. Higher SDR = cleaner separation.

Pricing Philosophy

Why We Charge the Way We Do

Most audio tools use subscriptions because subscriptions are good for the business — they generate predictable revenue even from users who barely log in. That's not how we want to operate.

StemSplit uses a pay-as-you-go credit model. You buy minutes of processing time and they never expire. If you separate one song a month, you pay for one song. If you batch-process an album, you pay for that. There's no plan to upgrade to, no features gated behind a higher tier.

The free tier gives every new user 5 minutes to try the product for real — not a watermarked preview or a 30-second clip. Five full minutes of actual output.

How it works

  • Credits never expire — ever
  • No subscription required
  • 5 free minutes on signup, no card needed
  • Same quality at every price point
  • Full API access included

In cosa crediamo

Semplicità prima di tutto

La tecnologia complessa dovrebbe essere semplice da usare. Carica, elabora, scarica — è tutto.

Prezzi trasparenti

Niente abbonamenti, niente crediti che scadono. Paga quello che usi, quando lo usi.

Costruzione in pubblico

Stiamo costruendo pubblicamente con la nostra comunità. I tuoi suggerimenti e feedback non sono solo ascoltati — sono la nostra roadmap.

Common Questions

Pronto a provare?

Scopri perché migliaia di utenti scelgono StemSplit.