How it works

  1. Load an instrumental (or skip it). Drop a beat or backing track up to 200 MB — or pick "Process a vocal recording" to work on a voice file alone. Everything is decoded and processed locally.
  2. Bring your voice. Record it on the spot — headphones on, 3-2-1 countdown, the instrumental plays while you sing — or import an existing take. Then slide the timing control until your voice locks onto the beat.
  3. Mix and export. Balance voice and instrumental, shape the vocal with compressor, EQ, reverb, delay and more (live A/B against the dry voice), and export WAV or MP3 — an offline render of exactly what you heard.

Features

  • Record a cover on any instrumental. Headphone monitoring while you record, a countdown to get ready, and a ±300 ms timing slider to line your take up with the beat — it's your real voice, not an AI one.
  • 100% in your browser, no upload. The effects chain runs locally with the Web Audio API — your voice never leaves your computer.
  • Six studio effects in one rack. Compressor, 3-band EQ, chorus, flanger, convolution reverb (six spaces from Room to Cathedral) and delay, chained in a fixed studio-style order.
  • Real-time preview with A/B. Every slider is live during playback, and one click switches between the dry and the processed voice.
  • Record and mix. Record your voice from the microphone, clean it up locally, and mix an optional instrumental raw under the processed vocal.

FAQ

Can I record a cover over an instrumental?

Yes — that's the main flow. Load an instrumental (from your files, or one you separated with AudioKit's Voice Remover), then bring your voice: record it on the spot — headphones on, 3-2-1 countdown, the backing track plays while you sing — or import an existing take (a previous recording, or vocals extracted with Voice Remover). Either way, your voice then lines up with a ±300 ms timing slider, gets its own volume against the instrumental, and can be polished with the effects chain before exporting the full mix.

Is Vocal Studio free?

Yes. The real-time preview is unlimited: load a voice, move the sliders and listen as long as you like. Only exporting the processed file counts toward the 5 free uses per day shared across AudioKit's free tools — no account needed. If you need more, AudioKit Premium removes the daily limit. There's no watermark and no feature lock.

Is my voice recording uploaded to a server?

No. The whole effects chain runs in your browser with the Web Audio API: your file is decoded, processed and exported on your own machine and never leaves it. The only network call is a tiny anonymous counter that tracks your daily free quota. The one exception is the optional premium "Enhance with AI" button, which does send your voice to our server for AI processing.

Which audio formats are supported?

MP3, WAV, FLAC, AIFF, OGG and M4A files up to 200 MB. You can also skip the file entirely and record your voice straight from the microphone (up to 10 minutes on the free plan), then send the take into the effects chain. An optional instrumental can be loaded alongside and mixed raw under your processed voice.

What format is the exported file?

You can export to WAV (lossless, ideal if the vocal goes back into a DAW) or MP3 at 192 kbps (smaller, fine for sharing a demo). The export is an offline render of exactly what you hear in the preview — same effects chain, same settings, including the voice/instrumental balance if you loaded a backing track.

How much reverb should I put on vocals?

Less than you think. Start around 20–30% mix with a short pre-delay (10–30 ms) so the dry consonants stay intelligible before the tail blooms. Pick the space to match the song: Room or Vocal for pop and rap, Hall or Cathedral for ballads. Then A/B against the dry voice — if you only notice the reverb when you switch it off, the amount is about right.