Processed in your browser · no upload

Text to speech (free read-aloud)

Turn text into a natural voice — generate a downloadable WAV locally, with multiple voices, free and unlimited.

Turn a script, an article or a voiceover draft into something you can actually hear — that's what text-to-speech is for.

Text to read aloud

0 / 2000

Voice

First run downloads the Kokoro voice model (~80MB), then it is cached in your browser for instant reuse. Everything runs on your device; the text is never uploaded.

How to use text to speech

1Paste or type the text you want read aloud (up to ~2,000 characters per run).
2Pick a voice (different genders, US/UK accents).
3Click Generate speech. The first run downloads the voice model (~80MB, then cached); synthesis is quick.
4Listen in the browser, then download it as a WAV file.

Why use ConvertMeow's Text to speech?

Text stays on your machine: synthesis runs in your browser, so scripts, drafts and private notes never touch a server.
Free, unlimited, no watermark: generate as many clips as you like — nothing is stamped on the audio and there's no upgrade nag.
Apache-licensed, commercial-friendly model: it uses Kokoro-82M (Apache-2.0), so the speech you generate is safe to use in video voiceovers, podcasts and more.

Frequently asked questions

Yes. ConvertMeow uses the Kokoro-82M model under the Apache-2.0 license (which permits commercial use), synthesis happens locally, and the WAV output is yours to use freely. As always, whether your final content is compliant still depends on the text you choose to read.

The first time, the ~80MB voice model is downloaded to your browser (then cached, so you never re-download it). After that, every synthesis is fast. Chrome / Edge with WebGPU are noticeably quicker; other browsers automatically fall back to WASM and still work.

The current voices are English (US/UK) and work best on English text. Other languages aren't its strength yet. For long passages, split them into sentences or paragraphs and generate in chunks, then join the clips.

No. The whole process runs in your browser on your device — neither the text nor the generated audio is sent to any server, so nothing is collected or used for training.

Updated 2026-06-09 · ConvertMeow team

Sources, review and limits

Last verified

2026-06-17

Author

ConvertMeow editorial desk

Reviewer

Browser media tooling review

Primary sources

Browser File, Canvas, Audio and Video APIs
Open-source client-side conversion libraries where a format needs a parser or encoder
User-provided files processed in the browser

Conversion output depends on the original file, browser support and codec limits. Use the exported file for convenience, and verify mission-critical media in your own workflow.