Question 1

Is ResponsiveVoice free to use?

Accepted Answer

Yes. The ResponsiveVoice library is open source and free to use from npm or a CDN. Create a free account at responsivevoice.org/register — it takes a few seconds — to unlock free server voices for your site. Without an account the library runs in demo mode. Paid plans add features such as streaming, and premium voice providers (Microsoft Azure, OpenAI, Google Cloud) are supported via Bring Your Own Key (BYOK).

Question 2

Which browsers and runtimes are supported?

Accepted Answer

ResponsiveVoice runs in all evergreen browsers and in Node.js, using the native Web Speech API where available and server voices (with an account) otherwise. See the Browser Support guide for the full compatibility matrix.

Question 3

Do I need an API key?

Accepted Answer

Yes, to use server voices — and it's free. Register an account to get one. The key is a website identity, not a secret: it's tied to your registered domain, so it's safe to include in client-side code. Without a key, the library runs in demo mode.

Question 4

Does ResponsiveVoice support streaming audio?

Accepted Answer

Yes — on higher-tier plans. Audio is delivered as it's synthesized via HTTP audio streaming or WebSocket streaming, so playback can start before the full clip is ready. Other tiers return the complete audio in a single response.

Question 5

How many voices and languages are available?

Accepted Answer

The base catalog includes around 100 voices across many languages and genders, chosen through the voice resolution chain (native Web Speech or fallback). Bring Your Own Key (BYOK) providers add their own voices on top — growing the catalog to thousands.

Question 6

Does ResponsiveVoice work on iOS?

Accepted Answer

Yes. iOS (and some mobile browsers) require a user gesture before audio can play, and ResponsiveVoice handles that automatically — it shows a built-in permission prompt that captures the first tap and unlocks audio, so you don't need to add your own button. The prompt is customizable, or you can disable it and trigger speech from your own UI instead.

Question 7

How can I improve speech quality?

Accepted Answer

Punctuation shapes pacing and emphasis — add commas and periods for natural pauses. For tricky pronunciations, respell a word phonetically, add hyphens between syllables, or spell it out letter by letter. You can also configure text replacements for consistent pronunciation of names and domain terms.

Question 8

Can I change the speaking rate, pitch, and volume?

Accepted Answer

Yes. Set rate and pitch (0–2, default 1) and volume (0–1, default 1) per request. Native browser voices (Web Speech API) apply them directly; for voices that synthesize server-side — API-only voices, or when the browser lacks the requested voice — how each adjustment applies depends on that provider.

Question 9

Does ResponsiveVoice support SSML?

Accepted Answer

SSML (voice markup) is part of the Web Speech API specification, but no current browser actually implements it, and there's no announced commitment to add it. So ResponsiveVoice takes plain text — shape delivery with the rate, pitch, and volume parameters, plus punctuation and text replacements for pacing and pronunciation. If browsers add SSML support, ResponsiveVoice will adopt it.

Frequently Asked Questions