Polly

Definition

AWS Polly is a text-to-speech service that converts written text into lifelike spoken audio, enhancing accessibility and user engagement in applications.

Use Cases

Provider Equivalents

Frequently Asked Questions

What's the difference between Amazon Polly and Amazon Transcribe?
Polly converts text into speech (text-to-speech). Amazon Transcribe does the opposite: it converts speech audio into text (speech-to-text). Use Polly when you need a voice to read text; use Transcribe when you need written text from recordings or live audio.
When should I use Amazon Polly?
Use Polly when you need to generate spoken audio from text, such as reading articles aloud, adding voice prompts to an IVR/contact center, creating accessibility features for users with visual impairments, generating audio for e-learning, or producing voiceovers for apps where recording human narration for every change would be slow or expensive.
How much does Amazon Polly cost?
Polly pricing is typically based on the number of characters you convert to speech, and the rate depends on the voice type (for example, standard vs neural). Costs also include any related services you use to store and deliver audio (like Amazon S3 and CloudFront). For exact rates and free tier details, check the current Amazon Polly pricing page because prices can change by region and over time.

Category: ai-ml

Difficulty: basic

Related Terms

See Also