🗣️ 03. Amazon Polly
Amazon Polly is a Text-to-Speech (TTS) service that uses deep learning to convert text into natural, lifelike speech.
🔍 Key Features
- Converts text → speech in real-time
- Offers dozens of natural voices in multiple languages
- Supports Speech Marks (timing, viseme data for lip-sync)
- Enables SSML (Speech Synthesis Markup Language) for fine control
- Neural TTS (NTTS) for higher-quality, human-like voices
💡 Use Cases
- Create talking applications and chatbots
- Add voice to news readers or e-learning content
- Generate audiobooks or automated announcements
⚙️ Integrations
- Works with Amazon S3, Lambda, Lex, and Translate
| Aspect | Details |
|---|---|
| Service Type | Text-to-Speech (AI / ML) |
| Input | Text |
| Output | Audio (MP3, OGG, PCM) |