Skip to main content

🗣️ 03. Amazon Polly

Amazon Polly is a Text-to-Speech (TTS) service that uses deep learning to convert text into natural, lifelike speech.


🔍 Key Features

  • Converts text → speech in real-time
  • Offers dozens of natural voices in multiple languages
  • Supports Speech Marks (timing, viseme data for lip-sync)
  • Enables SSML (Speech Synthesis Markup Language) for fine control
  • Neural TTS (NTTS) for higher-quality, human-like voices

💡 Use Cases

  • Create talking applications and chatbots
  • Add voice to news readers or e-learning content
  • Generate audiobooks or automated announcements

⚙️ Integrations

  • Works with Amazon S3, Lambda, Lex, and Translate

AspectDetails
Service TypeText-to-Speech (AI / ML)
InputText
OutputAudio (MP3, OGG, PCM)