Text-to-Speech
AI technology that converts written text into natural-sounding human speech. Modern TTS systems can generate voices with realistic intonation, emotion, and even clone specific voices.
Why It Matters
TTS makes content accessible to visually impaired users, powers voice assistants, and enables audio content creation at scale without recording studios.
Example
ElevenLabs or Amazon Polly converting a blog post into a podcast-quality audio narration with natural intonation and pacing.
Think of it like...
Like a professional voice actor who can read any script perfectly on the first take, in any language, with any emotional tone you specify.
Related Terms
Speech-to-Text
AI technology that converts spoken audio into written text (also called automatic speech recognition or ASR). Modern systems handle accents, background noise, and multiple speakers.
Natural Language Processing
The branch of AI that deals with the interaction between computers and human language. NLP enables machines to read, understand, generate, and make sense of human language in a useful way.
Voice Cloning
AI technology that creates a synthetic replica of a specific person's voice from a small sample of their speech. Cloned voices can speak any text in the original person's vocal characteristics.