OCI Speech

Definition

Oracle's service for converting speech to text and text to speech. Like having a universal translator between spoken words and written text.

Use Cases

Provider Equivalents

Frequently Asked Questions

What's the difference between OCI Speech and OCI Language?
OCI Speech turns audio into text (speech to text) and turns text into audio (text to speech). OCI Language analyzes text you already have—for example detecting sentiment, key phrases, or language—so it typically comes after speech-to-text if your input starts as audio.
When should I use OCI Speech?
Use OCI Speech when you need to transcribe calls or meetings, add live captions, build voice-controlled apps, create voice bots, or generate spoken audio from text for accessibility (screen-reader-like experiences), IVR prompts, or narrated content.
How much does OCI Speech cost?
Pricing is usage-based and depends on factors like how many minutes of audio you transcribe (speech to text) and how much text you convert to audio (text to speech). Costs can also vary by features such as real-time vs batch processing and the selected voice or language. For exact rates, check the OCI Speech pricing page for your region and estimate based on expected audio minutes and characters.

Category: ai-ml

Difficulty: basic

Related Terms

See Also