Question 1

What's the difference between OCI Speech and OCI Language?

Accepted Answer

OCI Speech turns audio into text (speech to text) and turns text into audio (text to speech). OCI Language analyzes text you already have—for example detecting sentiment, key phrases, or language—so it typically comes after speech-to-text if your input starts as audio.

Question 2

When should I use OCI Speech?

Accepted Answer

Use OCI Speech when you need to transcribe calls or meetings, add live captions, build voice-controlled apps, create voice bots, or generate spoken audio from text for accessibility (screen-reader-like experiences), IVR prompts, or narrated content.

Question 3

How much does OCI Speech cost?

Accepted Answer

Pricing is usage-based and depends on factors like how many minutes of audio you transcribe (speech to text) and how much text you convert to audio (text to speech). Costs can also vary by features such as real-time vs batch processing and the selected voice or language. For exact rates, check the OCI Speech pricing page for your region and estimate based on expected audio minutes and characters.

OCI Speech

Definition

Use Cases

Provider Equivalents

Frequently Asked Questions

Related Terms

See Also