Google's service for converting audio to text with high accuracy. Like having a professional transcriptionist that works instantly and supports many languages.
Video conferencing apps use Speech-to-Text to provide real-time captions for accessibility and meeting notes.
All four are managed speech recognition services that convert audio to text. They commonly support batch transcription and real-time streaming, offer language options, and provide features like timestamps and speaker identification (availability and naming vary by provider). Choice often depends on where your data already lives, latency needs, supported languages/models, and compliance requirements.