Transcription
All articles
OpenAI Whisper - Open-Source Speech Recognition
Whisper is an open-source automatic speech recognition model by OpenAI that provides robust, multilingual …Azure Speech Services - Speech-to-Text, Text-to-Speech, and Translation
Azure Speech Services provides cloud-based APIs for speech recognition, speech synthesis, real-time …Audio Transcription Pipeline Patterns
End-to-end patterns for audio transcription at scale. Pre-processing, model selection, speaker diarization, …AI Spark: Automated Meeting Action Item Tracker
Extract action items from meeting transcripts and track them to completion using AI-powered follow-up …Speech-to-Text (STT)
What speech-to-text technology is, how AWS Transcribe, Azure Speech, and GCP Speech-to-Text compare, and key …Amazon Transcribe - Speech-to-Text for Enterprise
Amazon Transcribe capabilities, accuracy characteristics, pricing, and the integration patterns that work well …AI-Powered Accessibility for Broadcasters and Media
Automated subtitle generation, audio descriptions, sign language overlay detection, and WCAG compliance …AI Spark: Never Write Meeting Notes Again
Automate meeting summaries and action item extraction using transcription and LLM post-processing - a …AI Audio Analysis - Multi-Track Selection and Quality Enhancement
Automated best-mic selection from multi-track recordings, noise reduction, speaker isolation, and quality …
Open source projects