# Speech Perform speech recognition on live or prerecorded audio, and receive transcriptions, alternative interpretations, and confidence levels of the results. ## Overview Use the Speech framework to recognize spoken words in recorded or live audio. The keyboard’s dictation support uses speech recognition to translate audio content into text. This framework provides a similar behavior, except that you can use it without the presence of the keyboard. For example, you might use speech recognition to recognize verbal commands or to handle text dictation in other parts of your app. The [`SpeechTranscriber`](/documentation/Speech/SpeechTranscriber) class and other module classes provide specific services. The [`AssetInventory`](/documentation/Speech/AssetInventory) class ensures that the system has the assets necessary to support those classes. The [`SpeechAnalyzer`](/documentation/Speech/SpeechAnalyzer) class manages an analysis session that uses those classes. For a general understanding of how you use these classes together, see [`SpeechAnalyzer`](/documentation/Speech/SpeechAnalyzer). ## Topics ### Essentials [Bringing advanced speech-to-text capabilities to your app](/documentation/Speech/bringing-advanced-speech-to-text-capabilities-to-your-app) Learn how to incorporate live speech-to-text transcription into your app with SpeechAnalyzer. [`SpeechAnalyzer`](/documentation/Speech/SpeechAnalyzer) Analyzes spoken audio content in various ways and manages the analysis session. [`AssetInventory`](/documentation/Speech/AssetInventory) Manages the assets that are necessary for transcription or other analyses. ### Essentials [Speech Recognition in Objective-C](/documentation/Speech/speech-recognition-in-objc) Use these classes to perform speech recognition in Objective-C code. ### Modules [`SpeechTranscriber`](/documentation/Speech/SpeechTranscriber) A speech-to-text transcription module that’s appropriate for normal conversation and general purposes. [`DictationTranscriber`](/documentation/Speech/DictationTranscriber) A speech-to-text transcription module that’s similar to system dictation features and compatible with older devices. [`SpeechDetector`](/documentation/Speech/SpeechDetector) A module that performs a voice activity detection (VAD) analysis. [`SpeechModule`](/documentation/Speech/SpeechModule) Protocol that all analyzer modules conform to. [`LocaleDependentSpeechModule`](/documentation/Speech/LocaleDependentSpeechModule) A module that requires locale-specific assets. ### Input and output [`AnalyzerInput`](/documentation/Speech/AnalyzerInput) Time-coded audio data. [`SpeechModuleResult`](/documentation/Speech/SpeechModuleResult) Protocol that all module results conform to. ### Custom vocabulary [`AnalysisContext`](/documentation/Speech/AnalysisContext) Contextual information that may be shared among analyzers. [`SFSpeechLanguageModel`](/documentation/Speech/SFSpeechLanguageModel) A language model built from custom training data. [`SFSpeechLanguageModel.Configuration`](/documentation/Speech/SFSpeechLanguageModel/Configuration) An object describing the location of a custom language model and specialized vocabulary. [`SFCustomLanguageModelData`](/documentation/Speech/SFCustomLanguageModelData) An object that generates and exports custom language model training data. ### Asset and resource management [`AssetInstallationRequest`](/documentation/Speech/AssetInstallationRequest) An object that describes, downloads, and installs a selection of assets. [`SpeechModels`](/documentation/Speech/SpeechModels) Namespace for methods related to model management. ### Legacy API [Speech Recognition in Objective-C](/documentation/Speech/speech-recognition-in-objc) Use these classes to perform speech recognition in Objective-C code. --- Copyright © 2026 Apple Inc. All rights reserved. | [Terms of Use](https://www.apple.com/legal/internet-services/terms/site.html) | [Privacy Policy](https://www.apple.com/privacy/privacy-policy)