Rate this page:

Google

List of Google Speech-to-Text models. The enhanced models cost more than the standard rate.
Add the following line to your scenario code to use the enum:

require(Modules.ASR);

Constants

command_and_search_enhanced

Copy URL

Command_and_search model with more accurate recognition.

default

Copy URL

Best for audio that is not one of the specific audio models. For example, long-form audio. Ideally the audio is high-fidelity, recorded at a 16khz or greater sampling rate.

default_enhanced

Copy URL

Default model with more accurate recognition.

phone_call

Copy URL

Best for audio that originated from a phone call (typically recorded at a 8khz sampling rate).

phone_call_enhanced

Copy URL

Phone_call model with more accurate recognition.

video

Copy URL

Best for audio that originated from video or includes multiple speakers. Ideally the audio is recorded at a 16khz or greater sampling rate.

video_enhanced

Copy URL

Video model with more accurate recognition.