Languages
To choose a transcription model, refer to Models.
The languages, packs, and options on this page apply to the Enhanced and Standard models. The Melia 1 model is multilingual: it transcribes the individual languages listed here and switches between them automatically, without language selection. You can use their language codes as language hints. Melia 1 does not support the auto option, the bilingual and multi-language pack codes, or translation. For Melia 1, refer to Models.
Transcription languages
To automatically identify the language in an audio file, use the Language Identification feature.
To dynamically update your system with the latest languages and features offered by Speechmatics, use the Feature Discovery endpoint.
Speechmatics supports the following languages. Your ability to use any or all of them depends on the languages you are contracted to use.
Speechmatics takes a global-first approach to languages. A single language pack supports many accents and dialects, so you do not need to know which accent is in your audio before selecting a language. This approach achieves high accuracy compared to accent-specific language packs.
Each language is uniquely identified by a two-letter code (ISO 639-1) or three-letter code (ISO 639-3) in API requests and responses.
Translation languages
Translation is available with the Enhanced and Standard models. It is supported for most Speechmatics languages, with the supported translation pairs listed below. For more details, see Translation.
Bilingual and multi-language packs
These packs handle a fixed set of languages that you select in advance. To transcribe audio without selecting languages, including spontaneous switching across all supported languages, use the Melia 1 multilingual model. Refer to Models.
The Enhanced and Standard models can transcribe a selected combination of languages in one media file or stream, including speakers who switch between the languages in that pack. Each pack covers a fixed set of languages that you select with the language property.
Supported packs are:
This config selects the Mandarin and English pack:
{
"type": "transcription",
"transcription_config": {
"language": "cmn_en"
}
}
This config selects the Spanish and English pack, which requires the domain property:
{
"type": "transcription",
"transcription_config": {
"language": "es",
"domain": "bilingual-en"
}
}
Healthcare domain
Speechmatics offers a medical domain that provides high accuracy for healthcare use cases such as ambient scribes and dictation tools. The medical domain is available with the Enhanced model only. It does not apply to the Standard or Melia 1 models.
The medical domain is kept up to date using officially maintained data sources. This improves recognition of medical terminology such as procedures, medications, conditions, and anatomy.
For languages without medical domain support, the Enhanced model still gives high accuracy in the healthcare domain.
Set the domain property to medical:
{
"type": "transcription",
"transcription_config": {
"model": "enhanced",
"language": "en",
"domain": "medical"
}
}