Audio File Recognition Component (Sber SaluteSpeech)

Table of Contents

Description

Performs caller speech recognition into text form.
Sber SaluteSpeech on-premise service is used, channel GRPC.

The component is designed to recognize pre-recorded files. It supports recognizing files with arbitrary duration.

Pre-packages files into a format PCM.

Configuring connection points in domain settings, field 'sber_salute'.

Table 1. System Characteristics

Index

149

Short title

asr_sber

Types of scenarios

All of them

Starter module

era_script_component_asr_sber

Mode

Asynchronous

Icon

149

Branching pattern

Branching, closing

Properties

Table 2. Properties
Specification Description

Title: Audio File Path
Code: filePath
Visibility: no
Default: — 

Argument containing the path to a wav file in wav format PCM

Title: Account Sber Salute
Code: accountKey
Visibility: no
Default: default

Specifies the account that defines the connection points to the service Sber SaluteSpeech.
The list includes the 'default' value that sets the root 'speech' fields in the object to be used 'settings.sber_salute'.
Additionally, the keys of the 'settings.sber_salute.accounts' object are listed, each of which also has an object with separately configured access parameters.

Title: Set of Grammars
Code: model
Visibility: no
Default: callcenter

Sber SaluteSpeech recognition service parameter: grammar set name.

Title: Language
Code: lang
Visibility: no
Default: ru-RU

Sber SaluteSpeech recognition service parameter: recognition language.
Possible options:

  • ru-RU (0) – Russian language

  • en-US (1) – English language

  • kk-KZ (2) – Kazakh language

Title: `A profanity filter
Code: profanityFilter
Visibility: no
Default: `Disable'

Sber SaluteSpeech Recognition Service Parameter: switch for profanity filter.

Title: Some Suggestions
Code: multiUtterance
Visibility: no
Default: `Enable'

Sber SaluteSpeech recognition service parameter: waiting for multiple sentences.

If off, the response comes quickly along with detecting the end of the first sentence.
If enabled, all results are collected and glued together.".

"Title: Waiting for the beginning of the speech, s
Code: noSpeechTimeoutSec
Visibility: no
Default: 7

Sber SaluteSpeech recognition service parameter: Speech timeout.

Title: Sentence length maximum, s
Code: maxSpeechTimeoutSec
Visibility: no
Default: 20

Sber SaluteSpeech recognition service parameter: maximum sentence length.

Title: In Cyrillic
Code: forceCyrillic
Visibility: no
Default: `Off'

Sber SaluteSpeech Recognition Service Parameter: Force conversion of result to Cyrillic.

Title: Response timeout, s
Code: responseTimeoutSec
Visibility: no
Default: 5

Waiting timeout for response from the recognition service Yandex Cloud SpeechKit.
When the timeout expires, control is passed to the next component on the Time branch.

Title: Result to variable
Code: varText
Visibility: no
Default: — 

Variable to save the text result of recognition.

Title: Normalized result into a variable
Code: varNormText
Visibility: no
Default: — 

Variable to save the normalized text result of the recognition.

Title: Response code to variable
Code: varCode
Visibility: no
Default: — 

Variable to store the response code of the recognition service (the code is emulated similar to the HTTP: 200, 408, 500).

Title: Response body to variable
Code: varBody
Visibility: no
Default: — 

Variable to store the full content of the recognition service response.
If one statement - an object, if multiple statements - an array of objects.

Title: Transition
Code: transfer
Visibility: no
Default: — 

Component to which control is passed in case of successful completion of the operation.

Title: Transition, Time
Code: transferTimeout
Visibility: no
Default: — 

The component to which control is passed in case the timeout period for HTTP response from the recognition service has expired.

Title: Transition, Error
Code: transferError
Visibility: no
Default: —