This module contains a Google Cloud Translate Speech operator.

Module Contents



Recognizes speech in audio input and translates it.

class*, audio, config, target_language, format_, source_language, model, project_id=None, gcp_conn_id='google_cloud_default', impersonation_chain=None, **kwargs)[source]

Bases: airflow.models.BaseOperator

Recognizes speech in audio input and translates it.

Note that it uses the first result from the recognition api response - the one with the highest confidence In order to see other possible results please use CloudSpeechToTextRecognizeSpeechOperator and CloudTranslateTextOperator separately

See also

For more information on how to use this operator, take a look at the guide: CloudTranslateSpeechOperator


Execute method returns string object with the translation

This is a list of dictionaries queried value. Dictionary typically contains three keys (though not all will be present in all cases).

  • detectedSourceLanguage: The detected language (as an ISO 639-1 language code) of the text.

  • translatedText: The translation of the text into the target language.

  • input: The corresponding input value.

  • model: The model used to translate the text.

Dictionary is set as XCom return value.

  • audio ( – audio data to be recognized. See more:

  • config ( – information to the recognizer that specifies how to process the request. See more:

  • target_language (str) – The language to translate results into. This is required by the API and defaults to the target language of the current instance. Check the list of available languages here:

  • format – (Optional) One of text or html, to specify if the input text is plain text or HTML.

  • source_language (str | None) – (Optional) The language of the text to be translated.

  • model (str) – (Optional) The model used to translate the text, such as 'base' or 'nmt'.

  • project_id (str | None) – Optional, Google Cloud Project ID where the Compute Engine Instance exists. If set to None or missing, the default project_id from the Google Cloud connection is used.

  • gcp_conn_id (str) – Optional, The connection ID used to connect to Google Cloud. Defaults to ‘google_cloud_default’.

  • impersonation_chain (str | Sequence[str] | None) – Optional service account to impersonate using short-term credentials, or chained list of accounts required to get the access_token of the last account in the list, which will be impersonated in the request. If set as a string, the account must grant the originating account the Service Account Token Creator IAM role. If set as a sequence, the identities from the list must grant Service Account Token Creator IAM role to the directly preceding identity, with first account from the list granting this role to the originating account (templated).

template_fields :Sequence[str] = ['target_language', 'format_', 'source_language', 'model', 'project_id', 'gcp_conn_id',...[source]

This is the main method to derive when creating an operator. Context is the same dictionary used as when rendering jinja templates.

Refer to get_template_context for more context.

Was this entry helpful?