Google Cloud Speech to Text Operators¶

Prerequisite Tasks¶

To use these operators, you must do a few things:

Select or create a Cloud Platform project using the Cloud Console.

Enable billing for your project, as described in the Google Cloud documentation.

Enable the API, as described in the Cloud Console documentation.
Install API libraries via pip.
pip install 'apache-airflow[google]'
Detailed information is available for Installation.
Setup a Google Cloud Connection.

CloudSpeechToTextRecognizeSpeechOperator¶

Recognizes speech in audio input and returns text.

For parameter definition, take a look at CloudSpeechToTextRecognizeSpeechOperator

Arguments¶

config and audio arguments need to be dicts or objects of corresponding classes from google.cloud.speech_v1.types module

for more information, see: https://googleapis.github.io/google-cloud-python/latest/speech/gapic/v1/api.html#google.cloud.speech_v1.SpeechClient.recognize

tests/system/google/cloud/speech_to_text/example_speech_to_text.py[source]

INPUT = {"text": "Sample text for demo purposes"}
VOICE = {"language_code": "en-US", "ssml_gender": "FEMALE"}
AUDIO_CONFIG = {"audio_encoding": "LINEAR16"}

filename is a simple string argument:

tests/system/google/cloud/speech_to_text/example_speech_to_text.py[source]

CONFIG = RecognitionConfig({"encoding": "LINEAR16", "language_code": "en_US"})
AUDIO = RecognitionAudio({"uri": f"gs://{BUCKET_NAME}/{FILE_NAME}"})

Using the operator¶

tests/system/google/cloud/speech_to_text/example_speech_to_text.py[source]

speech_to_text_recognize_task = CloudSpeechToTextRecognizeSpeechOperator(
    config=CONFIG, audio=AUDIO, task_id="speech_to_text_recognize_task"
)

Templating¶

template_fields: Sequence[str] = (
    "audio",
    "config",
    "project_id",
    "gcp_conn_id",
    "timeout",
    "impersonation_chain",
)

Reference¶

For further information, look at: