Google Cloud Speech to Text Operators¶
Prerequisite Tasks¶
To use these operators, you must do a few things:
Select or create a Cloud Platform project using the Cloud Console.
Enable billing for your project, as described in the Google Cloud documentation.
Enable the API, as described in the Cloud Console documentation.
Install API libraries via pip.
pip install 'apache-airflow[google]'Detailed information is available for Installation.
CloudSpeechToTextRecognizeSpeechOperator¶
Recognizes speech in audio input and returns text.
For parameter definition, take a look at
CloudSpeechToTextRecognizeSpeechOperator
Arguments¶
config and audio arguments need to be dicts or objects of corresponding classes from google.cloud.speech_v1.types module
for more information, see: https://googleapis.github.io/google-cloud-python/latest/speech/gapic/v1/api.html#google.cloud.speech_v1.SpeechClient.recognize
INPUT = {"text": "Sample text for demo purposes"}
VOICE = {"language_code": "en-US", "ssml_gender": "FEMALE"}
AUDIO_CONFIG = {"audio_encoding": "LINEAR16"}
filename is a simple string argument:
CONFIG = {"encoding": "LINEAR16", "language_code": "en_US"}
AUDIO = {"uri": f"gs://{BUCKET_NAME}/{FILE_NAME}"}
Using the operator¶
speech_to_text_recognize_task = CloudSpeechToTextRecognizeSpeechOperator(
config=CONFIG, audio=AUDIO, task_id="speech_to_text_recognize_task"
)
Templating¶
template_fields: Sequence[str] = (
"audio",
"config",
"project_id",
"gcp_conn_id",
"timeout",
"impersonation_chain",
)