airflow.providers.google.cloud.hooks.dataform¶
Module Contents¶
Classes¶
| Hook for Google Cloud DataForm APIs. | 
- class airflow.providers.google.cloud.hooks.dataform.DataformHook(**kwargs)[source]¶
- Bases: - airflow.providers.google.common.hooks.base_google.GoogleBaseHook- Hook for Google Cloud DataForm APIs. - get_dataform_client()[source]¶
- Retrieves client library object that allow access to Cloud Dataform service. 
 - wait_for_workflow_invocation(workflow_invocation_id, repository_id, project_id, region, wait_time=10, timeout=None)[source]¶
- Helper method which polls a job to check if it finishes. - Parameters
- workflow_invocation_id (str) – Id of the Workflow Invocation 
- repository_id (str) – Id of the Dataform repository 
- project_id (str) – Required. The ID of the Google Cloud project the cluster belongs to. 
- region (str) – Required. The Cloud Dataproc region in which to handle the request. 
- wait_time (int) – Number of seconds between checks 
- timeout (int | None) – How many seconds wait for job to be ready. Used only if - asynchronousis False
 
 
 - create_compilation_result(project_id, region, repository_id, compilation_result, retry=DEFAULT, timeout=None, metadata=())[source]¶
- Creates a new CompilationResult in a given project and location. - Parameters
- project_id (str) – Required. The ID of the Google Cloud project that the task belongs to. 
- region (str) – Required. The ID of the Google Cloud region that the task belongs to. 
- repository_id (str) – Required. The ID of the Dataform repository that the task belongs to. 
- compilation_result (CompilationResult | dict) – Required. The compilation result to create. 
- retry (Retry | _MethodDefault) – Designation of what errors, if any, should be retried. 
- timeout (float | None) – The timeout for this request. 
- metadata (Sequence[tuple[str, str]]) – Strings which should be sent along with the request as metadata. 
 
 
 - get_compilation_result(project_id, region, repository_id, compilation_result_id, retry=DEFAULT, timeout=None, metadata=())[source]¶
- Fetches a single CompilationResult. - Parameters
- project_id (str) – Required. The ID of the Google Cloud project that the task belongs to. 
- region (str) – Required. The ID of the Google Cloud region that the task belongs to. 
- repository_id (str) – Required. The ID of the Dataform repository that the task belongs to. 
- compilation_result_id (str) – The Id of the Dataform Compilation Result 
- retry (Retry | _MethodDefault) – Designation of what errors, if any, should be retried. 
- timeout (float | None) – The timeout for this request. 
- metadata (Sequence[tuple[str, str]]) – Strings which should be sent along with the request as metadata. 
 
 
 - create_workflow_invocation(project_id, region, repository_id, workflow_invocation, retry=DEFAULT, timeout=None, metadata=())[source]¶
- Creates a new WorkflowInvocation in a given Repository. - Parameters
- project_id (str) – Required. The ID of the Google Cloud project that the task belongs to. 
- region (str) – Required. The ID of the Google Cloud region that the task belongs to. 
- repository_id (str) – Required. The ID of the Dataform repository that the task belongs to. 
- workflow_invocation (WorkflowInvocation | dict) – Required. The workflow invocation resource to create. 
- retry (Retry | _MethodDefault) – Designation of what errors, if any, should be retried. 
- timeout (float | None) – The timeout for this request. 
- metadata (Sequence[tuple[str, str]]) – Strings which should be sent along with the request as metadata. 
 
 
 - get_workflow_invocation(project_id, region, repository_id, workflow_invocation_id, retry=DEFAULT, timeout=None, metadata=())[source]¶
- Fetches a single WorkflowInvocation. - Parameters
- project_id (str) – Required. The ID of the Google Cloud project that the task belongs to. 
- region (str) – Required. The ID of the Google Cloud region that the task belongs to. 
- repository_id (str) – Required. The ID of the Dataform repository that the task belongs to. 
- workflow_invocation_id (str) – Required. The workflow invocation resource’s id. 
- retry (Retry | _MethodDefault) – Designation of what errors, if any, should be retried. 
- timeout (float | None) – The timeout for this request. 
- metadata (Sequence[tuple[str, str]]) – Strings which should be sent along with the request as metadata. 
 
 
 - cancel_workflow_invocation(project_id, region, repository_id, workflow_invocation_id, retry=DEFAULT, timeout=None, metadata=())[source]¶
- Requests cancellation of a running WorkflowInvocation. - Parameters
- project_id (str) – Required. The ID of the Google Cloud project that the task belongs to. 
- region (str) – Required. The ID of the Google Cloud region that the task belongs to. 
- repository_id (str) – Required. The ID of the Dataform repository that the task belongs to. 
- workflow_invocation_id (str) – Required. The workflow invocation resource’s id. 
- retry (Retry | _MethodDefault) – Designation of what errors, if any, should be retried. 
- timeout (float | None) – The timeout for this request. 
- metadata (Sequence[tuple[str, str]]) – Strings which should be sent along with the request as metadata. 
 
 
 - create_repository(*, project_id, region, repository_id, retry=DEFAULT, timeout=None, metadata=())[source]¶
- Creates repository - Parameters
- project_id (str) – Required. The ID of the Google Cloud project where repository should be. 
- region (str) – Required. The ID of the Google Cloud region where repository should be. 
- repository_id (str) – Required. The ID of the new Dataform repository. 
- retry (Retry | _MethodDefault) – Designation of what errors, if any, should be retried. 
- timeout (float | None) – The timeout for this request. 
- metadata (Sequence[tuple[str, str]]) – Strings which should be sent along with the request as metadata. 
 
 
 - delete_repository(*, project_id, region, repository_id, force=True, retry=DEFAULT, timeout=None, metadata=())[source]¶
- Deletes repository. - Parameters
- project_id (str) – Required. The ID of the Google Cloud project where repository located. 
- region (str) – Required. The ID of the Google Cloud region where repository located. 
- repository_id (str) – Required. The ID of the Dataform repository that should be deleted. 
- force (bool) – If set to true, any child resources of this repository will also be deleted. 
- retry (Retry | _MethodDefault) – Designation of what errors, if any, should be retried. 
- timeout (float | None) – The timeout for this request. 
- metadata (Sequence[tuple[str, str]]) – Strings which should be sent along with the request as metadata. 
 
 
 - create_workspace(*, project_id, region, repository_id, workspace_id, retry=DEFAULT, timeout=None, metadata=())[source]¶
- Creates workspace. - Parameters
- project_id (str) – Required. The ID of the Google Cloud project where workspace should be. 
- region (str) – Required. The ID of the Google Cloud region where workspace should be. 
- repository_id (str) – Required. The ID of the Dataform repository where workspace should be. 
- workspace_id (str) – Required. The ID of the new Dataform workspace. 
- retry (Retry | _MethodDefault) – Designation of what errors, if any, should be retried. 
- timeout (float | None) – The timeout for this request. 
- metadata (Sequence[tuple[str, str]]) – Strings which should be sent along with the request as metadata. 
 
 
 - delete_workspace(*, project_id, region, repository_id, workspace_id, retry=DEFAULT, timeout=None, metadata=())[source]¶
- Deletes workspace. - Parameters
- project_id (str) – Required. The ID of the Google Cloud project where workspace located. 
- region (str) – Required. The ID of the Google Cloud region where workspace located. 
- repository_id (str) – Required. The ID of the Dataform repository where workspace located. 
- workspace_id (str) – Required. The ID of the Dataform workspace that should be deleted. 
- retry (Retry | _MethodDefault) – Designation of what errors, if any, should be retried. 
- timeout (float | None) – The timeout for this request. 
- metadata (Sequence[tuple[str, str]]) – Strings which should be sent along with the request as metadata. 
 
 
 - write_file(*, project_id, region, repository_id, workspace_id, filepath, contents, retry=DEFAULT, timeout=None, metadata=())[source]¶
- Writes a new file to the specified workspace. - Parameters
- project_id (str) – Required. The ID of the Google Cloud project where workspace located. 
- region (str) – Required. The ID of the Google Cloud region where workspace located. 
- repository_id (str) – Required. The ID of the Dataform repository where workspace located. 
- workspace_id (str) – Required. The ID of the Dataform workspace where files should be created. 
- filepath (str) – Required. Path to file including name of the file relative to workspace root. 
- contents (bytes) – Required. Content of the file to be written. 
- retry (Retry | _MethodDefault) – Designation of what errors, if any, should be retried. 
- timeout (float | None) – The timeout for this request. 
- metadata (Sequence[tuple[str, str]]) – Strings which should be sent along with the request as metadata. 
 
 
 - make_directory(*, project_id, region, repository_id, workspace_id, path, retry=DEFAULT, timeout=None, metadata=())[source]¶
- Makes new directory in specified workspace. - Parameters
- project_id (str) – Required. The ID of the Google Cloud project where workspace located. 
- region (str) – Required. The ID of the Google Cloud region where workspace located. 
- repository_id (str) – Required. The ID of the Dataform repository where workspace located. 
- workspace_id (str) – Required. The ID of the Dataform workspace where directory should be created. 
- path (str) – Required. The directory’s full path including new directory name, relative to the workspace root. 
- retry (Retry | _MethodDefault) – Designation of what errors, if any, should be retried. 
- timeout (float | None) – The timeout for this request. 
- metadata (Sequence[tuple[str, str]]) – Strings which should be sent along with the request as metadata. 
 
 
 - remove_directory(*, project_id, region, repository_id, workspace_id, path, retry=DEFAULT, timeout=None, metadata=())[source]¶
- Removes directory in specified workspace. - Parameters
- project_id (str) – Required. The ID of the Google Cloud project where workspace located. 
- region (str) – Required. The ID of the Google Cloud region where workspace located. 
- repository_id (str) – Required. The ID of the Dataform repository where workspace located. 
- workspace_id (str) – Required. The ID of the Dataform workspace where directory located. 
- path (str) – Required. The directory’s full path including directory name, relative to the workspace root. 
- retry (Retry | _MethodDefault) – Designation of what errors, if any, should be retried. 
- timeout (float | None) – The timeout for this request. 
- metadata (Sequence[tuple[str, str]]) – Strings which should be sent along with the request as metadata. 
 
 
 - remove_file(*, project_id, region, repository_id, workspace_id, filepath, retry=DEFAULT, timeout=None, metadata=())[source]¶
- Removes file in specified workspace. - Parameters
- project_id (str) – Required. The ID of the Google Cloud project where workspace located. 
- region (str) – Required. The ID of the Google Cloud region where workspace located. 
- repository_id (str) – Required. The ID of the Dataform repository where workspace located. 
- workspace_id (str) – Required. The ID of the Dataform workspace where directory located. 
- filepath (str) – Required. The full path including name of the file, relative to the workspace root. 
- retry (Retry | _MethodDefault) – Designation of what errors, if any, should be retried. 
- timeout (float | None) – The timeout for this request. 
- metadata (Sequence[tuple[str, str]]) – Strings which should be sent along with the request as metadata. 
 
 
 - install_npm_packages(*, project_id, region, repository_id, workspace_id, retry=DEFAULT, timeout=None, metadata=())[source]¶
- Installs npm dependencies in the provided workspace. Requires “package.json” to be created in workspace - Parameters
- project_id (str) – Required. The ID of the Google Cloud project where workspace located. 
- region (str) – Required. The ID of the Google Cloud region where workspace located. 
- repository_id (str) – Required. The ID of the Dataform repository where workspace located. 
- workspace_id (str) – Required. The ID of the Dataform workspace. 
- retry (Retry | _MethodDefault) – Designation of what errors, if any, should be retried. 
- timeout (float | None) – The timeout for this request. 
- metadata (Sequence[tuple[str, str]]) – Strings which should be sent along with the request as metadata. 
 
 
 
