airflow.providers.google.cloud.hooks.mlengine¶
This module contains a Google ML Engine Hook.
Module Contents¶
- 
airflow.providers.google.cloud.hooks.mlengine._poll_with_exponential_delay(request, execute_num_retries, max_n, is_done_func, is_error_func)[source]¶
- 
Execute request with exponential delay.
- This method is intended to handle and retry in case of api-specific errors, such as 429 “Too Many Requests”, unlike the request.execute which handles lower level errors like ConnectionError/socket.timeout/ssl.SSLError. - Parameters
- request (googleapiclient.http.HttpRequest) – request to be executed. 
- execute_num_retries (int) – num_retries for request.execute method. 
- max_n (int) – number of times to retry request in this method. 
- is_done_func (callable) – callable to determine if operation is done. 
- is_error_func (callable) – callable to determine if operation is failed. 
 
- Returns
- response 
- Return type
- httplib2.Response 
 
- 
class airflow.providers.google.cloud.hooks.mlengine.MLEngineHook[source]¶
- Bases: - airflow.providers.google.common.hooks.base_google.GoogleBaseHook- Hook for Google ML Engine APIs. - All the methods in the hook where project_id is used must be called with keyword arguments rather than positional. - 
get_conn(self)[source]¶
- Retrieves the connection to MLEngine. - Returns
- Google MLEngine services object. 
 
 - 
create_job(self, job: dict, project_id: str, use_existing_job_fn: Optional[Callable] = None)[source]¶
- Launches a MLEngine job and wait for it to reach a terminal state. - Parameters
- project_id (str) – The Google Cloud project id within which MLEngine job will be launched. If set to None or missing, the default project_id from the Google Cloud connection is used. 
- job (dict) – - MLEngine Job object that should be provided to the MLEngine API, such as: - { 'jobId': 'my_job_id', 'trainingInput': { 'scaleTier': 'STANDARD_1', ... } } 
- use_existing_job_fn (function) – In case that a MLEngine job with the same job_id already exist, this method (if provided) will decide whether we should use this existing job, continue waiting for it to finish and returning the job object. It should accepts a MLEngine job object, and returns a boolean value indicating whether it is OK to reuse the existing job. If ‘use_existing_job_fn’ is not provided, we by default reuse the existing MLEngine job. 
 
- Returns
- The MLEngine job object if the job successfully reach a terminal state (which might be FAILED or CANCELLED state). 
- Return type
 
 - 
cancel_job(self, job_id: str, project_id: str)[source]¶
- Cancels a MLEngine job. - Parameters
- Returns
- Empty dict if cancelled successfully 
- Return type
- Raises
- googleapiclient.errors.HttpError 
 
 - 
_get_job(self, project_id: str, job_id: str)[source]¶
- Gets a MLEngine job based on the job id. - Parameters
- Returns
- MLEngine job object if succeed. 
- Return type
- Raises
- googleapiclient.errors.HttpError 
 
 - 
_wait_for_job_done(self, project_id: str, job_id: str, interval: int = 30)[source]¶
- Waits for the Job to reach a terminal state. - This method will periodically check the job state until the job reach a terminal state. - Parameters
- project_id (str) – The project in which the Job is located. If set to None or missing, the default project_id from the Google Cloud connection is used. (templated) 
- job_id (str) – A unique id for the Google MLEngine job. (templated) 
- interval (int) – Time expressed in seconds after which the job status is checked again. (templated) 
 
- Raises
- googleapiclient.errors.HttpError 
 
 - 
create_version(self, model_name: str, version_spec: Dict, project_id: str)[source]¶
- Creates the Version on Google Cloud ML Engine. - Parameters
- version_spec (dict) – A dictionary containing the information about the version. (templated) 
- model_name (str) – The name of the Google Cloud ML Engine model that the version belongs to. (templated) 
- project_id (str) – The Google Cloud project name to which MLEngine model belongs. If set to None or missing, the default project_id from the Google Cloud connection is used. (templated) 
 
- Returns
- If the version was created successfully, returns the operation. Otherwise raises an error . 
- Return type
 
 - 
set_default_version(self, model_name: str, version_name: str, project_id: str)[source]¶
- Sets a version to be the default. Blocks until finished. - Parameters
- model_name (str) – The name of the Google Cloud ML Engine model that the version belongs to. (templated) 
- version_name (str) – A name to use for the version being operated upon. (templated) 
- project_id (str) – The Google Cloud project name to which MLEngine model belongs. If set to None or missing, the default project_id from the Google Cloud connection is used. (templated) 
 
- Returns
- If successful, return an instance of Version. Otherwise raises an error. 
- Return type
- Raises
- googleapiclient.errors.HttpError 
 
 - 
list_versions(self, model_name: str, project_id: str)[source]¶
- Lists all available versions of a model. Blocks until finished. - Parameters
- Returns
- return an list of instance of Version. 
- Return type
- List[Dict] 
- Raises
- googleapiclient.errors.HttpError 
 
 - 
delete_version(self, model_name: str, version_name: str, project_id: str)[source]¶
- Deletes the given version of a model. Blocks until finished. - Parameters
- Returns
- If the version was deleted successfully, returns the operation. Otherwise raises an error. 
- Return type
- Dict 
 
 - 
create_model(self, model: dict, project_id: str)[source]¶
- Create a Model. Blocks until finished. - Parameters
- Returns
- If the version was created successfully, returns the instance of Model. Otherwise raises an error. 
- Return type
- Dict 
- Raises
- googleapiclient.errors.HttpError 
 
 - 
get_model(self, model_name: str, project_id: str)[source]¶
- Gets a Model. Blocks until finished. - Parameters
- Returns
- If the model exists, returns the instance of Model. Otherwise return None. 
- Return type
- Dict 
- Raises
- googleapiclient.errors.HttpError 
 
 - 
delete_model(self, model_name: str, project_id: str, delete_contents: bool = False)[source]¶
- Delete a Model. Blocks until finished. - Parameters
- model_name (str) – The name of the model. 
- delete_contents (bool) – Whether to force the deletion even if the models is not empty. Will delete all version (if any) in the dataset if set to True. The default value is False. 
- project_id (str) – The Google Cloud project name to which MLEngine model belongs. If set to None or missing, the default project_id from the Google Cloud connection is used. (templated) 
 
- Raises
- googleapiclient.errors.HttpError 
 
 
-