Workflow Worker

`WorkflowWorker(inventory, broker: str, worker_name: str, exit_event=None, init_done_event=None, log_level: str = 'WARNING', log_queue: object = None)` ¤

Bases: NFPWorker

WorkflowWorker class for managing and executing workflows.

This class extends the NFPWorker class and provides methods to load inventory, retrieve version information, manage workflow results, and execute workflows.

Attributes:

Name	Type	Description
`init_done_event`	`Event`	Event to signal the completion of initialization.
`workflow_worker_inventory`	`dict`	Inventory loaded from the broker.

Parameters:

Name	Type	Description	Default
`inventory`		The inventory object to be used by the worker.	required
`broker`	`str`	The broker address.	required
`worker_name`	`str`	The name of the worker.	required
`exit_event`	`Event`	Event to signal the worker to exit. Defaults to None.	`None`
`init_done_event`	`Event`	Event to signal that initialization is done. Defaults to None.	`None`
`log_level`	`str`	The logging level. Defaults to "WARNING".	`'WARNING'`
`log_queue`	`object`	The logging queue. Defaults to None.	`None`

Source code in norfab\workers\workflow_worker.py

def __init__(
    self,
    inventory,
    broker: str,
    worker_name: str,
    exit_event=None,
    init_done_event=None,
    log_level: str = "WARNING",
    log_queue: object = None,
):
    super().__init__(
        inventory, broker, SERVICE, worker_name, exit_event, log_level, log_queue
    )
    self.init_done_event = init_done_event

    # get inventory from broker
    self.workflow_worker_inventory = self.load_inventory()

    self.init_done_event.set()
    log.info(f"{self.name} - Started")

`get_version()` ¤

Generate a report of the versions of specific Python packages and system information.

This method collects the version information of several Python packages and system details, including the Python version, platform, and a specified language model.

Returns:

Name	Type	Description
`Result`		An object containing a dictionary with the package names as keys and their respective version numbers as values. If a package is not found, its version will be an empty string.

Source code in norfab\workers\workflow_worker.py

def get_version(self):
    """
    Generate a report of the versions of specific Python packages and system information.

    This method collects the version information of several Python packages and system details,
    including the Python version, platform, and a specified language model.

    Returns:
        Result: An object containing a dictionary with the package names as keys and their
                respective version numbers as values. If a package is not found, its version
                will be an empty string.
    """
    libs = {
        "norfab": "",
        "python": sys.version.split(" ")[0],
        "platform": sys.platform,
    }
    # get version of packages installed
    for pkg in libs.keys():
        try:
            libs[pkg] = importlib.metadata.version(pkg)
        except importlib.metadata.PackageNotFoundError:
            pass

    return Result(result=libs)

`get_inventory()` ¤

NorFab task to retrieve the workflow's worker inventory.

Returns:

Name	Type	Description
`Result`		An instance of the Result class containing the workflow's worker inventory.

Source code in norfab\workers\workflow_worker.py

def get_inventory(self):
    """
    NorFab task to retrieve the workflow's worker inventory.

    Returns:
        Result: An instance of the Result class containing the workflow's worker inventory.
    """
    return Result(result=self.workflow_worker_inventory)

`remove_no_match_results(results: Dict) -> Dict` ¤

Remove results from the workflow results for workers that did not have any resources matched by given task.

Parameters:

Name	Type	Description	Default
`results`	`Dict`	The workflow results.	required

Returns:

Name	Type	Description
`Dict`	`Dict`	The workflow results with empty results removed.

Source code in norfab\workers\workflow_worker.py

def remove_no_match_results(self, results: Dict) -> Dict:
    """
    Remove results from the workflow results for workers that did not
    have any resources matched by given task.

    Args:
        results (Dict): The workflow results.

    Returns:
        Dict: The workflow results with empty results removed.
    """
    ret = {}
    for step, task_results in results.items():
        ret[step] = {}
        for worker_name, worker_result in task_results.items():
            # check results for tasks that did not fail
            if worker_result["failed"] is False:
                if worker_result["status"] != "no_match":
                    ret[step][worker_name] = worker_result
            # add failed tasks irregardless of status content
            else:
                ret[step][worker_name] = worker_result
    return ret

`skip_step_check(results: dict, step: str, data: dict) -> Tuple[bool, str]` ¤

Determines whether a step should be skipped based on the provided conditions.

Parameters:

Name	Type	Description	Default
`results`	`dict`	The results of previous steps.	required
`step`	`str`	The name of the current step.	required
`data`	`dict`	A dictionary containing conditions for skipping the step. Possible keys are: "run_if_fail_any": List of step names. Skip if any of the workers have failed these steps. "run_if_pass_any": List of step names. Skip if any of the workers have passed these steps. "run_if_fail_all": List of step names. Skip if all of the workers have failed these steps. "run_if_pass_all": List of step names. Skip if all of the workers have passed these steps.	required

Returns:

Name	Type	Description
`tuple`	`Tuple[bool, str]`	tuple of boolean status and message, if status is true step should be skipped.

Source code in norfab\workers\workflow_worker.py

def skip_step_check(self, results: dict, step: str, data: dict) -> Tuple[bool, str]:
    """
    Determines whether a step should be skipped based on the provided conditions.

    Args:
        results (dict): The results of previous steps.
        step (str): The name of the current step.
        data (dict): A dictionary containing conditions for skipping the step. Possible keys are:

            - "run_if_fail_any": List of step names. Skip if any of the workers have failed these steps.
            - "run_if_pass_any": List of step names. Skip if any of the workers have passed these steps.
            - "run_if_fail_all": List of step names. Skip if all of the workers have failed these steps.
            - "run_if_pass_all": List of step names. Skip if all of the workers have passed these steps.

    Returns:
        tuple: tuple of boolean status and message, if status is true step should be skipped.
    """
    if data.get("run_if_fail_any"):
        # check if have results for all needed steps
        for k in data["run_if_fail_any"]:
            if k not in results:
                return (
                    "error",
                    f"run_if_fail_any check failed for '{step}', '{k}' results not found",
                )
        # check if any of the steps failed
        for step_name in data["run_if_fail_any"]:
            for worker_name, worker_result in results[step_name].items():
                if (
                    worker_result["failed"] is True
                    and worker_result["status"] != "no_match"
                ):
                    # do not skip this step since one of the steps failed
                    return False, None
        else:
            return (
                True,  # skip this step since none of the steps failed
                f"Skipping {step}, no workers failed any of run_if_fail_any steps: {', '.join(data['run_if_fail_any'])}",
            )
    if data.get("run_if_pass_any"):
        # check if have results for all needed steps
        for k in data["run_if_pass_any"]:
            if k not in results:
                return (
                    "error",
                    f"run_if_pass_any check failed for '{step}', '{k}' results not found",
                )
        # check if any of the steps passed
        for step_name in data["run_if_pass_any"]:
            for worker_name, worker_result in results[step_name].items():
                if (
                    worker_result["failed"] is False
                    and worker_result["status"] != "no_match"
                ):
                    # do not skip this step since one of the steps passed
                    return False, None
        else:
            return (
                True,  # skip this step since none of the steps passed
                f"Skipping {step}, no workers passed any of run_if_pass_any steps: {', '.join(data['run_if_pass_any'])}",
            )
    if data.get("run_if_fail_all"):
        # check if have results for all needed steps
        for k in data["run_if_fail_all"]:
            if k not in results:
                return (
                    "error",
                    f"run_if_fail_all check failed for '{step}', '{k}' results not found",
                )
        # check if all workers failed the step(s)
        for step_name in data["run_if_fail_all"]:
            for worker_name, worker_result in results[step_name].items():
                if (
                    worker_result["failed"] is False
                    and worker_result["status"] != "no_match"
                ):
                    # skip this step since one of the workers passed this step
                    return (
                        True,
                        f"Skipping {step}, worker {worker_name} not failed one of run_if_fail_all steps: {step_name}.",
                    )
        else:
            # do not skip this step since all steps failed
            return False, None
    if data.get("run_if_pass_all"):
        # check if have results for all needed steps
        for k in data["run_if_pass_all"]:
            if k not in results:
                return (
                    "error",
                    f"run_if_pass_all check failed for '{step}', '{k}' results not found",
                )
        # check if all workers passed the step(s)
        for step_name in data["run_if_pass_all"]:
            for worker_name, worker_result in results[step_name].items():
                if (
                    worker_result["failed"] is True
                    and worker_result["status"] != "no_match"
                ):
                    # skip this step since one of the workers failed this step
                    return (
                        True,
                        f"Skipping {step}, worker {worker_name} not passed one of run_if_pass_all steps: {step_name}.",
                    )
        else:
            # do not skip this step since all steps passed
            return False, None

    return False, None  # do not skip this step

`stop_workflow_check(result: dict, step: str, data: dict) -> bool` ¤

Determines whether to stop the workflow based on the result of a specific step and provided data.

Parameters:

Name	Type	Description	Default
`result`	`dict`	The results dictionary for given step.	required
`step`	`str`	The specific step to check within the result.	required
`data`	`dict`	A dictionary containing step data, including a flag to stop if a failure occurs.	required

Returns:

Name	Type	Description
`bool`	`bool`	True if the workflow should be stopped due to a failure in the specified step and the stop_on_failure flag is set; otherwise, False.

Source code in norfab\workers\workflow_worker.py

def stop_workflow_check(self, result: dict, step: str, data: dict) -> bool:
    """
    Determines whether to stop the workflow based on the result of
    a specific step and provided data.

    Args:
        result (dict): The results dictionary for given step.
        step (str): The specific step to check within the result.
        data (dict): A dictionary containing step data, including a flag
            to stop if a failure occurs.

    Returns:
        bool: True if the workflow should be stopped due to a failure in
            the specified step and the stop_on_failure flag is set; otherwise, False.
    """
    if data.get("stop_on_failure") is True:
        for worker_name, worker_result in result.items():
            if worker_result["failed"] is True:
                return True  # stop the workflow since a failure occurred
    return False

`run(workflow: Union[str, Dict]) -> Dict` ¤

Executes a workflow defined by a dictionary.

Parameters:

Name	Type	Description	Default
`workflow`	`Union[str, Dict]`	The workflow to execute. This can be a URL to a YAML file.	required
`remove_no_match_results`	`bool`	Whether to remove empty results from the final output. Defaults to True.	required

Returns:

Name	Type	Description
`Dict`	`Dict`	A dictionary containing the results of the workflow execution.

Raises:

Type	Description
`ValueError`	If the workflow is not a valid URL or dictionary.

Source code in norfab\workers\workflow_worker.py

def run(self, workflow: Union[str, Dict]) -> Dict:
    """
    Executes a workflow defined by a dictionary.

    Args:
        workflow (Union[str, Dict]): The workflow to execute. This can be a URL to a YAML file.
        remove_no_match_results (bool, optional): Whether to remove empty results from the final output. Defaults to True.

    Returns:
        Dict: A dictionary containing the results of the workflow execution.

    Raises:
        ValueError: If the workflow is not a valid URL or dictionary.
    """
    ret = Result(task=f"{self.name}:run", result={})

    # load workflow from URL
    if self.is_url(workflow):
        workflow_name = (
            os.path.split(workflow)[-1].replace(".yaml", "").replace(".yml", "")
        )
        workflow = self.jinja2_render_templates([workflow])
        workflow = yaml.safe_load(workflow)

    # extract workflow parameters
    workflow_name = workflow.pop("name", "workflow")
    workflow_description = workflow.pop("description", "")
    remove_no_match_results = workflow.pop("remove_no_match_results", True)

    self.event(f"Starting workflow '{workflow_name}'")
    log.info(f"Starting workflow '{workflow_name}': {workflow_description}")

    ret.result[workflow_name] = {}

    # run each step in the workflow
    for step, data in workflow.items():
        # check if need to skip step based on run_if_x flags
        skip_status, message = self.skip_step_check(
            ret.result[workflow_name], step, data
        )
        if skip_status is True:
            ret.result[workflow_name][step] = {
                "all-workers": {
                    "failed": False,
                    "result": None,
                    "status": "skipped",
                    "task": data["task"],
                    "errors": [],
                    "messages": [message],
                    "juuid": None,
                }
            }
            self.event(
                f"Skipping workflow step '{step}', one of run_if_x conditions not satisfied"
            )
            continue
        # stop workflow execution on error
        elif skip_status == "error":
            ret.result[workflow_name][step] = {
                "all-workers": {
                    "failed": True,
                    "result": None,
                    "status": "error",
                    "task": data["task"],
                    "errors": [message],
                    "messages": [],
                    "juuid": None,
                }
            }
            self.event(message)
            log.error(message)
            break

        self.event(f"Doing workflow step '{step}'")

        ret.result[workflow_name][step] = self.client.run_job(
            service=data["service"],
            task=data["task"],
            workers=data.get("workers", "all"),
            kwargs=data.get("kwargs", {}),
            args=data.get("args", []),
            timeout=data.get("timeout", 600),
        )

        # check if need to stop workflow based on stop_if_fail flag
        if (
            self.stop_workflow_check(ret.result[workflow_name][step], step, data)
            is True
        ):
            break

    if remove_no_match_results:
        ret.result[workflow_name] = self.remove_no_match_results(
            ret.result[workflow_name]
        )

    return ret

Workflow Worker

WorkflowWorker(inventory, broker: str, worker_name: str, exit_event=None, init_done_event=None, log_level: str = 'WARNING', log_queue: object = None) ¤

get_version() ¤

get_inventory() ¤

remove_no_match_results(results: Dict) -> Dict ¤

skip_step_check(results: dict, step: str, data: dict) -> Tuple[bool, str] ¤

stop_workflow_check(result: dict, step: str, data: dict) -> bool ¤

run(workflow: Union[str, Dict]) -> Dict ¤

`WorkflowWorker(inventory, broker: str, worker_name: str, exit_event=None, init_done_event=None, log_level: str = 'WARNING', log_queue: object = None)` ¤

`get_version()` ¤

`get_inventory()` ¤

`remove_no_match_results(results: Dict) -> Dict` ¤

`skip_step_check(results: dict, step: str, data: dict) -> Tuple[bool, str]` ¤

`stop_workflow_check(result: dict, step: str, data: dict) -> bool` ¤

`run(workflow: Union[str, Dict]) -> Dict` ¤