Task

A task in our context refers to a graph problem. The abstraction of such a problem enables us to experiment with multiple issues and understand the efficacy of our policy in varying circumstances.

Methods

Several methods are employed to set up the task, determine the reward, verify task completion, and evaluate node performance.

Setup

This step involves defining the initial action, observations, and the starting node.

Reward

Based on the new state, we determine the reward. This measure encourages the policy to make beneficial decisions.

Terminated

This method checks whether the task has been solved. It helps us understand the progress and efficiency of the policy.

Node Correctness

This method returns a color grade to visually represent how well a node has performed in its actions.

List of Tasks

Below are some tasks of special interest that we use to test our policy.

Coloring

(To Be Described)

Distance

(To Be Described)

The description of each task will provide a clear understanding of the problem context and the expected outcomes. This documentation will be updated as soon as the task descriptions are finalized.

TODO

Task

Methods​

Setup​

Reward​

Terminated​

Node Correctness​

List of Tasks​

Coloring​

Distance​