Upasana dev by upasana27 · Pull Request #5 · StephAO/overcooked_ai

upasana27 · 2022-07-20T21:12:49Z

Added code for computing heuristics at each timestep in step function of OvercookedEnv.

StephAO · 2022-08-02T18:32:37Z

agents/heuristics.py

@@ -0,0 +1,60 @@
+from arguments import get_arguments


Please create a PR with just the heuristics file (i.e. branch of current main, add the heuristics file, and then create the PR)

StephAO · 2022-08-02T18:34:40Z

agents/heuristics.py

+        self.grid = grid
+        self.terrain_mtx = self.grid.terrain_mtx
+        self.planner = mlam
+    def heuristic_1(self):


please give this a better name and provide a doc string explaining the logic in english.

StephAO · 2022-08-02T18:36:25Z

agents/heuristics.py

+from torch.utils.data import Dataset, DataLoader
+from tqdm import tqdm
+
+class Heuristic():


All heuristics should return a scalar number for the alternatives they are ranking. In this case, we are ranking possible subtasks, so the output of each heuristic function here should be dictionary where the keys are the subtasks labels and the values are their scalar "worth". (if you prefer, an ordered list of the values is also acceptable, as long as the order is clear).

StephAO · 2022-08-02T18:36:41Z

agents/heuristics.py

+            heuristic1.append(distance_heur)
+        return heuristic1
+
+    def heuristic2(self, history0, history1):


doc string please

StephAO · 2022-08-02T18:37:42Z

agents/heuristics.py

+
+    def heuristic2(self, history0, history1):
+        # dividing by history to find task probabiity
+        task_counter_sum = [ [sum(x)/ len(history1) for x in zip(*history0)], [sum(x)/ len(history1) for x in zip(*history1)]]


what are x and history here? The output should be a scalar value for each subtask, so counting how many each has happened so far is only half the logic

upasana27 added 3 commits July 3, 2022 15:42

add subtask predictor

cbad178

predict subtasks with random batches

3e8e0ef

compute heuristic for each timestep

0a7b91d

StephAO reviewed Aug 2, 2022

View reviewed changes

Create Plastic Policy agent

d32d57a

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Upasana dev#5

Upasana dev#5
upasana27 wants to merge 4 commits intomainfrom
upasana_dev

upasana27 commented Jul 20, 2022

Uh oh!

StephAO Aug 2, 2022

Uh oh!

StephAO Aug 2, 2022

Uh oh!

StephAO Aug 2, 2022

Uh oh!

StephAO Aug 2, 2022

Uh oh!

StephAO Aug 2, 2022

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

upasana27 commented Jul 20, 2022

Uh oh!

StephAO Aug 2, 2022

Choose a reason for hiding this comment

Uh oh!

StephAO Aug 2, 2022

Choose a reason for hiding this comment

Uh oh!

StephAO Aug 2, 2022

Choose a reason for hiding this comment

Uh oh!

StephAO Aug 2, 2022

Choose a reason for hiding this comment

Uh oh!

StephAO Aug 2, 2022

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants