feature(zc): add MetaDiffuser and prompt-dt by Super1ce · Pull Request #771 · opendilab/DI-engine

Super1ce · 2024-01-30T08:05:09Z

Add MetaDIffusion and prompt-dt algorithm

zjowowen · 2024-02-05T07:24:54Z

ding/entry/serial_entry_meta_offline.py

+) -> 'Policy':  # noqa
+    """
+    Overview:
+        Serial pipeline entry.


Add more details?

zjowowen · 2024-02-05T07:27:11Z

ding/entry/serial_entry_meta_offline.py

+        # use the original batch size per gpu and increase learning rate
+        # correspondingly.
+        cfg.policy.learn.batch_size // get_world_size(),
+        # cfg.policy.learn.batch_size


Remove this line.

zjowowen · 2024-02-05T07:45:01Z

ding/entry/serial_entry_meta_offline.py

+    for epoch in range(cfg.policy.learn.train_epoch):
+        if get_world_size() > 1:
+            dataloader.sampler.set_epoch(epoch)
+        for i in range(cfg.policy.train_num):


"train_num"->"batch_size"?

zjowowen · 2024-02-05T07:46:43Z

ding/model/template/decision_transformer.py

+                    (prompt_returns_embeddings, prompt_state_embeddings, prompt_action_embeddings), dim=1
+                ).permute(0, 2, 1, 3).reshape(prompt_states.shape[0], 3 * prompt_seq_length, self.h_dim)
+
+                # prompt_stacked_attention_mask = torch.stack(


Remove these unused lines?

zjowowen · 2024-02-05T07:49:53Z

ding/model/template/diffusion.py

+        self.returns_condition = returns_condition
+        self.condition_guidance_w = condition_guidance_w
+
+    # def get_loss_weights(self, discount: int):


Remove these unused lines?

zjowowen · 2024-02-05T07:52:02Z

ding/model/template/diffusion.py


    return model_mean + model_std * noise, y

+def free_guidance_sample(


Add class hints for all arguments, add Overview for functions and classes.

zjowowen · 2024-02-05T07:52:51Z

ding/model/template/diffusion.py

+
+        self.embed = nn.Sequential(
+            nn.Linear((obs_dim * 2 + action_dim + 1) * encoder_horizon, dim * 4),
+            Mish(),#nn.Mish(),


Remove unused code.

zjowowen · 2024-02-05T07:58:25Z

ding/policy/meta_diffuser.py

+        self._learn_model = model_wrap(self._model, wrapper_name='base')
+        self._learn_model.reset()
+
+    def _forward_learn(self, data: List[torch.Tensor]) -> Dict[str, Any]:


data should be collated into batchsize before entering policy._forward_learn.
data type shoule be Dict[str, torch.Tensor].

zjowowen · 2024-02-05T07:59:04Z

ding/policy/meta_diffuser.py

+        if self.have_train:
+            if self.task_id is None:
+                self.task_id = [0] * self.eval_batch_size
+            # if data_id is None:


Remove unused lines.

zjowowen · 2024-02-05T08:02:37Z

ding/policy/prompt_dt.py

+        if self._cuda:
+            data = to_device(data, self._device)
+
+        p_s, p_a, p_rtg, p_t, p_mask, timesteps, states, actions, rewards, returns_to_go, \


data should be collated into batchsize before entering policy._forward_learn.
data type shoule be Dict[str, torch.Tensor], so that it can be assigned confirmly.

zjowowen · 2024-02-05T08:03:28Z

ding/torch_utils/network/diffusion.py

+                self.returns_mlp = nn.Sequential(
+                    SinusoidalPosEmb(dim),
+                    nn.Linear(dim, dim * 4),
+                    #nn.Mish(),


Remove unused code line.

zjowowen · 2024-02-05T08:04:43Z

ding/utils/data/dataset.py

+
+@DATASET_REGISTRY.register('meta_traj')
+class MetaTraj(Dataset):
+    def __init__(self, cfg):


Add notation for this class and config items.

zjowowen · 2024-02-05T08:05:19Z

ding/worker/collector/interaction_serial_meta_evaluator.py

+        Interaction serial evaluator class, policy interacts with env. This class evaluator algorithm
+        with test environment list.
+    Interfaces:
+        __init__, reset, reset_policy, reset_env, close, should_eval, eval


init -> __init__

Super1ce and others added 15 commits November 6, 2023 11:37

add action

b05c856

change entry

a459fd0

Merge branch 'opendilab:main' into main

1c08ede

add meta diffusion and prompt dt

e97725c

add metadiffuser

32ccf3f

change

94648d1

change

16e8144

add init

3524c72

add init

b0e7274

add

6be5920

debug

7519400

change pdt

3bafbf1

add comman

2b1bdaa

metadiffuser

c8d9c7f

debug

fd2896c

PaParaZz1 added the algo Add new algorithm or improve old one label Jan 31, 2024

PaParaZz1 mentioned this pull request Jan 31, 2024

Roadmap for DI-engine #548

Open

zjowowen reviewed Feb 5, 2024

View reviewed changes

ding/entry/serial_entry_meta_offline.py Outdated

) -> 'Policy': # noqa

"""

Overview:

Serial pipeline entry.

Copy link

Collaborator

zjowowen Feb 5, 2024

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Add more details?

zjowowen reviewed Feb 5, 2024

View reviewed changes

Super1ce added 2 commits February 6, 2024 18:42

change

35e8e77

add notion

9b611db


		return model_mean + model_std * noise, y

		def free_guidance_sample(

Conversation

Super1ce commented Jan 30, 2024

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Comments