Training details about MineAgent

Hi. Thank you for releasing the precious benchmark! I'm working on implementing the PPO agent you reported in the paper. However, I found some misalignments between the code and your paper. 

### Trimmed action space

As mentioned by #4, the code below does not correspond to the 89 action dims in Appendix G.2. 

https://github.com/MineDojo/MineCLIP/blob/e6c06a0245fac63dceb38bc9bd4fecd033dae735/main/mineagent/run_env_in_loop.py#L75

### About the `compass` observation

In the paper I see that the compass has a shape of `(2,)`. However, I see an input of `(4,)` shape in your code.

https://github.com/MineDojo/MineCLIP/blob/e6c06a0245fac63dceb38bc9bd4fecd033dae735/main/mineagent/run_env_in_loop.py#L25

### Training on `MultiDiscrete` action space

Is the 89-dimension action space in the paper a `MultiDiscrete` action space like the original MineDojo action space, or you simply treat it as a `Discrete` action space? 

In addition, can you release the training code on three task groups in the paper (or share this code via my GitHub email)? It will be beneficial for baseline comparisons! 

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Training details about MineAgent #9

Trimmed action space

About the `compass` observation

Training on `MultiDiscrete` action space

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Training details about MineAgent #9

Description

Trimmed action space

About the compass observation

Training on MultiDiscrete action space

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions

About the `compass` observation

Training on `MultiDiscrete` action space