查看huggingface dataset上ALFWorld和Mind2Web的训练数据,发现根据提供的指令,模型不可能产生预期的行为,比如下面两条数据,这个是符合预期的吗? <img width="480" alt="image (1)" src="https://github.com/THUDM/AgentTuning/assets/3916058/09a97d00-0716-4a18-a4f4-e9f16a17e7e1"> <img width="480" alt="image" src="https://github.com/THUDM/AgentTuning/assets/3916058/489fd8f5-5a44-4d83-9998-bec1591b4266">