Thanks for open-sourced agentTuning code , I am quite interested in training the model, i see the training framework is not open-sourced #1,
The discussion mentioned that it could support ptuning or LORA, i am also wondering if it could also support RLHF?
Recently, i read a paper: https://arxiv.org/abs/2312.14878, i am curious how the AgentLM performance would be if we could let it learn from interacting with environments. (refer to Finetune type II in that paper)