Dear authors,
Thank you for your impressive work on VoxInstruct. I found the paper and demo both inspiring, especially the model’s ability to generate expressive speech from natural language instructions.
I’m very interested in reproducing your results and building upon your framework. I was wondering if you have plans to release the full training pipeline and fine-tuning scripts.
I’d greatly appreciate any updates regarding the expected release timeline.
Best regards,