Previously, I successfully trained and deployed pi0.5-SF on the simulation dataset LIBERO PLUS, improving the success rate from 61.1% for pi0.5 to 75.2%. Now, when attempting to deploy this algorithm on real hardware using the default simulation parameters, I encounter situations where tasks that pi0.5 can handle perfectly result in a 0% success rate for pi0.5-SF. I would like to know if you and your team have any suggestions on this, such as parameter selection, dataset scale, etc.?