The inference script provided in the repo doesn't seem to work in its current form. I had to make some changes to it to make it run. In doing so, I am not able to replicate the results provided in the paper. The quantitative metrics give a much lower score in comparison to what is reported in the paper.
Please help us point out the mistake we are doing or update the inference/training file with the correct one.
Thank you.