Hi, this is an impressive work!
I noticed that the AIDT dataset uses InsightFace Swapper to generate synthetic images. Since the swapping model runs at 128×128 resolution, did you upscale the results using standard interpolation (like Lanczos/Bicubic) or an AI-based method such as CodeFormer?