Thanks for releasing the code.
I’m trying to reproduce the PPA‑1 dataset described in Appendix B. However, a full re-download of some upstream sources (e.g., ENA/NCBI) seems infeasible for academic groups (potentially tens of TB).Could you share (even roughly) the subset/scope used for each data source in Table 1?
Thanks a lot!!!!!