The preceding text states 'Below we default to the long path for reproducibility,' but the code now defaults to df_synthetic. Also, the inline comment 'if you followed the pedigree path' is inaccurate for df_synthetic. Please update the prose and inline comment to reflect the new default and data source (e.g., 'synthetic data generated above').
Originally posted by @Copilot in #32