Should fairness metrics (aka de-biasing metrics) assume you're doing data augmentation?

### Environment details
* SDMetrics version: 0.21.0

### Description
We have an upcoming fairness metric called [EqualizedOddsImprovment](https://docs.sdv.dev/sdmetrics/metrics/privacy-and-fairness-metrics/equalizedoddsimprovement). This is meant to indicate whether the synthetic data is improving the fairness (as compared to the real data).

When it comes to fairness/de-biasing metrics, we should figure out if the expectation should be data augmentation or data replacement:

1. **Data augmentation** means that you're expecting to add the synthetic data to the real data. So the metric should actually be comparing _real data_ vs. _real + synthetic data_. 
    - This would make sense if, for example, you want to ultimately use the data for training an ML model. (You wouldn't want to get rid of the real data in this case, but rather add to it.)
2. **Data replacement** means that you're expecting to use the synthetic data in place of the real data. So the metric should actually be comparing _real data_ vs. _synthetic data_ only.
    - This would make sense if, for example, you want to ultimately share the synthetic data externally. (You would want to entirely replace the real data with synthetic data.)

Currently, the metric is designed with the _replacement_ strategy in mind: It computes the equalized odds on the real data, and then on the synthetic data. It then returns the difference between the two computations.

### Additional Context
For the data augmentation case, see #779 for options in usage

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Should fairness metrics (aka de-biasing metrics) assume you're doing data augmentation? #780

Environment details

Description

Additional Context

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Should fairness metrics (aka de-biasing metrics) assume you're doing data augmentation? #780

Description

Environment details

Description

Additional Context

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions