A question of the calculatePerplexity

I know that the following can calculate loss
![image](https://github.com/ftramer/LM_Memorization/assets/84905965/4624d05e-a046-4850-9903-f12fd864e3e2)
However, why labels be input_id? After read the paper, maybe I think the code should be:
```
def calculatePerplexity(sentence, model1, model2, tokenizer):
    """
    exp(loss)
    """
    input_ids = torch.tensor(tokenizer.encode(sentence)).unsqueeze(0)
    input_ids = input_ids.to(device)
    outputs_ids = model2.generate(**input_ids, **gen_kwargs).to(device)
    with torch.no_grad():
        outputs = model1(input_ids, labels=output_ids)
    loss, logits = outputs[:2]
    return torch.exp(loss)
```
this can test and verify whether the output of the two models is the same. If different, maybe one of the model memorizes the train data.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

A question of the calculatePerplexity #5

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

A question of the calculatePerplexity #5

Description

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions