About real metrics calculation and adaptation to other LLMs

I really appreciate your excellent work on LLM Compass. It provides very useful insights for model evaluation.

I have two questions about the implementation:

1. For Figure 5, could you explain how the real metrics were tested and calculated?

2. Does LLM Compass support simulating other models besides GPT-3? The current implementation seems specifically designed for GPT-3, and adapting it to other models appears to require significant engineering effort.

Thank you for your time and consideration.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

About real metrics calculation and adaptation to other LLMs #10

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

About real metrics calculation and adaptation to other LLMs #10

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions