Support multimodal input data by moskomule · Pull Request #278 · sbintuitions/flexeval

moskomule · 2026-02-12T06:04:31Z

Refer to #277.

Add parse_input_utterance and preprocessor to TemplateChatDataset to support multimodal input data.

parse_input_utterance parses structured contents used in multimodal LMs
preprocessor preprocesses each item, like image resizing

As an example of preprocessor, I created flexeval/multimodal/image_preprocessor.py.

I want to know whether such domain-specific preprocessors should be excluded from flexeval itself (and place alongside users' jsonnets).

Copilot

Pull request overview

This PR adds support for multimodal input data (e.g., text + images) to flexeval, enabling evaluation of Vision Language Models (VLMs) and other multimodal language models. The implementation introduces two key features to TemplateChatDataset: parse_input_utterance to parse structured content from templates into lists of dictionaries (as required by OpenAI's multimodal API format), and preprocessor to preprocess items before template rendering (e.g., image resizing or base64 encoding).

Changes:

Added parse_input_utterance parameter supporting literal_eval and json_loads parsing methods
Added preprocessor parameter accepting a list of preprocessor instances for item transformation
Created Preprocessor abstract base class defining the preprocessor interface
Implemented ConvertImageToBase64 as an example preprocessor for image handling
Added tests for the parse_input_utterance functionality

Reviewed changes

Copilot reviewed 4 out of 4 changed files in this pull request and generated 11 comments.

File	Description
flexeval/core/chat_dataset/template_based.py	Core implementation of multimodal support: added Preprocessor ABC, parse_input_utterance and preprocessor parameters to TemplateChatDataset and its subclasses
flexeval/multimodal/image_preprocessor.py	Example implementation of image-to-base64 preprocessor with resizing support
flexeval/multimodal/init.py	Module initialization exporting ConvertImageToBase64
tests/core/chat_dataset/test_template_based.py	Test coverage for parse_input_utterance feature with different parsing methods

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

flexeval/core/chat_dataset/template_based.py

tests/core/chat_dataset/test_template_based.py

flexeval/multimodal/image_preprocessor.py

flexeval/core/chat_dataset/template_based.py

flexeval/multimodal/image_preprocessor.py

flexeval/core/chat_dataset/template_based.py

Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>

moskomule added 4 commits February 12, 2026 13:48

Introduce parse_input_utterance and preprocessor

e24291c

change preprocessor to class

d8b329b

add preprocessor impl

34b13a9

add test for parse_input_utterance

4f12613

moskomule requested review from Copilot and junya-takayama February 12, 2026 06:04

moskomule assigned moskomule and amanjainj98 Feb 12, 2026

moskomule added the enhancement New feature or request label Feb 12, 2026

Copilot started reviewing on behalf of moskomule February 12, 2026 06:05 View session

Copilot AI reviewed Feb 12, 2026

View reviewed changes

moskomule and others added 5 commits February 12, 2026 15:11

Update flexeval/multimodal/image_preprocessor.py

28b65c1

Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>

Update flexeval/core/chat_dataset/template_based.py

7b4bd70

Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>

fix

9e6ec0c

fix test

c7994ea

fix

7bad582

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support multimodal input data#278

Support multimodal input data#278
moskomule wants to merge 9 commits intomainfrom
feat/multimodal-input

moskomule commented Feb 12, 2026

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

moskomule commented Feb 12, 2026

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants