Skip to content

Inquiry about the MLLM and prompts used for recaptioning #2

@w1oves

Description

@w1oves

Hi authors,
Thanks for your great work! I am currently studying your dataset and have a few questions regarding the recaptioning process mentioned:

  1. Model Selection: Could you clarify which Multimodal Large Language Model (MLLM) was used for the recaptioning pipeline?
  2. Prompt Details: What specific prompts were used to guide the model during this process?
  3. Experimental Analysis: Does the paper include any experimental analysis or ablation studies specifically focusing on the impact or quality of these recaption results? If so, could you point me to the relevant section?

Thanks in advance for your time and help!

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions