Inquiry about the MLLM and prompts used for recaptioning

Hi authors,
Thanks for your great work! I am currently studying your dataset and have a few questions regarding the recaptioning process mentioned:
1. **Model Selection:** Could you clarify which Multimodal Large Language Model (MLLM) was used for the recaptioning pipeline?
2. **Prompt Details:** What specific prompts were used to guide the model during this process?
3. **Experimental Analysis:** Does the paper include any experimental analysis or ablation studies specifically focusing on the impact or quality of these recaption results? If so, could you point me to the relevant section?


Thanks in advance for your time and help!

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Inquiry about the MLLM and prompts used for recaptioning #2

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Inquiry about the MLLM and prompts used for recaptioning #2

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions