GitHub - NTUMARS/HoloLLM: Official code of our paper "HoloLLM: Multisensory Foundation Model for Language-Grounded Human Sensing and Reasoning" (NeurIPS 2025)

HoloLLM: Multisensory Foundation Model for Language-Grounded Human Sensing and Reasoning

[Project Page] [Paper] [Model] [Data]

News

2025.09.19 HoloLLM is accepted by NeurIPS 2025!🎉

Universal Modality-Injection Projector (UMIP) of HoloLLM.

Download the checkpoints of UMIP and store them in ./checkpoints.
Modality-Specific Tailored Encoders.

Download the checkpoints of modality-specific tailored encoders and store them in ./modality_specific_encoders.
LLaMA2-7B

Download the checkpoints of LLaMA2-7B and store the "consolidated.00.pth" into ./LLM_ckpt/llama2-7B.

Data

MMFi Dataset

Download the MMFi dataset and the corresponding RGB images.

Organize the MMFi dataset into

./datasets/MMFi
|-- E01
|   |-- S01
|   |   |-- A01
|   |   |   |-- rgb_img (.png)
|   |   |   |-- rgb
|   |   |   |-- depth
|   |   |   |-- mmwave
|   |   |   |-- lidar   
|   |   |   |-- wifi-csi
|   |   |   |-- ...
|   |   |-- A02
|   |   |-- ...
|   |   |-- A27
|   |-- S02
|   |-- ...
|   |-- S10
|-- E02
|......
|-- E03
|......
|-- E04
|......

XRF55 Dataset

Download the XRF55 dataset. Please note that you need to request permission from the authors of XRF55 to access the Kinect videos (RGB modality).

Organize the XRF55 dataset into

./datasets/XRF55
|-- Scene1
|   |-- Color
|   |   |-- 03
|   |   |   |-- 03_01_01
|   |   |   |-- 03_01_02
|   |   |   |-- 03_01_03
|   |   |   |-- ... (PID_ActionID_SampleID)
|   |   |-- 04
|   |   |-- ...
|   |   |-- 31
|   |-- Depth
|   |   |......
|   |-- IR
|   |   |......
|   |-- WiFi
|   |   |......
|   |-- RFID
|   |   |......
|-- Scene2
|   |......
|-- Scene3
|   |......
|-- Scene4
|   |......

Textual Annotations of MMFi and XRF55

Download the textual annotations of MMFi and XRF55 dataset and store them into ./datasets/textual_annotations/mmfi/ and ./datasets/textual_annotations/xrf55/.

Evaluation

Evaluate HoloLLM on MMFi

Take "Random" setting as an example. Change the settings in ./eval/holollm_eval_mmfi.py

# line 400

# cross_env, cross_sub, random
exp_settings = "random"
pretrained_path = "./checkpoints/holollm_mmfi_random.pth"
llm_type = "holollm_random_mmfi"
base_path = "model./eval/holollm_mmfi_random/"
llama_ckpt_dir = "./LLM_ckpt/llama2-7B"
modality_list = ["mmfi_video", "mmfi_depth", "mmfi_mmwave", "mmfi_lidar", "mmfi_wifi"]

Then, run the following command in the terminal:

python ./eval/holollm_eval_mmfi.py

Evaluate HoloLLM on XRF55

Take "Random" setting as an example. Change the settings in ./eval/holollm_eval_xrf55.py

# line 285

# cross_env, cross_sub, random
exp_settings = "random"
pretrained_path = "./checkpoints/holollm_xrf55_random.pth"
llm_type = "holollm_random_xrf55"
base_path = "./eval/holollm_xrf55_random/"
llama_ckpt_dir = "./LLM_ckpt/llama2-7B"
modality_list = ["xrf55_infra", "xrf55_wifi", "xrf55_rfid", "xrf55_depth", "xrf55_video"]

Then, run the following command in the terminal:

python ./eval/holollm_eval_xrf55.py

Training

Training HoloLLM on MMFi

Take "Random" setting as an example, directly run the following command in the terminal.
```
bash ./scripts_sh/holollm_mmfi_random.sh
```
Training HoloLLM on XRF55

Take "Random" setting as an example, directly run the following command in the terminal.
```
bash ./scripts_sh/holollm_xrf55_random.sh
```

Citation

@article{zhou2025holollm,
  title={HoloLLM: Multisensory Foundation Model for Language-Grounded Human Sensing and Reasoning},
  author={Zhou, Chuhao and Yang, Jianfei},
  journal={arXiv preprint arXiv:2505.17645},
  year={2025}
}

Acknowledgement

Llama2, OneLLM, Tokenpacker, Honeybee, ImageBind, LanguageBind, MM-Fi, XRF55.

License

This project is developed based on OneLLM and Llama 2, please refer to the LLAMA 2 Community License.

⚠️ Trouble Shootings

"FileNotFoundError: [Errno 2] No such file or directory: 'java'" when run evaluation.
```
sudo apt install openjdk-11-jre-headless
```

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
config		config
data		data
eval		eval
model		model
scripts_sh		scripts_sh
src		src
util		util
LICENSE_llama2		LICENSE_llama2
README.md		README.md
engine_mmfi_ce_full.py		engine_mmfi_ce_full.py
engine_mmfi_cs_full.py		engine_mmfi_cs_full.py
engine_mmfi_random_full.py		engine_mmfi_random_full.py
engine_xrf55_ce_full.py		engine_xrf55_ce_full.py
engine_xrf55_cs_full.py		engine_xrf55_cs_full.py
engine_xrf55_random_full.py		engine_xrf55_random_full.py
holollm_mmfi_ce.py		holollm_mmfi_ce.py
holollm_mmfi_cs.py		holollm_mmfi_cs.py
holollm_mmfi_random.py		holollm_mmfi_random.py
holollm_xrf55_ce.py		holollm_xrf55_ce.py
holollm_xrf55_cs.py		holollm_xrf55_cs.py
holollm_xrf55_random.py		holollm_xrf55_random.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

HoloLLM: Multisensory Foundation Model for Language-Grounded Human Sensing and Reasoning

News

Contents

Install

Models

Data

Evaluation

Training

Citation

Acknowledgement

License

⚠️ Trouble Shootings

About

Uh oh!

Releases

Packages

Languages

NTUMARS/HoloLLM

Folders and files

Latest commit

History

Repository files navigation

HoloLLM: Multisensory Foundation Model for Language-Grounded Human Sensing and Reasoning

News

Contents

Install

Models

Data

Evaluation

Training

Citation

Acknowledgement

License

⚠️ Trouble Shootings

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages