Skip to content
View zhshj0110's full-sized avatar

Block or report zhshj0110

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
zhshj0110/README.md

Hi! 👋

  • 🌱 I’m currently studying at the School of Artificial Intelligence, Beijing University of Posts and Telecommunications.
  • 🤔 My research interests include Human Activity Analysis, Human Motion Synthetic and Multimodal Large Language Models.
  • 📫 Email me @ zhshj0110@gmail.com

zhshj0110 |

Papers

  • [ICCV 2025] Q-Frame: Query-aware Frame Selection and Multi-Resolution Adaptation for Video-LLMs [paper]
  • [PR 2025] A Generically Contrastive Spatiotemporal Representation Enhancement for 3D Skeleton Action Recognition [paper]
  • [ROBIO 2024] Temporal Text Prompts for Skeleton-based Action Recognition [paper]
  • [KBS 2024] MLP-AIR: An effective MLP-based module for actor interaction relation learning in group activity recognition [paper]
  • [TCSVT 2024] SiT-MLP: A Simple MLP with Point-wise Topology Feature Learning for Skeleton-based Action Recognition [paper]
  • [PR 2024] Kinematics Modeling Network for Video-based Human Pose Estimation [paper]
  • [TIP 2022] Relation-Based Associative Joint Location for Human Pose Estimation in Videos [paper]

Pinned Loading

  1. huggingface/transformers huggingface/transformers Public

    🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.

    Python 156k 32.1k

  2. EvolvingLMMs-Lab/lmms-eval EvolvingLMMs-Lab/lmms-eval Public

    One-for-All Multimodal Evaluation Toolkit Across Text, Image, Video, and Audio Tasks

    Python 3.7k 516

  3. SiT-MLP SiT-MLP Public

    [TCSVT 2024] Implementation of the paper "SiT-MLP: A Simple MLP with Point-wise Topology Feature Learning for Skeleton-based Action Recognition"

    Python 19 1

  4. CSRE CSRE Public

    [PR 2025] The official implementation of paper 'A Generically Contrastive Spatiotemporal Representation Enhancement for 3D Skeleton Action Recognition'

    Python 7 2

  5. xiaomi-research/q-frame xiaomi-research/q-frame Public

    [ICCV 2025] Implementation of the paper "Q-Frame: Query-aware Frame Selection and Multi-Resolution Adaptation for Video-LLMs"

    Python 69 3

  6. xiaomi-research/btl-ui xiaomi-research/btl-ui Public

    [NeurIPS 2025] Implementation of the paper "BTL-UI: Blink-Think-Link Reasoning Model for GUI Agent"

    Python 15 1