Preferred path
Export short mesh sequences as GLB/PLY and render them with Three.js or model-viewer.
1UNIST Vision and Learning Lab, UNIST 2University of Birmingham 3CJ Corporation
ECCV 2026 (accepted)
TLDR; Multi-THuMBS tackles multi-person 3D human mesh tracking in real-world videos with frequent shot changes. Existing methods often lose identity consistency when the camera view changes abruptly, especially when multiple people interact, overlap, or leave the frame. Our approach reconstructs boundary frames in a shared 3D space, registers human meshes across shots, and preserves per-person identity and motion consistency for temporally coherent 3D trajectories.
Given a video split by a shot boundary, Multi-THuMBS first estimates human meshes and camera poses for all frames. It then builds a (A) shared 3D space from the boundary frames, (B) aligns meshes across shots, (C) propagates camera and mesh trajectories, (D) links identities using spatial, pose, and appearance cues, and applies (E) temporal smoothing.
Export short mesh sequences as GLB/PLY and render them with Three.js or model-viewer.
Keep demo assets small enough for GitHub Pages; use decimated meshes and compressed textures if needed.
Use an MP4/GIF preview first, then upgrade to interactive playback after asset size and controls are stable.
Coming soon!
This work is supported by NRF grants (No. RS-2025-00521013 20%, No. RS-2025-02216916 10%) and IITP grants (No. RS-2020-II201336 Artificial intelligence graduate school program(UNIST) 10%; No. RS-2025-25442824 AI Star Fellowship Program(UNIST) 10%), funded by the Korean government (MSIT). This work is supported by CJ Corporation 50%.