Tech

AI technology reconstructs 3D hand-object interactions from video, even when elements are obscured

Share
Share
More realistic surgical simulations—AI innovation for reconstructing 3D hand-object interactions
Results of reconstructing hand-object interactions from various viewpoints using the ‘BIGS’ method. Credit: arXiv (2025). DOI: 10.48550/arxiv.2504.09097

Researchers at UNIST have developed an AI technology capable of reconstructing three-dimensional (3D) representations of unfamiliar objects manipulated with both hands, as well as simulated surgical scenes involving intertwined hands and medical instruments. This advancement enables highly accurate augmented reality (AR) visualizations, further enhancing real-time interaction capabilities.

Led by Professor Seungryul Baek of the UNIST Graduate School of Artificial Intelligence, the team introduced Bimanual Interaction 3D Gaussian Splatting (BIGS), an innovative AI model that can visualize complex interactions between hands and objects in 3D using only a single RGB video input.

This technology allows for the real-time reconstruction of intricate hand-object dynamics, even when the objects are unfamiliar or partially obscured. The study is published on the arXiv preprint server.

Traditional approaches in this domain have been limited to recognizing only one hand at a time or responding solely to pre-scanned objects, restricting their applicability in realistic AR and VR environments.

By contrast, BIGS can reliably predict full object and hand shapes, even in scenarios where parts are hidden or occluded, and can do so without the need for depth sensors or multiple cameras—relying solely on a single RGB camera.

The core of this AI model is based on 3D Gaussian Splatting, a technique that represents object shapes as a cloud of points with smooth Gaussian distributions.

Unlike point cloud methods that produce sharp boundaries, Gaussian Splatting enables natural reconstruction of contact surfaces and complex interactions.

The model further addresses occlusion challenges by aligning multiple hand instances to a canonical Gaussian structure and employs a pre-trained diffusion model for score distillation sampling (SDS), allowing it to accurately reconstruct unseen surfaces, including the backs of objects.

Extensive experiments utilizing international datasets such as ARCTIC and HO3Dv3 demonstrated that BIGS outperforms existing technologies in accurately capturing hand postures, object shapes, contact interactions, and rendering quality. These capabilities hold significant promise for applications in virtual and augmented reality, robotic control, and remote surgical simulations.

This research was conducted with contributions from first author Jeongwan On, along with Kyeonghwan Gwak, Gunyoung Kang, Junuk Cha, Soohyun Hwang, and Hyein Hwang.

Professor Baek remarked, “This advancement is expected to facilitate real-time interaction reconstruction in various fields, including VR, AR, robotic control, and remote surgical training.”

More information:
Jeongwan On et al, BIGS: Bimanual Category-agnostic Interaction Reconstruction from Monocular Videos via 3D Gaussian Splatting, arXiv (2025). DOI: 10.48550/arxiv.2504.09097

Journal information:
arXiv


Provided by
Ulsan National Institute of Science and Technology


Citation:
AI technology reconstructs 3D hand-object interactions from video, even when elements are obscured (2025, June 13)
retrieved 13 June 2025
from

This document is subject to copyright. Apart from any fair dealing for the purpose of private study or research, no
part may be reproduced without the written permission. The content is provided for information purposes only.

Share

Leave a comment

Leave a Reply

Your email address will not be published. Required fields are marked *

Related Articles
OpenAI has upgraded ChatGPT’s Projects feature, and I find it makes working way more efficient
Tech

OpenAI has upgraded ChatGPT’s Projects feature, and I find it makes working way more efficient

OpenAI has upgraded ChatGPT’s Projects feature to remember past chats, tone preferences,...

Gigabyte mixes messages with new AI PC that’s also called a gaming desktop
Tech

Gigabyte mixes messages with new AI PC that’s also called a gaming desktop

Gigabyte’s AI TOP 500 TRX50 is a desktop built for AI developers...

Old solar gadgets, rogue modules, and risky firmware could be the cyber timebomb inside your solar system
Tech

Old solar gadgets, rogue modules, and risky firmware could be the cyber timebomb inside your solar system

Forescout report finds many vulnerable solar devices run outdated firmware with known...

AMD goes all-in on AI with calculated acquisitions, but Nvidia will remain king for the foreseeable future
Tech

AMD goes all-in on AI with calculated acquisitions, but Nvidia will remain king for the foreseeable future

AMD is aggressively acquiring talent to bridge the Instinct and Blackwell GPU...