Research
My research focus lies at the intersection of Computer Vision and Deep Learning.
In particular, I am interested in the following topics:
- Egocentric 3D Vision
- Human Pose Estimation
- Human Motion Generation (Diffusion Model)
- Human Body Reconstruction with Self-Supervised Personalization from Images and Speech (LLM)
- Synthetic-to-Real Domain Adaptation
|
|
Bring Your Rear Cameras for Egocentric 3D Human Pose Estimation
Hiroyasu Akada, Jian Wang, Vladislav Golyanik, and Christian Theobalt
Under Review, 2025
[Project page]
[Paper]
[Code (coming soon)]
Egocentric 3D full-body tracking has been studied using cameras installed in front of a head-mounted device (HMD).
While frontal placement is the optimal and the only option for some tasks, such as hand tracking, it remains unclear if the same holds for full-body tracking.
Notably, even the state-of-the-art methods often fail to estimate accurate 3D poses in many scenarios, such as when HMD users tilt their heads upward---a common motion in human activities.
Hence, this paper investigates the usefulness of rear cameras in the HMD design for full-body tracking.
We show that simply adding rear views to the frontal inputs is not optimal for existing methods and
propose a new method that refines 2D joint heatmap estimation, thereby significantly improving 3D pose tracking (>10% on MPJPE).
Furthermore, we introduce two new large-scale datasets, Ego4View-Syn and Ego4View-RW, for a rear-view evaluation.
|
|
EventEgo3D++: 3D Human Motion Capture from a Head Mounted Event Camera
Christen Millerdurai, Hiroyasu Akada, Jian Wang, Diogo Luvizon, Christian Theobalt, Vladislav
Golyanik
Under Review, 2025
[Project page]
[Paper]
[Code (coming soon)]
This paper is an extension of our previous work "EventEgo3D (CVPR 2024)" (See below).
We tackle a new problem, i.e. 3D human motion capture from an egocentric monocular event camera with
a
fisheye lens.
"EventEgo3D (CVPR 2024)" proposed the EE3D framework that is specifically tailored for learning with
event streams in the LNES
representation, enabling high 3D reconstruction accuracy.
We upgrade this framework to a new EE3D++ framework for further performance improvement.
We also introduce a new dataset, EE3D-W, in addition to EE3D-S and EE3D-R from "EventEgo3D (CVPR
2024)".
|
|
3D Human Pose Perception from Egocentric Stereo Videos
Hiroyasu Akada, Jian Wang, Vladislav Golyanik, and Christian Theobalt
Computer Vision and Pattern Recognition (CVPR), 2024, Highlight (top
3.5%)
[Project page]
[Benchmark Challenge]
[Paper]
[Code]
In this work, we propose a new transformer-based framework to improve egocentric stereo 3D human
pose
estimation, which leverages the scene information and temporal context of egocentric stereo videos.
Furthermore, we introduce two new benchmark datasets, i.e., UnrealEgo2 and UnrealEgo-RW (RealWorld).
Our extensive experiments show that the proposed approach significantly outperforms previous
methods.
|
|
EventEgo3D: 3D Human Motion Capture from Egocentric Event Streams
Christen Millerdurai, Hiroyasu Akada, Jian Wang, Diogo Luvizon, Christian Theobalt, Vladislav
Golyanik
Computer Vision and Pattern Recognition (CVPR), 2024
[Project page]
[Paper]
[Code]
We tackle a new problem, i.e. 3D human motion capture from an egocentric monocular event camera with
a
fisheye lens.
Event streams have high temporal resolution and could provide reliable cues for 3D human motion
capture under high-speed human motions and rapidly changing illumination.
We leverage these characteristics and propose the first approach for event-based 3D human pose
estimation, EventEgo3D (EE3D).
The proposed EE3D framework is specifically tailored for learning with event streams in the LNES
representation, enabling high 3D reconstruction accuracy.
We also provide two new datasets, EE3D-S and EE3D-R.
|
|
UnrealEgo: A New Dataset for Robust Egocentric 3D Human Motion Capture
Hiroyasu Akada, Jian Wang, Soshi Shimada, Masaki Takahashi, Christian Theobalt, and Vladislav
Golyanik
European Conference on Computer Vision (ECCV), 2022
[Project page]
[Paper]
[Code]
We present UnrealEgo, i.e. a new large-scale naturalistic dataset for egocentric 3D human
pose
estimation.
UnrealEgo is based on an advanced concept of eyeglasses equipped with two fisheye cameras that can
be
used in unconstrained environments.
UnrealEgo is the first dataset to provide in-the-wild stereo images with the largest variety of
motions among existing egocentric datasets.
We also propose a new benchmark method that achieves the state-of-the-art results on UnrealEgo.
|
|
Self-Supervised Learning of Domain Invariant Features for Depth Estimation
Hiroyasu Akada, Shariq Farooq Bhat, Ibraheem Alhashim, Peter Wonka
IEEE/CVF Winter Conference on Applications of Computer Vision (WACV), 2022
[Paper]
[Code]
We tackle the problem of unsupervised synthetic-to-real domain adaptation for single image depth
estimation.
An essential building block of single image depth estimation is an encoder-decoder task network that
takes RGB images as input and produces depth maps as output.
In this paper, we propose a novel training strategy to force the task network to learn domain
invariant
representations in a self-supervised manner.
|
|
Dynamic Object Removal from Unpaired Images for Agricultural Autonomous Robots
Hiroyasu Akada and Masaki Takahashi
International Conference on Intelligent Autonomous Systems (IAS), 2021
[Paper]
We developed a GAN-based stem that remove dynamic objects in images. The system can be trained without using
paired images with/without the dynamic objects.
|
Education & Experience
[Sep 2022 - Present] Ph.D. student, Max Planck
Institute for Informatics (Saarbrucken, Germany)
[July 2021 - Aug 2022] Visiting researcher, Max Planck
Institute for Informatics (Saarbrucken, Germany)
[Apr 2018 - Aug 2022] Master student, Keio University
(Kanagawa, Japan)
[Sep 2020 - June 2021] Internship, KAUST (Saudi
Arabia, remote)
[July 2019 - Aug 2019] Internship, Tencent (Palo
Alto, CA, USA)
[Aug 2018 - May 2019] Student, University of California,
Berkeley
(Berkeley, CA, USA)
[Apr 2014 - Mar 2018] Bachelor student, Keio University
(Kanagawa, Japan)
|
Awards & Grants
-
Nakajima Foundation (中島記念国際交流財団), 2022-2027
Scholarship for my PhD study for 5 years, including tuition fees, living expenses, stipends, etc.
-
CREST, Japan Science and Technology Agency, 2021-2022
Financial support for my research outside of Japan for 1 year.
-
TOBITATE, Ministry of Education, Culture, Sports, Science and Technology, Japan 2018-2019
Scholarship for my study outside of Japan for 1 year.
|
Professional Services & Activities
Lab Visits:
Conference Participation:
CVPR 2024 (Seattle, USA), ECCV 2022 (Tel Aviv, Israel), WACV 2022 (Hawaii, USA), IAS (Singapore,
online)
|
|