Research
I am interested in Video Understanding and Behavioral Analysis with multi-modalities. While tons of skill-related videos are available online, how to effectively utilize them to assess human skills, generating support or feedback from AI agents, and support skill learning remains underexplored. My current research focuses on bridging knowledge from skilled instructional videos to skill learning applications.
|
|
SportSkills: Physical Skill Learning from Sports Instructional Videos
Kumar Ashutosh, Wu Chi hsuan, Kristen Grauman
arXiv 2026
project page
|
|
SkillSight: Efficient First-Person Skill Assessment with Gaze
Wu Chi hsuan, Kumar Ashutosh, Kristen Grauman
CVPR 2026
project page
|
|
Stitch-a-Demo: Video Demonstrations from Multistep Descriptions
Wu Chi hsuan*, Kumar Ashutosh*, Kristen Grauman
CVPR 2026
project page
|
|
CMOSE: Comprehensive Multi-Modality Online Student Engagement Dataset with High-Quality Labels
Wu Chi hsuan, Liu Shih Yang, Huang Xijie, Prof. Tim Kwang-Ting Cheng
CVPRW 2024
project page
|
|
It's All Relative: Interpretable Models for Scoring Bias in Documents
Aswin Suresh, Wu Chi hsuan, Prof. Matthias Grossglauser
EACL 2024
|
|
Day-Night-Transfomation-for-improving-feature-matching
Computer Vision Project
[Github]
We transformed illumination of day-night image pairs using CycleGAN to improve feature matching.
|
|
Super-Resolution on Computer Texts
Undergraduate Research Opportunity Program, Supervised by Prof. Qifeng Chen
[Github]
We designed two-stream model to simultaneuously improve text boundary clarity and colors on text images.
|
|