publications

Selected publications and preprints.

2026

  1. WebSTEP teaser
    Where Did It Go Wrong? Process-Level Evaluation of Web Agents with Semantic State Tracking
    Preprint. Under review., 2026

    Developed a controlled benchmark for diagnosing process-level failures in web agents, enabling fine-grained analysis of exploration, execution, and decision-making behaviors.

  2. 3D-PAQA teaser
    Towards Preference-Aligned 3D Quality Assessment
    JiHyuk Byun and Seon Joo Kim
    Workshops on Image Processing and Image Understanding (IPIU), 2026

    Preference-aligned 3D quality assessment for scalable, human-aligned evaluation and curation of 3D assets.