publications

publications by categories in reversed chronological order. generated by jekyll-scholar.

2026

  1. CVPR 2026
    livr.jpg
    Latent Implicit Visual Reasoning
    CVPR, 2026

2024

  1. EMNLP 2024
    traveler.png
    TraveLER: A Modular Multi-LMM Agent Framework for Video Question-Answering
    EMNLP, 2024

2023

  1. CVEU @ ICCV 2023
    LUSE.jpg
    LUSE: Using LLMs for Unsupervised Step Extraction in Instructional Videos
    ICCV CVEU Workshop, 2023