publications

publications by categories in reversed chronological order. generated by jekyll-scholar.

2025

  1. Preprint 2025
    livr.jpg
    Latent Implicit Visual Reasoning
    Preprint, 2025

2024

  1. EMNLP 2024
    traveler.png
    TraveLER: A Modular Multi-LMM Agent Framework for Video Question-Answering
    EMNLP, 2024

2023

  1. CVEU @ ICCV 2023
    LUSE.jpg
    LUSE: Using LLMs for Unsupervised Step Extraction in Instructional Videos
    ICCV CVEU Workshop, 2023