publications

publications by categories in reversed chronological order. generated by jekyll-scholar.

2025

  1. CVPR 2025
    cover_metavqa.gif
    Embodied Scene Understanding for Vision Language Models via MetaVQA
    Weizhen Wang, Chenda Duan, Zhenghao Peng, and 2 more authors
    In Proceedings of IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2025
  2. arXiv Preprint
    stork_teaser.png
    STORK: Improving the Fidelity of Mid-NFE Sampling for Diffusion and Flow Matching Models
    Zheng Tan, Weizhen Wang, Andrea L. Bertozzi, and 1 more author
    2025
  3. arXiv Preprint
    cover_dreamland.gif
    Dreamland: Controllable World Creation with Simulator and Generative Models
    Sicheng Mo, Ziyang Leng, Leon Liu, and 3 more authors
    2025