publications

publications by categories in reversed chronological order. generated by jekyll-scholar.

2025

  1. LoHR-Bench: A Dual-Level Benchmark for Extended Long-Horizon Robot Manipulation
    Haoran Zhang, Zhiwei Xue, Jinhang Qiu, and 7 more authors
    2025
    ICRA 2026, Under Review
  2. RoboVIP: Multi-View Video Generation with Visual Identity Prompting Augments Robot Manipulation
    Boyang Wang, Haoran Zhang, Shujie Zhang, and 8 more authors
    2025
    CVPR 2026, Under Review
  3. StarBench: A Turn-Based RPG Benchmark for Agentic Multimodal Decision-Making and Information Seeking
    Haoran Zhang, Chenhao Zhu, Sicong Guo, and 3 more authors
    2025
    AAMAS 2026, Under Review

2024

  1. IaC-Eval: A Code Generation Benchmark for Cloud Infrastructure-as-Code Programs
    Patrick Tser Jern Kon, Jiachen Liu, Yiming Qiu, and 11 more authors
    2024
    NeurIPS 2024 (Datasets and Benchmarks Track)