About Me

I am currently an algorithm engineer at Alibaba Cloud Intelligence, where my work spans three core areas: Multimodal LLM post-training, agentic reasoning, and efficient inference. I received my Master’s and Bachelor’s degrees from the School of Computer Science and Engineering, Sun Yat-sen University (SYSU). My research interests include reinforcement learning, multi-agent systems, and the broader goal of building more capable and efficient multimodal large language models.

🔥 News

  • [2026.05] 🎉 One paper submitted to ACL ARR 2026 May
  • [2026.05] 🎉 Three papers accepted to ICML 2026
  • [2026.04] One paper accepted by ACL 2026 Findings
  • [2025.08] One paper accepted by TOG (Proceedings of SIGGRAPH Asia 2025)
  • [2024.06] I joined Alibaba Cloud Intellegence as an algorithm engineer
  • [2023.05] I joined Alibaba Cloud Intellegence as an algorithm intern
  • [2023.05] One paper accepted by International Journal of Computer Vision (IJCV)
  • [2023.05] One paper accepted by TOG (Proceedings of SIGGRAPH 2023)
  • [2023.03] One paper accepted by Computer Graphics Forum

📄 Publications

IBTPO
Long Live The Balance: Information Bottleneck Driven Tree-based Policy Optimization
Jiang, H., Li, S., Bu, T., Xu, B., Liu, X., Chen, Q., Duan, H., Hu, L., Yang, B., Zhang, M.
ICML 2026 Main Conference
Paper | Code
RoTS
Recovering Policy-Induced Errors: Benchmarking and Trajectory Synthesis for Robust GUI Agents
Bu, T.#, Liu, X.#, Chen, Q.#, Jiang, H., Li, S., Duan, H, Jiang, L., Hu, L., Yang, B., Zhang, M.
# Equal Contribution
ICML 2026 Main Conference (spotlight)
Paper | Code
D-CORE
D-CORE: Incentivizing Task Decomposition in Large Reasoning Models for Complex Tool Use
Xu, B.#, Wu, S.#, Jiang, H., Liu, K., Chen, X., Hu, L, Yang, B.
# Equal Contribution
ICML 2026 Main Conference
Paper | Code
MemTR
MemTR: Enhancing Tool-Calling Reliability via Uncertainty-Triggered FFN-Space Retracing
Duan, H., Jiang, L., Zhang, M., Zhu, X., Bu, T., Jiang, H., Wei, X., Hu, L.
ACL 2026 Findings
Paper | Code
Self-supervised Texture Filtering
Self-supervised Texture Filtering
Jiang, H., Zheng, R., Nie, Y., Xiao, C., Zheng, W., Zhang, Q.
ACM Transactions on Graphics (Proceedings of SIGGRAPH Asia 2025)
Paper | Code
Learning to Remove Shadows
Learning to Remove Shadows from a Single Image
Jiang, H., Zhang, Q., Nie, Y., Zhu, L, Zheng, W.
International Journal of Computer Vision
Paper | Code
Pyramid Texture Filtering
Pyramid Texture Filtering
Zhang, Q., Jiang, H., Nie, Y., Zheng, W.
ACM Transactions on Graphics (Proceedings of SIGGRAPH 2023)
First Student Author
Paper | Project | Code
Multi-Scale Deep Image Prior
Learning Multi-Scale Deep Image Prior for High-Quality Unsupervised Image Denoising
Jiang, H., Zhang, Q., Nie, Y., Zhu, L, Zheng, W.
Computer Graphics Forum
Paper | Code