I am currently an algorithm engineer at Alibaba Cloud Intelligence, where my work spans three core areas: Multimodal LLM post-training, agentic reasoning, and efficient inference. I received my Master’s and Bachelor’s degrees from the School of Computer Science and Engineering, Sun Yat-sen University (SYSU). My research interests include reinforcement learning, multi-agent systems, and the broader goal of building more capable and efficient multimodal large language models.
🔥 News
- [2026.05] 🎉 One paper submitted to ACL ARR 2026 May
- [2026.05] 🎉 Three papers accepted to ICML 2026
- [2026.04] One paper accepted by ACL 2026 Findings
- [2025.08] One paper accepted by TOG (Proceedings of SIGGRAPH Asia 2025)
- [2024.06] I joined Alibaba Cloud Intellegence as an algorithm engineer
- [2023.05] I joined Alibaba Cloud Intellegence as an algorithm intern
- [2023.05] One paper accepted by International Journal of Computer Vision (IJCV)
- [2023.05] One paper accepted by TOG (Proceedings of SIGGRAPH 2023)
- [2023.03] One paper accepted by Computer Graphics Forum
📄 Publications
Long Live The Balance: Information Bottleneck Driven Tree-based Policy Optimization Jiang, H., Li, S., Bu, T., Xu, B., Liu, X., Chen, Q., Duan, H., Hu, L., Yang, B., Zhang, M. ICML 2026 Main Conference Paper |
Code Recovering Policy-Induced Errors: Benchmarking and Trajectory Synthesis for Robust GUI Agents Bu, T.#, Liu, X.#, Chen, Q.#, Jiang, H., Li, S., Duan, H, Jiang, L., Hu, L., Yang, B., Zhang, M. # Equal Contribution ICML 2026 Main Conference (spotlight) Paper |
Code D-CORE: Incentivizing Task Decomposition in Large Reasoning Models for Complex Tool Use Xu, B.#, Wu, S.#, Jiang, H., Liu, K., Chen, X., Hu, L, Yang, B. # Equal Contribution ICML 2026 Main Conference Paper |
Code MemTR: Enhancing Tool-Calling Reliability via Uncertainty-Triggered FFN-Space Retracing Duan, H., Jiang, L., Zhang, M., Zhu, X., Bu, T., Jiang, H., Wei, X., Hu, L. ACL 2026 Findings Paper |
Code Self-supervised Texture Filtering Jiang, H., Zheng, R., Nie, Y., Xiao, C., Zheng, W., Zhang, Q. ACM Transactions on Graphics (Proceedings of SIGGRAPH Asia 2025) Paper |
Code Learning to Remove Shadows from a Single Image Jiang, H., Zhang, Q., Nie, Y., Zhu, L, Zheng, W. International Journal of Computer Vision Paper |
Code Pyramid Texture Filtering Zhang, Q., Jiang, H.♣, Nie, Y., Zheng, W. ACM Transactions on Graphics (Proceedings of SIGGRAPH 2023) ♣ First Student Author Paper |
Project |
Code Learning Multi-Scale Deep Image Prior for High-Quality Unsupervised Image Denoising Jiang, H., Zhang, Q., Nie, Y., Zhu, L, Zheng, W. Computer Graphics Forum Paper |
Code