Ming Yang PUITAR

Ming Yang

Hi there 👋 I'am Ming Yang. Welecome to my site~

I am a second-year Master at School of Data Science, Fudan University, advised by Prof. Weiguo Zheng. My research focuses on LLM related technologies, including

Retrieval-Augmented Generation (RAG) and Approximate Nearest Neighbor Search (ANNS);
LLM Application Algorithm, including RLVR, SFT, and multi-turn tool-integrated reasoning;
Agentic System, with an interest in turning foundation models into usable products.

I am a recipient of both the National Scholarship for Undergraduate and Graduate Students. With an expected graduation in July 2027, I am actively seeking opportunities to contribute to LLM systems, AI reasoning/search, and AI agent products.

News

[2026/02] One paper accepted by ACL 2026 arXiv
[2026/03] Our open-source multi-agent systems for presentation delivery is coming DeepSlide
[2025/09] New work about LLM RLVR arXiv
[2025/05] One paper accpeted by KDD 2025
[2024/09] One paper accepted by NeurIPS 2024

Selected Pulications

CSPG: Crossing Sparse Proximity Graphs for Approximate Nearest Neighbor Search.
Ming Yang, Yuzheng Cai, and Weiguo Zheng.
Thirty-eighth Annual Conference on Neural Information Processing Systems. NeurIPS 2024.
[link] [pdf] [code] [slides] [poster]
Hi-PNG: Efficient Interval-Filtering ANNS via Hierarchical Interval Partition Navigating Graph.
Ming Yang, Yuzheng Cai, and Weiguo Zheng.
Proceedings of the 31st ACM SIGKDD Conference on Knowledge Discovery and Data Mining. KDD 2025.
[link] [code]

Experience

Ant Group — LLM Application Algorithm Intern (AI4Data, Tiansuan Lab · Jun 2025 – Jan 2026)
- Solved a critical infrastructure issue in the VeRL + vLLM training stack under none-Nvidia GPU + FSDP settings by fixing tensor-view / protobuf synchronization problems, improving stability on internal hardware.
- Led benchmark construction for related bussiness with AI search and proposed a business-oriented reinforcement strategy for AI search.
- Produced multiple research outputs, including HAMMER, DiPO, and TSPO.
Lingjun Investment — LLM Quant Application Algorithm Intern (Dec 2024 – Apr 2025)
- Built an end-to-end financial report analysis framework for long-document LLM applications
- Designed multiple information factors for financial text understanding, including confidence expression, positional signals, information density, and structure-aware indicators
- Closed the loop between data construction and model optimization; improved extraction success rate from 57.3% to 82.7% using rule-based data generation and LlamaFactory fine-tuning
CSDN — NLP Algorithm Intern (Mar 2024 – Jun 2024)
- Contributed to (CSDN Tech AI Chatbot), an AI QA product for the developer ecosystem
- Built data filtering and fine-tuning pipelines for technical QA using Qwen1.5-32B LoRA

Ongoing Research

HAMMER: Hamiltonian Curiosity Augmented Large Language Model Reinforcement (first author, under review)
DiPO: Disentangled Perplexity Policy Optimization for Fine-grained Exploration-Exploitation Trade-Off (co-author, under review)
TSPO: Breaking the Double Homogenization Dilemma in Multi-turn Search Policy Optimization (co-author, under review)
DeepSlide: From Artifacts to Presentation Delivery (first author, project lead, under review)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Ming Yang PUITAR

Achievements

Achievements

Block or report PUITAR

Ming Yang

News

Selected Pulications

Experience

Ongoing Research

Pinned Loading

Uh oh!