Skip to content
View PUITAR's full-sized avatar
🎯
Focusing
🎯
Focusing

Block or report PUITAR

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
PUITAR/README.md

Ming Yang

Hi there 👋 I'am Ming Yang. Welecome to my site~

I am a second-year Master at School of Data Science, Fudan University, advised by Prof. Weiguo Zheng. My research focuses on LLM related technologies, including

  • Retrieval-Augmented Generation (RAG) and Approximate Nearest Neighbor Search (ANNS);
  • LLM Application Algorithm, including RLVR, SFT, and multi-turn tool-integrated reasoning;
  • Agentic System, with an interest in turning foundation models into usable products.

I am a recipient of both the National Scholarship for Undergraduate and Graduate Students. With an expected graduation in July 2027, I am actively seeking opportunities to contribute to LLM systems, AI reasoning/search, and AI agent products.

News

  • [2026/02] One paper accepted by ACL 2026 arXiv
  • [2026/03] Our open-source multi-agent systems for presentation delivery is coming DeepSlide
  • [2025/09] New work about LLM RLVR arXiv
  • [2025/05] One paper accpeted by KDD 2025
  • [2024/09] One paper accepted by NeurIPS 2024

Selected Pulications

  • CSPG: Crossing Sparse Proximity Graphs for Approximate Nearest Neighbor Search.
    Ming Yang, Yuzheng Cai, and Weiguo Zheng.
    Thirty-eighth Annual Conference on Neural Information Processing Systems. NeurIPS 2024.
    [link] [pdf] [code] [slides] [poster]

  • Hi-PNG: Efficient Interval-Filtering ANNS via Hierarchical Interval Partition Navigating Graph.
    Ming Yang, Yuzheng Cai, and Weiguo Zheng.
    Proceedings of the 31st ACM SIGKDD Conference on Knowledge Discovery and Data Mining. KDD 2025.
    [link] [code]

Experience

  • Ant Group — LLM Application Algorithm Intern (AI4Data, Tiansuan Lab · Jun 2025 – Jan 2026)

    • Solved a critical infrastructure issue in the VeRL + vLLM training stack under none-Nvidia GPU + FSDP settings by fixing tensor-view / protobuf synchronization problems, improving stability on internal hardware.
    • Led benchmark construction for related bussiness with AI search and proposed a business-oriented reinforcement strategy for AI search.
    • Produced multiple research outputs, including HAMMER, DiPO, and TSPO.
  • Lingjun Investment — LLM Quant Application Algorithm Intern (Dec 2024 – Apr 2025)

    • Built an end-to-end financial report analysis framework for long-document LLM applications
    • Designed multiple information factors for financial text understanding, including confidence expression, positional signals, information density, and structure-aware indicators
    • Closed the loop between data construction and model optimization; improved extraction success rate from 57.3% to 82.7% using rule-based data generation and LlamaFactory fine-tuning
  • CSDN — NLP Algorithm Intern (Mar 2024 – Jun 2024)

    • Contributed to (CSDN Tech AI Chatbot), an AI QA product for the developer ecosystem
    • Built data filtering and fine-tuning pipelines for technical QA using Qwen1.5-32B LoRA

Ongoing Research

  • HAMMER: Hamiltonian Curiosity Augmented Large Language Model Reinforcement (first author, under review)
  • DiPO: Disentangled Perplexity Policy Optimization for Fine-grained Exploration-Exploitation Trade-Off (co-author, under review)
  • TSPO: Breaking the Double Homogenization Dilemma in Multi-turn Search Policy Optimization (co-author, under review)
  • DeepSlide: From Artifacts to Presentation Delivery (first author, project lead, under review)

Pinned Loading

  1. CSPG CSPG Public

    The repo for the article: CSPG: Crossing Sparse Proximity Graphs for Approximate Nearest Neighbor Search

    Jupyter Notebook 12 1

  2. Hi-PNG Hi-PNG Public

    This is the official implementation of the paper ⌈ Hi-PNG: Efficient Interval-Filtering ANNS via Hierarchical Interval Partition Navigating Graph ⌋

    Jupyter Notebook 4

  3. harsha-simhadri/big-ann-benchmarks harsha-simhadri/big-ann-benchmarks Public

    Framework for evaluating ANNS algorithms on billion scale datasets.

    Jupyter Notebook 431 138

  4. DeepSlide DeepSlide Public

    Not just slides - delivery-ready talks

    Python 59 1