Skip to content

Revert "[AMD] feat: Add mi355x distributed inference test CI workflow"#408

Merged
cquil11 merged 1 commit intomainfrom
revert-401-feat/mi355x-disagg
Jan 8, 2026
Merged

Revert "[AMD] feat: Add mi355x distributed inference test CI workflow"#408
cquil11 merged 1 commit intomainfrom
revert-401-feat/mi355x-disagg

Conversation

@cquil11
Copy link
Collaborator

@cquil11 cquil11 commented Jan 8, 2026

Reverts InferenceMAX/InferenceMAX#401


Note

Reverts the MI355x disaggregated/multinode setup and associated CI artifacts.

  • Remove dsr1-fp8-mi355x-sglang-disagg block from amd-master.yaml and delete benchmarks/dsr1_fp8_mi355x_sglang-disagg_slurm.sh
  • Prune related changelog entry in perf-changelog.yaml
  • Simplify runners/launch_mi355x-amd.sh to single-node container launch; drop SLURM multinode orchestration, log collection, and cleanup logic

Other configs (e.g., gptoss-fp4-mi355x-vllm) remain unchanged aside from surrounding context.

Written by Cursor Bugbot for commit 091f445. This will update automatically on new commits. Configure here.

@gemini-code-assist
Copy link
Contributor

Summary of Changes

Hello @cquil11, I'm Gemini Code Assist1! I'm currently reviewing this pull request and will post my feedback shortly. In the meantime, here's a summary to help you and other reviewers quickly get up to speed!

This pull request systematically undoes the integration of a previously added CI workflow for AMD mi355x distributed inference tests. The change removes the associated configurations, benchmark scripts, and multi-node execution logic, effectively streamlining the CI pipeline by eliminating a feature that is no longer desired or functional.

Highlights

  • Revert of AMD mi355x Distributed Inference CI Workflow: This pull request fully reverts the changes introduced by InferenceMAX/InferenceMAX#401, which had added a CI workflow for AMD mi355x distributed inference tests.
  • Configuration Removal: The dsr1-fp8-mi355x-sglang-disagg configuration, including its various speculative decoding and multi-node settings, has been removed from the .github/configs/amd-master.yaml file.
  • Benchmark Script Deletion: The dedicated benchmark script benchmarks/dsr1_fp8_mi355x_sglang-disagg_slurm.sh, responsible for setting up and launching multi-node SGLang distributed inference benchmarks, has been removed.
  • Runner Script Simplification: The runners/launch_mi355x-amd.sh script has been significantly simplified by removing all logic related to multi-node benchmark execution, including the scancel_sync function and the conditional block for IS_MULTINODE.
  • Changelog Update: The entry in perf-changelog.yaml describing the addition of 'PD disaggregation (1P2D) for Mi355X' has been removed.

🧠 New Feature in Public Preview: You can now enable Memory to help Gemini Code Assist learn from your team's feedback. This makes future code reviews more consistent and personalized to your project's style. Click here to enable Memory in your admin console.

Using Gemini Code Assist

The full guide for Gemini Code Assist can be found on our documentation page, here are some quick tips.

Invoking Gemini

You can request assistance from Gemini at any point by creating a comment using either /gemini <command> or @gemini-code-assist <command>. Below is a summary of the supported commands on the current page.

Feature Command Description
Code Review /gemini review Performs a code review for the current pull request in its current state.
Pull Request Summary /gemini summary Provides a summary of the current pull request in its current state.
Comment @gemini-code-assist Responds in comments when explicitly tagged, both in pull request comments and review comments.
Help /gemini help Displays a list of available commands.

Customization

To customize Gemini Code Assist for GitHub experience, repository maintainers can create a configuration file and/or provide a custom code review style guide (such as PEP-8 for Python) by creating and adding files to a .gemini/ folder in the base of the repository. Detailed instructions can be found here.

Limitations & Feedback

Gemini Code Assist may make mistakes. Please leave feedback on any instances where its feedback is incorrect or counter productive. You can react with 👍 and 👎 on @gemini-code-assist comments. If you're interested in giving your feedback about your experience with Gemini Code Assist for Github and other Google products, sign up here.

You can also get AI-powered code generation, chat, as well as code reviews directly in the IDE at no cost with the Gemini Code Assist IDE Extension.

Footnotes

  1. Review the Privacy Notices, Generative AI Prohibited Use Policy, Terms of Service, and learn how to configure Gemini Code Assist in GitHub here. Gemini can make mistakes, so double check it and use code with caution.

Copy link
Contributor

@gemini-code-assist gemini-code-assist bot left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Code Review

This pull request reverts a previous commit that added the mi355x distributed inference test CI workflow. The changes correctly remove the test configuration from .github/configs/amd-master.yaml, the associated benchmark script, the multi-node logic from the launch_mi355x-amd.sh runner, and the corresponding entry from perf-changelog.yaml. The revert appears to be complete and clean. As this is a revert consisting entirely of removals, I have no specific feedback on the code changes.

@cquil11 cquil11 merged commit a00d72a into main Jan 8, 2026
1 check passed
@cquil11 cquil11 deleted the revert-401-feat/mi355x-disagg branch January 8, 2026 21:27
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

Development

Successfully merging this pull request may close these issues.

1 participant