Skip to content

Add quantization config option#433

Merged
loadams merged 6 commits intomainfrom
features/quant-fp6
Mar 8, 2024
Merged

Add quantization config option#433
loadams merged 6 commits intomainfrom
features/quant-fp6

Conversation

@mrwyattii
Copy link
Contributor

@mrwyattii mrwyattii commented Mar 6, 2024

Required changes to DS: deepspeedai/DeepSpeed#5234

loadams added a commit to deepspeedai/DeepSpeed that referenced this pull request Mar 8, 2024
The user interface: deepspeedai/DeepSpeed-MII#433
nv-a6000 ci running against the MII branch linked above is
[here](https://github.com/microsoft/DeepSpeed/actions/runs/8192124606)

Co-authored-by: Zhen Zheng
[[email protected]](mailto:[email protected])
Co-authored-by: Shiyang Chen [[email protected]](mailto:[email protected])
Co-authored-by: Arash Bakhtiari
[[email protected]](mailto:[email protected])
Co-authored-by: Haojun Xia
[[email protected]](mailto:[email protected])

---------

Co-authored-by: ZHENG, Zhen <[email protected]>
Co-authored-by: Shiyang Chen <[email protected]>
Co-authored-by: Haojun Xia <[email protected]>
Co-authored-by: Arash Bakhtiari <[email protected]>
Co-authored-by: Michael Wyatt <[email protected]>
Co-authored-by: Michael Wyatt <[email protected]>
@loadams loadams merged commit 429bc5c into main Mar 8, 2024
ShellyNR pushed a commit to ShellyNR/DeepSpeed that referenced this pull request Mar 11, 2024
The user interface: deepspeedai/DeepSpeed-MII#433
nv-a6000 ci running against the MII branch linked above is
[here](https://github.com/microsoft/DeepSpeed/actions/runs/8192124606)

Co-authored-by: Zhen Zheng
[[email protected]](mailto:[email protected])
Co-authored-by: Shiyang Chen [[email protected]](mailto:[email protected])
Co-authored-by: Arash Bakhtiari
[[email protected]](mailto:[email protected])
Co-authored-by: Haojun Xia
[[email protected]](mailto:[email protected])

---------

Co-authored-by: ZHENG, Zhen <[email protected]>
Co-authored-by: Shiyang Chen <[email protected]>
Co-authored-by: Haojun Xia <[email protected]>
Co-authored-by: Arash Bakhtiari <[email protected]>
Co-authored-by: Michael Wyatt <[email protected]>
Co-authored-by: Michael Wyatt <[email protected]>
rraminen pushed a commit to ROCm/DeepSpeed that referenced this pull request May 9, 2024
The user interface: deepspeedai/DeepSpeed-MII#433
nv-a6000 ci running against the MII branch linked above is
[here](https://github.com/microsoft/DeepSpeed/actions/runs/8192124606)

Co-authored-by: Zhen Zheng
[[email protected]](mailto:[email protected])
Co-authored-by: Shiyang Chen [[email protected]](mailto:[email protected])
Co-authored-by: Arash Bakhtiari
[[email protected]](mailto:[email protected])
Co-authored-by: Haojun Xia
[[email protected]](mailto:[email protected])

---------

Co-authored-by: ZHENG, Zhen <[email protected]>
Co-authored-by: Shiyang Chen <[email protected]>
Co-authored-by: Haojun Xia <[email protected]>
Co-authored-by: Arash Bakhtiari <[email protected]>
Co-authored-by: Michael Wyatt <[email protected]>
Co-authored-by: Michael Wyatt <[email protected]>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants