MegaMOE adaptation for SM90#24
Conversation
|
Hi @qiushixiaoyu, can you please rebase your PR upon |
|
Hi @qiushixiaoyu do you have any performance report for the mega MOE sm90 implementaion ? I also noticed that some modification on sm100. May I know which kind machine you are testing ? |
I test on H20. <title></title> <style type="text/css"> p.p1 {margin: 0.0px 0.0px 0.0px 0.0px; font: 13.0px 'Helvetica Neue'} p.p2 {margin: 0.0px 0.0px 0.0px 0.0px; text-align: right; font: 13.0px 'Helvetica Neue'} table.t1 {border-collapse: collapse} td.td1 {border-style: solid; border-width: 1.0px 1.0px 1.0px 1.0px; border-color: #9a9a9a #9a9a9a #9a9a9a #9a9a9a; padding: 1.0px 5.0px 1.0px 5.0px} </style>export PYTHONPATH=/workspace/DeepGEMM:/workspace/DeepEP:${PYTHONPATH:-}
|
Done |
|
Moved to #36 |
Add mega moe support for sm90.
Use the following command to test:
python tests/test_mega_moe_sm90.py --layers 1 2 3 --num-processes 8 --fail-fast
python tests/test_mega_moe_sm90.py --layers 4 --num-processes 8 --fail-fast
python tests/test_mega_moe_sm90.py --layers 5 --num-correctness-tests 16 --num-processes 8
Co-authored with AI