write dequantization scripts for DeepSeek V4 FP4/FP8 weights by snehalv2002 · Pull Request #3873 · AI-Hypercomputer/maxtext

snehalv2002 · 2026-05-11T21:57:03Z

Description

Adds scripts for dequantizing the DeepSeek V4 weights to bf16. DeepSeek gives us: MoE weights in INT8 (actually two FP4s stored in 1 byte because torch doesn't have an FP4 dtype) and attention weights in F8_E4M3. All scaling factors are in F8_E8M0. We heavily reference the huggingface dequantization script here.

FIXES: b/510020740

Tests

The test script runs inference on deepseek v4 flash through the transformers library, loading weights from the original deepseek checkpoints and our script dequantized checkpoints, then comparing kl-div, token output, etc. Test results can be seen here.

Checklist

Before submitting this PR, please make sure (put X in square brackets):

I have performed a self-review of my code. For an optional AI review, add the gemini-review label.
I have necessary comments in my code, particularly in hard-to-understand areas.
I have run end-to-end tests tests and provided workload links above if applicable.
I have made or will make corresponding changes to the doc if needed, including adding new documentation pages to the relevant Table of Contents (toctree directive) as explained in our documentation.

codecov · 2026-05-11T22:09:49Z

Codecov Report

✅ All modified and coverable lines are covered by tests.

📢 Thoughts on this report? Let us know!

parambole · 2026-05-13T18:45:07Z

+    parser.add_argument("--input-path", "--input-fp8-hf-path", type=str, required=True,
+                        help="Path to DeepSeek FP8/FP4 Hugging Face folder")
+    parser.add_argument("--output-path", "--output-bf16-hf-path", type=str, required=True,
+                        help="Directory to save output BF16 weights")


nit: We can probably have a check to make sure that the input and output paths are not the same ( to prevent override ).

parambole · 2026-05-13T18:49:03Z

+)
+
+
+def weight_dequant_cpu(x: torch.Tensor, s: torch.Tensor, block_size: int = 128) -> torch.Tensor:


Rename it to dequantize_fp8 ?

parambole · 2026-05-13T18:49:28Z

+    for i in range(0, M, block_size):
+        for j in range(0, N, block_size):


nit: Can we vectorize this operation ?

parambole

Thank you for adding this script. I have left a few comments PTAL.

snehalv2002 marked this pull request as ready for review May 11, 2026 21:58

snehalv2002 force-pushed the snehalv-dsv4-dequantize branch 2 times, most recently from 9647437 to b8b85f2 Compare May 12, 2026 18:10

write dequantization scripts for DeepSeek V4 FP4/FP8 weights

8f8ce91

snehalv2002 force-pushed the snehalv-dsv4-dequantize branch from b8b85f2 to 8f8ce91 Compare May 12, 2026 22:44

parambole reviewed May 13, 2026

View reviewed changes

parambole assigned snehalv2002 May 13, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

write dequantization scripts for DeepSeek V4 FP4/FP8 weights#3873

write dequantization scripts for DeepSeek V4 FP4/FP8 weights#3873
snehalv2002 wants to merge 1 commit into
mainfrom
snehalv-dsv4-dequantize

snehalv2002 commented May 11, 2026 •

edited

Loading

Uh oh!

codecov Bot commented May 11, 2026

Uh oh!

parambole May 13, 2026

Uh oh!

parambole May 13, 2026 •

edited

Loading

Uh oh!

parambole May 13, 2026

Uh oh!

parambole left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

		)


		def weight_dequant_cpu(x: torch.Tensor, s: torch.Tensor, block_size: int = 128) -> torch.Tensor:

		for i in range(0, M, block_size):
		for j in range(0, N, block_size):

Conversation

snehalv2002 commented May 11, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Tests

Checklist

Uh oh!

codecov Bot commented May 11, 2026

Codecov Report

Uh oh!

parambole May 13, 2026

Choose a reason for hiding this comment

Uh oh!

parambole May 13, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

parambole May 13, 2026

Choose a reason for hiding this comment

Uh oh!

parambole left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

snehalv2002 commented May 11, 2026 •

edited

Loading

parambole May 13, 2026 •

edited

Loading