Skip to content

Conversation

@MatzeB
Copy link
Contributor

@MatzeB MatzeB commented Dec 17, 2025

Summary:
X-link: https://github.com/facebookresearch/FBGEMM/pull/2236

EmbeddingSpMDMNBitBenchmark uses the scale_bias_last == true && has_weight == true variant which wasn't previously specialized in the code because we did not see it in the bigger test suites. However it does not cost too much code size to add it and makes this benchmark look better... Also added block_size==576, block_size==36 and block_size==72 specialization seen in some other models.

This adds ~50K in code size (compared to currently ~150K for the existing specializations) which seems acceptable.

Differential Revision: D87289832

Summary:
X-link: facebookresearch/FBGEMM#2236

`EmbeddingSpMDMNBitBenchmark` uses the `scale_bias_last == true && has_weight == true` variant which wasn't previously specialized in the code because we did not see it in the bigger test suites. However it does not cost too much code size to add it and makes this benchmark look better... Also added block_size==576, block_size==36 and block_size==72 specialization seen in some other models.

This adds ~50K in code size (compared to currently ~150K for the existing specializations) which seems acceptable.

Differential Revision: D87289832
@meta-cla meta-cla bot added the cla signed label Dec 17, 2025
@meta-codesync
Copy link
Contributor

meta-codesync bot commented Dec 17, 2025

@MatzeB has exported this pull request. If you are a Meta employee, you can view the originating Diff in D87289832.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant