Skip to content

Conversation

@terdogan
Copy link

Summary:
X-link: https://github.com/facebookresearch/FBGEMM/pull/2245

The benchmarks were calling iota() and shuffle() on a 4M element vector for each batch iteration, resulting in ~80M operations per setup. Using std::sample() reduces this to O(k) per batch where k is the number of indices needed (~100-200), significantly reducing benchmark setup overhead.

Differential Revision: D89337685

Tahsin Erdogan added 2 commits December 17, 2025 14:42
…rch#5253)

Summary:
X-link: facebookresearch/FBGEMM#2244


The dedup_map variable was declared but never used in the EmbeddingSpMDM benchmark files. This cleanup removes the dead code and the now-unused <map> and <set> includes that were only needed for that variable.

Reviewed By: amitaga

Differential Revision: D89335855
Summary:
X-link: facebookresearch/FBGEMM#2245

The benchmarks were calling iota() and shuffle() on a 4M element vector for each batch iteration, resulting in ~80M operations per setup. Using std::sample() reduces this to O(k) per batch where k is the number of indices needed (~100-200), significantly reducing benchmark setup overhead.

Differential Revision: D89337685
@meta-cla meta-cla bot added the cla signed label Dec 17, 2025
@meta-codesync
Copy link
Contributor

meta-codesync bot commented Dec 17, 2025

@terdogan has exported this pull request. If you are a Meta employee, you can view the originating Diff in D89337685.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant