Implement Fisher Yates algorithm by kayvank · Pull Request #172 · DataHaskell/dataframe

kayvank · 2026-03-01T01:12:24Z

No description provided.

daikonradish · 2026-03-01T01:29:52Z

Thanks for submitting a PR! I'll have a good look at it later but if you're open to it, you could look at a one-time test for randomness here:

https://cnut1648.github.io/files/posts/Test_for_rand.pdf

Basically, take the indices that are output by shuffleVec and take a look at the distribution. But that's might be overkill.

mchav · 2026-03-01T08:02:51Z

src/DataFrame/Operations/Permutation.hs

+  where
+    shuffleVec :: (RandomGen g) => g -> VU.Vector Int -> VU.Vector Int
+    shuffleVec g v = runST $ do
+        vm <- VU.thaw v


Instead of declaring the vector from list just to thaw it we can just create a new one in the shuffle vec function.

Issue 170, implement PR comments and clean up build warnings

mchav · 2026-03-01T21:53:17Z

@kayvank this looks good. Rustin on some simple tests. Mostly that indices aren't dropped or duplicated and that it doesn't fail on empty.

kayvank · 2026-03-02T02:34:19Z

src/DataFrame/Operations/Permutation.hs

 shuffledIndices :: (RandomGen g) => g -> Int -> VU.Vector Int
-shuffledIndices pureGen k = VU.fromList (shuffle' [0 .. (k - 1)] k pureGen)
+shuffledIndices pureGen k
+    | k <= 0 = VU.empty


We return empty vector even when k is a negative number, which does not seen correct.
Should we error inf the rare event that k < 0? @mchav

I think that's fine since the number is derived from the size of the dataframe. And the shuffle of an empty dataframe is an empty dataframe.

kayvank · 2026-03-02T06:50:47Z

@kayvank this looks good. Rustin on some simple tests. Mostly that indices aren't dropped or duplicated and that it doesn't fail on empty.

Added two new unit tests.

kayvank marked this pull request as draft March 1, 2026 01:13

Implement Fisher Yates algorithm

7eaf4e7

kayvank force-pushed the 170/implement-Fisher-Yates-algorithm branch from 402f667 to 7eaf4e7 Compare March 1, 2026 01:14

kayvank mentioned this pull request Mar 1, 2026

Refactor shuffle to implement Fisher Yates algorithm #170

Closed

mchav requested changes Mar 1, 2026

View reviewed changes

kayvank force-pushed the 170/implement-Fisher-Yates-algorithm branch from fc6f38a to cc83918 Compare March 1, 2026 17:57

Clean up build warnings

3e69bd5

Issue 170, implement PR comments and clean up build warnings

kayvank force-pushed the 170/implement-Fisher-Yates-algorithm branch from cc83918 to 3e69bd5 Compare March 1, 2026 17:58

mchav approved these changes Mar 1, 2026

View reviewed changes

kayvank commented Mar 2, 2026

View reviewed changes

Unit tests for Fisher Yates algorithm

0b00734

kayvank force-pushed the 170/implement-Fisher-Yates-algorithm branch from ff07a4e to 0b00734 Compare March 2, 2026 06:49

kayvank marked this pull request as ready for review March 2, 2026 06:49

mchav approved these changes Mar 2, 2026

View reviewed changes

mchav merged commit c4ae5f8 into DataHaskell:main Mar 2, 2026
7 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Implement Fisher Yates algorithm#172

Implement Fisher Yates algorithm#172
mchav merged 3 commits intoDataHaskell:mainfrom
kayvank:170/implement-Fisher-Yates-algorithm

kayvank commented Mar 1, 2026

Uh oh!

daikonradish commented Mar 1, 2026

Uh oh!

mchav Mar 1, 2026

Uh oh!

mchav commented Mar 1, 2026

Uh oh!

kayvank Mar 2, 2026

Uh oh!

mchav Mar 2, 2026

Uh oh!

kayvank commented Mar 2, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

kayvank commented Mar 1, 2026

Uh oh!

daikonradish commented Mar 1, 2026

Uh oh!

mchav Mar 1, 2026

Choose a reason for hiding this comment

Uh oh!

mchav commented Mar 1, 2026

Uh oh!

kayvank Mar 2, 2026

Choose a reason for hiding this comment

Uh oh!

mchav Mar 2, 2026

Choose a reason for hiding this comment

Uh oh!

kayvank commented Mar 2, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants