cmov: impl optimized `CmovEq` for `[u8]` [BREAKING] #1356

tarcieri · 2026-01-16T17:55:08Z

Note: version bumped to v0.5.0-pre to denote breaking change (not for release)

Perhaps the first and foremost use case for a crate like this (or subtle or ctutils) is comparing byte slices in constant-time, however the existing codegen for this is bad, because it goes a byte-at-a-time, converting them to a u32oru64`, then emitting predication instructions (or using bitwise masking) on each individual byte.

Instead this removes the CmovEq impl for [T] and replaces it with an optimized impl of CmovEq for [u8], reusing the code for the optimized CmovEq impl for arrays added in #1353.

This approach goes in word-sized chunks of the slice, converting them to a word-sized integer (u32 or u64) and using the CmovEq impl on those types, which should result in much more efficient code.

With this change all of the slice chunking code is now in the slice module, which lets us move the vendored copies of [T]::as_chunks(_mut) there, get rid of a utils module, and rename it back to macros (though that's perhaps a misnomer as it contains only one macro).

A small change to the Cmov impl added in #1354: it panics if the input sizes aren't equal, using the same panic message as copy_from_slice.

Note: version bumped to v0.5.0-pre to denote breaking change (not for release) Perhaps the first and foremost use case for a crate like this (or `subtle` or `ctutils) is comparing byte slices in constant-time, however the existing codegen for this is bad, because it goes a byte-at-a-time, converting them to a `u32` or `u64`, then emitting predication instructions (or using bitwise masking) on each individual byte. Instead this removes the `CmovEq` impl for `[T]` and replaces it with an optimized impl of `CmovEq` for `[u8]`, reusing the code for the optimized `CmovEq` impl for arrays added in #1353. This approach goes in word-sized chunks of the slice, converting them to a word-sized integer (`u32` or `u64`) and using the `CmovEq` impl on those types, which should result in much more efficient code. With this change all of the slice chunking code is now in the `slice` module, which lets us move the vendored copies of `[T]::as_chunks(_mut)` there, get rid of a `utils` module, and rename it back to `macros` (though that's perhaps a misnomer as it contains only one macro). A small change to the `Cmov` impl added in #1354: it panics if the input sizes aren't equal, using the same panic message as `copy_from_slice`.

tarcieri force-pushed the ctutils/optimized-cmoveq-for-byte-slices branch 5 times, most recently from c844f00 to e3d1c1d Compare January 16, 2026 18:06

tarcieri force-pushed the ctutils/optimized-cmoveq-for-byte-slices branch from e3d1c1d to b795962 Compare January 16, 2026 18:43

tarcieri merged commit 19e042a into master Jan 16, 2026
117 checks passed

tarcieri deleted the ctutils/optimized-cmoveq-for-byte-slices branch January 16, 2026 18:48

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

cmov: impl optimized `CmovEq` for `[u8]` [BREAKING] #1356

cmov: impl optimized `CmovEq` for `[u8]` [BREAKING] #1356

Uh oh!

tarcieri commented Jan 16, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

cmov: impl optimized CmovEq for [u8] [BREAKING] #1356

cmov: impl optimized CmovEq for [u8] [BREAKING] #1356

Uh oh!

Conversation

tarcieri commented Jan 16, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

cmov: impl optimized `CmovEq` for `[u8]` [BREAKING] #1356

cmov: impl optimized `CmovEq` for `[u8]` [BREAKING] #1356