Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Performance regression in nightly with target-cpu=native on znver4 #139370

Open
nsarlin-zama opened this issue Apr 4, 2025 · 4 comments
Open
Labels
C-bug Category: This is a bug. I-prioritize Issue: Indicates that prioritization has been requested for this issue. needs-triage This issue may need triage. Remove it if it has been sufficiently triaged. regression-untriaged Untriaged performance or correctness regression.

Comments

@nsarlin-zama
Copy link

Code

Hi, I have a regression when I try to run benchmarks using the nightly compiler on a crate that does elliptic curve operations:
In this repo: https://github.com/zama-ai/tfhe-rs, on branch v1.0 , in the crate tfhe-zk-pok, running the bench with RUSTFLAGS="-C target-cpu=znver4" cargo +nightly-2025-02-18 bench -p tfhe-zk-pok --bench pke_v2 -- pke_zk_verify_v2::PKEV2.

The machine running the bench is an aws hpc7 with an AMD EPYC 9R14 CPU.
Depending on the bench I get around 30% slower timings.

A bisect shows that the regression was introduced with nightly 2025-02-18. Stable toolchains do not seem to be impacted. The slowdown is only visible with -C target-cpu=znver4.

Version it worked on

All stable toolchains, and nightly before 2025-02-18.

Version with regression

nightly 2025-02-18 and after. The latest one as of today (2025-04-04) also has the regression.

Thank you for your help.

@nsarlin-zama nsarlin-zama added C-bug Category: This is a bug. regression-untriaged Untriaged performance or correctness regression. labels Apr 4, 2025
@rustbot rustbot added I-prioritize Issue: Indicates that prioritization has been requested for this issue. needs-triage This issue may need triage. Remove it if it has been sufficiently triaged. labels Apr 4, 2025
@lqd
Copy link
Member

lqd commented Apr 4, 2025

2025-02-18 sounds recent enough to have a more precise bisection, to a given PR with https://github.com/rust-lang/cargo-bisect-rustc/ -- could you try to do it?

@nikic
Copy link
Contributor

nikic commented Apr 4, 2025

2025-02-18 is probably the LLVM 20 upgrade.

@lqd
Copy link
Member

lqd commented Apr 4, 2025

Ah indeed, #135763 landed on the 17th.

@nsarlin-zama
Copy link
Author

I can confirm from cargo-bisect-rustc (great tool btw!) that the regressing commit is the one introducing llvm-20: ce36a96

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
C-bug Category: This is a bug. I-prioritize Issue: Indicates that prioritization has been requested for this issue. needs-triage This issue may need triage. Remove it if it has been sufficiently triaged. regression-untriaged Untriaged performance or correctness regression.
Projects
None yet
Development

No branches or pull requests

4 participants