feat: expand CPU instructions by chamalgomes · Pull Request #46 · chamalgomes/artifacts

chamalgomes · 2026-02-27T03:29:56Z

No description provided.

Copilot

Pull request overview

This PR enables advanced CPU instruction set extensions (AVX2, FMA, and F16C) in the CUDA build workflow for llama-cpp-python. These optimizations will improve computational performance for matrix operations commonly used in LLM inference, particularly for the CPU-side operations even in CUDA builds.

Changes:

Enabled AVX2, FMA, and F16C CPU instruction sets in the CUDA build configuration by changing CMake flags from off to on

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

Copilot · 2026-02-27T03:32:09Z

.github/workflows/llama-build-cuda.yaml

+            -DGGML_AVX2=on
+            -DGGML_FMA=on
+            -DGGML_F16C=on


Enabling AVX2, FMA, and F16C CPU instructions will improve performance but will break compatibility with older CPUs that don't support these instruction sets. These instructions require:

AVX2: Intel Haswell (2013) or AMD Excavator (2015) and newer

FMA: Same as AVX2

F16C: Intel Ivy Bridge (2012) or AMD Bulldozer (2011) and newer

This means the built wheels will fail with "Illegal instruction" errors on older CPUs. Consider:

Documenting these CPU requirements in the release notes or README

Testing that the target deployment environments support these instructions

Potentially providing separate builds for older CPUs if backward compatibility is needed

The trade-off is significant performance gains (especially for matrix operations used in LLM inference) versus broader CPU compatibility.

Update llama-build-cuda.yaml

725626f

Copilot AI review requested due to automatic review settings February 27, 2026 03:29

Copilot started reviewing on behalf of chamalgomes February 27, 2026 03:30 View session

Update llama-build-cuda.yaml

d55d455

Copilot AI reviewed Feb 27, 2026

View reviewed changes

chamalgomes merged commit aee75ab into main Feb 27, 2026

chamalgomes deleted the feat/expand-cpu-instructions branch February 27, 2026 06:10

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: expand CPU instructions#46

feat: expand CPU instructions#46
chamalgomes merged 2 commits intomainfrom
feat/expand-cpu-instructions

chamalgomes commented Feb 27, 2026

Uh oh!

Copilot AI left a comment

Uh oh!

Copilot AI Feb 27, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

chamalgomes commented Feb 27, 2026

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Uh oh!

Copilot AI Feb 27, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants