Skip to content

[RVV] add rvv f32 kernels for velu, vgelu, vapproxgelu#9963

Open
velonica0 wants to merge 3 commits intogoogle:masterfrom
velonica0:rvv-elementwise
Open

[RVV] add rvv f32 kernels for velu, vgelu, vapproxgelu#9963
velonica0 wants to merge 3 commits intogoogle:masterfrom
velonica0:rvv-elementwise

Conversation

@velonica0
Copy link
Copy Markdown

Part of #9954

Add RVV kernels for three elementwise activation ops:

  • f32-velu (ELU) - rr2_p6 polynomial approximation
  • f32-vgelu (GELU) - rational 12/10 approximation (div + nr variants)
  • f32-vapproxgelu - rational 12/10 approximation (div + nr variants)

Tested on SpacemiT K1 CPU and K3 CPU, both VLEN=256.

Add RVV kernels for three elementwise activation ops:
- f32-velu (ELU)       - rr2_p6 polynomial approximation
- f32-vgelu (GELU)     - rational 12/10 approximation (div + nr variants)
- f32-vapproxgelu      - rational 12/10 approximation (div + nr variants)

Tested on SpacemiT K1 CPU and K3 CPU, both VLEN=256.
Move f32-velu, f32-vgelu, f32-vapproxgelu entries from end of
PROD/NON_PROD lists to alphabetical positions to avoid merge conflicts
with other RVV PRs.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants