Eval bug: Gemma MoE model crashing

### Name and Version

version: 9459 (07ac3cec6)
built with Clang 19.1.5 for Windows x86_64 (latest version)


### Operating systems

Windows

### GGML backends

CUDA

### Hardware

RTX 5060 Ti

### Models

_No response_

### Problem description & steps to reproduce


D:/a/beellama.cpp/beellama.cpp/ggml/src/ggml-backend.cpp:272: GGML_ASSERT(offset + size <= ggml_nbytes(tensor) && "tensor read out of bounds") failed 

In the previous version it was working, and I was getting a 1.5x+ speed up on the MoE model too, but now it seems only dense is working.

### First Bad Commit

_No response_

### Relevant log output

<details>
<summary>Logs</summary>


```console

```
</details>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Eval bug: Gemma MoE model crashing #34

Name and Version

Operating systems

GGML backends

Hardware

Models

Problem description & steps to reproduce

First Bad Commit

Relevant log output

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Uh oh!

Eval bug: Gemma MoE model crashing #34

Description

Name and Version

Operating systems

GGML backends

Hardware

Models

Problem description & steps to reproduce

First Bad Commit

Relevant log output

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions