Skip to content

Sweep for auxillary loss and top-k #165

@aaronkl

Description

@aaronkl

Sweep over auxiliary loss and top-k for the 200M-A50M model. Also try it with different number of experts

Metadata

Metadata

Assignees

Type

No type

Projects

Status

No status

Relationships

None yet

Development

No branches or pull requests

Issue actions