Commit c68bf05
authored
feat: lora for accelerated MoE - limited (#141)
* limit changes
Signed-off-by: Will Johnson <mwjohnson728@gmail.com>
* lora filtering
Signed-off-by: Will Johnson <mwjohnson728@gmail.com>
* naming
Signed-off-by: Will Johnson <mwjohnson728@gmail.com>
* fix: hardcodes
Signed-off-by: Will Johnson <mwjohnson728@gmail.com>
* fix: target modules
Signed-off-by: Will Johnson <mwjohnson728@gmail.com>
* remove lora config from expert weights
Signed-off-by: Will Johnson <mwjohnson728@gmail.com>
* fix: pass in lora config
Signed-off-by: Will Johnson <mwjohnson728@gmail.com>
* fix: requires grad
Signed-off-by: Will Johnson <mwjohnson728@gmail.com>
* fix: don't turn off requires grad
Signed-off-by: Will Johnson <mwjohnson728@gmail.com>
---------
Signed-off-by: Will Johnson <mwjohnson728@gmail.com>1 parent 2a3a10f commit c68bf05
File tree
5 files changed
+95
-42
lines changed- plugins/accelerated-moe
- src/fms_acceleration_moe
- utils
5 files changed
+95
-42
lines changed| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
281 | 281 | | |
282 | 282 | | |
283 | 283 | | |
284 | | - | |
| 284 | + | |
285 | 285 | | |
286 | 286 | | |
287 | 287 | | |
| |||
Lines changed: 2 additions & 0 deletions
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
77 | 77 | | |
78 | 78 | | |
79 | 79 | | |
| 80 | + | |
80 | 81 | | |
81 | 82 | | |
82 | 83 | | |
| |||
97 | 98 | | |
98 | 99 | | |
99 | 100 | | |
| 101 | + | |
100 | 102 | | |
101 | 103 | | |
102 | 104 | | |
| |||
Lines changed: 90 additions & 32 deletions
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
53 | 53 | | |
54 | 54 | | |
55 | 55 | | |
| 56 | + | |
| 57 | + | |
56 | 58 | | |
57 | 59 | | |
58 | 60 | | |
| |||
110 | 112 | | |
111 | 113 | | |
112 | 114 | | |
| 115 | + | |
| 116 | + | |
| 117 | + | |
| 118 | + | |
| 119 | + | |
113 | 120 | | |
114 | | - | |
115 | | - | |
116 | | - | |
117 | | - | |
118 | | - | |
119 | | - | |
120 | | - | |
121 | | - | |
122 | | - | |
| 121 | + | |
| 122 | + | |
| 123 | + | |
| 124 | + | |
| 125 | + | |
| 126 | + | |
| 127 | + | |
| 128 | + | |
| 129 | + | |
| 130 | + | |
| 131 | + | |
| 132 | + | |
| 133 | + | |
| 134 | + | |
| 135 | + | |
| 136 | + | |
| 137 | + | |
| 138 | + | |
123 | 139 | | |
124 | 140 | | |
125 | 141 | | |
| |||
467 | 483 | | |
468 | 484 | | |
469 | 485 | | |
| 486 | + | |
470 | 487 | | |
471 | | - | |
472 | | - | |
473 | | - | |
474 | | - | |
475 | | - | |
476 | | - | |
477 | | - | |
478 | | - | |
479 | | - | |
480 | | - | |
481 | | - | |
482 | | - | |
483 | | - | |
484 | | - | |
485 | | - | |
486 | | - | |
487 | | - | |
488 | | - | |
| 488 | + | |
| 489 | + | |
| 490 | + | |
| 491 | + | |
| 492 | + | |
| 493 | + | |
| 494 | + | |
| 495 | + | |
| 496 | + | |
489 | 497 | | |
490 | | - | |
491 | | - | |
492 | | - | |
493 | | - | |
| 498 | + | |
| 499 | + | |
| 500 | + | |
| 501 | + | |
| 502 | + | |
| 503 | + | |
| 504 | + | |
| 505 | + | |
| 506 | + | |
| 507 | + | |
| 508 | + | |
| 509 | + | |
| 510 | + | |
| 511 | + | |
| 512 | + | |
| 513 | + | |
| 514 | + | |
| 515 | + | |
| 516 | + | |
| 517 | + | |
| 518 | + | |
| 519 | + | |
| 520 | + | |
| 521 | + | |
| 522 | + | |
| 523 | + | |
| 524 | + | |
| 525 | + | |
| 526 | + | |
| 527 | + | |
| 528 | + | |
| 529 | + | |
| 530 | + | |
| 531 | + | |
| 532 | + | |
| 533 | + | |
494 | 534 | | |
495 | 535 | | |
496 | 536 | | |
| |||
540 | 580 | | |
541 | 581 | | |
542 | 582 | | |
| 583 | + | |
| 584 | + | |
| 585 | + | |
| 586 | + | |
| 587 | + | |
| 588 | + | |
| 589 | + | |
| 590 | + | |
| 591 | + | |
| 592 | + | |
| 593 | + | |
| 594 | + | |
| 595 | + | |
| 596 | + | |
| 597 | + | |
543 | 598 | | |
544 | | - | |
| 599 | + | |
| 600 | + | |
| 601 | + | |
545 | 602 | | |
546 | 603 | | |
547 | 604 | | |
548 | 605 | | |
549 | 606 | | |
550 | 607 | | |
| 608 | + | |
551 | 609 | | |
552 | 610 | | |
553 | 611 | | |
| |||
Lines changed: 0 additions & 8 deletions
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
17 | 17 | | |
18 | 18 | | |
19 | 19 | | |
20 | | - | |
21 | 20 | | |
22 | 21 | | |
23 | 22 | | |
| |||
237 | 236 | | |
238 | 237 | | |
239 | 238 | | |
240 | | - | |
241 | | - | |
242 | | - | |
243 | | - | |
244 | 239 | | |
245 | 240 | | |
246 | 241 | | |
| |||
286 | 281 | | |
287 | 282 | | |
288 | 283 | | |
289 | | - | |
290 | 284 | | |
291 | 285 | | |
292 | 286 | | |
| |||
296 | 290 | | |
297 | 291 | | |
298 | 292 | | |
299 | | - | |
300 | 293 | | |
301 | 294 | | |
302 | 295 | | |
| |||
307 | 300 | | |
308 | 301 | | |
309 | 302 | | |
310 | | - | |
311 | 303 | | |
312 | 304 | | |
313 | 305 | | |
| |||
Lines changed: 2 additions & 1 deletion
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
92 | 92 | | |
93 | 93 | | |
94 | 94 | | |
95 | | - | |
| 95 | + | |
| 96 | + | |
96 | 97 | | |
97 | 98 | | |
98 | 99 | | |
| |||
0 commit comments