You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Immediate To-Dos:
Improve the Multilora PEFT class extension code ( @sumo already has an implementation and will push it shortly)
Gating needs to be standardized to enable flexible switching of expert adapters from a larger db of adapters (likely through centroid/similarity measures)
UI to run MoE inference and Base Model inference side by side (w streaming and display of selected experts during inference)
simplifying the process of Finetuning new experts and adding them to the MoE arch
The text was updated successfully, but these errors were encountered:
Immediate To-Dos:
Improve the Multilora PEFT class extension code ( @sumo already has an implementation and will push it shortly)
Gating needs to be standardized to enable flexible switching of expert adapters from a larger db of adapters (likely through centroid/similarity measures)
UI to run MoE inference and Base Model inference side by side (w streaming and display of selected experts during inference)
simplifying the process of Finetuning new experts and adding them to the MoE arch
The text was updated successfully, but these errors were encountered: