【轻训营】F2LLM添加更多的decode-only的model的支持 by luanzhiwow · Pull Request #18 · codefuse-ai/CodeFuse-Embeddings

luanzhiwow · 2025-11-25T12:37:33Z

#10
1.
model.py: 核心模型加载逻辑重构
tokenize_data.py: 新增通用数据预处理脚本
MULTI_MODEL_GUIDE.md: 详细使用文档
2.
通用分词器 - 新增tokenize_data.py支持任意transformers模型
配置扩展 - 添加模型类型和注意力实现配置选项
兼容性修复 - 处理不同模型的分词器差异和输出格式

支持更多的model

5f96b33

luanzhiwow changed the title ~~【轻训营】添加更多的decode-only的model的支持~~ 【轻训营】F2LLM添加更多的decode-only的model的支持 Nov 25, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

【轻训营】F2LLM添加更多的decode-only的model的支持#18

【轻训营】F2LLM添加更多的decode-only的model的支持#18
luanzhiwow wants to merge 1 commit into
codefuse-ai:mainfrom
luanzhiwow:cfe_for_gpt

luanzhiwow commented Nov 25, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

luanzhiwow commented Nov 25, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants