Skip to content
This repository was archived by the owner on May 1, 2025. It is now read-only.

Add RWKV models#16

Open
guangyusong wants to merge 2 commits into
salesforce:mainfrom
guangyusong:main
Open

Add RWKV models#16
guangyusong wants to merge 2 commits into
salesforce:mainfrom
guangyusong:main

Conversation

@guangyusong

@guangyusong guangyusong commented Jun 5, 2023

Copy link
Copy Markdown

Changes:

Added support for RWKV model family.

Related links:

Paper: https://arxiv.org/abs/2305.13048
Github: https://github.com/BlinkDL/RWKV-LM

Screenshots:

Model list:
Screenshot 2023-06-05 at 2 10 45 AM

Sample generation:
Screenshot 2023-06-05 at 2 09 48 AM

@salesforce-cla

salesforce-cla Bot commented Jun 5, 2023

Copy link
Copy Markdown

Thanks for the contribution! Before we can merge this, we need @guangyusong to sign the Salesforce Inc. Contributor License Agreement.

@bdqnghi

bdqnghi commented Jun 6, 2023

Copy link
Copy Markdown
Contributor

thank you. The RWKV model family looks very nice. However, they are not really code learning models, do you have the checkpoints related to coding tasks?

@guangyusong

Copy link
Copy Markdown
Author

The RWKV model family should have a similar data split as GPT-J. We anticipate releasing a model that's more adept at coding tasks in the near future.

Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants