Skip to content

Docs: use relative path for preprocess_data_for_megatron.py#15221

Closed
Saibabu7770 wants to merge 1 commit intoNVIDIA-NeMo:mainfrom
Saibabu7770:clean-my-first-pr
Closed

Docs: use relative path for preprocess_data_for_megatron.py#15221
Saibabu7770 wants to merge 1 commit intoNVIDIA-NeMo:mainfrom
Saibabu7770:clean-my-first-pr

Conversation

@Saibabu7770
Copy link
Copy Markdown
Contributor

This updates GPT training docs to use a relative path:
python scripts/nlp_language_modeling/preprocess_data_for_megatron.py

Fixes #15130

Signed-off-by: Saibabu7770 <saibabupat7@gmail.com>
@pzelasko
Copy link
Copy Markdown
Collaborator

Closing this PR as "won't merge".
The following collections have been moved to separate repos in https://github.com/NVIDIA-NeMo organization: avlm, llm, multimodal, multimodal-autoregressive, vlm, speechlm, diffusion.
If you still wish to proceed with this contribution, please re-open it in the relevant repo.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

None yet

Development

Successfully merging this pull request may close these issues.

[BUG] Megatron Tokenization script missing / Docs outdated.

3 participants