This lesson teaches core workflows for building, training, and tuning ML/AI models using Google Cloud's Vertex AI platform. Participants learn to set up data storage, configure Vertex AI Workbench notebooks as lightweight controllers, launch training and hyperparameter tuning jobs, and optimize resource costs effectively within GCP. The workshop also includes a section on building retrieval-augmented generation (RAG) pipelines using Gemini models.
- Overview of Google Cloud for Machine Learning
- Data Storage: Setting up GCS
- Notebooks as Controllers
- Accessing and Managing Data in GCS
- Using GitHub PAT in Vertex AI Notebooks
- Training Models in Vertex AI: XGBoost (CPU)
- Training Models in Vertex AI: PyTorch (GPU)
- Hyperparameter Tuning in Vertex AI
- Resource Management & Cleanup
- Retrieval-Augmented Generation (RAG)
See the Setup page for instructions on preparing for this workshop.
This lesson uses The Carpentries Workbench and is part of The Carpentries Incubator.
For questions or issues, contact endemann@wisc.edu or open an issue on this repository.