Skip to content

Commit f47faf6

Browse files
committed
add datathon documentation
1 parent 4fb4bb9 commit f47faf6

16 files changed

Lines changed: 348 additions & 1 deletion

File tree

.devcontainer/devcontainer.json

Lines changed: 8 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,8 @@
1+
{
2+
"name": "SQL + AI Contest",
3+
"image": "mcr.microsoft.com/devcontainers/python:3.11",
4+
"features": {
5+
"ghcr.io/devcontainers/features/docker-in-docker:2": {}
6+
},
7+
"forwardPorts": [8000, 5000]
8+
}
Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1 @@
1+
# Mission 1 workflow placeholder
Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1 @@
1+
# Mission 2 workflow placeholder
Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1 @@
1+
# Mission 3 workflow placeholder
Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1 @@
1+
# Final submission workflow placeholder

README.md

Lines changed: 38 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -1 +1,38 @@
1-
# sql-ai-contest-starter
1+
# SQL AI Datathon Starter Repository
2+
3+
Welcome to the SQL AI Datathon! You'll complete guided missions that will prepare you for building AI-powered applications with Microsoft SQL. You’ll use AI features to build embeddings, semantic/hybrid search, DAB endpoints, and an agent with RAG.
4+
5+
Complete all four missions to earn a badge and unlock the Open Hack challenge.
6+
You'll add your own creative twist in the Open Hack and be eligible for judging for a ticket to SQLCon 2026!
7+
8+
9+
## Quick Start
10+
1. Fork this repo.
11+
2. Open in Codespaces or local devcontainer.
12+
3. Run `scripts/...` to populate sample data.
13+
4. Commit a change to trigger **Mission 1** CI; fix any failing tests.
14+
5. Repeat for Missions 2 & 3. Use Week 4 to improve the app.
15+
16+
## Mission Briefs
17+
1. [Embeddings & Search](missions/mission1/README.md)
18+
2. [Retrieval Augmented Generation (RAG)](missions/mission2/README.md)
19+
3. [Orchestrate SQL + AI workflows in C# or Python](missions/mission3/README.md)
20+
4. [Building a Full-Stack AI Application in C# or Python](missions/mission4/README.md)
21+
7. [Open Hack]()
22+
23+
## What you’ll build
24+
- Embeddings & search
25+
- REST/GraphQL endpoints with Data API Builder
26+
- An agent that answers questions grounded in your SQL data
27+
28+
## How you’re evaluated for the Open Hack
29+
- CI must pass for each mission to receive a badge and your Open Hack submission to be eligible for judging
30+
- README clarity & reproducibility
31+
- Creativity, impact, and alignment with SQL + AI objectives
32+
33+
# How to Submit
34+
1. Ensure all mission workflows passed on your repo.
35+
2. Open an Issue on [this repo](https://github.com/your-repo-link) using the “Final Submission” template.
36+
3. Paste links to:
37+
- Latest successful workflow runs
38+
- Repo README and any demo instructions

missions/mission1/README.md

Lines changed: 17 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,17 @@
1+
# Mission 1: Creating Embeddings and Performing Search
2+
3+
You will be guided through implementing semantic search capabilities using embedding models and Azure SQL Database. In this mission, you will:
4+
5+
## Learning Objectives
6+
- **Convert Text to Vectors**: Use an embedding model to convert text into high-dimensional vector representations
7+
- **Store Embeddings**: Store embeddings efficiently in Azure SQL Database
8+
- **Query with Vector Similarity**: Query the database using vector similarity to find semantically related content
9+
- **Maintain Embeddings**: Keep embeddings updated as data changes
10+
11+
## Prerequisites
12+
1. Azure SQL Database with populated data
13+
1. Embedding model access (you can use the free AI Proxy to generate embeddings for free)
14+
15+
## Next Steps
16+
After completing this mission, you will have implemented a semantic search solution that can find relevant information based on meaning rather than just keywords.
17+
Proceed to [Mission 2: Retrieval Augmented Generation (RAG)](../mission2/README.md) to learn how to build an end-to-end RAG pipeline that combines embeddings with language model generation.

missions/mission2/README.md

Lines changed: 25 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,25 @@
1+
# Mission 2: Retrieval Augmented Generation (RAG)
2+
3+
You will be guided through implementing retrieval augmented generation (RAG) capabilities using embedding models and Azure SQL Database. In this mission, you will:
4+
5+
## Learning Objectives
6+
- **Implement RAG Pipeline**: Build an end-to-end retrieval-augmented generation workflow
7+
- **Query with Context**: Use vector similarity search to find relevant documents based on user questions
8+
- **Generate Informed Responses**: Feed retrieved context to a language model to produce accurate, grounded answers
9+
- **Prevent Hallucinations**: Learn techniques to ensure LLM responses are based on actual data rather than fabricated information
10+
- **Optimize Context Windows**: Understand how to select and rank the most relevant information for the language model
11+
12+
13+
## Prerequisites
14+
1. Mission 1 completed
15+
1. Embedding model access (you can use the free AI Proxy to generate embeddings for free)
16+
1. Language model access (you can use the free AI Proxy to generate completions for free)
17+
18+
# Key Concepts
19+
This mission demonstrates how to build production-ready AI applications that combine the semantic understanding of embeddings with the generative capabilities of large language models. RAG is a fundamental pattern for creating trustworthy AI systems that answer questions based on your specific domain knowledge.
20+
21+
The techniques learned here enable you to build chatbots, knowledge assistants, and intelligent search systems that provide accurate, source-backed responses.
22+
23+
## Next Steps
24+
After completing this mission, you will have implemented a robust RAG pipeline that can answer questions based on your SQL data.
25+
Proceed to [Mission 3: Orchestrate SQL + AI workflows](../mission3/README.md) to learn how to build complex, multi-step AI workflows that integrate RAG with other SQL and AI capabilities.

missions/mission3/README.md

Lines changed: 24 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,24 @@
1+
# Mission 3: Orchestrate SQL + AI workflows
2+
3+
This mission advances your skills by teaching you how to orchestrate complex workflows that combine SQL database operations with AI capabilities. You'll learn to build intelligent, multi-step processes that seamlessly integrate data retrieval, transformation, and AI-powered decision making.
4+
5+
## Overview
6+
Learn how to design and implement sophisticated AI workflows that coordinate multiple database queries, embedding operations, and language model calls to solve complex business problems requiring multiple steps and decision points.
7+
8+
## Learning Objectives
9+
- **Build Multi-Step Workflows**: Create orchestrated processes that chain together SQL queries, vector searches, and AI model calls
10+
- **Implement Conditional Logic**: Use AI responses to determine next steps in your workflow
11+
- **Handle Data Transformations**: Process and prepare data between workflow stages
12+
- **Manage State and Context**: Maintain information across multiple operations in a workflow
13+
14+
## Prerequisites
15+
1. Mission 1 completed (embeddings and vector search)
16+
1. Mission 2 completed (RAG implementation)
17+
1. Understanding of async/await patterns
18+
1. Embedding and language model access
19+
20+
21+
## Next Steps
22+
After completing this mission, you will have built a robust AI orchestration layer that can handle complex scenarios by integrating SQL data operations with AI capabilities.
23+
24+
Proceed to [Mission 4: Building a Full-Stack AI Application](../mission4/README.md) to learn how to create a complete application that leverages the workflows you've built in this mission.
Lines changed: 43 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,43 @@
1+
{
2+
"cells": [
3+
{
4+
"cell_type": "code",
5+
"execution_count": null,
6+
"id": "ce33247c",
7+
"metadata": {
8+
"vscode": {
9+
"languageId": "plaintext"
10+
}
11+
},
12+
"outputs": [],
13+
"source": [
14+
"# Mission 3: Orchestrate SQL + AI workflows\n",
15+
"\n",
16+
"This mission advances your skills by teaching you how to orchestrate complex workflows that combine SQL database operations with AI capabilities. You'll learn to build intelligent, multi-step processes that seamlessly integrate data retrieval, transformation, and AI-powered decision making.\n",
17+
"\n",
18+
"## Overview\n",
19+
"Learn how to design and implement sophisticated AI workflows that coordinate multiple database queries, embedding operations, and language model calls to solve complex business problems requiring multiple steps and decision points.\n",
20+
"\n",
21+
"## Learning Objectives\n",
22+
"- **Build Multi-Step Workflows**: Create orchestrated processes that chain together SQL queries, vector searches, and AI model calls\n",
23+
"- **Implement Conditional Logic**: Use AI responses to determine next steps in your workflow\n",
24+
"- **Handle Data Transformations**: Process and prepare data between wor\n",
25+
"- **Manage State and Context**: Maintain information across multiple operations in a workflow\n",
26+
"kflow stages\n",
27+
"\n",
28+
"## Prerequisites\n",
29+
"1. Mission 1 completed (embeddings and vector search)\n",
30+
"1. Mission 2 completed (RAG implementation)\n",
31+
"1. Understanding of async/await patterns\n",
32+
"1. Embedding and language model access"
33+
]
34+
}
35+
],
36+
"metadata": {
37+
"language_info": {
38+
"name": "python"
39+
}
40+
},
41+
"nbformat": 4,
42+
"nbformat_minor": 5
43+
}

0 commit comments

Comments
 (0)