RAG (FAISS, BM25) β’ LangChain β’ LangSmith β’ Ragas β’ OpenAI β’ Whisper β’ LLM Evaluation β’ Prompt Engineering
πΉ Build scalable Playwright automation frameworks
πΉ Design end-to-end testing for AI & LLM systems
πΉ Validate RAG pipelines and reduce hallucinations
πΉ Automate voice & chat AI workflows (STT + LLM + TTS)
πΉ Create AI evaluation frameworks using LLM-as-a-Judge

