Basic Agent With Tool AI Engineer Intern

Basic Agent With Tool AI Engineer Intern — a thoughtfully built LLM-powered conversational application for intelligent, context-aware dialogue.

Topics: agentic-ai · ai-agents · api · autonomous-agents · deep-learning · function-execution · generative-ai · large-language-models · llm · tool-calling-llm

Overview

Basic Agent With Tool AI Engineer Intern is an LLM-powered application that provides a clean, functional interface for interacting with frontier language models. It goes beyond a simple prompt-response wrapper: it maintains conversation memory, supports configurable system prompts for persona and task specialisation, and provides streaming output for a responsive chat experience.

The application supports multiple LLM backends — OpenAI GPT-4o, Google Gemini, Anthropic Claude, and locally-running Ollama models — configurable via environment variables without code changes. This makes it a flexible foundation for a wide range of conversational AI use cases: customer support bots, coding assistants, educational tutors, and domain-specific advisors.

The conversation history is managed with configurable memory: either a simple sliding window buffer (last N messages) or a vector-store-backed semantic memory that retrieves relevant earlier context based on the current query. Both modes are implemented and selectable via the sidebar.

Motivation

Building a production-ready LLM chat application requires more than an API call. Memory, streaming, error handling, rate limit management, and a clean UI each add meaningful complexity. This project implements all of these correctly so that it can serve as a solid, reusable foundation for any conversational AI product built on top of it.

Architecture

User message → Streamlit chat UI
        │
  Memory retrieval (buffer or vector)
        │
  LLM API call (streamed)
        │
  Response rendering + history update

Features

Multi-Turn Memory

Conversation history maintained across turns — either as a sliding window buffer or semantic vector retrieval for long-context conversations.

Streaming Token Output

LLM responses are streamed character by character to the UI, providing low-latency perceived response time.

System Prompt Customisation

Sidebar text area for runtime system prompt editing — change the AI's persona, task focus, or response format without restarting the app.

Multi-Model Backend

Switch between OpenAI, Gemini, Claude, and Ollama backends via environment variable or sidebar selector.

Conversation Export

Download full conversation history as Markdown or JSON.

Token Usage Display

Real-time per-message and session-total token counters in the sidebar.

Error Handling and Retry

Graceful handling of API rate limits, timeouts, and model errors with automatic retry and user notification.

Session Management

Create, name, switch between, and delete conversation sessions without page reload.

Tech Stack

Library / Tool	Role	Why This Choice
Streamlit	Chat UI	st.chat_message, st.chat_input, sidebar
OpenAI SDK	Primary LLM backend	GPT-4o function calling and streaming
google-generativeai	Gemini backend	Gemini Pro/Flash streaming API
python-dotenv	Config	Environment variable management
FAISS (optional)	Vector memory	Semantic conversation history retrieval
LangChain (optional)	Memory abstraction	ConversationBufferMemory wrappers

Getting Started

Prerequisites

Python 3.9+ (or Node.js 18+ for TypeScript/JS projects)
pip or npm package manager
Relevant API keys (see Configuration section)

Installation

git clone https://github.com/Devanik21/Basic-Agent-with-Tool---AI-Engineer-Intern-.git
cd Basic-Agent-with-Tool---AI-Engineer-Intern-
python -m venv venv && source venv/bin/activate
pip install streamlit openai google-generativeai python-dotenv
echo 'OPENAI_API_KEY=sk-...' > .env
streamlit run app.py

Usage

streamlit run app.py

# Use Gemini backend
BACKEND=gemini streamlit run app.py

# Use local Ollama
BACKEND=ollama MODEL=llama3 streamlit run app.py

Configuration

Variable	Default	Description
`OPENAI_API_KEY`	`(required for OpenAI)`	OpenAI API key
`GOOGLE_API_KEY`	`(required for Gemini)`	Google API key
`BACKEND`	`openai`	LLM backend: openai, gemini, ollama
`MEMORY_WINDOW`	`10`	Number of messages in sliding window memory

Copy .env.example to .env and populate all required values before running.

Project Structure

Basic-Agent-with-Tool---AI-Engineer-Intern-/
├── README.md
├── financial-research-assistant.py
└── ...

Roadmap

RAG mode: chat over uploaded documents with citation
Voice I/O with Whisper and TTS
Multi-agent mode with specialist sub-agents
Docker deployment with Traefik reverse proxy
Persistent storage with SQLite conversation database

Contributing

Contributions, issues, and feature requests are welcome. Please:

Fork the repository
Create a feature branch (git checkout -b feature/your-feature)
Commit your changes (git commit -m 'feat: add your feature')
Push to your branch (git push origin feature/your-feature)
Open a Pull Request

Please follow conventional commit messages and ensure any new code is documented.

Notes

API keys for the configured LLM backend are required. Streaming availability depends on the model and backend. Ollama must be running locally for the Ollama backend to work.

Author

Devanik Debnath
B.Tech, Electronics & Communication Engineering
National Institute of Technology Agartala

License

This project is open source and available under the MIT License.

Crafted with curiosity, precision, and a belief that good software is worth building well.

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
.gitattributes		.gitattributes
LICENSE		LICENSE
README.md		README.md
financial-assistant-readme.md		financial-assistant-readme.md
financial-research-assistant.py		financial-research-assistant.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Basic Agent With Tool AI Engineer Intern

Overview

Motivation

Architecture

Features

Multi-Turn Memory

Streaming Token Output

System Prompt Customisation

Multi-Model Backend

Conversation Export

Token Usage Display

Error Handling and Retry

Session Management

Tech Stack

Getting Started

Prerequisites

Installation

Usage

Configuration

Project Structure

Roadmap

Contributing

Notes

Author

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Basic Agent With Tool AI Engineer Intern

Overview

Motivation

Architecture

Features

Multi-Turn Memory

Streaming Token Output

System Prompt Customisation

Multi-Model Backend

Conversation Export

Token Usage Display

Error Handling and Retry

Session Management

Tech Stack

Getting Started

Prerequisites

Installation

Usage

Configuration

Project Structure

Roadmap

Contributing

Notes

Author

License

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages