Skip to content
This repository was archived by the owner on Mar 6, 2026. It is now read-only.

Latest commit

Β 

History

History
141 lines (104 loc) Β· 3.93 KB

File metadata and controls

141 lines (104 loc) Β· 3.93 KB

VisualTreeSearch

A powerful web agent visualization tool that helps you understand and analyze web automation processes through visual tree search.

Disclaimer: Please note that VisualTreeSearch is not affiliated with any for-profit company. This is a collaborative project from PathOnAI.org, an open-source AI research community, where Danqing Zhang (danqing.zhang.personal@gmail.com) is the main contributor and lead author of the ECML-PKDD paper. If anyone claims VisualTreeSearch is affiliated with any for-profit company, please contact Danqing Zhang (danqing.zhang.personal@gmail.com) for verification.

News

  • 06/16/2025 - "VisualTreeSearch: Understanding Web Agent Test-time Scaling" was accepted by ECML-PKDD 2025.
  • 04/28/2025 - Released this Open Source Repo: visual tree search.

🌐 Live Demo

Watch the video

Visit our live demo at: visual-tree-search.pathonai.org

🌟 Features

  • Interactive visualization of web agent actions
  • Real-time tree search visualization
  • Modern and responsive UI
  • Comprehensive web automation analysis

πŸ› οΈ Tech Stack

Frontend

  • Framework: NextJS 14
  • Styling: TailwindCSS
  • UI Components: Shadcn UI
  • Deployment: Vercel

Backend

  • Framework: FastAPI
  • Deployment: AWS ECS

Browser Service

  • Framework: FastAPI
  • Deployment: AWS ECS
  • Browser Engine: Chromium (via Playwright)

State Reset

  • Framework: FastAPI
  • Deployment: AWS EC2
  • Database Access: SQLAlchemy ORM connecting to MariaDB (using MySQL-compatible interface)

πŸš€ Getting Started

Prerequisites

  • Node.js (Latest LTS version)
  • Python 3.8+
  • npm or yarn
  • Git

Installation

  1. Clone the repository
git clone https://github.com/PathOnAI/VisualTreeSearch-Demo.git
cd VisualTreeSearch-Demo
  1. Backend Setup
# Navigate to backend directory
cd visual-tree-search-backend

# Create and activate virtual environment (recommended)
python -m venv venv
source venv/bin/activate  # On Windows use: venv\Scripts\activate

# Install dependencies
pip install -r requirements.txt
pip install uvicorn[standard]  # Install uvicorn with standard extras
  1. Frontend Setup
# Navigate to frontend directory
cd ../visual-tree-search-app

# Install dependencies
npm install

# Create .env file
echo "NEXT_PUBLIC_BACKEND_URL=http://127.0.0.1:3000" > .env

Local Development

Backend

  1. Navigate to backend directory:
cd visual-tree-search-backend
  1. Activate virtual environment (if not already activated):
source venv/bin/activate  # On Windows use: venv\Scripts\activate
  1. Run the FastAPI server:
uvicorn app.main:app --host 0.0.0.0 --port 3000 --reload

Note: The --reload flag enables auto-reload when code changes are detected. Remove it in production.

Frontend

  1. Open a new terminal and navigate to frontend directory:
cd visual-tree-search-app
  1. Start the development server:
npm run dev -- -p 3001

The application should now be running at:

πŸ“ Project Structure

VisualTreeSearch-Demo/
β”œβ”€β”€ visual-tree-search-app/        # Frontend application
β”‚   β”œβ”€β”€ src/                      # Source code
β”‚   β”œβ”€β”€ public/                   # Static files
β”‚   └── package.json             # Frontend dependencies
β”œβ”€β”€ visual-tree-search-backend/    # Backend API service
β”‚   β”œβ”€β”€ app/                     # Backend source code
β”‚   β”œβ”€β”€ requirements.txt         # Backend dependencies
β”‚   └── test/                    # Test files
β”œβ”€β”€ visual-tree-search-browser-service/  # Browser automation service
└── visual-tree-search-state-reset/     # State management service