Virtual Tool Calling (VTC) Architecture

This document describes the Virtual Tool Calling (VTC) subsystem, which provides transparent support for Cline-like clients that embed tool calls as XML within message content rather than using native structured tool calls.

Overview

Virtual Tool Calling (VTC) is a mode used by popular coding agents like Cline, KiloCode, and RooCode. These clients embed tool invocations as XML within the regular message content:

<function_calls>
<invoke name="execute_command">
<parameter name="command">ls -la</parameter>
</invoke>
</function_calls>

The VTC subsystem enables the proxy to:

Detect VTC clients based on User-Agent patterns
Extract XML tool calls from message content into structured format
Process tool calls uniformly through the core pipeline (reactors, filters, loop detection)
Serialize back to XML before sending responses to VTC clients

This architecture ensures that advanced proxy features work consistently regardless of whether clients use native or virtual tool calling.

Architecture Diagram

graph TD
    subgraph "Backend Response"
        BR[Raw Response with XML Tool Calls]
    end

    subgraph "VTC Processing Pipeline"
        direction TB
        
        PRE[VTC Pre-Processor]
        PRE_DESC["- Buffers streaming content<br/>- Detects complete XML patterns<br/>- Parses XML to internal format<br/>- Strips XML from content"]
        
        CORE[Core Pipeline]
        CORE_DESC["- Loop Detection<br/>- Tool Call Reactors<br/>- Content Filters<br/>- Steering Messages"]
        
        POST[VTC Post-Processor]
        POST_DESC["- Converts tool_calls to XML<br/>- Appends XML to content<br/>- Clears tool_calls metadata"]
        
        PRE --> CORE
        CORE --> POST
    end

    subgraph "Client Response"
        CR[Response with XML Tool Calls]
    end

    BR --> PRE
    POST --> CR

    style PRE fill:#e1f5fe
    style CORE fill:#fff3e0
    style POST fill:#e8f5e9

Key Design Principles

1. Session-Aware Processing

VTC processing is only enabled for sessions flagged as vtc_enabled=True. This flag is set based on User-Agent detection during session initialization.

# Detection happens in RequestProcessorService
if detect_vtc_client(agent, vtc_client_patterns):
    session.state = session.state.with_vtc_enabled(True)

2. Transparent for Non-VTC Sessions

When vtc_enabled=False, both VTC processors pass content through unchanged:

async def process(self, content: StreamingContent) -> StreamingContent:
    vtc_enabled = content.metadata.get("vtc_enabled", False)
    if not vtc_enabled:
        return content  # Pass through unchanged
    # ... VTC processing

3. Unified Internal Format

Internally, all tool calls use the OpenAI-compatible format:

{
    "id": "vtc_abc123def456",
    "type": "function",
    "function": {
        "name": "execute_command",
        "arguments": "{\"command\": \"ls -la\"}"
    }
}

This allows the core pipeline to process tool calls uniformly regardless of their origin.

4. Streaming-Safe Buffering

The pre-processor buffers streaming content until complete XML patterns are detected, preventing partial XML from being emitted prematurely.

Components

VTC Detection (`src/core/services/vtc_detection.py`)

Detects VTC clients based on User-Agent string matching:

def detect_vtc_client(agent: str | None, patterns: list[str]) -> bool:
    """Detect if agent matches any VTC client pattern (case-insensitive)."""

Configuration (app_config.py):

vtc_client_patterns: list[str] = Field(
    default_factory=lambda: ["cline", "kilo", "roo"]
)

VTC XML Parser (`src/core/services/vtc_xml_parser.py`)

Provides utilities for parsing and serializing XML tool calls:

Function	Description
`parse_vtc_xml(content, allowed_tools)`	Extract tool calls from XML content
`serialize_tool_calls_to_xml(tool_calls)`	Convert tool calls to XML format
`has_partial_xml_pattern(text)`	Check for incomplete XML patterns
`detect_complete_tool_call(text)`	Check for complete tool call patterns

Supported XML Formats:

Cline format (with wrapper):

<function_calls>
<invoke name="tool_name">
<parameter name="param1">value1</parameter>
</invoke>
</function_calls>

Bare invoke format:

<invoke name="tool_name">
<parameter name="param1">value1</parameter>
</invoke>

Namespaced format (namespace prefix is stripped):

<invoke name="antml:tool:read_file">
<parameter name="path">/tmp/file.txt</parameter>
</invoke>

VTC Pre-Processor (`src/core/services/streaming/vtc_preprocessor.py`)

Converts XML tool calls to internal format at the start of the streaming pipeline.

Responsibilities:

Buffer streaming content until complete XML patterns are detected
Parse XML using parse_vtc_xml()
Add extracted tool calls to metadata["tool_calls"]
Strip XML from content
Handle buffer overflow (configurable max size)

Configuration:

@dataclass
class VTCPreProcessorConfig:
    max_buffer_bytes: int = 64 * 1024  # 64KB max buffer
    min_buffer_check: int = 10         # Minimum bytes before pattern check

VTC Post-Processor (`src/core/services/streaming/vtc_postprocessor.py`)

Converts internal tool calls back to XML format at the end of the streaming pipeline.

Responsibilities:

Check for tool_calls in metadata
Serialize to XML using serialize_tool_calls_to_xml()
Append XML to content
Remove tool_calls from metadata (prevents duplicate delivery)

Configuration:

@dataclass
class VTCPostProcessorConfig:
    prepend_newlines: bool = True  # Add newlines before XML
    newline_count: int = 2         # Number of newlines

VTC Buffer State (`src/core/services/streaming/stream_context_registry.py`)

Per-stream state for VTC processing:

@dataclass
class VTCBufferState:
    pending_text: str = ""                              # Buffered content
    extracted_tool_calls: list[dict[str, Any]] = field(default_factory=list)
    allowed_tools: list[str] | None = None              # Tool whitelist
    vtc_enabled: bool = False
    last_accessed: float = field(default_factory=time.time)

Session State (`src/core/domain/session.py`)

VTC flag in session state:

@dataclass
class SessionState:
    vtc_enabled: bool = False
    # ... other fields

    def with_vtc_enabled(self, enabled: bool) -> SessionState:
        """Create new state with vtc_enabled flag updated."""

Pipeline Integration

VTC processors are integrated into the streaming pipeline in streaming_integration.py:

async def integrate_streaming_pipeline(
    raw_stream: AsyncIterator[Any],
    provider: str,
    vtc_enabled: bool = False,  # VTC flag from session
    # ... other parameters
) -> StreamingResponseEnvelope:
    
    processors: list[IStreamProcessor] = []
    
    # VTC Pre-processor: FIRST in pipeline
    if vtc_enabled:
        processors.append(VTCPreProcessor(registry=registry))
    
    # Core processors (loop detection, think tags, etc.)
    if enable_loop_detection:
        processors.append(LoopDetectionProcessor())
    # ... other processors
    
    # VTC Post-processor: LAST in pipeline
    if vtc_enabled:
        processors.append(VTCPostProcessor(registry=registry))

Data Flow Example

Input (VTC Client to Proxy)

I will check the files now.

<function_calls>
<invoke name="list_files">
<parameter name="path">/project</parameter>
</invoke>
</function_calls>

After VTC Pre-Processor

Content: "I will check the files now."

Metadata:

{
    "vtc_enabled": True,
    "tool_calls": [
        {
            "id": "vtc_abc123def456",
            "type": "function",
            "function": {
                "name": "list_files",
                "arguments": "{\"path\": \"/project\"}"
            }
        }
    ]
}

After Core Pipeline (unchanged)

Same as above (core processors work with normalized format)

After VTC Post-Processor

Content:

I will check the files now.

<function_calls>
<invoke name="list_files">
<parameter name="path">/project</parameter>
</invoke>
</function_calls>

Metadata: {"vtc_enabled": True} (tool_calls removed)

Configuration

Application Configuration

In app_config.yaml or environment:

vtc_client_patterns:
  - cline
  - kilo
  - roo
  - mycustomclient  # Add custom patterns

Disabling VTC

To disable VTC detection entirely:

vtc_client_patterns: []

Testing

Unit Tests

Test File	Coverage
`tests/unit/core/services/test_vtc_xml_parser.py`	XML parsing and serialization
`tests/unit/core/services/test_vtc_detection.py`	Client detection logic
`tests/unit/core/services/streaming/test_vtc_preprocessor.py`	Pre-processor behavior
`tests/unit/core/services/streaming/test_vtc_postprocessor.py`	Post-processor behavior

Integration Tests

Test File	Coverage
`tests/integration/test_vtc_roundtrip.py`	End-to-end VTC processing

Running VTC Tests

# Run all VTC tests
./.venv/Scripts/python.exe -m pytest tests/unit/core/services/test_vtc_*.py tests/unit/core/services/streaming/test_vtc_*.py tests/integration/test_vtc_*.py -v

# Run with coverage
./.venv/Scripts/python.exe -m pytest tests/unit/core/services/test_vtc_*.py --cov=src/core/services/vtc --cov-report=term-missing

Troubleshooting

VTC Not Detected

Symptoms: XML tool calls pass through unchanged

Checks:

Verify User-Agent header contains a matching pattern
Check vtc_client_patterns configuration
Enable debug logging to see detection results

logger.debug("VTC client detected: agent=%r matches pattern=%r", agent, pattern)

Partial XML Being Emitted

Symptoms: Incomplete XML tags appear in client output

Checks:

Verify buffer size is sufficient (max_buffer_bytes)
Check for malformed XML in backend response
Inspect VTC buffer state for the stream

Tool Calls Not Extracted

Symptoms: XML remains in content, no tool_calls in metadata

Checks:

Verify XML format matches supported patterns
Check allowed_tools whitelist if set
Ensure vtc_enabled=True in metadata

Source Files

File	Purpose
`src/core/services/vtc_detection.py`	VTC client detection
`src/core/services/vtc_xml_parser.py`	XML parsing/serialization
`src/core/services/streaming/vtc_preprocessor.py`	VTC pre-processor
`src/core/services/streaming/vtc_postprocessor.py`	VTC post-processor
`src/core/services/streaming/stream_context_registry.py`	VTC buffer state
`src/core/domain/session.py`	Session state with VTC flag
`src/core/ports/streaming_integration.py`	Pipeline integration
`src/core/config/app_config.py`	VTC configuration

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Virtual Tool Calling (VTC) Architecture

Overview

Architecture Diagram

Key Design Principles

1. Session-Aware Processing

2. Transparent for Non-VTC Sessions

3. Unified Internal Format

4. Streaming-Safe Buffering

Components

VTC Detection (`src/core/services/vtc_detection.py`)

VTC XML Parser (`src/core/services/vtc_xml_parser.py`)

VTC Pre-Processor (`src/core/services/streaming/vtc_preprocessor.py`)

VTC Post-Processor (`src/core/services/streaming/vtc_postprocessor.py`)

VTC Buffer State (`src/core/services/streaming/stream_context_registry.py`)

Session State (`src/core/domain/session.py`)

Pipeline Integration

Data Flow Example

Input (VTC Client to Proxy)

After VTC Pre-Processor

After Core Pipeline (unchanged)

After VTC Post-Processor

Configuration

Application Configuration

Disabling VTC

Testing

Unit Tests

Integration Tests

Running VTC Tests

Troubleshooting

VTC Not Detected

Partial XML Being Emitted

Tool Calls Not Extracted

Related Documentation

Source Files

FilesExpand file tree

vtc-architecture.md

Latest commit

History

vtc-architecture.md

File metadata and controls

Virtual Tool Calling (VTC) Architecture

Overview

Architecture Diagram

Key Design Principles

1. Session-Aware Processing

2. Transparent for Non-VTC Sessions

3. Unified Internal Format

4. Streaming-Safe Buffering

Components

VTC Detection (src/core/services/vtc_detection.py)

VTC XML Parser (src/core/services/vtc_xml_parser.py)

VTC Pre-Processor (src/core/services/streaming/vtc_preprocessor.py)

VTC Post-Processor (src/core/services/streaming/vtc_postprocessor.py)

VTC Buffer State (src/core/services/streaming/stream_context_registry.py)

Session State (src/core/domain/session.py)

Pipeline Integration

Data Flow Example

Input (VTC Client to Proxy)

After VTC Pre-Processor

After Core Pipeline (unchanged)

After VTC Post-Processor

Configuration

Application Configuration

Disabling VTC

Testing

Unit Tests

Integration Tests

Running VTC Tests

Troubleshooting

VTC Not Detected

Partial XML Being Emitted

Tool Calls Not Extracted

Related Documentation

Source Files

VTC Detection (`src/core/services/vtc_detection.py`)

VTC XML Parser (`src/core/services/vtc_xml_parser.py`)

VTC Pre-Processor (`src/core/services/streaming/vtc_preprocessor.py`)

VTC Post-Processor (`src/core/services/streaming/vtc_postprocessor.py`)

VTC Buffer State (`src/core/services/streaming/stream_context_registry.py`)

Session State (`src/core/domain/session.py`)