Capabilities¶

Web Search¶

Web search capability using Brave Search API (privacy-focused, free tier available).

How it works: - Research Agent has access to web search tools - Search results include titles, snippets, and URLs - Results are cited for factuality

Web Browsing¶

Interactive web browsing using Playwright with an accessibility-tree-based page understanding.

Why accessibility trees? - More reliable than XPath/CSS selectors which break on site changes - Works on JavaScript-heavy sites via Playwright's accessibility snapshot API - Captures semantic structure (roles, names, values) for precise interaction - Click targets use role + name references rather than fragile selectors

Browser Agent capabilities: - Navigate to URLs and extract content - Click, type, scroll interactions - Form filling and multi-step workflows - Accessibility tree extraction (interactive, forms, or full modes) - Screenshot capture for vision fallback

Cron Job Scheduling¶

Recurring task scheduling with APScheduler running within the main process.

Job types: - Tool Jobs: Execute a specific tool with fixed parameters - Prompt Jobs: Send a prompt to the coordinator for complex reasoning

Features: - Cron expressions for flexible scheduling - Enable/disable jobs without deletion - Execution history tracking - Run immediately option

Future Task Scheduling¶

One-time task scheduling for future execution.

Use cases: - Reminders ("Remind me tomorrow at 3pm") - Scheduled actions ("Send this email Monday morning") - Deferred tasks ("Check my calendar tomorrow at 8am")

Monitoring UI¶

Web-based monitoring dashboard with:

System tab: - Health checks (database, LLM API, disk space) - Performance metrics - System logs viewer - Service connection status

Jobs tab: - Cron job management with enable/disable - Future task management - Execution history with status and duration

Unified Search¶

Single search interface across all integrated data sources.

Concept: - Keyword search using each service's native API - Semantic search using local embeddings (cosine similarity threshold: 0.3) - Results merged with Reciprocal Rank Fusion (k=60) - Automatic index updates (opportunistic + on-demand rebuild)

Tools: - unified_search - Search across all sources with hybrid keyword + semantic matching - reindex_search - Rebuild the semantic index from connected sources

Parameters: - query: Search query string (required) - sources: Filter by source (optional) — notion, gmail, outlook_email, outlook_files, onenote, nextcloud, memory - search_type: hybrid (default), keyword (API-only), or semantic (embedding-only) - limit: Maximum results per source (default: 10, max: 50)

Sources: - Notion — Pages and databases via Notion API - Gmail — Emails via Google Gmail API - Outlook Email — Emails via Microsoft Graph API - Outlook Files — OneDrive files via Microsoft Graph API (text extraction for .txt, .md, .docx, .pdf, etc.) - OneNote — Pages via Microsoft Graph API - Nextcloud — Files via WebDAV (text extraction for common formats) - Memory — Assistant's long-term facts indexed via system_index_memory_facts

Index Behavior: - Hybrid/semantic searches trigger automatic background reindex if index is empty - Keyword search results are opportunistically embedded in the background after each search - Semantic search requires LLM provider with embedding support

Agent Architecture - Multi-agent system design
Software Architecture - System design
Database Schema - Data models