LangGraph Autonomous Task Agent (Sidekick)

Stateful task agent that combines browsing, tool use, memory, and evaluation loops for multi-step execution.

Autonomous task execution agent built with LangGraph that performs web browsing, information extraction, and iterative reasoning using tool-based agents with persistent memory and evaluation loops.

Key Outcomes

Stateful LangGraph orchestration

Playwright tool use for real-world interaction

Persistent memory across multi-step tasks

View code Live demo

Context

Problem and Context

Static-answer assistants break down on tasks that require browsing, extraction, intermediate memory, and stopping criteria.

The goal here was to build a more execution-oriented assistant that can make progress through a sequence of tool-backed steps.

Approach

Approach and Architecture

The architecture uses a LangGraph StateGraph so agent state, tool usage, and evaluation decisions remain explicit rather than hidden in prompt history.

This makes retry behavior, memory updates, and completion checks much easier to reason about.

Diagrams

System Diagrams

Static diagrams included with the project to show architecture, workflow, and data movement at a glance.

LangGraph agent architecture diagram — Core agent architecture for orchestration, tooling, and memory.

Implementation

Implementation Details

Playwright handles browser automation, while SQLite-backed checkpoints give the agent stable memory between iterations.

A tool layer wraps browsing, retrieval, and notifications so the orchestrator can route the right capability at the right step.

Worker -> ToolNode -> Evaluator style execution loop

Persistent memory and retrieval across steps

Tool-driven completion rather than answer-only generation

Results

Results and Tradeoffs

The project demonstrates a practical pattern for stateful assistants that need to act on the web instead of only reasoning over text.

Persistent task memory stored in SQLite

RAG-enabled context retrieval

Iterative evaluation for completion quality

Lessons

Lessons and Next Steps

Task agents benefit from explicit graph structure because debugging hidden control flow inside prompts quickly becomes unmanageable.

Explore More

Related Projects

Browse adjacent work from the same archive group or jump back to the project archive.

Back to archive

Agentic Systems

BHSI Risk Classification System

Production-facing underwriting workflow that compresses manual D&O research into a structured AI-assisted risk review.

Open case study

Agentic Systems

Deep Research Workflow

Multi-agent research system that routes work by query complexity and adds evaluator loops to improve quality while reducing LLM cost.

Open case study

Agentic Systems

Autonomous Trading System

Concurrent trading-floor simulation where specialist agents research markets, generate signals, and stress-test portfolio decisions.

Open case study