feat: 14 vendor-derived agent optimizations by Mingye-Lu · Pull Request #39 · Mingye-Lu/AgenticCrawler

Mingye-Lu · 2026-06-11T04:10:54Z

Summary

Implements 14 optimizations derived from analysis of 10 competitor products (browser-use, Stagehand, crawl4ai, Skyvern, Spider, nanobrowser, ZeroClaw, Gottem). All features are disabled by default and opt-in via settings.optimization.*.

Optimizations

Token Savings

HTML Diff Mode (html_diff_mode) — On repeated visits to the same URL, returns only changed content sections with [unchanged: N sections] markers. 50–70% token reduction on multi-turn sessions.
Content-Aware Cleaning Profiles (content_aware_profiles) — Auto-selects ReadingMode / Minimal / Aggressive cleaning profile based on task keyword and content size.

Loop Prevention

Action Loop Detection (loop_detection) — Rolling-window action hash with escalating nudges (soft at 5, medium at 8, strong at 12 repeats) + page stagnation detection.
Page Fingerprinting (page_fingerprinting) — Lightweight FNV-1a fingerprint used by loop detection and action caching.
Planning Interval (planning_interval) — Every N steps injects a planning-checkpoint or execution-mode guidance into the dynamic prompt.

Reliability

Failure Classification (ailure_classification) — 16-category keyword-based error taxonomy (SelectorNotFound, CaptchaDetected, RateLimited, …). Zero LLM cost.
Self-Healing Selectors (self_healing) — On SelectorNotFound/SelectorAmbiguous, text-matches a replacement element ref from a fresh page_map. Logs [healed: @Eold → @Enew]. Zero LLM calls.

Performance

Action Caching (�ction_caching) — In-memory cache for read-only tools keyed by tool + input + page fingerprint. TTL-based expiry, invalidated on page change.
Confidence Tracking (confidence_tracking) — Parses [confidence: HIGH/MEDIUM/LOW] from assistant responses; 2 consecutive LOWs trigger a stagnation alert.
Compound Component Enrichment (compound_enrichment) — Adds enrichment metadata to complex form controls (date/range/select/file/textarea). Max 200 bytes/element.

Budget

Budget Enforcement (�udget_max_session_cost_usd + �udget_enforcement) — Warn or block when session cost limit is reached.
Per-Agent Cost Attribution (per_agent_cost_tracking) — /cost shows a per-child-agent cost breakdown.

Foundation

Dynamic System Prompt Infrastructure — DynamicPromptContext struct with four optional nudge fields injected as section 9 of the system prompt via a shared Arc<Mutex<>> slot.
Optimization Settings Schema — 18 fields in nested settings.optimization block, all Option, all default OFF.

Verification

\cargo test --workspace\ — 1,097 tests pass, 0 failures
\cargo clippy --workspace --all-targets -- -D warnings\ — clean
\cargo fmt --check\ — clean
\cargo build --release\ — clean

Commits

Commit	Description
\8370d6e\	feat(agent): dynamic system prompt infrastructure + settings schema
\d5936f\	feat(agent): HTML diff, loop detection, page fingerprinting, planning interval
\2689427\	feat(agent): failure classification, self-healing, action caching, confidence tracking
\8f94f78\	feat(browser,runtime): component enrichment, cleaning profiles, budget enforcement, cost attribution
\699f7f\	chore: CHANGELOG update
\7e4b9a5\	fix: clippy --all-targets warnings
\231b2f9\	docs: optimization settings + performance features documentation

…n settings schema - Add DynamicPromptContext struct with stagnation_alert, planning_guidance, budget_warning, loop_nudge fields (all Option<String>) - Add build_dynamic_section() that renders non-None fields into section 9 - Extend build_system_prompt() to accept Option<&DynamicPromptContext>; None preserves byte-identical output to current behavior - Add Arc<Mutex<Option<Vec<String>>>> prompt_override slot to ConversationRuntime; prepare_iteration() checks and applies the override before each API call - Add Arc<Mutex<Option<String>>> last_assistant_text slot to ConversationRuntime; stream_assistant_message() writes the latest response text for agent to read - CrawlerAgent holds both slots and wires them to ConversationRuntime at construction - Add OptimizationSettings struct with 18 fields (all Option<T>, all default false/0) - Add optimization: Option<OptimizationSettings> to Settings struct with serde(default) - Add 18 settings_get_* getter functions all defaulting to safe off-by-default values - All existing tests pass; new tests verify backward compat and dynamic section behavior

… planning interval - HtmlDiffTracker: sections-based diff on repeated URL visits; returns only changed sections with [unchanged: N sections] markers; 50%+ token savings on revisits - LoopDetector: rolling window action hash with escalating nudges at 5/8/12 repeats; stagnation detection after 5 consecutive identical page fingerprints; no hard blocks - PageFingerprint: lightweight FNV-1a hash of url+element_count+first-1000-chars text; deterministic, no timestamps, used by LoopDetector and ActionCache (Task 7) - Planning interval: every N steps inject planning-checkpoint or execution-mode guidance into DynamicPromptContext via prompt_override_slot; disabled by default (interval=0) - All features gated by settings.optimization.* flags (all default OFF) - CrawlState gains html_diff_tracker, loop_detector, page_fingerprints fields (ephemeral)

@Eold

…on caching, confidence tracking - FailureClassifier: 16-category keyword-based error taxonomy (zero LLM cost); classify() maps error messages to SelectorNotFound/CaptchaDetected/RateLimited etc; retry_strategy() returns RetryWithHealing/RetryWithDelay/NoRetry per category - SelfHealingSelectors: on SelectorNotFound/SelectorAmbiguous, fetches fresh page_map and text-match heals to correct element ref; logs [healed: @Eold -> @Enew]; max retries from settings (default 2); zero LLM calls - ActionCache: FNV-1a keyed in-memory cache for read-only tools (page_map, read_content, list_resources, execute_js); invalidated on page fingerprint change; TTL 30s default; interaction tools never cached; in-memory only (no cross-session persistence) - ConfidenceTracking: parses [confidence: HIGH/MEDIUM/LOW] from assistant text; 2+ consecutive LOWs triggers stagnation_alert via DynamicPromptContext; advisory only - All features gated by settings.optimization.* flags (all default OFF) - CrawlState gains action_cache field; CrawlerAgent gains confidence_tracker field

…udget enforcement, cost attribution - CompoundComponentEnrichment: bridge_script.rs getEnrichment() adds optional enrichment JSON to form controls (date/range/number/color/file/select/textarea); max 200 bytes/elem, 20 select options + overflow; guarded by compound_enrichment flag - ContentAwareCleaningProfiles: CleaningProfile enum (Default/Minimal/Aggressive/ReadingMode) with per-profile thresholds and tag-weight multipliers; select_profile() auto-selects by task keyword and content size; prune_html() unchanged, new prune_html_with_profile() - BudgetEnforcement: BudgetEnforcer (Warn/Block/RouteDown modes), SharedCostCounter Arc<AtomicU64> millicents slot shared runtime-to-agent; Warn injects budget_warning into DynamicPromptContext; Block terminates loop via ToolError; RouteDown acts as Block - PerAgentCostAttribution: AgentCostReport + build_cost_breakdown() walks flat child_sessions and reconstructs per-child cost; /cost command shows per-agent breakdown when per_agent_cost_tracking flag is ON - Fix: mcp-server execute_async call updated to pass CrawlState parameter

…ype_complexity)

- README: add optimization settings table to settings.json section - README: add Performance Optimizations section with feature table + example config - AGENTS.md: add Optimization layer section describing shared infrastructure, per-module table, and execution order in CrawlerAgent::execute() - AGENTS.md: update test count from ~770 to ~1,100

Co-authored-by: Sisyphus <clio-agent@sisyphuslabs.ai>

…child attribution Co-authored-by: Sisyphus <clio-agent@sisyphuslabs.ai>

…om action cache Co-authored-by: Sisyphus <clio-agent@sisyphuslabs.ai>

…ntext additive Co-authored-by: Sisyphus <clio-agent@sisyphuslabs.ai>

…ion in tool handlers Co-authored-by: Sisyphus <clio-agent@sisyphuslabs.ai>

…gning-clones lint Co-authored-by: Sisyphus <clio-agent@sisyphuslabs.ai>

…diff_mode Co-authored-by: Sisyphus <clio-agent@sisyphuslabs.ai>

Co-authored-by: Sisyphus <clio-agent@sisyphuslabs.ai>

Mingye-Lu and others added 15 commits June 10, 2026 16:59

chore: update CHANGELOG.md for optimization features

f699f7f

fix: resolve clippy --all-targets warnings (raw strings, float_cmp, t…

7e4b9a5

…ype_complexity)

fix(runtime): thread session model through struct and re-export

ec90e16

Co-authored-by: Sisyphus <clio-agent@sisyphuslabs.ai>

fix(runtime): use model-specific pricing for cumulative cost and per-…

8e35b8a

…child attribution Co-authored-by: Sisyphus <clio-agent@sisyphuslabs.ai>

fix(agent): wire failure_classification flag and remove execute_js fr…

169b34b

…om action cache Co-authored-by: Sisyphus <clio-agent@sisyphuslabs.ai>

fix(agent): correct execute() pipeline order and make DynamicPromptCo…

de85a99

…ntext additive Co-authored-by: Sisyphus <clio-agent@sisyphuslabs.ai>

fix(agent): wire page fingerprinting and content-aware profile select…

cd1b101

…ion in tool handlers Co-authored-by: Sisyphus <clio-agent@sisyphuslabs.ai>

fix(ui,tui,mcp): propagate session model to child agents and fix assi…

c28a9dc

…gning-clones lint Co-authored-by: Sisyphus <clio-agent@sisyphuslabs.ai>

fix(mcp): persist CrawlState across direct-tool calls to enable html_…

cfebe8d

…diff_mode Co-authored-by: Sisyphus <clio-agent@sisyphuslabs.ai>

style: cargo fmt mcp-server execute_browser_tool call site

2151b91

Co-authored-by: Sisyphus <clio-agent@sisyphuslabs.ai>

Mingye-Lu merged commit f02e8aa into main Jun 11, 2026
4 checks passed

Mingye-Lu deleted the feat/vendor-optimizations branch June 11, 2026 13:29

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: 14 vendor-derived agent optimizations#39

feat: 14 vendor-derived agent optimizations#39
Mingye-Lu merged 15 commits into
mainfrom
feat/vendor-optimizations

Mingye-Lu commented Jun 11, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant