Назад към всички

screen-monitor

// Dual-mode screen sharing and analysis. Model-agnostic (Gemini/Claude/Qwen3-VL).

$ git log --oneline --stat
stars:370
forks:70
updated:February 19, 2026
SKILL.mdreadonly
SKILL.md Frontmatter
namescreen-monitor
descriptionDual-mode screen sharing and analysis. Model-agnostic (Gemini/Claude/Qwen3-VL).
metadata[object Object]

Screen Monitor

This skill provides two ways for the agent to see and interact with your screen.

🟢 Path A: Fast Share (WebRTC)

Best for: Quick visual checks, restricted browsers, or non-technical environments.

Tools

  • screen_share_link: Generates a local WebRTC portal URL.
  • screen_analyze: Captures the current frame from the portal and analyzes it with vision.

Usage:

# Get the link
bash command:"{baseDir}/references/get-share-url.sh"

# Analyze
bash command:"{baseDir}/references/screen-analyze.sh"

🔵 Path B: Full Control (Browser Relay)

Best for: Deep debugging, UI automation, and clicking/typing in tabs.

Setup

  1. Run clawdbot browser extension install.
  2. Load the unpacked extension from clawdbot browser extension path.
  3. Click the Clawdbot icon in your Chrome toolbar to Attach.

Tools

  • browser action:snapshot: Take a precise screenshot of the attached tab.
  • browser action:click: Interact with elements (requires profile="chrome").

Technical Details

  • Port: 18795 (WebRTC Backend)
  • Files:
    • web/screen-share.html: The sharing portal.
    • references/backend-endpoint.js: Frame storage server.