Desktop + browser GUI agent MCP server. VLM-powered automation: screenshot → vision model → mouse/keyboard/Playwright actions. 9 MCP tools. Fits RTX 4090.
python typescript webapp desktop-automation browser-automation vlm playwright vision-language-model computer-use mcp-server gui-agent fastmcp ui-tars uitars
-
Updated
Jul 1, 2026 - Python