Windows desktop automation MCP server for LLM agents using entity-based actions instead of coordinate clicking. Provides comprehensive control via UIA, CDP, keyboard, mouse, clipboard, and terminal with verified delivery and causal context.
Harusame64/desktop-touch-mcp is a Model Context Protocol (MCP) server that enables LLM agents to automate Windows desktop interactions. Unlike traditional coordinate-based clicking, it uses entity-based actions to identify and interact with UI elements, making interactions more reliable and context-aware. The tool integrates multiple automation methods including UI Automation (UIA), Chrome DevTools Protocol (CDP), screenshot analysis, keyboard/mouse/clipboard control, and terminal command execution.
This is an MCP server designed for integration with compatible AI platforms. Installation typically involves: 1) Clone or download the repository from GitHub, 2) Configure the MCP server settings in your AI platform (Claude, ChatGPT with MCP support, or AI Studio), 3) Establish connection between the MCP server and your LLM client, 4) Grant necessary Windows permissions for desktop automation and terminal access. Detailed setup instructions are available in the project's README documentation.
Monday.com MCP Server streamlines board management, item operations, and workflow automation for teams. I…
by NotionFlow
Sentry MCP Server provides comprehensive error tracking and performance monitoring, helping developers id…
by AnalyticsPro
Cloudflare MCP Server simplifies Cloudflare management by providing tools for DNS management, Workers dep…
by PricingBot