We have hosted the application askui vision agent in order to run this application in our online workstations with Wine or directly.
Quick description about askui vision agent:
AskUI�s Vision Agent is an automation framework that allows you�and AI agents�to control real desktops, mobile devices, and HMI systems by perceiving the UI and performing actions like clicking, typing, scrolling, and drag-and-drop. It is designed for multi-platform compatibility and supports multiple AI models so you can tailor perception and decision-making to your workload. The repository presents a feature overview, sample media, and frequent release notes, which show ongoing improvements such as CORS checks and other operational tweaks. The broader AskUI documentation covers the Python Vision Agent along with suite services and inference APIs, indicating a productized ecosystem rather than a single library. Community-curated lists also recognize Vision Agent as part of the broader �GUI agents� landscape, placing it among other computer-use agents.Features:
- Multimodal UI perception for windows, widgets, and web pages
- Cross-platform automation for desktop, mobile, and HMI
- Action primitives for click, type, scroll, and drag-and-drop
- Pluggable model backends for perception and control
- Frequent releases with operational and security improvements
- Documentation spanning agent SDKs, suite services, and APIs
Programming Language: Python.
Categories:
©2024. Winfy. All Rights Reserved.
By OD Group OU – Registry code: 1609791 -VAT number: EE102345621.