OpenSourceProjects logo
skyvern logo

skyvernAutomate browser based workflows with AI

Automate browser based workflows with AI

21,553 stars
1,988 forks
Python
AGPL-3.0
skyvern screenshot

skyvern

Skyvern automates browser-based workflows using Vision LLMs and computer vision, eliminating the need for brittle XPath-based selectors. It uses AI agents to understand websites visually and execute complex automation tasks across any website without custom code.

Key Features

  • Vision-Based Interaction: Uses Vision LLMs to understand and interact with websites based on visual elements rather than DOM parsing or XPaths
  • No-Code Workflow Builder: Enables both technical and non-technical users to automate manual workflows without writing custom scripts
  • Layout-Resistant: Automatically adapts to website changes since it relies on visual understanding rather than predefined selectors
  • Multi-Agent System: Employs a swarm of agents to comprehend websites, plan actions, and execute workflows efficiently
  • Cloud & Self-Hosted Options: Available as managed Skyvern Cloud or runnable locally with Docker or pip installation

Use Cases

  • Data Entry Automation: Automatically fill forms and submit data across multiple websites without manual intervention
  • Web Scraping & Information Collection: Extract data from websites that require human-like interactions and navigation
  • Cross-Website Workflows: Apply the same automation workflow to multiple different websites without reconfiguration
  • Legacy System Integration: Automate interactions with older websites that lack modern APIs

Who Is It For

Skyvern is designed for developers, RPA professionals, and business users who need to automate repetitive browser-based tasks. It's ideal for organizations seeking a more resilient alternative to traditional automation tools that break with website updates.