SDET-GENIE

Revolutionizing Quality Assurance with AI-powered solutions

Stars: 51

Visit

SDET-GENIE is a cutting-edge, AI-powered Quality Assurance (QA) automation framework that revolutionizes the software testing process. Leveraging a suite of specialized AI agents, SDET-GENIE transforms rough user stories into comprehensive, executable test automation code through a seamless end-to-end process. The framework integrates five powerful AI agents working in sequence: User Story Enhancement Agent, Manual Test Case Agent, Gherkin Scenario Agent, Browser Agent, and Code Generation Agent. It supports multiple testing frameworks and provides advanced browser automation capabilities with AI features.

README:

SDET-GENIE: AI-AGENTS in QA Automation --> @AGNO ^ @ Browser-use

🚀 Project Overview

The framework integrates five powerful AI agents working in sequence:

User Story Enhancement Agent - Transforms rough ideas into detailed JIRA-style user stories
Manual Test Case Agent - Converts enhanced user stories into comprehensive test cases
Gherkin Scenario Agent - Transforms test cases into structured Gherkin feature files
Browser Agent - Executes Gherkin scenarios in real browsers and captures interaction data
Code Generation Agent - Produces ready-to-run automation code in multiple frameworks

🌟 Key Features

1. User Story Enhancement Agent

Transforms rough, incomplete user stories into detailed, valuable JIRA-style user stories
Ensures proper WHO, WHAT, and WHY structure
Adds comprehensive acceptance criteria and implementation notes
Creates appropriately sized stories that can be completed in a single sprint

2. Manual Test Case Agent

Converts user stories and acceptance criteria into comprehensive manual test cases
Generates positive, negative, edge, and boundary test scenarios
Creates detailed test steps with expected results
Produces industry-standard test documentation

3. Gherkin Scenario Agent

Transforms manual test cases into well-structured Gherkin feature files
Creates human-readable feature files with proper Given/When/Then syntax
Supports scenario outlines for data-driven testing
Adds appropriate tags for test organization and filtering

4. Intelligent Browser Agent

Automated browser interaction and test execution
Dynamic element identification and mapping
Comprehensive DOM detail capture
Robust element selector generation
Enhanced with Browser-Use Features:
- 🎥 GIF Generation - Automatic creation of animated GIFs showing test execution
- 📸 Video Recording - WebM recordings of entire browser sessions
- 🌐 Network Tracing - HAR files capturing all HTTP activity
- 👁️ AI Vision Integration - Computer vision for improved element identification
- ✨ Element Highlighting - Visual highlighting of interactive elements
- 📜 Comprehensive Agent History - Complete record of actions, decisions, and outcomes
- 🔬 Advanced Debugging - Detailed execution traces for troubleshooting

5. Code Generation Agent

Produces production-ready automation code from Gherkin scenarios
Supports multiple testing frameworks (Selenium, Playwright, Cypress, etc.)
Generates clean, well-structured, and maintainable code
Includes all necessary imports, dependencies, and helper functions

🔧 Technology Stack

Python
AI Models (Google Gemini 2.0 Flash)
Selenium/Playwright
Gherkin/Cucumber
Browser Automation Technologies
Browser-Use Library - Advanced browser automation with AI capabilities

📦 Installation - Quick start

install playwright:

playwright install

git clone https://github.com/WaiGenie/SDET-GENIE.git

cd SDET-GENIE

python -m venv .venv

.venv\Scripts\activate

pip install-requirements.txt

Create .env file
Place your GOOGLE_API_KEY=AIzaXXXXXXXXXXXXXX

streamlit run app.py

🖥️ Quick Start

Prepare your user story
Run the AI agents
Generate and execute automated tests

🤝 Contributing

We're excited to welcome contributors to SDET-GENIE! Whether you're fixing bugs, improving documentation, or adding new features, your contributions are highly valued.

💡 Why Contribute?

Gain experience with cutting-edge AI and test automation technologies
Join a growing community of QA automation enthusiasts
Help shape the future of AI-powered testing
Get your name featured in our contributors list
Learn best practices in test automation

🚀 Getting Started with Contributions

2. Create a new branch (`git checkout -b feature/your-feature-name`)
3. Make your changes
4. Run tests
5. Commit your changes
6. Push to your fork
7. Open a Pull Request

🔍 Areas for Contribution

Bug fixes
Documentation improvements
New test automation framework support
Performance optimizations
Cloud browser provider integrations
UI/UX improvements
Test coverage enhancements

📄 License

This project is licensed under the GNU Affero General Public License v3.0 (AGPL-3.0)

License Highlights

✔️ Personal and educational use allowed

✔️ Code modification permitted

✔️ Copyright and license notices must be preserved

✔️ Source code must be disclosed when distributing

✔️ Changes must be released under the same license

❌ No commercial use without explicit permission

❌ No warranty provided

For full license details, see the LICENSE file or visit GNU AGPL-3.0

🤔 Questions or Need Help?

Open a GitHub Discussion
Check existing issues
Join our community Discord - https://discord.gg/QqF68r39

📚 Blog Post

Read our in-depth article: From User Stories to Automated Tests: The Future of QA Automation using AI Agents

Demo - https://youtu.be/z0fSNoUZTzw?si=xrfbDsGWlnTJzcYK

For detailed information about all browser-use features implemented in SDET-GENIE, see BROWSER_USE_FEATURES.md

🌈 Acknowledgments

Inspired by the challenges in modern software quality assurance
Powered by cutting-edge AI technologies
Enhanced with the browser-use library for advanced browser automation capabilities

How it works:

1. Entrypoint: User provides a rough user story about what to test in the website.

2. User Story Enhancement:
   - The User Story Enhancement Agent transforms the rough user story into a detailed, JIRA-style user story
   - Adds proper structure (WHO, WHAT, WHY), acceptance criteria, and implementation notes

3. Manual Test Case Generation:
   - The Manual Test Case Agent converts the enhanced user story into comprehensive test cases
   - Generates positive, negative, edge, and boundary test scenarios with detailed steps

4. Gherkin Scenario Generation:
   - The Gherkin Agent transforms manual test cases into well-structured Gherkin scenarios
   - Creates feature files with proper Given/When/Then syntax and scenario outlines

5. Browser Automation:
   - The Browser Agent executes each Gherkin scenario in a real browser
   - Custom actions registered:
     -> "Get XPath of element using index"
     -> "Get element property"
     -> "Perform element action"
   - Executes and collects results:
     history = await browser_agent.run()

6. Data Collection from Browser:
   - Collects XPaths, actions, and extracted content from browser interactions
   - Saves combined history to session state:
     st.session_state.history = {
         "urls": history.urls(),
         "action_names": history.action_names(),
         "detailed_actions": all_actions,
         "element_xpaths": element_xpath_map,
         "extracted_content": all_extracted_content,
         "errors": history.errors(),
         "model_actions": history.model_actions(),
         "execution_date": st.session_state.get("execution_date", "Unknown")
     }

7. Test Automation Code Generation:
   - The Code Generation Agent produces ready-to-execute test automation code
   - Uses Gherkin scenarios and browser interaction data to generate code
   - Supports multiple frameworks (Selenium, Playwright, Cypress, etc.)
   - automation_code = generator_function(
         generated_steps,  # Generated Gherkin scenarios
         history           # Browser interaction data
     )

Made with ❤️ by the WaiGenie Team

For Tasks:

Click tags to check more tools for each tasks

enhance user stories generate test cases create gherkin scenarios execute tests in browsers generate automation code

For Jobs:

quality assurance engineer automation test engineer software developer in test qa automation analyst test automation architect

Alternative AI tools for SDET-GENIE

Similar Open Source Tools

SDET-GENIE

github

: 51

samples

Strands Agents Samples is a repository showcasing easy-to-use examples for building AI agents using a model-driven approach. The examples provided are for demonstration and educational purposes only, not intended for direct production use. Users can explore various samples to understand concepts and techniques, ensuring proper security and testing procedures before implementation.

github

: 410

arbigent

Arbigent (Arbiter-Agent) is an AI agent testing framework designed to make AI agent testing practical for modern applications. It addresses challenges faced by traditional UI testing frameworks and AI agents by breaking down complex tasks into smaller, dependent scenarios. The framework is customizable for various AI providers, operating systems, and form factors, empowering users with extensive customization capabilities. Arbigent offers an intuitive UI for scenario creation and a powerful code interface for seamless test execution. It supports multiple form factors, optimizes UI for AI interaction, and is cost-effective by utilizing models like GPT-4o mini. With a flexible code interface and open-source nature, Arbigent aims to revolutionize AI agent testing in modern applications.

github

: 221

autobe

AutoBE is an AI-powered no-code agent that builds backend applications, enhanced by compiler feedback. It automatically generates backend applications using TypeScript, NestJS, and Prisma following a waterfall development model. The generated code is validated by review agents and OpenAPI/TypeScript/Prisma compilers, ensuring 100% working code. The tool aims to enable anyone to build backend servers, AI chatbots, and frontend applications without coding knowledge by conversing with AI.

github

: 642

repromodel

ReproModel is an open-source toolbox designed to boost AI research efficiency by enabling researchers to reproduce, compare, train, and test AI models faster. It provides standardized models, dataloaders, and processing procedures, allowing researchers to focus on new datasets and model development. With a no-code solution, users can access benchmark and SOTA models and datasets, utilize training visualizations, extract code for publication, and leverage an LLM-powered automated methodology description writer. The toolbox helps researchers modularize development, compare pipeline performance reproducibly, and reduce time for model development, computation, and writing. Future versions aim to facilitate building upon state-of-the-art research by loading previously published study IDs with verified code, experiments, and results stored in the system.

github

: 151

software-dev-prompt-library

A collection of AI-powered prompts designed to streamline software development workflows. The library contains prompts at various stages of development, with structured sequences of connected prompts, project initialization support, development assistance, and documentation generation. It aims to provide consistent guidance across different development phases, promote systematic development processes, and enable progress tracking and validation.

github

: 80

vts

VTS (Vector Transport Service) is an open-source tool developed by Zilliz based on Apache Seatunnel for moving vectors and unstructured data. It addresses data migration needs, supports real-time data streaming and offline import, simplifies unstructured data transformation, and ensures end-to-end data quality. Core capabilities include rich connectors, stream and batch processing, distributed snapshot support, high performance, and real-time monitoring. Future developments include incremental synchronization, advanced data transformation, and enhanced monitoring. VTS supports various connectors for data migration and offers advanced features like Transformers, cluster mode deployment, RESTful API, Docker deployment, and more.

github

: 71

refly

Refly.AI is an open-source AI-native creation engine that empowers users to transform ideas into production-ready content. It features a free-form canvas interface with multi-threaded conversations, knowledge base integration, contextual memory, intelligent search, WYSIWYG AI editor, and more. Users can leverage AI-powered capabilities, context memory, knowledge base integration, quotes, and AI document editing to enhance their content creation process. Refly offers both cloud and self-hosting options, making it suitable for individuals, enterprises, and organizations. The tool is designed to facilitate human-AI collaboration and streamline content creation workflows.

github

: 4.6k

raga-llm-hub

Raga LLM Hub is a comprehensive evaluation toolkit for Language and Learning Models (LLMs) with over 100 meticulously designed metrics. It allows developers and organizations to evaluate and compare LLMs effectively, establishing guardrails for LLMs and Retrieval Augmented Generation (RAG) applications. The platform assesses aspects like Relevance & Understanding, Content Quality, Hallucination, Safety & Bias, Context Relevance, Guardrails, and Vulnerability scanning, along with Metric-Based Tests for quantitative analysis. It helps teams identify and fix issues throughout the LLM lifecycle, revolutionizing reliability and trustworthiness.

github

: 92

ScholarCopilot

Scholar Copilot is an intelligent academic writing assistant that enhances the research writing process through AI-powered text completion and citation suggestions. It aims to streamline academic writing while maintaining high scholarly standards. The tool provides features such as smart text generation with next-3-sentence suggestions, full section auto-completion, and context-aware writing. It also offers intelligent citation management with real-time citation suggestions, one-click citation insertion, and citation Bibtex generation. Scholar Copilot employs a unified model architecture that integrates retrieval and generation through a dynamic switching mechanism, ensuring coherent text generation with appropriate citation points.

github

: 86

adk-java

Agent Development Kit (ADK) for Java is an open-source toolkit designed for developers to build, evaluate, and deploy sophisticated AI agents with flexibility and control. It allows defining agent behavior, orchestration, and tool use directly in code, enabling robust debugging, versioning, and deployment anywhere. The toolkit offers a rich tool ecosystem, code-first development approach, and support for modular multi-agent systems, making it ideal for creating advanced AI agents integrated with Google Cloud services.

github

: 694

LinguaHaru

Next-generation AI translation tool that provides high-quality, precise translations for various common file formats with a single click. It is based on cutting-edge large language models, offering exceptional translation quality with minimal operation, supporting multiple document formats and languages. Features include multi-format compatibility, global language translation, one-click rapid translation, flexible translation engines, and LAN sharing for efficient collaborative work.

github

: 93

FinRobot

FinRobot is an open-source AI agent platform designed for financial applications using large language models. It transcends the scope of FinGPT, offering a comprehensive solution that integrates a diverse array of AI technologies. The platform's versatility and adaptability cater to the multifaceted needs of the financial industry. FinRobot's ecosystem is organized into four layers, including Financial AI Agents Layer, Financial LLMs Algorithms Layer, LLMOps and DataOps Layers, and Multi-source LLM Foundation Models Layer. The platform's agent workflow involves Perception, Brain, and Action modules to capture, process, and execute financial data and insights. The Smart Scheduler optimizes model diversity and selection for tasks, managed by components like Director Agent, Agent Registration, Agent Adaptor, and Task Manager. The tool provides a structured file organization with subfolders for agents, data sources, and functional modules, along with installation instructions and hands-on tutorials.

github

: 1.0k

ai-platform-engineering

The AI Platform Engineering repository provides a collection of tools and resources for building and deploying AI models. It includes libraries for data preprocessing, model training, and model serving. The repository also contains example code and tutorials to help users get started with AI development. Whether you are a beginner or an experienced AI engineer, this repository offers valuable insights and best practices to streamline your AI projects.

github

: 123

A-mem

A-MEM is a novel agentic memory system designed for Large Language Model (LLM) agents to dynamically organize memories in an agentic way. It introduces advanced memory organization capabilities, intelligent indexing, and linking of memories, comprehensive note generation, interconnected knowledge networks, continuous memory evolution, and agent-driven decision making for adaptive memory management. The system facilitates agent construction and enables dynamic memory operations and flexible agent-memory interactions.

github

: 144

eShopSupport

eShopSupport is a sample .NET application showcasing common use cases and development practices for building AI solutions in .NET, specifically Generative AI. It demonstrates a customer support application for an e-commerce website using a services-based architecture with .NET Aspire. The application includes support for text classification, sentiment analysis, text summarization, synthetic data generation, and chat bot interactions. It also showcases development practices such as developing solutions locally, evaluating AI responses, leveraging Python projects, and deploying applications to the Cloud.

github

: 228

For similar tasks

SDET-GENIE

github

: 51

EvoMaster

EvoMaster is an open-source AI-driven tool that automatically generates system-level test cases for web/enterprise applications. It uses Evolutionary Algorithm and Dynamic Program Analysis to evolve test cases, maximizing code coverage and fault detection. It supports REST, GraphQL, and RPC APIs, with whitebox testing for JVM-compiled APIs. The tool generates JUnit tests in Java or Kotlin, focusing on fault detection, self-contained tests, SQL handling, and authentication. Known limitations include manual driver creation for whitebox testing and longer execution times for better results. EvoMaster has been funded by ERC and RCN grants.

github

: 443

repopack

Repopack is a powerful tool that packs your entire repository into a single, AI-friendly file. It optimizes your codebase for AI comprehension, is simple to use with customizable options, and respects Gitignore files for security. The tool generates a packed file with clear separators and AI-oriented explanations, making it ideal for use with Generative AI tools like Claude or ChatGPT. Repopack offers command line options, configuration settings, and multiple methods for setting ignore patterns to exclude specific files or directories during the packing process. It includes features like comment removal for supported file types and a security check using Secretlint to detect sensitive information in files.

github

: 1.7k

EvoMaster

EvoMaster is an open-source AI-driven tool that automatically generates system-level test cases for web/enterprise applications. It uses an Evolutionary Algorithm and Dynamic Program Analysis to evolve test cases, maximizing code coverage and fault detection. The tool supports REST, GraphQL, and RPC APIs, with whitebox testing for JVM-compiled languages. It generates JUnit tests, detects faults, handles SQL databases, and supports authentication. EvoMaster has been funded by the European Research Council and the Research Council of Norway.

github

: 554

ianvs

Ianvs is a distributed synergy AI benchmarking project incubated in KubeEdge SIG AI. It aims to test the performance of distributed synergy AI solutions following recognized standards, providing end-to-end benchmark toolkits, test environment management tools, test case control tools, and benchmark presentation tools. It also collaborates with other organizations to establish comprehensive benchmarks and related applications. The architecture includes critical components like Test Environment Manager, Test Case Controller, Generation Assistant, Simulation Controller, and Story Manager. Ianvs documentation covers quick start, guides, dataset descriptions, algorithms, user interfaces, stories, and roadmap.

github

: 111

NotHotDog

NotHotDog is an open-source platform for testing, evaluating, and simulating AI agents. It offers a robust framework for generating test cases, running conversational scenarios, and analyzing agent performance.

github

: 55

rhesis

Rhesis is a comprehensive test management platform designed for Gen AI teams, offering tools to create, manage, and execute test cases for generative AI applications. It ensures the robustness, reliability, and compliance of AI systems through features like test set management, automated test generation, edge case discovery, compliance validation, integration capabilities, and performance tracking. The platform is open source, emphasizing community-driven development, transparency, extensible architecture, and democratizing AI safety. It includes components such as backend services, frontend applications, SDK for developers, worker services, chatbot applications, and Polyphemus for uncensored LLM service. Rhesis enables users to address challenges unique to testing generative AI applications, such as non-deterministic outputs, hallucinations, edge cases, ethical concerns, and compliance requirements.

github

: 115

For similar jobs

testzeus-hercules

Hercules is the world’s first open-source testing agent designed to handle the toughest testing tasks for modern web applications. It turns simple Gherkin steps into fully automated end-to-end tests, making testing simple, reliable, and efficient. Hercules adapts to various platforms like Salesforce and is suitable for CI/CD pipelines. It aims to democratize and disrupt test automation, making top-tier testing accessible to everyone. The tool is transparent, reliable, and community-driven, empowering teams to deliver better software. Hercules offers multiple ways to get started, including using PyPI package, Docker, or building and running from source code. It supports various AI models, provides detailed installation and usage instructions, and integrates with Nuclei for security testing and WCAG for accessibility testing. The tool is production-ready, open core, and open source, with plans for enhanced LLM support, advanced tooling, improved DOM distillation, community contributions, extensive documentation, and a bounty program.

github

: 457

SDET-GENIE

github

: 51

cli

TestDriver is an innovative test framework that automates and scales QA using computer-use agents. It leverages AI vision, mouse, and keyboard emulation to control the entire desktop, making it more like a QA employee than a traditional test framework. With TestDriver, users can easily set up tests without complex selectors, reduce maintenance efforts as tests don't break with code changes, and gain more power to test any application and control any OS setting.

github

: 191

PromptFuzz

**Description:** PromptFuzz is an automated tool that generates high-quality fuzz drivers for libraries via a fuzz loop constructed on mutating LLMs' prompts. The fuzz loop of PromptFuzz aims to guide the mutation of LLMs' prompts to generate programs that cover more reachable code and explore complex API interrelationships, which are effective for fuzzing. **Features:** * **Multiply LLM support** : Supports the general LLMs: Codex, Inocder, ChatGPT, and GPT4 (Currently tested on ChatGPT). * **Context-based Prompt** : Construct LLM prompts with the automatically extracted library context. * **Powerful Sanitization** : The program's syntax, semantics, behavior, and coverage are thoroughly analyzed to sanitize the problematic programs. * **Prioritized Mutation** : Prioritizes mutating the library API combinations within LLM's prompts to explore complex interrelationships, guided by code coverage. * **Fuzz Driver Exploitation** : Infers API constraints using statistics and extends fixed API arguments to receive random bytes from fuzzers. * **Fuzz engine integration** : Integrates with grey-box fuzz engine: LibFuzzer. **Benefits:** * **High branch coverage:** The fuzz drivers generated by PromptFuzz achieved a branch coverage of 40.12% on the tested libraries, which is 1.61x greater than _OSS-Fuzz_ and 1.67x greater than _Hopper_. * **Bug detection:** PromptFuzz detected 33 valid security bugs from 49 unique crashes. * **Wide range of bugs:** The fuzz drivers generated by PromptFuzz can detect a wide range of bugs, most of which are security bugs. * **Unique bugs:** PromptFuzz detects uniquely interesting bugs that other fuzzers may miss. **Usage:** 1. Build the library using the provided build scripts. 2. Export the LLM API KEY if using ChatGPT or GPT4. 3. Generate fuzz drivers using the `fuzzer` command. 4. Run the fuzz drivers using the `harness` command. 5. Deduplicate and analyze the reported crashes. **Future Works:** * **Custom LLMs suport:** Support custom LLMs. * **Close-source libraries:** Apply PromptFuzz to close-source libraries by fine tuning LLMs on private code corpus. * **Performance** : Reduce the huge time cost required in erroneous program elimination.

github

: 230

code-review-gpt

Code Review GPT uses Large Language Models to review code in your CI/CD pipeline. It helps streamline the code review process by providing feedback on code that may have issues or areas for improvement. It should pick up on common issues such as exposed secrets, slow or inefficient code, and unreadable code. It can also be run locally in your command line to review staged files. Code Review GPT is in alpha and should be used for fun only. It may provide useful feedback but please check any suggestions thoroughly.

github

: 1.8k

aiverify

AI Verify is an AI governance testing framework and software toolkit that validates the performance of AI systems against a set of internationally recognised principles through standardised tests. AI Verify is consistent with international AI governance frameworks such as those from European Union, OECD and Singapore. It is a single integrated toolkit that operates within an enterprise environment. It can perform technical tests on common supervised learning classification and regression models for most tabular and image datasets. It however does not define AI ethical standards and does not guarantee that any AI system tested will be free from risks or biases or is completely safe.

github

: 75

cover-agent

CodiumAI Cover Agent is a tool designed to help increase code coverage by automatically generating qualified tests to enhance existing test suites. It utilizes Generative AI to streamline development workflows and is part of a suite of utilities aimed at automating the creation of unit tests for software projects. The system includes components like Test Runner, Coverage Parser, Prompt Builder, and AI Caller to simplify and expedite the testing process, ensuring high-quality software development. Cover Agent can be run via a terminal and is planned to be integrated into popular CI platforms. The tool outputs debug files locally, such as generated_prompt.md, run.log, and test_results.html, providing detailed information on generated tests and their status. It supports multiple LLMs and allows users to specify the model to use for test generation.

github

: 4.2k

auto-playwright

Auto Playwright is a tool that allows users to run Playwright tests using AI. It eliminates the need for selectors by determining actions at runtime based on plain-text instructions. Users can automate complex scenarios, write tests concurrently with or before functionality development, and benefit from rapid test creation. The tool supports various Playwright actions and offers additional options for debugging and customization. It uses HTML sanitization to reduce costs and improve text quality when interacting with the OpenAI API.

github

: 298