cursor-tools

Give Cursor Agent an AI Team and Advanced Skills

Stars: 3548

Visit

cursor-tools is a CLI tool designed to enhance AI agents with advanced skills, such as web search, repository context, documentation generation, GitHub integration, Xcode tools, and browser automation. It provides features like Perplexity for web search, Gemini 2.0 for codebase context, and Stagehand for browser operations. The tool requires API keys for Perplexity AI and Google Gemini, and supports global installation for system-wide access. It offers various commands for different tasks and integrates with Cursor Composer for AI agent usage.

README:

Give Cursor Agent an AI team and advanced skills

The AI Team
New Skills
How to Use
- Example: Using Perplexity
- Example: Using Gemini
What is cursor-tools
Installation
Requirements
Tips
Additional Examples
- GitHub Skills
- Gemini Code Review
Detailed Cursor Usage
- Tool Recommendations
- Command Nicknames
- Web Search
- Repository Search
- Documentation Generation
- GitHub Integration
- Browser Automation
- Direct Model Queries
Authentication and API Keys
AI Team Features
- Perplexity: Web Search & Research
- Gemini 2.0: Repository Context & Planning
- Stagehand: Browser Automation
- YouTube Video Analysis
Skills
- GitHub Integration
- Xcode Tools
- Documentation Generation
Configuration
- cursor-tools.config.json
- GitHub Authentication
- Repomix Configuration
- Model Selection
- Cursor Configuration
  - Cursor Agent Configuration
cursor-tools cli
- Command Options
- Execution Methods
Troubleshooting
Examples
- Web Search Examples
- Repository Context Examples
- Documentation Examples
- GitHub Integration Examples
- Xcode Command Examples
- Browser Command Examples
  - open subcommand examples
  - act, extract, observe subcommands examples
- YouTube Command Examples
Node Package Manager
Contributing
Sponsors
License

The AI Team

Perplexity to search the web and perform deep research
Gemini 2.0 for huge whole-codebase context window, search grounding and reasoning
Stagehand for browser operation to test and debug web apps (uses Anthropic or OpenAI models)
OpenRouter for access to a variety of models through a unified API (for MCP commands)

New Skills for your existing Agent

Work with GitHub Issues and Pull Requests
Generate local agent-accessible documentation for external dependencies
Analyze YouTube videos to extract insights, summaries, and implementation plans

cursor-tools is optimized for Cursor Composer Agent but it can be used by any coding agent that can execute commands

How do I use it?

After installation, to see AI teamwork in action just ask Cursor Composer to use Perplexity or Gemini. Here are two examples:

Asking Perplexity to carry out web research

see what happens next...

see the spec composer and perplexity produced together: pac-man-spec.md (link out to the example repo)

Asking Gemini for a plan

see what happens next...

see the spec composer and perplexity produced together: pac-man-plan.md (link out to the example repo)

What is cursor-tools

cursor-tools provides a CLI that your AI agent can use to expand its capabilities. cursor-tools is designed to be installed globally, providing system-wide access to its powerful features. When you run cursor-tools install we automatically add a prompt section to your Cursor project rules. During installation, you can choose between:

The new .cursor/rules/cursor-tools.mdc file (recommended)
The legacy .cursorrules file (for backward compatibility)

You can also control this using the USE_LEGACY_CURSORRULES environment variable:

USE_LEGACY_CURSORRULES=true - Use legacy .cursorrules file
USE_LEGACY_CURSORRULES=false - Use new .cursor/rules/cursor-tools.mdc file
If not set, defaults to legacy mode for backward compatibility

cursor-tools requires a Perplexity API key and a Google AI API key.

cursor-tools is a node package that should be installed globally.

Installation

Install cursor-tools globally:

npm install -g cursor-tools

Then run the interactive setup:

cursor-tools install .

This command will:

Guide you through API key configuration
Update your Cursor project rules for Cursor integration (using .cursor/rules/cursor-tools.mdc or existing .cursorrules)

Requirements

Node.js 18 or later
Perplexity API key
Google Gemini API key
For browser commands:
- Playwright (npm install --global playwright)
- OpenAI API key or Anthropic API key (for act, extract, and observe commands)

cursor-tools uses Gemini-2.0 because it is the only good LLM with a context window that goes up to 2 million tokens - enough to handle and entire codebase in one shot. Gemini 2.0 experimental models that we use by default are currently free to use on Google and you need a Google Cloud project to create an API key.

cursor-tools uses Perplexity because Perplexity has the best web search api and indexes and it does not hallucinate. Perplexity Pro users can get an API key with their pro account and recieve $5/month of free credits (at time of writing). Support for Google search grounding is coming soon but so far testing has shown it still frequently hallucinates things like APIs and libraries that don't exist.

Tips:

Ask Cursor Agent to have Gemini review its work
Ask Cursor Agent to generate documentation for external dependencies and write it to a local-docs/ folder

If you do something cool with cursor-tools please let me know on twitter or make a PR to add to this section!

Additional Examples

GitHub Skills

To see cursor-tools GitHub and Perplexity skills: Check out this example issue that was solved using Cursor agent and cursor-tools

Gemini code review

See cursor get approximately 5x more work done per-prompt with Gemini code review:

Detailed Cursor Usage

Use Cursor Composer in agent mode with command execution (not sure what this means, see section below on Cursor Agent configuration). If you have installed the cursor-tools prompt to your .cursorrules (or equivalent) just ask your AI coding agent/assistant to use "cursor-tools" to do things.

Tool Recommendations

cursor-tools ask allows direct querying of any model from any provider. It's best for simple questions where you want to use a specific model or compare responses from different models.
cursor-tools web uses an AI teammate with web search capability to answer questions. web is best for finding up-to-date information from the web that is not specific to the repository such as how to use a library to search for known issues and error messages or to get suggestions on how to do something. Web is a teammate who knows tons of stuff and is always up to date.
cursor-tools repo uses an AI teammate with large context window capability to answer questions. repo sends the entire repo as context so it is ideal for questions about how things work or where to find something, it is also great for code review, debugging and planning. is a teammate who knows the entire codebase inside out and understands how everything works together.
cursor-tools plan uses an AI teammate with reasoning capability to plan complex tasks. Plan uses a two step process. First it does a whole repo search with a large context window model to find relevant files. Then it sends only those files as context to a thinking model to generate a plan it is great for planning complex tasks and for debugging and refactoring. Plan is a teammate who is really smart on a well defined problem, although doesn't consider the bigger picture.
cursor-tools doc uses an AI teammate with large context window capability to generate documentation for local or github hosted repositories by sending the entire repo as context. doc can be given precise documentation tasks or can be asked to generate complete docs from scratch it is great for generating docs updates or for generating local documentation for a libary or API that you use! Doc is a teammate who is great at summarising and explaining code, in this repo or in any other repo!
cursor-tools browser uses an AI teammate with browser control (aka operator) capability to operate web browsers. browser can operate in a hidden (headless) mode to invisibly test and debug web apps or it can be used to connect to an existing browser session to interactively share your browser with Cursor agent it is great for testing and debugging web apps and for carrying out any task that can be done in a browser such as reading information from a bug ticket or even filling out a form. Browser is a teammate who can help you test and debug web apps, and can share control of your browser to perform small browser-based tasks.
cursor-tools youtube uses an AI teammate with video analysis capability to understand YouTube content. youtube can generate summaries, extract transcripts, create implementation plans from tutorials, and answer specific questions about video content. It's great for extracting value from technical talks, tutorials, and presentations without spending time watching the entire video. YouTube is a teammate who can watch and analyze videos for you, distilling the key information.

Note: For repo, doc and plan commands the repository content that is sent as context can be reduced by filtering out files in a .repomixignore file.

Command Nicknames

When using cursor-tools with Cursor Composer, you can use these nicknames:

"Gemini" is a nickname for cursor-tools repo
"Perplexity" is a nickname for cursor-tools web
"Stagehand" is a nickname for cursor-tools browser

Use web search

"Please implement country specific stripe payment pages for the USA, UK, France and Germany. Use cursor-tools web to check the available stripe payment methods in each country."

Note: in most cases you can say "ask Perplexity" instead of "use cursor-tools web" and it will work the same.

Use repo search

"Let's refactor our User class to allow multiple email aliases per user. Use cursor-tools repo to ask for a plan including a list of all files that need to be changed."

"Use cursor-tools repo to analyze how authentication is implemented in the Next.js repository. Use --from-github=vercel/next.js."

Note: in most cases you can say "ask Gemini" instead of "use cursor-tools repo" and it will work the same.

Use doc generation

"Use cursor-tools to generate documentation for the Github repo https://github.com/kait-http/kaito" and write it to docs/kaito.md"

Note: in most cases you can say "generate documentation" instead of "use cursor-tools doc" and it will work the same.

Use github integration

"Use cursor-tools github to fetch issue 123 and suggest a solution to the user's problem"

"Use cursor-tools github to fetch PR 321 and see if you can fix Andy's latest comment"

Note: in most cases you can say "fetch issue 123" or "fetch PR 321" instead of "use cursor-tools github" and it will work the same.

Use browser automation

"Use cursor-tools to open the users page and check the error in the console logs, fix it"

"Use cursor-tools to test the form field validation logic. Take screenshots of each state"

"Use cursor-tools to open https://example.com/foo the and check the error in the network logs, what could be causing it?"

Note: in most cases you can say "Use Stagehand" instead of "use cursor-tools" and it will work the same.

Use direct model queries

"Use cursor-tools ask to compare how different models answer this question: 'What are the key differences between REST and GraphQL?'"

"Ask OpenAI's o3-mini model to explain the concept of dependency injection."

"Use cursor-tools ask to analyze this complex algorithm with high reasoning effort: 'Explain the time and space complexity of the Boyer-Moore string search algorithm' --provider openai --model o3-mini --reasoning-effort high"

Note: The ask command requires both --provider and --model parameters to be specified. This command is generally less useful than other commands like repo or plan because it does not include any context from your codebase or repository.

Ask Command Options:

--provider=<provider>: AI provider to use (required)
--model=<model>: Model to use (required)
--max-tokens=<number>: Maximum tokens for response
--reasoning-effort=<low|medium|high>: Control the depth of reasoning for supported models (OpenAI o1/o3-mini models and Claude 3.7 Sonnet). Higher values produce more thorough responses for complex questions.

Authentication and API Keys

cursor-tools requires API keys for Perplexity AI, Google Gemini, and optionally for OpenAI, Anthropic and OpenRouter. These can be configured in two ways:

Interactive Setup: Run cursor-tools install and follow the prompts

Manual Setup: Create ~/.cursor-tools/.env in your home directory or .cursor-tools.env in your project root:

PERPLEXITY_API_KEY="your-perplexity-api-key"
GEMINI_API_KEY="your-gemini-api-key"
OPENAI_API_KEY="your-openai-api-key"  # Optional, for Stagehand
ANTHROPIC_API_KEY="your-anthropic-api-key" # Optional, for Stagehand and MCP
OPENROUTER_API_KEY="your-openrouter-api-key" # Optional, for MCP
GITHUB_TOKEN="your-github-token"  # Optional, for enhanced GitHub access

At least one of ANTHROPIC_API_KEY and OPENROUTER_API_KEY must be provided to use the mcp commands.

Google Gemini API Authentication

cursor-tools supports multiple authentication methods for accessing the Google Gemini API, providing flexibility for different environments and security requirements. You can choose from the following methods:

API Key (Default)
- This is the simplest method and continues to be supported for backward compatibility.
- Set the GEMINI_API_KEY environment variable to your API key string obtained from Google AI Studio.
- Example:
```
GEMINI_API_KEY="your-api-key-here"
```
Service Account JSON Key File
- For enhanced security, especially in production environments, use a service account JSON key file.
- Set the GEMINI_API_KEY environment variable to the path of your downloaded service account JSON key file.
- Example:
```
GEMINI_API_KEY="./path/to/service-account.json"
```
- This method enables access to the latest Gemini models available through Vertex AI, such as gemini-2.0-flash.
Application Default Credentials (ADC) (Recommended for Google Cloud Environments)
- ADC is ideal when running cursor-tools within Google Cloud environments (e.g., Compute Engine, Kubernetes Engine) or for local development using gcloud.
- Set the GEMINI_API_KEY environment variable to adc.
- Example:
```
GEMINI_API_KEY="adc"
```
- Setup Instructions: First, authenticate locally using gcloud:
```
gcloud auth application-default login
```

AI Team Features

Perplexity: Web Search & Research

Use Perplexity AI to get up-to-date information directly within Cursor:

cursor-tools web "What's new in TypeScript 5.7?"

Gemini 2.0: Repository Context & Planning

Leverage Google Gemini 2.0 models with 1M+ token context windows for codebase-aware assistance and implementation planning:

# Get context-aware assistance
cursor-tools repo "Explain the authentication flow in this project, which files are involved?"

# Generate implementation plans
cursor-tools plan "Add user authentication to the login page"

The plan command uses multiple AI models to:

Identify relevant files in your codebase (using Gemini by default)
Extract content from those files
Generate a detailed implementation plan (using o3-mini by default)

Plan Command Options:

--fileProvider=<provider>: Provider for file identification (gemini, openai, anthropic, perplexity, modelbox, or openrouter)
--thinkingProvider=<provider>: Provider for plan generation (gemini, openai, anthropic, perplexity, modelbox, or openrouter)
--fileModel=<model>: Model to use for file identification
--thinkingModel=<model>: Model to use for plan generation
--fileMaxTokens=<number>: Maximum tokens for file identification
--thinkingMaxTokens=<number>: Maximum tokens for plan generation
--debug: Show detailed error information

Repository context is created using Repomix. See repomix configuration section below for details on how to change repomix behaviour.

Above 1M tokens cursor-tools will always send requests to Gemini 2.0 Pro as it is the only model that supports 1M+ tokens.

The Gemini 2.0 Pro context limit is 2M tokens, you can add filters to .repomixignore if your repomix context is above this limit.

Stagehand: Browser Automation

Automate browser interactions for web scraping, testing, and debugging:

Important: The browser command requires the Playwright package to be installed separately in your project:

npm install playwright
# or
yarn add playwright
# or
pnpm add playwright

open - Open a URL and capture page content:

# Open and capture HTML content, console logs and network activity (enabled by default)
cursor-tools browser open "https://example.com" --html

# Take a screenshot
cursor-tools browser open "https://example.com" --screenshot=page.png

# Debug in an interactive browser session
cursor-tools browser open "https://example.com" --connect-to=9222

act - Execute actions using natural language - Agent tells the browser-use agent what to do:

# Single action
cursor-tools browser act "Login as '[email protected]'" --url "https://example.com/login"

# Multi-step workflow using pipe separator
cursor-tools browser act "Click Login | Type '[email protected]' into email | Click Submit" --url "https://example.com"

# Record interaction video
cursor-tools browser act "Fill out registration form" --url "https://example.com/signup" --video="./recordings"

observe - Analyze interactive elements:

# Get overview of interactive elements
cursor-tools browser observe "What can I interact with?" --url "https://example.com"

# Find specific elements
cursor-tools browser observe "Find the login form" --url "https://example.com"

extract - Extract data using natural language:

# Extract specific content
cursor-tools browser extract "Get all product prices" --url "https://example.com/products"

# Save extracted content
cursor-tools browser extract "Get article text" --url "https://example.com/blog" --html > article.html

# Extract with network monitoring
cursor-tools browser extract "Get API responses" --url "https://example.com/api-test" --network

Browser Command Options

All browser commands (open, act, observe, extract) support these options:

--console: Capture browser console logs (enabled by default, use --no-console to disable)
--html: Capture page HTML content (disabled by default)
--network: Capture network activity (enabled by default, use --no-network to disable)
--screenshot=<file path>: Save a screenshot of the page
--timeout=<milliseconds>: Set navigation timeout (default: 120000ms for Stagehand operations, 30000ms for navigation)
--viewport=<width>x<height>: Set viewport size (e.g., 1280x720)
--headless: Run browser in headless mode (default: true)
--no-headless: Show browser UI (non-headless mode) for debugging
--connect-to=<port>: Connect to existing Chrome instance. Special values: 'current' (use existing page), 'reload-current' (refresh existing page)
--wait=<time:duration or selector:css-selector>: Wait after page load (e.g., 'time:5s', 'selector:#element-id')
--video=<directory>: Save a video recording (1280x720 resolution, timestamped subdirectory). Not available when using --connect-to
--url=<url>: Required for act, observe, and extract commands
--evaluate=<string>: JavaScript code to execute in the browser before the main command

Notes on Connecting to an existing browser session with --connect-to

DO NOT ask browser act to "wait" for anything, the wait command is currently disabled in Stagehand.
When using --connect-to, viewport is only changed if --viewport is explicitly provided
Video recording is not available when using --connect-to
Special --connect-to values:
- current: Use the existing page without reloading
- reload-current: Use the existing page and refresh it (useful in development)

Video Recording

All browser commands support video recording of the browser interaction in headless mode (not supported with --connect-to):

Use --video=<directory> to enable recording
Videos are saved at 1280x720 resolution in timestamped subdirectories
Recording starts when the browser opens and ends when it closes
Videos are saved as .webm files

Example:

# Record a video of filling out a form
cursor-tools browser act "Fill out registration form with name John Doe" --url "http://localhost:3000/signup" --video="./recordings"

Console and Network Logging

Console logs and network activity are captured by default:

Use --no-console to disable console logging
Use --no-network to disable network logging
Logs are displayed in the command output

Complex Actions

The act command supports chaining multiple actions using the pipe (|) separator:

# Login sequence with console/network logging (enabled by default)
cursor-tools browser act "Click Login | Type '[email protected]' into email | Click Submit" --url "http://localhost:3000/login"

# Form filling with multiple fields
cursor-tools browser act "Select 'Mr' from title | Type 'John' into first name | Type 'Doe' into last name | Click Next" --url "http://localhost:3000/register"

# Record complex interaction
cursor-tools browser act "Fill form | Submit | Verify success" --url "http://localhost:3000/signup" --video="./recordings"

Troubleshooting Browser Commands

Common issues and solutions:

Element Not Found Errors
- Use --no-headless to visually debug the page
- Use browser observe to see what elements Stagehand can identify
- Check if the element is in an iframe or shadow DOM
- Ensure the page has fully loaded (try increasing --timeout)
Stagehand API Errors
- Verify your OpenAI or Anthropic API key is set correctly
- Check if you have sufficient API credits
- Try switching models using --model
Network Errors
- Check your internet connection
- Verify the target website is accessible
- Try increasing the timeout with --timeout
- Check if the site blocks automated access
Video Recording Issues
- Ensure the target directory exists and is writable
- Check disk space
- Video recording is not available with --connect-to
Performance Issues
- Use --headless mode for better performance (default)
- Reduce the viewport size with --viewport
- Consider using --connect-to for development

YouTube Video Analysis

Use Gemini-powered YouTube video analysis to extract insights, summaries, and implementation plans:

# Generate a video summary
cursor-tools youtube "https://www.youtube.com/watch?v=VIDEO_ID" --type=summary

# Get a detailed transcript
cursor-tools youtube "https://www.youtube.com/watch?v=VIDEO_ID" --type=transcript

# Create an implementation plan based on tutorial content
cursor-tools youtube "https://www.youtube.com/watch?v=VIDEO_ID" --type=plan

# Ask specific questions about the video
cursor-tools youtube "https://www.youtube.com/watch?v=VIDEO_ID" "How does the authentication flow work?"

# Save summary to a file
cursor-tools youtube "https://www.youtube.com/watch?v=VIDEO_ID" --type=summary --save-to=video-summary.md

The YouTube command leverages Gemini models' native ability to understand video content, enabling you to:

Extract key insights and summaries from technical talks, tutorials, and presentations
Generate complete transcripts of video content
Create implementation plans based on tutorial videos
Perform quality reviews of educational content
Get answers to specific questions about the video content

YouTube Command Options:

--type=<summary|transcript|plan|custom>: Type of analysis to perform (default: summary)

Note: The YouTube command requires a GEMINI_API_KEY to be set in your environment or .cursor-tools.env file as the Gemini API is currently the only interface that reliably supports YouTube video analysis.

Skills

GitHub Integration

Access GitHub issues and pull requests directly from the command line with rich formatting and full context:

# List recent PRs or issues
cursor-tools github pr
cursor-tools github issue

# View specific PR or issue with full discussion
cursor-tools github pr 123
cursor-tools github issue 456

The GitHub commands provide:

View of 10 most recent open PRs or issues when no number specified
Detailed view of specific PR/issue including:
- PR/Issue description and metadata
- Code review comments grouped by file (PRs only)
- Full discussion thread
- Labels, assignees, milestones and reviewers
Support for both local repositories and remote GitHub repositories
Markdown-formatted output for readability

Authentication Methods: The commands support multiple authentication methods:

GitHub token via environment variable: GITHUB_TOKEN=your_token_here
GitHub CLI integration (if gh is installed and logged in)
Git credentials (stored tokens or Basic Auth)

Without authentication:

Public repositories: Limited to 60 requests per hour
Private repositories: Not accessible

With authentication:

Public repositories: 5,000 requests per hour
Private repositories: Full access (with appropriate token scopes)

Xcode Tools

Automate iOS app building, testing, and running in the simulator:

# Available subcommands
cursor-tools xcode build  # Build Xcode project and report errors
cursor-tools xcode run    # Build and run app in simulator
cursor-tools xcode lint   # Analyze code and offer to fix warnings

Build Command Options:

# Specify custom build path (derived data)
cursor-tools xcode build buildPath=/custom/build/path

# Specify target device
cursor-tools xcode build destination="platform=iOS Simulator,name=iPhone 15"

Run Command Options:

# Run on iPhone simulator (default)
cursor-tools xcode run iphone

# Run on iPad simulator
cursor-tools xcode run ipad

# Run on specific device with custom build path
cursor-tools xcode run device="iPhone 16 Pro" buildPath=/custom/build/path

The Xcode commands provide:

Automatic project/workspace detection
Dynamic app bundle identification
Build output streaming with error parsing
Simulator device management
Support for both iPhone and iPad simulators
Custom build path specification to control derived data location

Documentation Generation (uses Gemini 2.0)

Generate comprehensive documentation for your repository or any GitHub repository:

# Document local repository and save to file
cursor-tools doc --save-to=docs.md

# Document remote GitHub repository (both formats supported)
cursor-tools doc --from-github=username/repo-name@branch
cursor-tools doc --from-github=https://github.com/username/repo-name@branch

# Save documentation to file (with and without a hint)
# This is really useful to generate local documentation for libraries and dependencies
cursor-tools doc --from-github=eastlondoner/cursor-tools --save-to=docs/CURSOR-TOOLS.md
cursor-tools doc --from-github=eastlondoner/cursor-tools --save-to=docs/CURSOR-TOOLS.md --hint="only information about the doc command"

Configuration

cursor-tools.config.json

Customize cursor-tools behavior by creating a cursor-tools.config.json file. This file can be created either globally in ~/.cursor-tools/cursor-tools.config.json or locally in your project root.

The cursor-tools.config file configures the local default behaviour for each command and provider.

Here is an example of a typical cursor-tools.config.json file, showing some of the most common configuration options:

{
  // Commands
  "repo": {
    "provider": "openrouter",
    "model": "google/gemini-2.5-pro-exp-03-25:free"
  },
  "doc": {
    "provider": "openrouter",
    "model": "anthropic/claude-3.7-sonnet",
    "maxTokens": 4096
  },
  "web": {
    "provider": "gemini",
    "model": "gemini-2.5-pro-exp"
  },
  "plan": {
    "fileProvider": "gemini",
    "thinkingProvider": "perplexity",
    "thinkingModel": "r1-1776"
  },
  "browser": {
    "headless": false
  },
  //...

  // Providers
  "stagehand": {
    "model": "claude-3-7-sonnet-latest", // For Anthropic provider
    "provider": "anthropic", // or "openai"
    "timeout": 90000
  },
  "openai": {
    "model": "gpt-4o"
  }
  //...
}

For details of all configuration options, see CONFIGURATION.md. This includes details of all the configuration options and how to use them.

GitHub Authentication

The GitHub commands support several authentication methods:

Environment Variable: Set GITHUB_TOKEN in your environment:
```
GITHUB_TOKEN=your_token_here
```
GitHub CLI: If you have the GitHub CLI (gh) installed and are logged in, cursor-tools will automatically use it to generate tokens with the necessary scopes.
Git Credentials: If you have authenticated git with GitHub (via HTTPS), cursor-tools will automatically:
- Use your stored GitHub token if available (credentials starting with ghp_ or gho_)
- Fall back to using Basic Auth with your git credentials

To set up git credentials:

Configure git to use HTTPS instead of SSH:

git config --global url."https://github.com/".insteadOf [email protected]:

Store your credentials:

git config --global credential.helper store  # Permanent storage
# Or for macOS keychain:
git config --global credential.helper osxkeychain

The next time you perform a git operation requiring authentication, your credentials will be stored

Authentication Status:

Without authentication:
- Public repositories: Limited to 60 requests per hour
- Private repositories: Not accessible
- Some features may be restricted
With authentication (any method):
- Public repositories: 5,000 requests per hour
- Private repositories: Full access (if token has required scopes)

cursor-tools will automatically try these authentication methods in order:

GITHUB_TOKEN environment variable
GitHub CLI token (if gh is installed and logged in)
Git credentials (stored token or Basic Auth)

If no authentication is available, it will fall back to unauthenticated access with rate limits.

Repomix Configuration

When generating documentation, cursor-tools uses Repomix to analyze your repository. By default, it excludes certain files and directories that are typically not relevant for documentation:

Node modules and package directories (node_modules/, packages/, etc.)
Build output directories (dist/, build/, etc.)
Version control directories (.git/)
Test files and directories (test/, tests/, __tests__/, etc.)
Configuration files (.env, .config, etc.)
Log files and temporary files
Binary files and media files

You can customize the files and folders to exclude using two methods, both can be combined together:

Create a .repomixignore file in your project root to specify files to exclude.

Example .repomixignore file for a Laravel project:

vendor/
public/
database/
storage/
.idea
.env

Create a repomix.config.json file in your project root for more advanced configuration options:

Example repomix.config.json to enable compression and specify what to include:

{
  "include": ["src/**/*", "README.md", "package.json"],
  "output": {
    "compress": true
  }
}

This configuration will be detected and used automatically by the repo, plan, and doc commands, allowing for precise control over which files are included in the repository analysis.

If both a .repomixignore and an ignore section in repomix.config.json are present then the ignore patterns from both are combined.

Model Selection

The browser commands support different AI models for processing. You can select the model using the --model option:

# Use gpt-4o
cursor-tools browser act "Click Login" --url "https://example.com" --model=gpt-4o

# Use Claude 3.7 Sonnet
cursor-tools browser act "Click Login" --url "https://example.com" --model=claude-3-7-sonnet-latest

You can set a default provider in your cursor-tools.config.json file under the stagehand section:

{
  "stagehand": {
    "model": "claude-3-7-sonnet-latest", // For Anthropic provider
    "provider": "anthropic", // or "openai"
    "timeout": 90000
  }
}

You can also set a default model in your cursor-tools.config.json file under the stagehand section:

{
  "stagehand": {
    "provider": "openai", // or "anthropic"
    "model": "gpt-4o"
  }
}

If no model is specified (either on the command line or in the config), a default model will be used based on your configured provider:

OpenAI: o3-mini
Anthropic: claude-3-7-sonnet-latest

Available models depend on your configured provider (OpenAI or Anthropic) in cursor-tools.config.json and your API key.

Cursor Configuration

cursor-tools automatically configures Cursor by updating your project rules during installation. This provides:

Command suggestions
Usage examples
Context-aware assistance

For new installations, we use the recommended .cursor/rules/cursor-tools.mdc path. For existing installations, we maintain compatibility with the legacy .cursorrules file. If both files exist, we prefer the new path and show a warning.

Cursor Agent Configuration:

To get the benefits of cursor-tools you should use Cursor agent in "yolo mode". Ideal settings:

cursor-tools cli

In general you do not need to use the cli directly, your AI coding agent will call the CLI but it is useful to know it exists and this is how it works.

Command Options

All commands support these general options:

--model: Specify an alternative model
--max-tokens: Control response length
--reasoning-effort=<low|medium|high>: Control the depth of reasoning for supported models (OpenAI o1/o3-mini and Claude 3.7 Sonnet). Higher values produce more thorough responses at the cost of increased token usage.
--save-to: Save command output to a file (in addition to displaying it, like tee)
--quiet: Suppress stdout output (only useful with --save-to)
--debug: Show detailed error information
--help: View all available options
--provider: AI provider to use. Valid values: openai, anthropic, perplexity, gemini, openrouter

Documentation command specific options:

--from-github: Generate documentation for a remote GitHub repository (supports @branch syntax)
--hint: Provide additional context or focus for documentation generation

Repository command specific options:

--from-github=<GitHub username>/<repository name>[@<branch>]: Analyze a remote GitHub repository without cloning it locally
--subdir=<path>: Analyze a specific subdirectory instead of the entire repository

Plan command specific options:

--fileProvider: Provider for file identification (gemini, openai, anthropic, perplexity, modelbox, or openrouter)
--thinkingProvider: Provider for plan generation (gemini, openai, anthropic, perplexity, modelbox, or openrouter)
--fileModel: Model to use for file identification
--thinkingModel: Model to use for plan generation
--fileMaxTokens: Maximum tokens for file identification
--thinkingMaxTokens: Maximum tokens for plan generation
--debug: Show detailed error information

GitHub command specific options:

--from-github=<GitHub username>/<repository name>[@<branch>]: Access PRs/issues from a specific GitHub repository. --repo is an older, still supported synonym for this option.

Xcode command specific options:

For the build subcommand:
- buildPath=<path>: Set a custom derived data path
- destination=<destination string>: Set a custom simulator destination
For the run subcommand:
- iphone or ipad: Select device type
- device=<device name>: Specify a custom device
- buildPath=<path>: Set a custom derived data path

Browser command specific options:

--console: Capture browser console logs (enabled by default, use --no-console to disable)
--html: Capture page HTML content (disabled by default)
--network: Capture network activity (enabled by default, use --no-network to disable)
--screenshot: Save a screenshot of the page
--timeout: Set navigation timeout (default: 120000ms for Stagehand operations, 30000ms for navigation)
--viewport: Set viewport size (e.g., 1280x720)
--headless: Run browser in headless mode (default: true)
--no-headless: Show browser UI (non-headless mode) for debugging
--connect-to: Connect to existing Chrome instance
--wait: Wait after page load (e.g., 'time:5s', 'selector:#element-id')
--video: Save a video recording (1280x720 resolution, timestamped subdirectory)
--url: Required for act, observe, and extract commands. Url to navigate to on connection or one of the special values: 'current' (use existing page), 'reload-current' (refresh existing page).
--evaluate: JavaScript code to execute in the browser before the main command

Execution Methods

Execute commands using:

cursor-tools <command> [options]

For example:

cursor-tools web "What's new in TypeScript 5.7?"

Troubleshooting

Command Not Found
- Ensure cursor-tools is installed globally using npm install -g cursor-tools
- Check your system's PATH environment variable to ensure it includes npm's global bin directory
- On Unix-like systems, the global bin directory is typically /usr/local/bin or ~/.npm-global/bin
- On Windows, it's typically %AppData%\npm
API Key Errors
- Verify .cursor-tools.env exists and contains valid API keys
- Run cursor-tools install to reconfigure API keys
- Check that your API keys have the necessary permissions
- For GitHub operations, ensure your token has the required scopes (repo, read:user)
- For Google Vertex AI authentication:
  - If using a JSON key file, verify the file path is correct and the file is readable
  - If using ADC, ensure you've run gcloud auth application-default login and the account has appropriate permissions
  - Verify your service account has the necessary roles in Google Cloud Console (typically "Vertex AI User")
  - For troubleshooting ADC: Run gcloud auth application-default print-access-token to check if ADC is working
- For MCP commands ensure that either the ANTHROPIC_API_KEY or the OPENROUTER_API_KEY are set.
- If using OpenRouter for MCP, ensure OPENROUTER_API_KEY is set.
- If a provider is not specified for an MCP command, Anthropic will be used by default.
Model Errors
- Check your internet connection
- Verify API key permissions
- Ensure the specified model is available for your API tier
GitHub API Rate Limits
- GitHub API has rate limits for unauthenticated requests. For higher limits you must be authenticated.
- If you have the gh cli installed and logged in cursor-tools will use that to obtain a short lived auth token. Otherwise you can add a GitHub token to your environment:
```
GITHUB_TOKEN=your_token_here
```
- Private repositories always require authentication
Documentation Generation Issues
- Repository too large: Try using --hint to focus on specific parts
- Token limit exceeded: The tool will automatically switch to a larger model
- Network timeouts: The tool includes automatic retries
- For very large repositories, consider documenting specific directories or files
Cursor Integration
- If .cursorrules is outdated, run cursor-tools install . to update
- Ensure Cursor is configured to allow command execution
- Check that your Cursor version supports AI commands

Examples

Web Search Examples

# Get information about new technologies
cursor-tools web "What are the key features of Bun.js?"

# Check API documentation
cursor-tools web "How to implement OAuth2 in Express.js?"

# Compare technologies
cursor-tools web "Compare Vite vs Webpack for modern web development"

Repository Context Examples

# Architecture understanding
cursor-tools repo "Explain the overall architecture of this project"

# Find usage examples
cursor-tools repo "Show me examples of error handling in this codebase"

# Debugging help
cursor-tools repo "Why might the authentication be failing in the login flow?"

# Analyze specific subdirectory
cursor-tools repo "Explain the code structure" --subdir=src/components

# Analyze remote GitHub repository
cursor-tools repo "Explain the architecture" --from-github=username/repo-name

# Deep analysis with enhanced reasoning
cursor-tools repo "Analyze the security implications of our authentication implementation" --reasoning-effort high

Direct Model Query Examples

# Basic question
cursor-tools ask "What is the capital of France?" --provider openai --model o3-mini

# Complex algorithm explanation with high reasoning effort
cursor-tools ask "Explain the quicksort algorithm and analyze its time complexity in different scenarios" --provider openai --model o3-mini --reasoning-effort high

# Comparative analysis with Claude model and enhanced reasoning
cursor-tools ask "Compare and contrast microservices vs monolithic architecture" --provider anthropic --model claude-3-7-sonnet --reasoning-effort medium

Documentation Examples

# Document specific aspects and save to file without stdout output
cursor-tools doc --save-to=docs/api.md --quiet --hint="Focus on the API endpoints and their usage"

# Document with hint to customize the docs output
cursor-tools doc --save-to=docs/architecture.md --quiet --hint="Focus on system architecture"

# Document dependencies
cursor-tools doc --from-github=expressjs/express --save-to=docs/EXPRESS.md --quiet

GitHub Integration Examples

# List PRs with specific labels
cursor-tools github pr --from-github facebook/react

# Check recent issues in a specific repository
cursor-tools github issue --from-github vercel/next.js

# View PR with code review comments
cursor-tools github pr 123 --from-github microsoft/typescript

# Track issue discussions
cursor-tools github issue 456 --from-github golang/go

Xcode Command Examples

# Build an iOS app with default settings
cursor-tools xcode build

# Build with custom derived data path
cursor-tools xcode build buildPath=~/custom/derived/data

# Run in iPhone simulator
cursor-tools xcode run iphone

# Run on specific iPad model
cursor-tools xcode run device="iPad Pro (12.9-inch) (6th generation)"

# Analyze code quality
cursor-tools xcode lint

Browser Command Examples

`open` subcommand examples:

# Open a URL and get HTML
cursor-tools browser open "https://example.com" --html

# Open and capture console logs and network activity
cursor-tools browser open "https://example.com" --console --network

# Take a screenshot
cursor-tools browser open "https://example.com" --screenshot=page.png

# Run in non-headless mode for debugging
cursor-tools browser open "https://example.com" --no-headless

`act`, `extract`, `observe` subcommands examples:

# AI-powered action
cursor-tools browser act "Click on 'Sign Up'" --url "https://example.com"

# AI-powered extraction
cursor-tools browser extract "Get the main content" --url "https://example.com/blog"

# AI-powered observation
cursor-tools browser observe "What can I do on this page?" --url "https://example.com"

YouTube Command Examples

# Generate a comprehensive summary of a technical talk
cursor-tools youtube "https://www.youtube.com/watch?v=dQw4w9WgXcQ" --type=summary

# Get a complete transcript with speaker annotations
cursor-tools youtube "https://www.youtube.com/watch?v=dQw4w9WgXcQ" --type=transcript --save-to=transcript.md

# Create an implementation plan from a coding tutorial
cursor-tools youtube "https://www.youtube.com/watch?v=dQw4w9WgXcQ" --type=plan

# Generate a critical review of a tutorial's accuracy and quality
cursor-tools youtube "https://www.youtube.com/watch?v=dQw4w9WgXcQ" --type=review

# Ask specific questions about video content
cursor-tools youtube "https://www.youtube.com/watch?v=dQw4w9WgXcQ" "What libraries does the tutorial use for authentication?"

# Use a specific model for analysis
cursor-tools youtube "https://www.youtube.com/watch?v=dQw4w9WgXcQ" --model=gemini-2.5-pro-exp

# Use custom analysis type for specialized insights
cursor-tools youtube "https://www.youtube.com/watch?v=dQw4w9WgXcQ" --type=custom "Extract all code examples and explain them in detail"

Node Package Manager (npm)

cursor-tools is available on npm here

Contributing

Contributions are welcome! Please feel free to submit a Pull Request. If you used cursor-tools to make your contribution please include screenshots or videos of cursor-tools in action.

License

MIT License - see LICENSE for details.

For Tasks:

Click tags to check more tools for each tasks

search web generate documentation integrate with github automate browser tasks build and run ios apps

For Jobs:

ai engineer software developer data scientist web developer quality assurance tester

Alternative AI tools for cursor-tools

Similar Open Source Tools

cursor-tools

github

: 3.5k

llm-vscode

llm-vscode is an extension designed for all things LLM, utilizing llm-ls as its backend. It offers features such as code completion with 'ghost-text' suggestions, the ability to choose models for code generation via HTTP requests, ensuring prompt size fits within the context window, and code attribution checks. Users can configure the backend, suggestion behavior, keybindings, llm-ls settings, and tokenization options. Additionally, the extension supports testing models like Code Llama 13B, Phind/Phind-CodeLlama-34B-v2, and WizardLM/WizardCoder-Python-34B-V1.0. Development involves cloning llm-ls, building it, and setting up the llm-vscode extension for use.

github

: 1.1k

RA.Aid

RA.Aid is an AI software development agent powered by `aider` and advanced reasoning models like `o1`. It combines `aider`'s code editing capabilities with LangChain's agent-based task execution framework to provide an intelligent assistant for research, planning, and implementation of multi-step development tasks. It handles complex programming tasks by breaking them down into manageable steps, running shell commands automatically, and leveraging expert reasoning models like OpenAI's o1. RA.Aid is designed for everyday software development, offering features such as multi-step task planning, automated command execution, and the ability to handle complex programming tasks beyond single-shot code edits.

github

: 1.6k

tiledesk-dashboard

Tiledesk is an open-source live chat platform with integrated chatbots written in Node.js and Express. It is designed to be a multi-channel platform for web, Android, and iOS, and it can be used to increase sales or provide post-sales customer service. Tiledesk's chatbot technology allows for automation of conversations, and it also provides APIs and webhooks for connecting external applications. Additionally, it offers a marketplace for apps and features such as CRM, ticketing, and data export.

github

: 258

hayhooks

Hayhooks is a tool that simplifies the deployment and serving of Haystack pipelines as REST APIs. It allows users to wrap their pipelines with custom logic and expose them via HTTP endpoints, including OpenAI-compatible chat completion endpoints. With Hayhooks, users can easily convert their Haystack pipelines into API services with minimal boilerplate code.

github

: 51

bedrock-claude-chat

This repository is a sample chatbot using the Anthropic company's LLM Claude, one of the foundational models provided by Amazon Bedrock for generative AI. It allows users to have basic conversations with the chatbot, personalize it with their own instructions and external knowledge, and analyze usage for each user/bot on the administrator dashboard. The chatbot supports various languages, including English, Japanese, Korean, Chinese, French, German, and Spanish. Deployment is straightforward and can be done via the command line or by using AWS CDK. The architecture is built on AWS managed services, eliminating the need for infrastructure management and ensuring scalability, reliability, and security.

github

: 1.1k

aidermacs

Aidermacs is an AI pair programming tool for Emacs that integrates Aider, a powerful open-source AI pair programming tool. It provides top performance on the SWE Bench, support for multi-file edits, real-time file synchronization, and broad language support. Aidermacs delivers an Emacs-centric experience with features like intelligent model selection, flexible terminal backend support, smarter syntax highlighting, enhanced file management, and streamlined transient menus. It thrives on community involvement, encouraging contributions, issue reporting, idea sharing, and documentation improvement.

github

: 376

llm-functions

LLM Functions is a project that enables the enhancement of large language models (LLMs) with custom tools and agents developed in bash, javascript, and python. Users can create tools for their LLM to execute system commands, access web APIs, or perform other complex tasks triggered by natural language prompts. The project provides a framework for building tools and agents, with tools being functions written in the user's preferred language and automatically generating JSON declarations based on comments. Agents combine prompts, function callings, and knowledge (RAG) to create conversational AI agents. The project is designed to be user-friendly and allows users to easily extend the capabilities of their language models.

github

: 263

yek

Yek is a fast Rust-based tool designed to read text-based files in a repository or directory, chunk them, and serialize them for Large Language Models (LLM) consumption. It utilizes .gitignore rules to skip unwanted files, Git history to infer important files, and additional ignore patterns. Yek splits content into chunks based on token count or byte size, supports processing multiple directories, and can stream content when output is piped. It is configurable via a 'yek.toml' file and prioritizes important files at the end of the output.

github

: 1.6k

aio-theme

github

: 71

clickclickclick

ClickClickClick is a framework designed to enable autonomous Android and computer use using various LLM models, both locally and remotely. It supports tasks such as drafting emails, opening browsers, and starting games, with current support for local models via Ollama, Gemini, and GPT 4o. The tool is highly experimental and evolving, with the best results achieved using specific model combinations. Users need prerequisites like `adb` installation and USB debugging enabled on Android phones. The tool can be installed via cloning the repository, setting up a virtual environment, and installing dependencies. It can be used as a CLI tool or script, allowing users to configure planner and finder models for different tasks. Additionally, it can be used as an API to execute tasks based on provided prompts, platform, and models.

github

: 314

ppt2desc

ppt2desc is a command-line tool that converts PowerPoint presentations into detailed textual descriptions using vision language models. It interprets and describes visual elements, capturing the full semantic meaning of each slide in a machine-readable format. The tool supports various model providers and offers features like converting PPT/PPTX files to semantic descriptions, processing individual files or directories, visual elements interpretation, rate limiting for API calls, customizable prompts, and JSON output format for easy integration.

github

: 84

mcpdoc

The MCP LLMS-TXT Documentation Server is an open-source server that provides developers full control over tools used by applications like Cursor, Windsurf, and Claude Code/Desktop. It allows users to create a user-defined list of `llms.txt` files and use a `fetch_docs` tool to read URLs within these files, enabling auditing of tool calls and context returned. The server supports various applications and provides a way to connect to them, configure rules, and test tool calls for tasks related to documentation retrieval and processing.

github

: 148

raycast_api_proxy

The Raycast AI Proxy is a tool that acts as a proxy for the Raycast AI application, allowing users to utilize the application without subscribing. It intercepts and forwards Raycast requests to various AI APIs, then reformats the responses for Raycast. The tool supports multiple AI providers and allows for custom model configurations. Users can generate self-signed certificates, add them to the system keychain, and modify DNS settings to redirect requests to the proxy. The tool is designed to work with providers like OpenAI, Azure OpenAI, Google, and more, enabling tasks such as AI chat completions, translations, and image generation.

github

: 317

askrepo

askrepo is a tool that reads the content of Git-managed text files in a specified directory, sends it to the Google Gemini API, and provides answers to questions based on a specified prompt. It acts as a question-answering tool for source code by using a Google AI model to analyze and provide answers based on the provided source code files. The tool leverages modules for file processing, interaction with the Google AI API, and orchestrating the entire process of extracting information from source code files.

github

: 206

code2prompt

Code2Prompt is a powerful command-line tool that generates comprehensive prompts from codebases, designed to streamline interactions between developers and Large Language Models (LLMs) for code analysis, documentation, and improvement tasks. It bridges the gap between codebases and LLMs by converting projects into AI-friendly prompts, enabling users to leverage AI for various software development tasks. The tool offers features like holistic codebase representation, intelligent source tree generation, customizable prompt templates, smart token management, Gitignore integration, flexible file handling, clipboard-ready output, multiple output options, and enhanced code readability.

github

: 734

For similar tasks

Botright

Botright is a tool designed for browser automation that focuses on stealth and captcha solving. It uses a real Chromium-based browser for enhanced stealth and offers features like browser fingerprinting and AI-powered captcha solving. The tool is suitable for developers looking to automate browser tasks while maintaining anonymity and bypassing captchas. Botright is available in async mode and can be easily integrated with existing Playwright code. It provides solutions for various captchas such as hCaptcha, reCaptcha, and GeeTest, with high success rates. Additionally, Botright offers browser stealth techniques and supports different browser functionalities for seamless automation.

github

: 396

CoolCline

CoolCline is a proactive programming assistant that combines the best features of Cline, Roo Code, and Bao Cline. It seamlessly collaborates with your command line interface and editor, providing the most powerful AI development experience. It optimizes queries, allows quick switching of LLM Providers, and offers auto-approve options for actions. Users can configure LLM Providers, select different chat modes, perform file and editor operations, integrate with the command line, automate browser tasks, and extend capabilities through the Model Context Protocol (MCP). Context mentions help provide explicit context, and installation is easy through the editor's extension panel or by dragging and dropping the `.vsix` file. Local setup and development instructions are available for contributors.

github

: 132

cursor-tools

github

: 3.5k

LLM-Navigation

LLM-Navigation is a repository dedicated to documenting learning records related to large models, including basic knowledge, prompt engineering, building effective agents, model expansion capabilities, security measures against prompt injection, and applications in various fields such as AI agent control, browser automation, financial analysis, 3D modeling, and tool navigation using MCP servers. The repository aims to organize and collect information for personal learning and self-improvement through AI exploration.

github

: 110

pr-agent

PR-Agent is a tool that helps to efficiently review and handle pull requests by providing AI feedbacks and suggestions. It supports various commands such as generating PR descriptions, providing code suggestions, answering questions about the PR, and updating the CHANGELOG.md file. PR-Agent can be used via CLI, GitHub Action, GitHub App, Docker, and supports multiple git providers and models. It emphasizes real-life practical usage, with each tool having a single GPT-4 call for quick and affordable responses. The PR Compression strategy enables effective handling of both short and long PRs, while the JSON prompting strategy allows for modular and customizable tools. PR-Agent Pro, the hosted version by CodiumAI, provides additional benefits such as full management, improved privacy, priority support, and extra features.

github

: 6.5k

shell_gpt

ShellGPT is a command-line productivity tool powered by AI large language models (LLMs). This command-line tool offers streamlined generation of shell commands, code snippets, documentation, eliminating the need for external resources (like Google search). Supports Linux, macOS, Windows and compatible with all major Shells like PowerShell, CMD, Bash, Zsh, etc.

github

: 9.0k

gpt-pilot

GPT Pilot is a core technology for the Pythagora VS Code extension, aiming to provide the first real AI developer companion. It goes beyond autocomplete, helping with writing full features, debugging, issue discussions, and reviews. The tool utilizes LLMs to generate production-ready apps, with developers overseeing the implementation. GPT Pilot works step by step like a developer, debugging issues as they arise. It can work at any scale, filtering out code to show only relevant parts to the AI during tasks. Contributions are welcome, with debugging and telemetry being key areas of focus for improvement.

github

: 32.2k

sirji

Sirji is an agentic AI framework for software development where various AI agents collaborate via a messaging protocol to solve software problems. It uses standard or user-generated recipes to list tasks and tips for problem-solving. Agents in Sirji are modular AI components that perform specific tasks based on custom pseudo code. The framework is currently implemented as a Visual Studio Code extension, providing an interactive chat interface for problem submission and feedback. Sirji sets up local or remote development environments by installing dependencies and executing generated code.

github

: 71

For similar jobs

sweep

Sweep is an AI junior developer that turns bugs and feature requests into code changes. It automatically handles developer experience improvements like adding type hints and improving test coverage.

github

: 7.1k

teams-ai

The Teams AI Library is a software development kit (SDK) that helps developers create bots that can interact with Teams and Microsoft 365 applications. It is built on top of the Bot Framework SDK and simplifies the process of developing bots that interact with Teams' artificial intelligence capabilities. The SDK is available for JavaScript/TypeScript, .NET, and Python.

github

: 502

ai-guide

This guide is dedicated to Large Language Models (LLMs) that you can run on your home computer. It assumes your PC is a lower-end, non-gaming setup.

github

: 159

classifai

Supercharge WordPress Content Workflows and Engagement with Artificial Intelligence. Tap into leading cloud-based services like OpenAI, Microsoft Azure AI, Google Gemini and IBM Watson to augment your WordPress-powered websites. Publish content faster while improving SEO performance and increasing audience engagement. ClassifAI integrates Artificial Intelligence and Machine Learning technologies to lighten your workload and eliminate tedious tasks, giving you more time to create original content that matters.

github

: 620

chatbot-ui

Chatbot UI is an open-source AI chat app that allows users to create and deploy their own AI chatbots. It is easy to use and can be customized to fit any need. Chatbot UI is perfect for businesses, developers, and anyone who wants to create a chatbot.

github

: 27.7k

BricksLLM

BricksLLM is a cloud native AI gateway written in Go. Currently, it provides native support for OpenAI, Anthropic, Azure OpenAI and vLLM. BricksLLM aims to provide enterprise level infrastructure that can power any LLM production use cases. Here are some use cases for BricksLLM: * Set LLM usage limits for users on different pricing tiers * Track LLM usage on a per user and per organization basis * Block or redact requests containing PIIs * Improve LLM reliability with failovers, retries and caching * Distribute API keys with rate limits and cost limits for internal development/production use cases * Distribute API keys with rate limits and cost limits for students

github

: 953

uAgents

uAgents is a Python library developed by Fetch.ai that allows for the creation of autonomous AI agents. These agents can perform various tasks on a schedule or take action on various events. uAgents are easy to create and manage, and they are connected to a fast-growing network of other uAgents. They are also secure, with cryptographically secured messages and wallets.

github

: 1.3k

griptape

Griptape is a modular Python framework for building AI-powered applications that securely connect to your enterprise data and APIs. It offers developers the ability to maintain control and flexibility at every step. Griptape's core components include Structures (Agents, Pipelines, and Workflows), Tasks, Tools, Memory (Conversation Memory, Task Memory, and Meta Memory), Drivers (Prompt and Embedding Drivers, Vector Store Drivers, Image Generation Drivers, Image Query Drivers, SQL Drivers, Web Scraper Drivers, and Conversation Memory Drivers), Engines (Query Engines, Extraction Engines, Summary Engines, Image Generation Engines, and Image Query Engines), and additional components (Rulesets, Loaders, Artifacts, Chunkers, and Tokenizers). Griptape enables developers to create AI-powered applications with ease and efficiency.

github

: 2.2k

cursor-tools

README:

Give Cursor Agent an AI team and advanced skills

Table of Contents

The AI Team

New Skills for your existing Agent

How do I use it?

Asking Perplexity to carry out web research

Asking Gemini for a plan

What is cursor-tools

Installation

Requirements

Tips:

Additional Examples

GitHub Skills

Gemini code review

Detailed Cursor Usage

Tool Recommendations

Command Nicknames

Use web search

Use repo search

Use doc generation

Use github integration

Use browser automation

Use direct model queries

Authentication and API Keys

Google Gemini API Authentication

AI Team Features

Perplexity: Web Search & Research

Gemini 2.0: Repository Context & Planning

Stagehand: Browser Automation

Browser Command Options

Video Recording

Console and Network Logging

Complex Actions

Troubleshooting Browser Commands

YouTube Video Analysis

Skills

GitHub Integration

Xcode Tools

Documentation Generation (uses Gemini 2.0)

Configuration

cursor-tools.config.json

GitHub Authentication

Repomix Configuration

Model Selection

Cursor Configuration

Cursor Agent Configuration:

cursor-tools cli

Command Options

Execution Methods

Troubleshooting

Examples

Web Search Examples

Repository Context Examples

Direct Model Query Examples

Documentation Examples

GitHub Integration Examples

Xcode Command Examples

Browser Command Examples

open subcommand examples:

act, extract, observe subcommands examples:

YouTube Command Examples

Node Package Manager (npm)

Contributing

Sponsors

Vinta.app

Resoled.it

iterate.com

License

For Tasks:

For Jobs:

Alternative AI tools for cursor-tools

Similar Open Source Tools

cursor-tools

llm-vscode

RA.Aid

tiledesk-dashboard

hayhooks

bedrock-claude-chat

`open` subcommand examples:

`act`, `extract`, `observe` subcommands examples: