well-architected-iac-analyzer

Generative AI tool for evaluating Infrastructure as Code and architecture diagrams against AWS Well-Architected best practices.

Stars: 196

Visit

Well-Architected Infrastructure as Code (IaC) Analyzer is a project demonstrating how generative AI can evaluate infrastructure code for alignment with best practices. It features a modern web application allowing users to upload IaC documents, complete IaC projects, or architecture diagrams for assessment. The tool provides insights into infrastructure code alignment with AWS best practices, offers suggestions for improving cloud architecture designs, and can generate IaC templates from architecture diagrams. Users can analyze CloudFormation, Terraform, or AWS CDK templates, architecture diagrams in PNG or JPEG format, and complete IaC projects with supporting documents. Real-time analysis against Well-Architected best practices, integration with AWS Well-Architected Tool, and export of analysis results and recommendations are included.

README:

Well-Architected IaC (Infrastructure as Code) Analyzer

Description

Well-Architected Infrastructure as Code (IaC) Analyzer is a project that demonstrates how generative AI can be used to evaluate infrastructure code for alignment with best practices.

It features a modern web application built with React and AWS Cloudscape Design System, allowing users to upload IaC documents (e.g., AWS CloudFormation, Terraform, or AWS CDK templates), complete IaC projects (multiple files or zip archives), or architecture diagrams for assessment. The application leverages Amazon Bedrock to analyze the infrastructure against AWS Well-Architected best practices. These best practices are sourced from AWS Well-Architected whitepapers and synchronized with the Amazon Bedrock knowledge base.

This tool provides users with insights into how well their infrastructure code aligns with or deviates from established AWS best practices, offering suggestions for improving cloud architecture designs. Users can also upload supporting documents to provide additional context for more accurate analysis results. For architecture diagrams, it can even generate corresponding IaC templates following AWS best practices.

Features

Upload and analyze Infrastructure as Code templates:
- CloudFormation (YAML/JSON)
- Terraform (.tf)
- AWS CDK (in any supported language)
Upload and analyze architecture diagrams:
- PNG format
- JPEG/JPG format
Analyze complete IaC projects:
- Multiple files at once
- ZIP archives containing infrastructure code
Add supporting documents (PDF, TXT, PNG, JPEG) to provide additional context for analysis
Generate IaC templates from architecture diagrams
Real-time analysis against Well-Architected best practices
Integration with AWS Well-Architected Tool
Export analysis results and recommendations

Installation and Deployment

You have three options for deploying this solution:

Option 1: Using a CloudFormation Deployment Stack (Recommended)
Option 2: Using a Deployment Script
Option 3: Manual Deployment

Option 1: Using a CloudFormation Deployment Stack (Recommended)

This option uses AWS CloudFormation to create a temporary deployment environment to deploy the Well-Architected IaC Analyzer solution. This approach doesn't require any tools to be installed on your local machine.

Prerequisites

You must enable AWS Bedrock Model Access to the following LLM models in your AWS region:

Titan Text Embeddings V2
Claude 3.5 Sonnet v2
To enable these models, follow the instructions here.

Deployment Steps

Download the CloudFormation template: iac-analyzer-deployment-stack.yaml
Open the AWS CloudFormation console:
- Make sure you are in the same AWS region where you enabled access to the LLM models
On the "Create stack" page:
- Select "Upload a template file" and upload the iac-analyzer-deployment-stack.yaml template
- Choose "Next"
On the "Specify stack details" page. Enter or change the stack name, then:
- Change the stack parameters as needed. Check the CloudFormation Configuration Parameters section below for details
Note: By default, the stack deploys with a Public Application Load Balancer (internet-facing) without authentication enabled. It's strongly recommended to enable authentication to properly secure your internet-facing application.
- Choose "Next" until reaching the "Review" page and choose "Submit".

The deployment process typically takes 15-20 minutes.

Once complete, you'll find a new CloudFormation stack named WA-IaC-Analyzer-{region}-GenAIStack containing all the deployed resources for this solution. Find the application URL in the stack outputs:

In the CloudFormation console, navigate to the Outputs tab of the stack named WA-IaC-Analyzer-{region}-GenAIStack
Look for the FrontendURL value

Post-Deployment Steps

If you enabled authentication with a custom domain:
- Create a DNS record (CNAME or Alias) pointing to the ALB domain name
If you created a new Cognito user pool:
- Navigate to the Amazon Cognito console
- Find the user pool created by the stack (named "WAAnalyzerUserPool")
- Add users who should have access to the application
Access your deployed application using the URL from the CloudFormation outputs (or your CNAME or Alias pointing to the ALB)

Option 2: Using a Deployment Script

Expand this section for instructions using the deployment script:

Prerequisites

You must enable AWS Bedrock Model Access to the following LLM models in your AWS region:

Titan Text Embeddings V2
Claude 3.5 Sonnet v2
To enable these models, follow the instructions here.

Apart from enabling access to the model listed above, the following tools must be installed on your local machine:

Node.js (v18 or later) and npm
Python (v3.11 or later) and pip
AWS CDK CLI
Either one of these container tools:
- Finch
- Docker
AWS CLI configured with appropriate credentials

Note: If you would like to change the default Load Balancer scheme, AI model or authentication options, check the Configuration Options For Manual Deployments section first before deploying.

Clone the Repository

git clone https://github.com/aws-samples/well-architected-iac-analyzer.git
cd well-architected-iac-analyzer

Make the deployment script executable:

chmod +x deploy-wa-analyzer.sh

Deploy with required parameters:

# Deploy using Docker
./deploy-wa-analyzer.sh -r us-west-2 -c docker

# Or deploy using Finch
./deploy-wa-analyzer.sh -r us-west-2 -c finch

The script will automatically:

Check for prerequisites
Set up the Python virtual environment
Install all dependencies
Deploy the CDK stack
Provide post-deployment information

After successful deployment, you can find the Application Load Balancer (ALB) DNS name in:

The outputs of the deploy-wa-analyzer.sh script
The outputs section of the CloudFormation stack named WA-IaC-Analyzer-{region}-GenAIStack in the AWS Console

Option 3: Manual Deployment

If you prefer to manually deploy step by step, expand this section for more instructions:

Prerequisites

You must enable AWS Bedrock Model Access to the following LLM models in your AWS region:

Titan Text Embeddings V2
Claude 3.5 Sonnet v2
To enable these models, follow the instructions here.

Apart from enabling access to the model listed above, the following tools must be installed on your local machine:

Node.js (v18 or later) and npm
Python (v3.11 or later) and pip
AWS CDK CLI
Either one of these container tools:
- Finch
- Docker
AWS CLI configured with appropriate credentials

Note: If you would like to change the default Load Balancer scheme, AI model or authentication options, check the Configuration Options For Manual Deployments section first before deploying.

Clone the Repository

git clone https://github.com/aws-samples/well-architected-iac-analyzer.git
cd well-architected-iac-analyzer

Set Up Python Virtual Environment

# Create virtual environment
python -m venv .venv

# Activate virtual environment
# On Linux/macOS:
source .venv/bin/activate
# On Windows:
.venv\Scripts\activate

# Verify you're in the virtual environment
# You should see (.venv) at the beginning of your prompt

Install Dependencies

Install Python dependencies:

pip3 install -r requirements.txt

Install CDK dependencies:

npm install

Install Frontend dependencies:

cd ecs_fargate_app/frontend
npm install
cd ../..

Install Backend dependencies:

cd ecs_fargate_app/backend
npm install
cd ../..

Deploy the Stack

Set the AWS region and ignore ECR credentials storage during CDK deployment:

export CDK_DEPLOY_REGION=us-west-2
export AWS_ECR_IGNORE_CREDS_STORAGE=true

Set the container runtime:

export CDK_DOCKER=finch  # For Finch

# OR

export CDK_DOCKER=docker # For Docker

Bootstrap CDK (if not already done):

cdk bootstrap

Deploy the stack:

cdk deploy

After successful deployment, you can find the Application Load Balancer (ALB) DNS name in:

The outputs of the cdk deploy command
The outputs section of the CloudFormation stack named WA-IaC-Analyzer-{region}-GenAIStack in the AWS Console

CloudFormation Configuration Parameters

The deployment stack template provides a comprehensive set of configuration options organized into parameter groups. Expand this to see more details:

Global Settings

Deploy with internet-facing Application Load Balancer? (PublicLoadBalancer)
- True (Default): Deploys an internet-facing load balancer accessible from the internet
- False: Deploys an internal load balancer accessible only within your VPC network
- Access considerations for internal load balancers: To access an internal load balancer, you'll need network connectivity to the deployed VPC via:
  - VPC peering
  - VPN connection
  - AWS Direct Connect
  - Other network connectivity solutions
- ⚠️ Security Warning: If you select True with authentication disabled, your application will be publicly accessible without authentication. For public-facing deployments, we strongly recommend enabling authentication.
Amazon Bedrock Model ID (ModelId)
- Default: anthropic.claude-3-5-sonnet-20241022-v2:0 (Claude 3.5 Sonnet v2)
- You can specify an alternative Bedrock model ID if needed
- Note: This application has been primarily tested with Claude 3.5 Sonnet v2. While other Bedrock models may work, using different models might lead to unexpected results.

Authentication Settings

Enable Authentication (Authentication)
- False (Default): No authentication required to access the application
- True: Enables authentication via the selected method
Authentication Method (AuthType)
- none (Default): No authentication
- new-cognito: Creates a new Amazon Cognito user pool
- existing-cognito: Uses an existing Amazon Cognito user pool
- oidc: Uses an OpenID Connect provider (Auth0, Okta, etc.)
SSL Certificate ARN (CertificateArn)
- Required when Authentication is set to True
- Format: arn:aws:acm:region:account:certificate/certificate-id
- Important: Before enabling authentication, you must have a valid AWS Certificate Manager (ACM) certificate covering the DNS domain name that you plan to use for accessing the application

Authentication-Specific Settings

New Cognito User Pool Settings

These parameters are required when AuthType is set to new-cognito:

Cognito Domain Prefix (CognitoDomainPrefix)
- A unique prefix for your Cognito domain (e.g., wa-analyzer)
- The resulting domain will be: your-prefix.auth.region.amazoncognito.com
Allowed Callback URLs (CallbackUrls)
- Comma-separated list of URLs where users will be redirected after signing in
- Must include the URL you'll use to access the application, followed by /oauth2/idpresponse
- Example: https://wa-analyzer.example.com/oauth2/idpresponse
Logout Redirect URL (LogoutUrl)
- URL where users will be redirected after signing out
- Example: https://wa-analyzer.example.com

Existing Cognito User Pool Settings

These parameters are required when AuthType is set to existing-cognito:

Existing Cognito User Pool ARN (ExistingUserPoolArn)
- ARN of your existing Cognito user pool
- Format: arn:aws:cognito-idp:region:account:userpool/pool-id
Existing Cognito Client ID (ExistingUserPoolClientId)
- The client ID from your existing Cognito user pool
Existing Cognito Domain (ExistingUserPoolDomain)
- The domain of your existing Cognito user pool
- Can be a Cognito prefix domain: your-prefix.auth.region.amazoncognito.com
- Or a custom domain: auth.your-domain.com
Existing Cognito Logout URL (ExistingCognitoLogoutUrl)
- The URL users are redirected to after signing out
- Example: https://wa-analyzer.example.com

OpenID Connect (OIDC) Settings

These parameters are required when AuthType is set to oidc:

OIDC Issuer URL (OidcIssuer)
- The issuer URL for your OIDC provider
- Example for Auth0: https://your-tenant.us.auth0.com/authorize
OIDC Client ID (OidcClientId)
- The client ID from your OIDC provider
OIDC Authorization Endpoint URL (OidcAuthorizationEndpoint)
- The authorization endpoint of your OIDC provider
- Example for Auth0: https://your-tenant.us.auth0.com/authorize
OIDC Token Endpoint URL (OidcTokenEndpoint)
- The token endpoint of your OIDC provider
- Example for Auth0: https://your-tenant.us.auth0.com/oauth/token
OIDC User Info Endpoint URL (OidcUserInfoEndpoint)
- The user info endpoint of your OIDC provider
- Example for Auth0: https://your-tenant.us.auth0.com/userinfo
OIDC Logout URL (OidcLogoutUrl)
- The URL for logging out users
- Example for Auth0: https://your-tenant.us.auth0.com/v2/logout?client_id=your-client-id&returnTo=https://wa-analyzer.example.com

Important OIDC Note: Before deploying with OIDC authentication, you must create a secret in AWS Secrets Manager named WAIaCAnalyzerOIDCSecret containing your OIDC client secret in the same region where you'll deploy the stack:
# Using AWS CLI
aws secretsmanager create-secret \
  --name WAIaCAnalyzerOIDCSecret \
  --secret-string "your-oidc-client-secret" \
  --region <aws-region>

Configuration Options For Manual Deployments

If you are following either the Option 2 or 3 for deploying the solution, configuration parameters are managed from the `config.ini` file. Expand this section to learn more:

Model Selection

If you want to use a different model than "Claude 3.5 Sonnet v2", update the config.ini with the correct model ID:

[settings]
model_id = anthropic.claude-3-5-sonnet-20241022-v2:0

Note: This application has been primarily tested with "Claude 3.5 Sonnet v2". While other Bedrock models may work, using different models might lead to unexpected results. The default model ID is set to anthropic.claude-3-5-sonnet-20241022-v2:0.

Load Balancer Scheme Selection

By default, this project will deploy the Load Balancer scheme as internet-facing (Public load balancer), making it accessible from the internet.

If you need to change the load balancer scheme to internal, you can modify the public_load_balancer parameter in the config.ini file:

[settings]
public_load_balancer = False

To access an internal load balancer, you will need to be in the private network connected to the deployed VPC, either via:

VPC peering
VPN
AWS Direct Connect
Other network connectivity solutions

⚠️ Security Warning: Since the load balancer is internet-facing by default, it's strongly recommended to enable the Authentication Options as per below. Otherwise, the application and all its functionalities will be accessible directly through the Internet without authentication. Proceed with caution and understand the security implications.

Authentication Options

Note: Before enabling authentication, make sure you have a valid AWS Certificate Manager (ACM) certificate covering the DNS domain name (CNAME or Alias) that you plan to use to point to this application's ALB.

For Example:

If you own the domain *.example.com

And you plan to access the application via wa-analyzer.example.com (with a CNAME or Alias pointing to the ALB deployed by this stack)

You must first create or import a certificate in ACM that covers either:

*.example.com, or

wa-analyzer.example.com

Then, you can add the certificate's ARN in the certificate_arn parameter below when deploying the stack

The application can be deployed with different authentication configurations managed via the config.ini file.

A. Default Deployment (No Authentication)

HTTP listener only
Can be deployed as public or private ALB
Example settings in config.ini:
```
authentication = False
auth_type = none
```

B. New Cognito User Pool

HTTPS listener with AWS Cognito authentication
Creates a new Cognito user pool
Self-signup disabled by default

Example settings in config.ini:

# In below example, "wa-analyzer.example.com" is the DNS alias that you would create pointing to the ALB deployed by this CDK stack
authentication = True
auth_type = new-cognito
certificate_arn = arn:aws:acm:region:account:certificate/certificate-id
cognito_domain_prefix = your-domain-prefix
allback_urls = https://wa-analyzer.example.com/oauth2/idpresponse
logout_url = https://wa-analyzer.example.com

C. Existing Cognito User Pool

HTTPS listener with existing AWS Cognito authentication
Uses an existing Cognito user pool

Example settings in config.ini:

# In below example, "wa-analyzer.example.com" is the DNS alias that you would create pointing to the ALB deployed by this CDK stack
authentication = True
auth_type = existing-cognito
certificate_arn = arn:aws:acm:region:account:certificate/certificate-id
existing_user_pool_arn = arn:aws:cognito-idp:<aws-region>:<aws-account-id>:userpool/<user-pool-id>
existing_user_pool_client_id = <user-pool-client-id>
existing_user_pool_domain = <your-existing-cognito-domain-prefix>.auth.<aws-region>.amazoncognito.com # Example using Cognito prefix domain: https://docs.aws.amazon.com/cognito/latest/developerguide/cognito-user-pools-assign-domain-prefix.html
# Or; existing_user_pool_domain = wa-analyzer-auth.example.com # Example of custom domain (e.g. wa-analyzer-auth.example.com): https://docs.aws.amazon.com/cognito/latest/developerguide/cognito-user-pools-add-custom-domain.html)
existing_cognito_logout_url = https://wa-analyzer.example.com

D. OpenID Connect (OIDC)

HTTPS listener with OIDC authentication
Compatible with any OIDC-compliant identity provider

If you plan to use OIDC authentication (auth_type = oidc), follow these steps:

Before deployment, create a secret in AWS Secrets Manager named WAIaCAnalyzerOIDCSecret containing your OIDC client secret before deploying the stack. The secret must be created in the same AWS region where you plan to deploy the stack:

# Using AWS CLI
aws secretsmanager create-secret \
  --name WAIaCAnalyzerOIDCSecret \
  --secret-string "your-oidc-client-secret" \
  --region <aws-region>

# Or you can create it via the AWS Console:
# 1. Go to AWS Secrets Manager console
# 2. Choose "Store a new secret"
# 3. Choose "Other type of secret"
# 4. Enter your OIDC client secret as a plaintext value
# 5. Set the secret name exactly as: WAIaCAnalyzerOIDCSecret
# 6. Do not add any automatic rotation
# 7. Complete the secret creation

Configure OIDC settings in config.ini:

# Below is an example when using Okta as your OIDC IdP, refer to the config.ini file for more examples. 
# In below example, "wa-analyzer.example.com" is the DNS alias that you would create pointing to the ALB deployed by this CDK stack
authentication = True
auth_type = oidc
certificate_arn = arn:aws:acm:region:account:certificate/certificate-id
oidc_issuer = https://<okta-tenant-id>.us.auth0.com/authorize
oidc_client_id = <okta-client-id>
oidc_authorization_endpoint = https://<okta-tenant-id>.us.auth0.com/authorize
oidc_token_endpoint = https://<okta-tenant-id>.us.auth0.com/oauth/token
oidc_user_info_endpoint = https://<okta-tenant-id>.us.auth0.com/userinfo
oidc_logout_url = https://<okta-tenant-id>.us.auth0.com/v2/logout?client_id=<oidc-client-id>&returnTo=https://wa-analyzer.example.com (# Refer to https://auth0.com/docs/authenticate/login/logout)

Clean up

You have two options to remove all resources created by this solution:

Option 1 - Using AWS Console (Recommended)

Open the CloudFormation console
Find and delete the stack named WA-IaC-Analyzer-{region}-GenAIStack

Option 2 - Using the Destroy Script

Only use this clean up option if you followed either Option 2 or 3 when deploying the solution. Expand for more details:

Make the destroy script executable:

chmod +x destroy-wa-analyzer.sh

Run the script with required parameters. Make sure to use the same region where you deployed the stack:

# Clean up using Docker
./destroy-wa-analyzer.sh -r us-west-2 -c docker

# Or clean up using Finch
./destroy-wa-analyzer.sh -r us-west-2 -c finch

The script will automatically:

Verify prerequisites
Set up the necessary environment
Destroy all resources in the stack

Local Development

For development purposes, you can run the application locally using either Finch or Docker containers. This allows you to make changes to the code and see them reflected immediately without having to deploy code changes into your AWS stack.

Prerequisites for Local Development

In addition to the main prerequisites, ensure you have:

Either Finch or Docker installed and running
AWS credentials configured with access to required services
Access to Amazon Bedrock service and the required models (as described in the main Prerequisites section)

Setting up Required AWS Resources

Note for Existing Stack Users: If you have already deployed this CDK stack in your AWS account, you can skip the manual resource creation steps below. Instead:

Go to the CloudFormation console and find your stack (it starts with "WA-IaC-Analyzer-")

In the "Outputs" tab of the CDK CloudFormation stack, find:

KnowledgeBaseID: Use this value for KNOWLEDGE_BASE_ID in your .env file (for "Setting up Local Development Environment" section below)

WellArchitectedDocsS3Bucket: Use this value for WA_DOCS_S3_BUCKET in your .env file (for "Setting up Local Development Environment" section below)

If you haven't deployed the stack yet, follow the steps below to create the required resources manually:

Create an S3 bucket:

aws s3 mb s3://your-knowledgebase-source-bucket-name --region your-aws-region

Upload Well-Architected documents:

aws s3 cp ecs_fargate_app/well_architected_docs/ s3://your-knowledgebase-source-bucket-name/ --recursive

Create a Bedrock Knowledge Base:
- Go to the Amazon Bedrock console
- Navigate to Knowledge bases
- Choose "Create knowledge base with vector store"
- Enter a name for your knowledge base
- Select "Amazon S3" as the data source
- Choose "Next"
- Add your S3 bucket as a data source:
  - Choose the bucket you created
  - Leave all other settings as default
  - Choose "Next"
- Select "Titan Text Embeddings v2" as the embedding model and use the default Vector database settings
- Choose "Next" and Complete the knowledge base creation
- Note the Knowledge Base ID from the details page

Setting up Local Development Environment

Create a .env file in the root directory with the following variables:

# AWS Authentication
AWS_REGION=your-aws-region-key
AWS_ACCESS_KEY_ID=your-access-key
AWS_SECRET_ACCESS_KEY=your-secret-key
AWS_SESSION_TOKEN=your-session-token

# Well-Architected Framework Resources
WA_DOCS_S3_BUCKET=your-knowledgebase-source-bucket-name
KNOWLEDGE_BASE_ID=your-kb-id
MODEL_ID=anthropic.claude-3-5-sonnet-20241022-v2:0

# Storage Configuration
STORAGE_ENABLED=true
ANALYSIS_STORAGE_BUCKET=your-analysis-storage-bucket-name
ANALYSIS_METADATA_TABLE=your-analysis-metadata-table-name

Security Note: It is encouraged the use of temporary credentials (including AWS_SESSION_TOKEN) when running the application locally. More details in Temporary security credentials in IAM. Temporary credentials have a limited lifetime and automatically expire, providing an additional layer of security.

Storage Configuration Notes:

If you have already deployed this CDK stack in your AWS account:

Go to the CloudFormation console and find your stack (it starts with "WA-IaC-Analyzer-")

In the "Outputs" tab, find:

AnalysisStorageBucketName: Use this value for ANALYSIS_STORAGE_BUCKET

AnalysisMetadataTableName: Use this value for ANALYSIS_METADATA_TABLE

Make the development script executable:

chmod +x dev.sh

Start the development environment using either Docker or Finch:

# Using Docker
./dev.sh -c docker -up

# Or using Finch
./dev.sh -c finch -up

This will:

Build and start the frontend container (available at http://localhost:8080)
Build and start the backend container (available at http://localhost:3000)
Enable hot reloading for both frontend and backend changes
Mount source code directories as volumes for immediate updates

To stop the development environment:

# Using Docker
./dev.sh -c docker -down

# Or using Finch
./dev.sh -c finch -down

Development Commands Reference

# Start development environment
./dev.sh -c <container_tool> -up

# Stop development environment
./dev.sh -c <container_tool> -down

# Show help and usage information
./dev.sh -h

Where <container_tool> is either docker or finch.

Switching Between Development and Production

Local development uses finch-compose.dev.yaml for container configuration
Production deployment continues to use CDK as described in the Installation and Deployment section

Contributing

See CONTRIBUTING for more information.

License

This library is licensed under the MIT-0 License. See the LICENSE file.

For Tasks:

Click tags to check more tools for each tasks

analyze infrastructure code improve cloud architecture generate iac templates integrate with aws well-architected tool export analysis results

For Jobs:

cloud architect devops engineer cloud consultant solution architect cloud infrastructure engineer

Alternative AI tools for well-architected-iac-analyzer

Similar Open Source Tools

well-architected-iac-analyzer

github

: 196

pentagi

PentAGI is an innovative tool for automated security testing that leverages cutting-edge artificial intelligence technologies. It is designed for information security professionals, researchers, and enthusiasts who need a powerful and flexible solution for conducting penetration tests. The tool provides secure and isolated operations in a sandboxed Docker environment, fully autonomous AI-powered agent for penetration testing steps, a suite of 20+ professional security tools, smart memory system for storing research results, web intelligence for gathering information, integration with external search systems, team delegation system, comprehensive monitoring and reporting, modern interface, API integration, persistent storage, scalable architecture, self-hosted solution, flexible authentication, and quick deployment through Docker Compose.

github

: 170

RA.Aid

RA.Aid is an AI software development agent powered by `aider` and advanced reasoning models like `o1`. It combines `aider`'s code editing capabilities with LangChain's agent-based task execution framework to provide an intelligent assistant for research, planning, and implementation of multi-step development tasks. It handles complex programming tasks by breaking them down into manageable steps, running shell commands automatically, and leveraging expert reasoning models like OpenAI's o1. RA.Aid is designed for everyday software development, offering features such as multi-step task planning, automated command execution, and the ability to handle complex programming tasks beyond single-shot code edits.

github

: 1.6k

Curie

github

: 52

web-ui

WebUI is a user-friendly tool built on Gradio that enhances website accessibility for AI agents. It supports various Large Language Models (LLMs) and allows custom browser integration for seamless interaction. The tool eliminates the need for re-login and authentication challenges, offering high-definition screen recording capabilities.

github

: 10.4k

backend.ai

Backend.AI is a streamlined, container-based computing cluster platform that hosts popular computing/ML frameworks and diverse programming languages, with pluggable heterogeneous accelerator support including CUDA GPU, ROCm GPU, TPU, IPU and other NPUs. It allocates and isolates the underlying computing resources for multi-tenant computation sessions on-demand or in batches with customizable job schedulers with its own orchestrator. All its functions are exposed as REST/GraphQL/WebSocket APIs.

github

: 550

recommendarr

Recommendarr is a tool that generates personalized TV show and movie recommendations based on your Sonarr, Radarr, Plex, and Jellyfin libraries using AI. It offers AI-powered recommendations, media server integration, flexible AI support, watch history analysis, customization options, and dark/light mode toggle. Users can connect their media libraries and watch history services, configure AI service settings, and get personalized recommendations based on genre, language, and mood/vibe preferences. The tool works with any OpenAI-compatible API and offers various recommended models for different cost options and performance levels. It provides personalized suggestions, detailed information, filter options, watch history analysis, and one-click adding of recommended content to Sonarr/Radarr.

github

: 516

code2prompt

Code2Prompt is a powerful command-line tool that generates comprehensive prompts from codebases, designed to streamline interactions between developers and Large Language Models (LLMs) for code analysis, documentation, and improvement tasks. It bridges the gap between codebases and LLMs by converting projects into AI-friendly prompts, enabling users to leverage AI for various software development tasks. The tool offers features like holistic codebase representation, intelligent source tree generation, customizable prompt templates, smart token management, Gitignore integration, flexible file handling, clipboard-ready output, multiple output options, and enhanced code readability.

github

: 734

pastemax

PasteMax is a modern file viewer application designed for developers to easily navigate, search, and copy code from repositories. It provides features such as file tree navigation, token counting, search capabilities, selection management, sorting options, dark mode, binary file detection, and smart file exclusion. Built with Electron, React, and TypeScript, PasteMax is ideal for pasting code into ChatGPT or other language models. Users can download the application or build it from source, and customize file exclusions. Troubleshooting steps are provided for common issues, and contributions to the project are welcome under the MIT License.

github

: 276

AutoAgent

AutoAgent is a fully-automated and zero-code framework that enables users to create and deploy LLM agents through natural language alone. It is a top performer on the GAIA Benchmark, equipped with a native self-managing vector database, and allows for easy creation of tools, agents, and workflows without any coding. AutoAgent seamlessly integrates with a wide range of LLMs and supports both function-calling and ReAct interaction modes. It is designed to be dynamic, extensible, customized, and lightweight, serving as a personal AI assistant.

github

: 1.9k

openai-chat-api-workflow

**OpenAI Chat API Workflow for Alfred** An Alfred 5 Workflow for using OpenAI Chat API to interact with GPT-3.5/GPT-4 🤖💬 It also allows image generation 🖼️, image understanding 👀, speech-to-text conversion 🎤, and text-to-speech synthesis 🔈 **Features:** * Execute all features using Alfred UI, selected text, or a dedicated web UI * Web UI is constructed by the workflow and runs locally on your Mac 💻 * API call is made directly between the workflow and OpenAI, ensuring your chat messages are not shared online with anyone other than OpenAI 🔒 * OpenAI does not use the data from the API Platform for training 🚫 * Export chat data to a simple JSON format external file 📄 * Continue the chat by importing the exported data later 🔄

github

: 308

chunkr

Chunkr is an open-source document intelligence API that provides a production-ready service for document layout analysis, OCR, and semantic chunking. It allows users to convert PDFs, PPTs, Word docs, and images into RAG/LLM-ready chunks. The API offers features such as layout analysis, OCR with bounding boxes, structured HTML and markdown output, and VLM processing controls. Users can interact with Chunkr through a Python SDK, enabling them to upload documents, process them, and export results in various formats. The tool also supports self-hosted deployment options using Docker Compose or Kubernetes, with configurations for different AI models like OpenAI, Google AI Studio, and OpenRouter. Chunkr is dual-licensed under the GNU Affero General Public License v3.0 (AGPL-3.0) and a commercial license, providing flexibility for different usage scenarios.

github

: 2.1k

airbadge

Airbadge is a Stripe addon for Auth.js that simplifies the process of creating a SaaS site by integrating payment, authentication, gating, self-service account management, webhook handling, trials & free plans, session data, and more. It allows users to launch a SaaS app without writing any authentication or payment code. The project is open source and free to use with optional paid features under the BSL License.

github

: 110

openai-edge-tts

This project provides a local, OpenAI-compatible text-to-speech (TTS) API using `edge-tts`. It emulates the OpenAI TTS endpoint (`/v1/audio/speech`), enabling users to generate speech from text with various voice options and playback speeds, just like the OpenAI API. `edge-tts` uses Microsoft Edge's online text-to-speech service, making it completely free. The project supports multiple audio formats, adjustable playback speed, and voice selection options, providing a flexible and customizable TTS solution for users.

github

: 412

steel-browser

Steel is an open-source browser API designed for AI agents and applications, simplifying the process of building live web agents and browser automation tools. It serves as a core building block for a production-ready, containerized browser sandbox with features like stealth capabilities, text-to-markdown session management, UI for session viewing/debugging, and full browser control through popular automation frameworks. Steel allows users to control, run, and manage a production-ready browser environment via a REST API, offering features such as full browser control, session management, proxy support, extension support, debugging tools, anti-detection mechanisms, resource management, and various browser tools. It aims to streamline complex browsing tasks programmatically, enabling users to focus on their AI applications while Steel handles the underlying complexity.

github

: 4.1k

extension-gen-ai

The Looker GenAI Extension provides code examples and resources for building a Looker Extension that integrates with Vertex AI Large Language Models (LLMs). Users can leverage the power of LLMs to enhance data exploration and analysis within Looker. The extension offers generative explore functionality to ask natural language questions about data and generative insights on dashboards to analyze data by asking questions. It leverages components like BQML Remote Models, BQML Remote UDF with Vertex AI, and Custom Fine Tune Model for different integration options. Deployment involves setting up infrastructure with Terraform and deploying the Looker Extension by creating a Looker project, copying extension files, configuring BigQuery connection, connecting to Git, and testing the extension. Users can save example prompts and configure user settings for the extension. Development of the Looker Extension environment includes installing dependencies, starting the development server, and building for production.

github

: 59

For similar tasks

well-architected-iac-analyzer

github

: 196

For similar jobs

minio

MinIO is a High Performance Object Storage released under GNU Affero General Public License v3.0. It is API compatible with Amazon S3 cloud storage service. Use MinIO to build high performance infrastructure for machine learning, analytics and application data workloads.

github

: 46.0k

ai-on-gke

This repository contains assets related to AI/ML workloads on Google Kubernetes Engine (GKE). Run optimized AI/ML workloads with Google Kubernetes Engine (GKE) platform orchestration capabilities. A robust AI/ML platform considers the following layers: Infrastructure orchestration that support GPUs and TPUs for training and serving workloads at scale Flexible integration with distributed computing and data processing frameworks Support for multiple teams on the same infrastructure to maximize utilization of resources

github

: 280

kong

Kong, or Kong API Gateway, is a cloud-native, platform-agnostic, scalable API Gateway distinguished for its high performance and extensibility via plugins. It also provides advanced AI capabilities with multi-LLM support. By providing functionality for proxying, routing, load balancing, health checking, authentication (and more), Kong serves as the central layer for orchestrating microservices or conventional API traffic with ease. Kong runs natively on Kubernetes thanks to its official Kubernetes Ingress Controller.

github

: 40.4k

AI-in-a-Box

AI-in-a-Box is a curated collection of solution accelerators that can help engineers establish their AI/ML environments and solutions rapidly and with minimal friction, while maintaining the highest standards of quality and efficiency. It provides essential guidance on the responsible use of AI and LLM technologies, specific security guidance for Generative AI (GenAI) applications, and best practices for scaling OpenAI applications within Azure. The available accelerators include: Azure ML Operationalization in-a-box, Edge AI in-a-box, Doc Intelligence in-a-box, Image and Video Analysis in-a-box, Cognitive Services Landing Zone in-a-box, Semantic Kernel Bot in-a-box, NLP to SQL in-a-box, Assistants API in-a-box, and Assistants API Bot in-a-box.

github

: 527

awsome-distributed-training

This repository contains reference architectures and test cases for distributed model training with Amazon SageMaker Hyperpod, AWS ParallelCluster, AWS Batch, and Amazon EKS. The test cases cover different types and sizes of models as well as different frameworks and parallel optimizations (Pytorch DDP/FSDP, MegatronLM, NemoMegatron...).

github

: 230

generative-ai-cdk-constructs

The AWS Generative AI Constructs Library is an open-source extension of the AWS Cloud Development Kit (AWS CDK) that provides multi-service, well-architected patterns for quickly defining solutions in code to create predictable and repeatable infrastructure, called constructs. The goal of AWS Generative AI CDK Constructs is to help developers build generative AI solutions using pattern-based definitions for their architecture. The patterns defined in AWS Generative AI CDK Constructs are high level, multi-service abstractions of AWS CDK constructs that have default configurations based on well-architected best practices. The library is organized into logical modules using object-oriented techniques to create each architectural pattern model.

github

: 444

model_server

OpenVINO™ Model Server (OVMS) is a high-performance system for serving models. Implemented in C++ for scalability and optimized for deployment on Intel architectures, the model server uses the same architecture and API as TensorFlow Serving and KServe while applying OpenVINO for inference execution. Inference service is provided via gRPC or REST API, making deploying new algorithms and AI experiments easy.

github

: 718

dify-helm

Deploy langgenius/dify, an LLM based chat bot app on kubernetes with helm chart.

github

: 340