search2ai

Help your LLMs online

Stars: 1102

Visit

S2A allows your large model API to support networking, searching, news, and web page summarization. It currently supports OpenAI, Gemini, and Moonshot (non-streaming). The large model will determine whether to connect to the network based on your input, and it will not connect to the network for searching every time. You don't need to install any plugins or replace keys. You can directly replace the custom address in your commonly used third-party client. You can also deploy it yourself, which will not affect other functions you use, such as drawing and voice.

README:

简体中文 · English

用户交流

telegram频道

友情赞助

版本更新

V0.2.6，20240425，支持 SearXNG 免费搜索服务，有损支持 Moonshot 流式模式
V0.2.5，20240425，为了解决隐私担忧，开源搜索接口部分的代码
V0.2.4，20240424，支持 Groq 的llama-3、mistral等模型，速度起飞
V0.2.3，20240423，Cloudflare Worker版本支持Azure OpenAI；支持授权码，可自定义用户的请求key
V0.2.2，20240420，支持 Moonshot 的非流式模式
V0.2.1，20240310，支持Google、Bing、Duckduckgo、Search1API新闻类搜索；支持通过环境变量MAX_RESULTS调整搜索结果数量；支持通过环境变量CRAWL_RESULTS调整希望深度搜索的数量
V0.2.0，20240310，优化openai.js，cloudflare worker版本，这次速度真的更快了！

更多历史更新，请参见版本记录

S2A

让你的大模型 API 支持联网，搜索、新闻、网页总结，已支持OpenAI、Gemini、Moonshot(非流式)，大模型会根据你的输入判断是否联网，不是每次都联网搜索，不需要安装任何插件，也不需要更换key，直接在你常用的三方客户端替换自定义地址即可，也支持自行部署，不会影响使用的其他功能，如画图、语音等

功能

模型	功能	流式输出	部署方式
`OpenAI`	联网、新闻、内容爬取	流式、非流式	Zeabur、本地部署、Cloudflare Worker、Vercel
`Azure OpenAI`	联网、新闻、内容爬取	流式、非流式	Cloudflare Worker
`Groq`	联网、新闻、内容爬取	流式、非流式	Cloudflare Worker
`Gemini`	联网	流式、非流式	Cloudflare Worker
`Moonshot`	联网、新闻、内容爬取	部分流式、非流式	Zeabur、本地部署、Cloudflare Worker（流式）、Vercel

使用

替换客户端自定义域名为你部署后的地址

部署

Zeabur一键部署

点击按钮即可一键部署，修改环境变量

如需保持项目更新，建议先fork本仓库，再通过Zeabur部署你的分支

本地部署

克隆仓库到本地

git clone https://github.com/fatwang2/search2ai

复制.env.template为.env，配置环境变量
进入api目录，运行程序，实时显示日志

cd api && nohup node index.js > output.log 2>&1 & tail -f output.log

端口3014，拼接后的完整地址如下，可根据客户端的需求配置apibase地址使用（如需https，需用nginx进行反代，网上教程很多）

http://localhost:3014/v1/chat/completions

Cloudflare Worker部署

复制search2openai.js或者search2gemini.js或者search2groq.js的代码，不需要任何修改！在cloudflare的worker里部署，上线后的worker的地址可作为你接口调用时的自定义域名地址，注意拼接，worker地址仅代表v1前的部分
在worker中配置环境变量
worker里配置触发器-自定义域名，国内直接访问worker的地址可能会出问题，需要替换为自定义域名

Vercel部署

特别说明：vercel项目暂不支持流式输出，且有10s响应限制，实际使用体验不佳，放出来主要是想等大神给我pull request

一键部署

为保证更新，也可以先fork本项目后自己在vercel上部署

环境变量

该项目提供了一些额外的配置项，通过环境变量设置：

环境变量	是否必须	描述	例子
`SEARCH_SERVICE`	Yes	你的搜索服务，选择什么服务，就需要配置什么服务的key	`search1api, google, bing, serpapi, serper, duckduckgo, searxng`
`APIBASE`	No	三方代理地址	`https://api.openai.com, https://api.moonshot.cn, https://api.groq.com/openai`
`MAX_RESULTS`	Yes	搜索结果条数	`10`
`CRAWL_RESULTS`	No	要进行深度搜索（搜索后获取网页正文）的数量，目前仅支持 search1api，深度速度会慢	`1`
`SEARCH1API_KEY`	No	如选search1api必填，我自己搭建的搜索服务，又快又便宜，点击链接注册	`xxx`
`BING_KEY`	No	如选bing搜索必填，请自行搜索教程，点击链接创建	`xxx`
`GOOGLE_CX`	No	如选Google搜索必填，Search engine ID，请自行搜索教程，点击链接创建	`xxx`
`GOOGLE_KEY`	No	如选Google搜索必填，API key，点击链接创建	`xxx`
`SERPAPI_KEY`	No	如选serpapi必填，免费100次/月，点击链接注册	`xxx`
`SERPER_KEY`	No	如选serper必填，6个月免费额度2500次，点击链接注册	`xxx`
`SEARXNG_BASE_URL`	No	如选searxng必填，填写自建searXNG服务域名，需打开 json 模式，教程参考链接	`https://search.xxx.xxx`
`OPENAI_TYPE`	No	openai供给来源，默认为openai	`openai, azure`
`RESOURCE_NAME`	No	如选azure必填	`xxxx`
`DEPLOY_NAME`	No	如选azure必填	`gpt-35-turbo`
`API_VERSION`	No	如选azure必填	`2024-02-15-preview`
`AZURE_API_KEY`	No	如选azure必填	`xxxx`
`AUTH_KEYS`	No	如果希望用户请求的时候单独定义授权码作为key，则需要填写，如选azure则必填	`000,1111,2222`
`OPENAI_API_KEY`	No	如果希望用户请求openai的时候也单独定义授权码作为key，则需要填写	`sk-xxx`

后续迭代

修复Vercel项目流式输出问题
提升流式输出的速度
支持更多垂类搜索

For Tasks:

Click tags to check more tools for each tasks

search the web get news summarize a web page write a story answer a question

For Jobs:

content writer researcher journalist student teacher

Alternative AI tools for search2ai

Similar Open Source Tools

search2ai

github

: 1.1k

devops-gpt

DevOpsGPT is a revolutionary tool designed to streamline your workflow and empower you to build systems and automate tasks with ease. Tired of spending hours on repetitive DevOps tasks? DevOpsGPT is here to help! Whether you're setting up infrastructure, speeding up deployments, or tackling any other DevOps challenge, our app can make your life easier and more productive. With DevOpsGPT, you can expect faster task completion, simplified workflows, and increased efficiency. Ready to experience the DevOpsGPT difference? Visit our website, sign in or create an account, start exploring the features, and share your feedback to help us improve. DevOpsGPT will become an essential tool in your DevOps toolkit.

github

: 52

OneClickLLAMA

OneClickLLAMA is a tool designed to run local LLM models such as Qwen2.5 and SakuraLLM with ease. It can be used in conjunction with various OpenAI format translators and analyzers, including LinguaGacha and KeywordGacha. By following the setup guides provided on the page, users can optimize performance and achieve a 3-5 times speed improvement compared to default settings. The tool requires a minimum of 8GB dedicated graphics memory, preferably NVIDIA, and the latest version of graphics drivers installed. Users can download the tool from the release page, choose the appropriate model based on usage and memory size, and start the tool by selecting the corresponding launch script.

github

: 175

dbhub

github

: 76

xiaogpt

xiaogpt is a tool that allows you to play ChatGPT and other LLMs with Xiaomi AI Speaker. It supports ChatGPT, New Bing, ChatGLM, Gemini, Doubao, and Tongyi Qianwen. You can use it to ask questions, get answers, and have conversations with AI assistants. xiaogpt is easy to use and can be set up in a few minutes. It is a great way to experience the power of AI and have fun with your Xiaomi AI Speaker.

github

: 6.5k

petercat

Peter Cat is an intelligent Q&A chatbot solution designed for community maintainers and developers. It provides a conversational Q&A agent configuration system, self-hosting deployment solutions, and a convenient integrated application SDK. Users can easily create intelligent Q&A chatbots for their GitHub repositories and quickly integrate them into various official websites or projects to provide more efficient technical support for the community.

github

: 1.3k

XiaoXinAir14IML_2019_hackintosh

XiaoXinAir14IML_2019_hackintosh is a repository dedicated to enabling macOS installation on Lenovo XiaoXin Air-14 IML 2019 laptops. The repository provides detailed information on the hardware specifications, supported systems, BIOS versions, related models, installation methods, updates, patches, and recommended settings. It also includes tools and guides for BIOS modifications, enabling high-resolution display settings, Bluetooth synchronization between macOS and Windows 10, voltage adjustments for efficiency, and experimental support for YogaSMC. The repository offers solutions for various issues like sleep support, sound card emulation, and battery information. It acknowledges the contributions of developers and tools like OpenCore, itlwm, VoodooI2C, and ALCPlugFix.

github

: 140

hume-api-examples

This repository contains examples of how to use the Hume API with different frameworks and languages. It includes examples for Empathic Voice Interface (EVI) and Expression Measurement API. The EVI examples cover custom language models, modal, Next.js integration, Vue integration, Hume Python SDK, and React integration. The Expression Measurement API examples include models for face, language, burst, and speech, with implementations in Python and Typescript using frameworks like Next.js.

github

: 164

airport

The 'airport' repository provides free Clash Meta nodes sourced from the internet, with testing every 6 hours to ensure quality and low latency. It includes features such as node deduplication, regional renaming, and geographical grouping.

github

: 78

DownEdit

DownEdit is a powerful program that allows you to download videos from various social media platforms such as TikTok, Douyin, Kuaishou, and more. With DownEdit, you can easily download videos from user profiles and edit them in bulk. You have the option to flip the videos horizontally or vertically throughout the entire directory with just a single click. Stay tuned for more exciting features coming soon!

github

: 301

hcaptcha-challenger

github

: 1.6k

DownEdit

DownEdit is a fast and powerful program for downloading and editing videos from platforms like TikTok, Douyin, and Kuaishou. It allows users to effortlessly grab videos, make bulk edits, and utilize advanced AI features for generating videos, images, and sounds in bulk. The tool offers features like video, photo, and sound editing, downloading videos without watermarks, bulk AI generation, and AI editing for content enhancement.

github

: 312

VoiceBench

VoiceBench is a repository containing code and data for benchmarking LLM-Based Voice Assistants. It includes a leaderboard with rankings of various voice assistant models based on different evaluation metrics. The repository provides setup instructions, datasets, evaluation procedures, and a curated list of awesome voice assistants. Users can submit new voice assistant results through the issue tracker for updates on the ranking list.

github

: 119

RVC_CLI

**RVC_CLI: Retrieval-based Voice Conversion Command Line Interface** This command-line interface (CLI) provides a comprehensive set of tools for voice conversion, enabling you to modify the pitch, timbre, and other characteristics of audio recordings. It leverages advanced machine learning models to achieve realistic and high-quality voice conversions. **Key Features:** * **Inference:** Convert the pitch and timbre of audio in real-time or process audio files in batch mode. * **TTS Inference:** Synthesize speech from text using a variety of voices and apply voice conversion techniques. * **Training:** Train custom voice conversion models to meet specific requirements. * **Model Management:** Extract, blend, and analyze models to fine-tune and optimize performance. * **Audio Analysis:** Inspect audio files to gain insights into their characteristics. * **API:** Integrate the CLI's functionality into your own applications or workflows. **Applications:** The RVC_CLI finds applications in various domains, including: * **Music Production:** Create unique vocal effects, harmonies, and backing vocals. * **Voiceovers:** Generate voiceovers with different accents, emotions, and styles. * **Audio Editing:** Enhance or modify audio recordings for podcasts, audiobooks, and other content. * **Research and Development:** Explore and advance the field of voice conversion technology. **For Jobs:** * Audio Engineer * Music Producer * Voiceover Artist * Audio Editor * Machine Learning Engineer **AI Keywords:** * Voice Conversion * Pitch Shifting * Timbre Modification * Machine Learning * Audio Processing **For Tasks:** * Convert Pitch * Change Timbre * Synthesize Speech * Train Model * Analyze Audio

github

: 71

RVC_CLI

RVC_CLI is a command line interface tool for retrieval-based voice conversion. It provides functionalities for installation, getting started, inference, training, UVR, additional features, and API integration. Users can perform tasks like single inference, batch inference, TTS inference, preprocess dataset, extract features, start training, generate index file, model extract, model information, model blender, launch TensorBoard, download models, audio analyzer, and prerequisites download. The tool is built on various projects like ContentVec, HIFIGAN, audio-slicer, python-audio-separator, RMVPE, FCPE, VITS, So-Vits-SVC, Harmonify, and others.

github

: 88

llm-export

llm-export is a tool for exporting llm models to onnx and mnn formats. It has features such as passing onnxruntime correctness tests, optimizing the original code to support dynamic shapes, reducing constant parts, optimizing onnx models using OnnxSlim for performance improvement, and exporting lora weights to onnx and mnn formats. Users can clone the project locally, clone the desired LLM project locally, and use LLMExporter to export the model. The tool supports various export options like exporting the entire model as one onnx model, exporting model segments as multiple models, exporting model vocabulary to a text file, exporting specific model layers like Embedding and lm_head, testing the model with queries, validating onnx model consistency with onnxruntime, converting onnx models to mnn models, and more. Users can specify export paths, skip optimization steps, and merge lora weights before exporting.

github

: 255

For similar tasks

blog

这是一个程序员关于 ChatGPT 学习过程的记录，其中包括了 ChatGPT 的使用技巧、相关工具和资源的整理，以及一些个人见解和思考。 **使用技巧** * **充值 OpenAI API**：可以通过 https://beta.openai.com/account/api-keys 进行充值，支持信用卡和 PayPal。 * **使用专梯**：推荐使用稳定的专梯，可以有效提高 ChatGPT 的访问速度和稳定性。 * **使用魔法**：可以通过 https://my.x-air.app:666/#/register?aff=32853 访问 ChatGPT，无需魔法即可访问。 * **下载各种 apk**：可以通过 https://apkcombo.com 下载各种安卓应用的 apk 文件。 * **ChatGPT 官网**：ChatGPT 的官方网站是 https://ai.com。 * **Midjourney**：Midjourney 是一个生成式 AI 图像平台，可以通过 https://midjourney.com 访问。 * **文本转视频**：可以通过 https://www.d-id.com 将文本转换为视频。 * **国内大模型**：国内也有很多大模型，如阿里巴巴的通义千问、百度文心一言、讯飞星火、阿里巴巴通义听悟等。 * **查看 OpenAI 状态**：可以通过 https://status.openai.com/ 查看 OpenAI 的服务状态。 * **Canva 画图**：Canva 是一个在线平面设计平台，可以通过 https://www.canva.cn 进行画图。 **相关工具和资源** * **文字转语音**：可以通过 https://modelscope.cn/models?page=1&tasks=text-to-speech&type=audio 找到文字转语音的模型。 * **可好好玩玩的项目**： * https://github.com/sunner/ChatALL * https://github.com/labring/FastGPT * https://github.com/songquanpeng/one-api * **个人博客**： * https://baoyu.io/ * https://gorden-sun.notion.site/527689cd2b294e60912f040095e803c5?v=4f6cc12006c94f47aee4dc909511aeb5 * **srt 2 lrc 歌词**：可以通过 https://gotranscript.com/subtitle-converter 将 srt 格式的字幕转换为 lrc 格式的歌词。 * **5 种速率限制**：OpenAI API 有 5 种速率限制：RPM（每分钟请求数）、RPD（每天请求数）、TPM（每分钟 tokens 数量）、TPD（每天 tokens 数量）、IPM（每分钟图像数量）。 * **扣子平台**：coze.cn 是一个扣子平台，可以提供各种扣子。 * **通过云函数免费使用 GPT-3.5**：可以通过 https://juejin.cn/post/7353849549540589587 免费使用 GPT-3.5。 * **不蒜子统计网页基数**：可以通过 https://busuanzi.ibruce.info/ 统计网页的基数。 * **视频总结和翻译网页**：可以通过 https://glarity.app/zh-CN 总结和翻译视频。 * **视频翻译和配音工具**：可以通过 https://github.com/jianchang512/pyvideotrans 翻译和配音视频。 * **文字生成音频**：可以通过 https://www.cnblogs.com/jijunjian/p/18118366 将文字生成音频。 * **memo ai**：memo.ac 是一个多模态 AI 平台，可以将视频链接、播客链接、本地音视频转换为文字，支持多语言转录后翻译，还可以将文字转换为新的音频。 * **视频总结工具**：可以通过 https://summarize.ing/ 总结视频。 * **可每天免费玩玩**：可以通过 https://www.perplexity.ai/ 每天免费玩玩。 * **Suno.ai**：Suno.ai 是一个 AI 语言模型，可以通过 https://bibigpt.co/ 访问。 * **CapCut**：CapCut 是一个视频编辑软件，可以通过 https://www.capcut.cn/ 下载。 * **Valla.ai**：Valla.ai 是一个多模态 AI 模型，可以通过 https://www.valla.ai/ 访问。 * **Viggle.ai**：Viggle.ai 是一个 AI 视频生成平台，可以通过 https://viggle.ai 访问。 * **使用免费的 GPU 部署文生图大模型**：可以通过 https://www.cnblogs.com/xuxiaona/p/18088404 部署文生图大模型。 * **语音转文字**：可以通过 https://speech.microsoft.com/portal 将语音转换为文字。 * **投资界的 ai**：可以通过 https://reportify.cc/ 了解投资界的 ai。 * **抓取小视频 app 的各种信息**：可以通过 https://github.com/NanmiCoder/MediaCrawler 抓取小视频 app 的各种信息。 * **马斯克 Grok1 开源**：马斯克的 Grok1 模型已经开源，可以通过 https://github.com/xai-org/grok-1 访问。 * **ChatALL**：ChatALL 是一个跨端支持的聊天机器人，可以通过 https://github.com/sunner/ChatALL 访问。 * **零一万物**：零一万物是一个 AI 平台，可以通过 https://www.01.ai/cn 访问。 * **智普**：智普是一个 AI 语言模型，可以通过 https://chatglm.cn/ 访问。 * **memo ai 下载**：可以通过 https://memo.ac/ 下载 memo ai。 * **ffmpeg 学习**：可以通过 https://www.ruanyifeng.com/blog/2020/01/ffmpeg.html 学习 ffmpeg。 * **自动生成文章小工具**：可以通过 https://www.cognition-labs.com/blog 生成文章。 * **简易商城**：可以通过 https://www.cnblogs.com/whuanle/p/18086537 搭建简易商城。 * **物联网**：可以通过 https://www.cnblogs.com/xuxiaona/p/18088404 学习物联网。 * **自定义表单、自定义列表、自定义上传和下载、自定义流程、自定义报表**：可以通过 https://www.cnblogs.com/whuanle/p/18086537 实现自定义表单、自定义列表、自定义上传和下载、自定义流程、自定义报表。 **个人见解和思考** * ChatGPT 是一个强大的工具，可以用来提高工作效率和创造力。 * ChatGPT 的使用门槛较低，即使是非技术人员也可以轻松上手。 * ChatGPT 的发展速度非常快，未来可能会对各个行业产生深远的影响。 * 我们应该理性看待 ChatGPT，既要看到它的优点，也要意识到它的局限性。 * 我们应该积极探索 ChatGPT 的应用场景，为社会创造价值。

github

: 81

chat-ui

A chat interface using open source models, eg OpenAssistant or Llama. It is a SvelteKit app and it powers the HuggingChat app on hf.co/chat.

github

: 8.5k

ChatterUI

ChatterUI is a mobile app that allows users to manage chat files and character cards, and to interact with Large Language Models (LLMs). It supports multiple backends, including local, koboldcpp, text-generation-webui, Generic Text Completions, AI Horde, Mancer, Open Router, and OpenAI. ChatterUI provides a mobile-friendly interface for interacting with LLMs, making it easy to use them for a variety of tasks, such as generating text, translating languages, writing code, and answering questions.

github

: 1.1k

99AI

99AI is a commercializable AI web application based on NineAI 2.4.2 (no authorization, no backdoors, no piracy, integrated front-end and back-end integration packages, supports Docker rapid deployment). The uncompiled source code is temporarily closed. Compared with the stable version, the development version is faster.

github

: 736

chatnio

Chat Nio is a next-generation AI one-stop solution that provides a rich and user-friendly interface for interacting with various AI models. It offers features such as AI chat conversation, rich format compatibility, markdown support, message menu support, multi-platform adaptation, dialogue memory, full-model file parsing, full-model DuckDuckGo online search, full-screen large text editing, model marketplace, preset support, site announcements, preference settings, internationalization support, and a rich admin system. Chat Nio also boasts a powerful channel management system that utilizes a self-developed channel distribution algorithm, supports multi-channel management, is compatible with multiple formats, allows for custom models, supports channel retries, enables balanced load within the same channel, and provides channel model mapping and user grouping. Additionally, Chat Nio offers forwarding API services that are compatible with multiple formats in the OpenAI universal format and support multiple model compatible layers. It also provides a custom build and install option for highly customizable deployments. Chat Nio is an open-source project licensed under the Apache License 2.0 and welcomes contributions from the community.

github

: 2.8k

Awesome-LLM-Reasoning

**Curated collection of papers and resources on how to unlock the reasoning ability of LLMs and MLLMs.** **Description in less than 400 words, no line breaks and quotation marks.** Large Language Models (LLMs) have revolutionized the NLP landscape, showing improved performance and sample efficiency over smaller models. However, increasing model size alone has not proved sufficient for high performance on challenging reasoning tasks, such as solving arithmetic or commonsense problems. This curated collection of papers and resources presents the latest advancements in unlocking the reasoning abilities of LLMs and Multimodal LLMs (MLLMs). It covers various techniques, benchmarks, and applications, providing a comprehensive overview of the field. **5 jobs suitable for this tool, in lowercase letters.** - content writer - researcher - data analyst - software engineer - product manager **Keywords of the tool, in lowercase letters.** - llm - reasoning - multimodal - chain-of-thought - prompt engineering **5 specific tasks user can use this tool to do, in less than 3 words, Verb + noun form, in daily spoken language.** - write a story - answer a question - translate a language - generate code - summarize a document

github

: 2.3k

Chinese-LLaMA-Alpaca-2

Chinese-LLaMA-Alpaca-2 is a large Chinese language model developed by Meta AI. It is based on the Llama-2 model and has been further trained on a large dataset of Chinese text. Chinese-LLaMA-Alpaca-2 can be used for a variety of natural language processing tasks, including text generation, question answering, and machine translation. Here are some of the key features of Chinese-LLaMA-Alpaca-2: * It is the largest Chinese language model ever trained, with 13 billion parameters. * It is trained on a massive dataset of Chinese text, including books, news articles, and social media posts. * It can be used for a variety of natural language processing tasks, including text generation, question answering, and machine translation. * It is open-source and available for anyone to use. Chinese-LLaMA-Alpaca-2 is a powerful tool that can be used to improve the performance of a wide range of natural language processing tasks. It is a valuable resource for researchers and developers working in the field of artificial intelligence.

github

: 6.8k

Linly-Talker

Linly-Talker is an innovative digital human conversation system that integrates the latest artificial intelligence technologies, including Large Language Models (LLM) 🤖, Automatic Speech Recognition (ASR) 🎙️, Text-to-Speech (TTS) 🗣️, and voice cloning technology 🎤. This system offers an interactive web interface through the Gradio platform 🌐, allowing users to upload images 📷 and engage in personalized dialogues with AI 💬.

github

: 2.2k

For similar jobs

ChatFAQ

ChatFAQ is an open-source comprehensive platform for creating a wide variety of chatbots: generic ones, business-trained, or even capable of redirecting requests to human operators. It includes a specialized NLP/NLG engine based on a RAG architecture and customized chat widgets, ensuring a tailored experience for users and avoiding vendor lock-in.

github

: 128

anything-llm

AnythingLLM is a full-stack application that enables you to turn any document, resource, or piece of content into context that any LLM can use as references during chatting. This application allows you to pick and choose which LLM or Vector Database you want to use as well as supporting multi-user management and permissions.

github

: 42.1k

ai-guide

This guide is dedicated to Large Language Models (LLMs) that you can run on your home computer. It assumes your PC is a lower-end, non-gaming setup.

github

: 159

classifai

Supercharge WordPress Content Workflows and Engagement with Artificial Intelligence. Tap into leading cloud-based services like OpenAI, Microsoft Azure AI, Google Gemini and IBM Watson to augment your WordPress-powered websites. Publish content faster while improving SEO performance and increasing audience engagement. ClassifAI integrates Artificial Intelligence and Machine Learning technologies to lighten your workload and eliminate tedious tasks, giving you more time to create original content that matters.

github

: 620

mikupad

mikupad is a lightweight and efficient language model front-end powered by ReactJS, all packed into a single HTML file. Inspired by the likes of NovelAI, it provides a simple yet powerful interface for generating text with the help of various backends.

github

: 300

glide

Glide is a cloud-native LLM gateway that provides a unified REST API for accessing various large language models (LLMs) from different providers. It handles LLMOps tasks such as model failover, caching, key management, and more, making it easy to integrate LLMs into applications. Glide supports popular LLM providers like OpenAI, Anthropic, Azure OpenAI, AWS Bedrock (Titan), Cohere, Google Gemini, OctoML, and Ollama. It offers high availability, performance, and observability, and provides SDKs for Python and NodeJS to simplify integration.

github

: 110

onnxruntime-genai

ONNX Runtime Generative AI is a library that provides the generative AI loop for ONNX models, including inference with ONNX Runtime, logits processing, search and sampling, and KV cache management. Users can call a high level `generate()` method, or run each iteration of the model in a loop. It supports greedy/beam search and TopP, TopK sampling to generate token sequences, has built in logits processing like repetition penalties, and allows for easy custom scoring.

github

: 442

firecrawl

Firecrawl is an API service that takes a URL, crawls it, and converts it into clean markdown. It crawls all accessible subpages and provides clean markdown for each, without requiring a sitemap. The API is easy to use and can be self-hosted. It also integrates with Langchain and Llama Index. The Python SDK makes it easy to crawl and scrape websites in Python code.

github

: 34.1k