Best AI tools for< Split Text >
20 - AI tool Sites
Recap
Recap is an open-source browser extension that allows users to easily summarize any portion of a webpage using ChatGPT. It provides a convenient way to extract key information from online content, making it easier for users to digest and understand lengthy articles or documents. With Recap, users can efficiently split and summarize text with the help of AI technology, enhancing their browsing experience and productivity.
GenerateSongAI
GenerateSongAI is an advanced AI music production tool that revolutionizes the way music creators generate high-quality songs effortlessly. With GenerateSongAI, users can create fully produced songs with just a simple text description or lyrics, customize music styles, extract vocals and instruments, remix existing audio files, and share their music across social platforms. The platform offers a seamless and intuitive experience for music production, catering to a wide range of users from indie filmmakers to content creators, without the need for technical expertise. GenerateSongAI unlocks endless possibilities for creating songs, limited only by the user's imagination.
promptoMANIA
promptoMANIA is an AI art community and prompt generator that allows users to create AI images using various diffusion models like CF Spark, Midjourney, and Stable Diffusion. Users can generate high-quality and detailed AI art by providing prompts and selecting different styles and references. The platform offers a user-friendly prompt builder and tools like Grid Splitter to enhance the AI art creation experience. promptoMANIA is a free online tool with no subscription or sign-up required.
**万兴科技**
**万兴科技** is an AI-powered tool that helps users create and edit PDF documents. It offers a wide range of features, including the ability to convert PDFs to other formats, edit text and images, and add annotations. **万兴科技** is a valuable tool for anyone who needs to work with PDFs on a regular basis.
AI Song Maker
AI Song Maker is an AI music generator platform that allows users to effortlessly create high-quality songs. Users can convert text to song, lyrics to song, and more using cutting-edge AI technology. The tool offers a range of features such as Text to Song, Lyrics to Song, Vocal Remover, Split Music, and Remix Music, designed to streamline the music creation process and inspire creativity. AI Song Maker is user-friendly and accessible to creators of all skill levels, providing a platform to generate unique and engaging music compositions for personal or commercial use.
LightPDF
LightPDF is an AI-powered, free online PDF editor, converter, and reader. It offers a wide range of PDF tools, including the ability to convert PDFs to and from other formats, edit PDFs, add watermarks, split and merge PDFs, rotate PDFs, annotate PDFs, optimize PDFs, compress PDFs, perform OCR on PDFs, and protect PDFs. LightPDF also offers a variety of AI-powered features, such as an AI chatbot that can answer questions about documents and an AI-powered OCR engine that can convert scanned PDFs and images to text.
PDF Translator & Editor
PDF Translator & Editor is a powerful AI-driven tool that offers multilingual document translation with format and layout preservation. It supports translation of native PDFs, scanned PDFs, Word, Excel, PowerPoint, and image files to 136 languages. The tool also provides versatile PDF conversion and editing capabilities, allowing users to convert PDFs to images and vice versa, edit PDF text, scan to PDF, and split PDF files. With AI technology from Google and Microsoft's Neural Machine Translation models, the tool ensures accurate translations with automatic language detection. Trusted by users worldwide, PDF Translator & Editor offers unlimited access without file size or page limits.
Supertranslate
Supertranslate is an AI-powered tool that allows users to automatically add English subtitles to videos in any language. It is powered by OpenAI's Whisper, which is known for being the world's most accurate speech-to-text engine. With Supertranslate, users can upload videos, generate subtitles, and download them in .srt/.vtt formats. The tool also features a fluid subtitle editor that enables users to split, merge, and adjust timecodes of the subtitles with ease. Supertranslate offers a fast and efficient way to subtitle videos, without the need for a credit card, and is free to use forever.
MakeBestMusic
MakeBestMusic is an AI-powered music production suite that allows users to create unique music with just one click using advanced AI technology. The platform offers features such as creating instrumental and vocal music by providing simple descriptive words or lyrics, remixing music by uploading audio files and providing remix descriptions, and splitting music files to extract drums and vocals. MakeBestMusic provides a versatile approach to music creation, catering to both instrumental and vocal preferences with ease. Users can generate dynamic music, remix original audio files, and extract sounds from background music using the powerful AI music tool.
Luma Dream Machine
Luma Dream Machine is a cutting-edge AI application that empowers users to ideate, visualize, and create stunning images and videos with ease. By leveraging powerful image and video AI models, users can bring their creative visions to life in a fluid and intuitive manner. The platform offers a range of features to facilitate fast iteration, creative exploration, and seamless editing, making it a go-to tool for artists, designers, and content creators seeking to push the boundaries of visual storytelling.
Luma AI
Luma AI is a 3D capture platform that allows users to create interactive 3D scenes from videos. With Luma AI, users can capture 3D models of people, objects, and environments, and then use those models to create interactive experiences such as virtual tours, product demonstrations, and training simulations.
MJSplitter
MJSplitter is a free online tool that allows users to split their Midjourney Grid images into single images. Users can either paste or upload their image grid, and the tool will automatically split the images and save them as JPEGs. The tool is not affiliated with Midjourney, and images are deleted from the server after 24 hours.
SplitSong
SplitSong.com is an AI-powered tool that allows users to split songs into individual instrument tracks. Created by @markdoppler_, the website offers a simple and efficient way to extract specific elements from music files, such as drums, keyboards, guitars, bass, and vocals. Users can upload their songs or extract tracks directly from YouTube videos. SplitSong leverages Artificial Intelligence technology to provide high-quality results and enhance the music editing experience.
Doclingo
Doclingo is an AI-powered document translation tool that supports translating documents in various formats such as PDF, Word, Excel, PowerPoint, SRT subtitles, ePub ebooks, AR&ZIP packages, and more. It utilizes large language models to provide accurate and professional translations, preserving the original layout of the documents. Users can enjoy a limited-time free trial upon registration, with the option to subscribe for more features. Doclingo aims to offer high-quality translation services through continuous algorithm improvements.
VocalRemover
Vocalremover.org is a website that offers a tool to remove vocals from music tracks. Users can upload their audio files and the tool processes them to create a version with vocals removed. The site aims to provide a simple and efficient solution for music enthusiasts, DJs, and content creators who want to create karaoke tracks or remixes.
LALAL.AI
LALAL.AI is a next-generation vocal remover and music source separation service that offers fast, easy, and precise stem extraction. It allows users to remove vocals, instrumental tracks, drums, bass, piano, electric guitar, acoustic guitar, and synthesizer tracks without compromising quality. The platform utilizes AI-powered technology to provide high-quality stem splitting based on transformer-based audio separation approach. Users can upload audio and video files to split into stems, choose from various packages for different processing limits, and enjoy features like voice cleaning, echo & reverb removal, and lead/back vocals separation.
Gaudio Studio
Gaudio Studio is an AI music separation tool designed for creators to unleash their creativity with ease. It allows users to extract background music, separate instruments, and remove vocals from any music content. Powered by GSEP (Gaudio source SEParation), a high-quality and easy-to-use AI stem separation model, Gaudio Studio offers a seamless experience for audio separation. Users can upload their songs in various formats, access the tool from desktop or mobile devices, and enjoy Studio Plans for advanced processing. Additionally, Gaudio Studio can be integrated with cloud APIs and On-device SDKs for business applications, offering a versatile solution for music professionals and enthusiasts.
Music Demixer
Music Demixer is an AI-powered online tool that allows users to separate and extract vocals from music tracks, split stems, transcribe music into MIDI files, and generate sheet music. It offers a private and offline experience, enabling users to isolate vocals, drums, bass, melody, guitar, and piano effortlessly. The tool is perfect for musicians, DJs, producers, and creators, providing advanced stem separation technology and automatic music transcription features.
Jamorphosia
Jamorphosia is an AI-powered application that allows users to remove instruments from a song. With advanced technology and audio separation capabilities, users can easily extract vocals, isolate specific instruments, and create custom backing tracks. The tool transforms audio files into personalized songs, suitable for practice or performance. Jamorphosia enhances the music experience by providing a platform for musicians to engage with original tracks in a more immersive way.
AdCopy
AdCopy is an AI-powered advertising platform that helps businesses create high-quality ads and optimize their ad campaigns. The platform uses AI to generate ad copy, create ad creatives, and provide insights into ad performance. AdCopy is designed to help businesses save time and money on their advertising campaigns, while also improving their results.
20 - Open Source AI Tools
chonkie
Chonkie is a lightweight and fast RAG chunking library designed to efficiently split text for RAG (Retrieval-Augmented Generation) applications. It offers various chunking methods like TokenChunker, WordChunker, SentenceChunker, SemanticChunker, SDPMChunker, and an experimental LateChunker. Chonkie is feature-rich, easy to use, fast, supports multiple tokenizers, and comes with a cute pygmy hippo mascot. It aims to provide a no-nonsense solution for chunking text without the need to worry about dependencies or bloat.
SimplerLLM
SimplerLLM is an open-source Python library that simplifies interactions with Large Language Models (LLMs) for researchers and beginners. It provides a unified interface for different LLM providers, tools for enhancing language model capabilities, and easy development of AI-powered tools and apps. The library offers features like unified LLM interface, generic text loader, RapidAPI connector, SERP integration, prompt template builder, and more. Users can easily set up environment variables, create LLM instances, use tools like SERP, generic text loader, calling RapidAPI APIs, and prompt template builder. Additionally, the library includes chunking functions to split texts into manageable chunks based on different criteria. Future updates will bring more tools, interactions with local LLMs, prompt optimization, response evaluation, GPT Trainer, document chunker, advanced document loader, integration with more providers, Simple RAG with SimplerVectors, integration with vector databases, agent builder, and LLM server.
modelfusion
ModelFusion is an abstraction layer for integrating AI models into JavaScript and TypeScript applications, unifying the API for common operations such as text streaming, object generation, and tool usage. It provides features to support production environments, including observability hooks, logging, and automatic retries. You can use ModelFusion to build AI applications, chatbots, and agents. ModelFusion is a non-commercial open source project that is community-driven. You can use it with any supported provider. ModelFusion supports a wide range of models including text generation, image generation, vision, text-to-speech, speech-to-text, and embedding models. ModelFusion infers TypeScript types wherever possible and validates model responses. ModelFusion provides an observer framework and logging support. ModelFusion ensures seamless operation through automatic retries, throttling, and error handling mechanisms. ModelFusion is fully tree-shakeable, can be used in serverless environments, and only uses a minimal set of dependencies.
yt-fts
yt-fts is a command line program that uses yt-dlp to scrape all of a YouTube channels subtitles and load them into a sqlite database for full text search. It allows users to query a channel for specific keywords or phrases and generates time stamped YouTube URLs to the videos containing the keyword. Additionally, it supports semantic search via the OpenAI embeddings API using chromadb.
godot-llm
Godot LLM is a plugin that enables the utilization of large language models (LLM) for generating content in games. It provides functionality for text generation, text embedding, multimodal text generation, and vector database management within the Godot game engine. The plugin supports features like Retrieval Augmented Generation (RAG) and integrates llama.cpp-based functionalities for text generation, embedding, and multimodal capabilities. It offers support for various platforms and allows users to experiment with LLM models in their game development projects.
warc-gpt
WARC-GPT is an experimental retrieval augmented generation pipeline for web archive collections. It allows users to interact with WARC files, extract text, generate text embeddings, visualize embeddings, and interact with a web UI and API. The tool is highly customizable, supporting various LLMs, providers, and embedding models. Users can configure the application using environment variables, ingest WARC files, start the server, and interact with the web UI and API to search for content and generate text completions. WARC-GPT is designed for exploration and experimentation in exploring web archives using AI.
WordLlama
WordLlama is a fast, lightweight NLP toolkit optimized for CPU hardware. It recycles components from large language models to create efficient word representations. It offers features like Matryoshka Representations, low resource requirements, binarization, and numpy-only inference. The tool is suitable for tasks like semantic matching, fuzzy deduplication, ranking, and clustering, making it a good option for NLP-lite tasks and exploratory analysis.
rag-web-ui
RAG Web UI is an intelligent dialogue system based on RAG (Retrieval-Augmented Generation) technology. It helps enterprises and individuals build intelligent Q&A systems based on their own knowledge bases. By combining document retrieval and large language models, it delivers accurate and reliable knowledge-based question-answering services. The system is designed with features like intelligent document management, advanced dialogue engine, and a robust architecture. It supports multiple document formats, async document processing, multi-turn contextual dialogue, and reference citations in conversations. The architecture includes a backend stack with Python FastAPI, MySQL + ChromaDB, MinIO, Langchain, JWT + OAuth2 for authentication, and a frontend stack with Next.js, TypeScript, Tailwind CSS, Shadcn/UI, and Vercel AI SDK for AI integration. Performance optimization includes incremental document processing, streaming responses, vector database performance tuning, and distributed task processing. The project is licensed under the Apache-2.0 License and is intended for learning and sharing RAG knowledge only, not for commercial purposes.
langchain-swift
LangChain for Swift. Optimized for iOS, macOS, watchOS (part) and visionOS.(beta) This is a pure client library, no server required
giskard
Giskard is an open-source Python library that automatically detects performance, bias & security issues in AI applications. The library covers LLM-based applications such as RAG agents, all the way to traditional ML models for tabular data.
LLMLingua
LLMLingua is a tool that utilizes a compact, well-trained language model to identify and remove non-essential tokens in prompts. This approach enables efficient inference with large language models, achieving up to 20x compression with minimal performance loss. The tool includes LLMLingua, LongLLMLingua, and LLMLingua-2, each offering different levels of prompt compression and performance improvements for tasks involving large language models.
code2prompt
Code2Prompt is a powerful command-line tool that generates comprehensive prompts from codebases, designed to streamline interactions between developers and Large Language Models (LLMs) for code analysis, documentation, and improvement tasks. It bridges the gap between codebases and LLMs by converting projects into AI-friendly prompts, enabling users to leverage AI for various software development tasks. The tool offers features like holistic codebase representation, intelligent source tree generation, customizable prompt templates, smart token management, Gitignore integration, flexible file handling, clipboard-ready output, multiple output options, and enhanced code readability.
call-gpt
Call GPT is a voice application that utilizes Deepgram for Speech to Text, elevenlabs for Text to Speech, and OpenAI for GPT prompt completion. It allows users to chat with ChatGPT on the phone, providing better transcription, understanding, and speaking capabilities than traditional IVR systems. The app returns responses with low latency, allows user interruptions, maintains chat history, and enables GPT to call external tools. It coordinates data flow between Deepgram, OpenAI, ElevenLabs, and Twilio Media Streams, enhancing voice interactions.
minbpe
This repository contains a minimal, clean code implementation of the Byte Pair Encoding (BPE) algorithm, commonly used in LLM tokenization. The BPE algorithm is "byte-level" because it runs on UTF-8 encoded strings. This algorithm was popularized for LLMs by the GPT-2 paper and the associated GPT-2 code release from OpenAI. Sennrich et al. 2015 is cited as the original reference for the use of BPE in NLP applications. Today, all modern LLMs (e.g. GPT, Llama, Mistral) use this algorithm to train their tokenizers. There are two Tokenizers in this repository, both of which can perform the 3 primary functions of a Tokenizer: 1) train the tokenizer vocabulary and merges on a given text, 2) encode from text to tokens, 3) decode from tokens to text. The files of the repo are as follows: 1. minbpe/base.py: Implements the `Tokenizer` class, which is the base class. It contains the `train`, `encode`, and `decode` stubs, save/load functionality, and there are also a few common utility functions. This class is not meant to be used directly, but rather to be inherited from. 2. minbpe/basic.py: Implements the `BasicTokenizer`, the simplest implementation of the BPE algorithm that runs directly on text. 3. minbpe/regex.py: Implements the `RegexTokenizer` that further splits the input text by a regex pattern, which is a preprocessing stage that splits up the input text by categories (think: letters, numbers, punctuation) before tokenization. This ensures that no merges will happen across category boundaries. This was introduced in the GPT-2 paper and continues to be in use as of GPT-4. This class also handles special tokens, if any. 4. minbpe/gpt4.py: Implements the `GPT4Tokenizer`. This class is a light wrapper around the `RegexTokenizer` (2, above) that exactly reproduces the tokenization of GPT-4 in the tiktoken library. The wrapping handles some details around recovering the exact merges in the tokenizer, and the handling of some unfortunate (and likely historical?) 1-byte token permutations. Finally, the script train.py trains the two major tokenizers on the input text tests/taylorswift.txt (this is the Wikipedia entry for her kek) and saves the vocab to disk for visualization. This script runs in about 25 seconds on my (M1) MacBook. All of the files above are very short and thoroughly commented, and also contain a usage example on the bottom of the file.
multimodal-chat
Yet Another Chatbot is a sophisticated multimodal chat interface powered by advanced AI models and equipped with a variety of tools. This chatbot can search and browse the web in real-time, query Wikipedia for information, perform news and map searches, execute Python code, compose long-form articles mixing text and images, generate, search, and compare images, analyze documents and images, search and download arXiv papers, save conversations as text and audio files, manage checklists, and track personal improvements. It offers tools for web interaction, Wikipedia search, Python scripting, content management, image handling, arXiv integration, conversation generation, file management, personal improvement, and checklist management.
promptic
Promptic is a tool designed for LLM app development, providing a productive and pythonic way to build LLM applications. It leverages LiteLLM, allowing flexibility to switch LLM providers easily. Promptic focuses on building features by providing type-safe structured outputs, easy-to-build agents, streaming support, automatic prompt caching, and built-in conversation memory.
distilabel
Distilabel is a framework for synthetic data and AI feedback for AI engineers that require high-quality outputs, full data ownership, and overall efficiency. It helps you synthesize data and provide AI feedback to improve the quality of your AI models. With Distilabel, you can: * **Synthesize data:** Generate synthetic data to train your AI models. This can help you to overcome the challenges of data scarcity and bias. * **Provide AI feedback:** Get feedback from AI models on your data. This can help you to identify errors and improve the quality of your data. * **Improve your AI output quality:** By using Distilabel to synthesize data and provide AI feedback, you can improve the quality of your AI models and get better results.
ChatGPT-Telegram-Bot
The ChatGPT Telegram Bot is a powerful Telegram bot that utilizes various GPT models, including GPT3.5, GPT4, GPT4 Turbo, GPT4 Vision, DALL·E 3, Groq Mixtral-8x7b/LLaMA2-70b, and Claude2.1/Claude3 opus/sonnet API. It enables users to engage in efficient conversations and information searches on Telegram. The bot supports multiple AI models, online search with DuckDuckGo and Google, user-friendly interface, efficient message processing, document interaction, Markdown rendering, and convenient deployment options like Zeabur, Replit, and Docker. Users can set environment variables for configuration and deployment. The bot also provides Q&A functionality, supports model switching, and can be deployed in group chats with whitelisting. The project is open source under GPLv3 license.
WDoc
WDoc is a powerful Retrieval-Augmented Generation (RAG) system designed to summarize, search, and query documents across various file types. It supports querying tens of thousands of documents simultaneously, offers tailored summaries to efficiently manage large amounts of information, and includes features like supporting multiple file types, various LLMs, local and private LLMs, advanced RAG capabilities, advanced summaries, trust verification, markdown formatted answers, sophisticated embeddings, extensive documentation, scriptability, type checking, lazy imports, caching, fast processing, shell autocompletion, notification callbacks, and more. WDoc is ideal for researchers, students, and professionals dealing with extensive information sources.
7 - OpenAI Gpts
Split Screen Ad Engine
Simply Enter your Niche and we'll create your Split Screen Ads for you.
RFP Proposal Pro (IT / Software Sales assistant)
Step 1: Upload RFP Step 2: Prompt: I need a comprehensive summary of the RFP. Split the summary in multiple blocks / section. After giving me one section wait for my command to move to the next section. Step 3: Prompt: Move to the next section, please :)
Pace Assistant
Provides running splits for Strava Routes, accounting for distance and elevation changes