Best AI tools for< Linguist >

Infographic

52 - AI tool Sites

VocaBuddy

VocaBuddy is a vocabulary building tool designed to help users enhance their language skills. It provides a platform for users to store and practice their vocabulary in an interactive and engaging manner. With features like random word generation and building vocabulary from selected words, VocaBuddy aims to make learning new words fun and effective. The app is user-friendly and suitable for language learners of all levels.

site

: 429

MachineTranslation.com

MachineTranslation.com is a leading AI translation platform trusted by over 1,000,000 users worldwide. It offers accurate and secure translations for businesses, professionals, and individuals. With support for 270+ languages and various file types, the platform ensures high-quality translations with human review. Users can personalize translations, preserve original formatting, and compare results from multiple AI sources. MachineTranslation.com is committed to making AI translation accessible and reliable through advanced technology and a user-friendly interface.

site

: 52.2k

TextSynth

TextSynth is an AI tool that provides access to large language models such as Mistral, Llama, Stable Diffusion, Whisper for text-to-image, text-to-speech, and speech-to-text capabilities via a REST API and a playground. It employs custom inference code for faster inference on standard GPUs and CPUs. Founded in 2020, TextSynth was among the first to offer access to the GPT-2 language model. The service is free with rate limitations, but users can opt for unlimited access by paying a small fee per request. All servers are located in France.

site

: 7.3k

Art Review Generator

The Art Review Generator is a natural language processing tool and text generator that analyzes and generates language used to describe art and culture. It utilizes a vast amount of training data from 57 years of art reviews to create medium-length sentences. While not classified as artificial intelligence, it leverages deep matrices of probability to produce new text. The tool focuses on modern art reviews, capturing the distinctive language of human expression, including intent, emotion, technique, and impact. Despite potential biases and glitches, it offers insights into evolving language and cultural changes.

site

: 662

Cameron Jones

The Cameron Jones website is a platform maintained by a Cognitive Science PhD student with a focus on persuasion, deception, and social intelligence in humans and Large Language Models (LLMs). The site showcases the student's publications, projects, and CV, along with research on LLM performance in tasks like the False Belief task and the Turing test.

site

: 136

BookTranslator

BookTranslator is an online EPUB/PDF book translation tool that empowers effortless reading of global books. It supports translation of books in multiple languages and can accurately translate the entire EPUB/PDF book into the language you want in just one minute. The tool uses the latest AI technology to ensure the best translation quality, providing lightning-fast translations and bilingual content comparison for better understanding. With support for over 74 languages, users can easily translate their books and access their translation history for future reference.

site

: 30.7k

Vokabeln.io

Vokabeln.io is a website that provides resources and information related to vocabulary. It was generated by the domain owner using Sedo Domain Parking. The site offers various materials and tools to help users improve their vocabulary skills. Please note that Sedo, the platform used for domain parking, does not have any relationship with third-party advertisers. The website does not endorse or recommend any specific service or trademark.

site

: 130

Context

Context is an AI tool designed for iOS/macOS that helps users look up the meaning of words within a sentence, particularly useful for serious foreign language reading situations. By leveraging AI technology, users can easily understand the context of unfamiliar words by asking the AI for explanations. Context simplifies the process of word lookup and provides features like explaining the whole sentence, saving lookups, syncing between devices, and exporting data in Markdown/CSV format. While the app is not free, it offers a free trial with a limited number of lookups before requiring a subscription. Developed by Chen Guo, Context aims to enhance language comprehension and reading experiences for users.

site

: 0

LM Studio

LM Studio is an AI tool designed for discovering, downloading, and running local LLMs (Large Language Models). Users can run LLMs on their laptops offline, use models through an in-app Chat UI or a local server, download compatible model files from HuggingFace repositories, and discover new LLMs. The tool ensures privacy by not collecting data or monitoring user actions, making it suitable for personal and business use. LM Studio supports various models like ggml Llama, MPT, and StarCoder on Hugging Face, with minimum hardware/software requirements specified for different platforms.

site

: 2.1m

DocTranslator

DOCTRANSLATOR.COM is an AI-powered online document translation platform that offers seamless translation services for various document formats. With support for over 100 languages, the platform ensures accurate and contextually relevant translations while maintaining the natural flow of content. Users can easily upload documents, track translation progress, and receive high-quality translations tailored to their audience. The platform simplifies the translation process, providing status tracking, dedicated support, and a user-friendly experience.

site

: 246.1k

Ollama

Ollama is an AI tool that allows users to access and utilize large language models such as Llama 3, Phi 3, Mistral, Gemma 2, and more. Users can customize and create their own models. The tool is available for macOS, Linux, and Windows platforms, offering a preview version for users to explore and utilize these models for various applications.

site

: 65.8k

Stability AI

Stability AI is an AI application that offers a suite of models for various modalities such as image, video, audio, 3D, and language. It provides cutting-edge generative AI technology with a focus on stability and quality. Users can access advanced AI models for tasks like text-to-image generation, video modeling, audio generation, and more. The application also offers licensing options for commercial use and self-hosting benefits.

site

: 1.2m

slAItor

slAItor is an AI translation assistant powered by GPT technology. It offers advanced translation features and customization options to enhance the translation experience. Users can benefit from step-by-step translations, multiple translation alternatives, and unique translation styles. The tool supports 28 language pairs and combines recent AI advancements with traditional translation techniques to deliver accurate and efficient translations. slAItor also provides post-processing and evaluation steps to ensure translation quality and offers a user-friendly interface for seamless translation management.

site

: 0

Lingolette

Lingolette is an AI language teaching machine that helps users master a language faster through personalized neural network chat-based tools. It speaks with users like a real teacher, motivates them on their learning journey, adapts to their learning style, and explains concepts clearly. Lingolette aims to enhance users' talking skills, pronunciation, and overall language learning experience.

site

: 11.1k

LEXICON

LEXICON is an AI-powered platform designed to enhance English learning through a combination of expert teachers and artificial intelligence. The platform offers personalized lesson programs developed by teachers to accelerate English speaking skills. By integrating AI technologies into language teaching methods, LEXICON provides a faster and more efficient learning experience. Users can benefit from specialized AI-supported exercises and receive detailed feedback on errors made during lessons. With over 10 years of experience in the software industry, LEXICON is committed to providing learners with tools tailored for language acquisition.

site

: 615

Lune

Lune is a language learning application that leverages AI technology to provide a personalized and dynamic learning experience. The app tracks your learning progress to create tailored content and courses. With a multifaceted approach, Lune offers various ways to learn languages, including holding conversations, guided learning, and reading stories. It categorizes every word you encounter and helps you remember vocabulary effortlessly. Additionally, Lune allows users to share their progress with friends and family. The app is designed to make language learning engaging, efficient, and accessible to all.

site

: 0

LanguaTalk

LanguaTalk is a language learning platform that offers personalized coaching from 5-star tutors and world-class AI software to help users achieve fluency in various languages. The platform provides tailored language tutoring, practice sessions with advanced AI, and a combination of both for accelerated learning. LanguaTalk leverages AI technology to offer features such as personalized coaching, practice conversations, role plays, instant corrections, interactive transcripts, vocabulary learning through flashcards, and more. The platform has both free and paid plans, with a focus on providing effective and affordable language learning solutions.

site

: 161.4k

Webnovels AI

Webnovels AI is an advanced AI tool designed for translating web novels, light novels, and e-books from Chinese, Korean, and Japanese into English. The platform offers instant bulk translation for complete novels, customizable glossary for perfect translations, the option to bring your own GPT key for unlimited access, and is tailored for both avid readers and professional translators. Users can monetize their translations and earn revenue, making it a cost-effective solution loved by over 100 readers and translators. Webnovels AI aims to provide the highest quality AI translation, backed by years of experience in reading and writing translations for Asian novels.

site

: 13.8k

AI Manga Translator

AI Manga Translator is an online tool powered by AI technology that allows users to upload and translate manga instantly. It supports multiple languages and translation engines, ensuring precision manga translation without altering the original style. The tool is user-friendly, making it accessible to all users, whether they are manga fans or professionals needing document translations. AI Manga Translator offers various plans for different translation needs, with accurate and fast translations powered by AI technology.

site

: 0

Reverso

Reverso is a free translation tool that offers a comprehensive set of features to help users translate text, documents, and more in multiple languages. It provides grammar checks, synonyms, conjugation, and context-based translations. With cutting-edge AI-powered technology, Reverso delivers accurate translations in various languages, including Arabic, Chinese, Italian, and more. Users can enjoy features like document translation, integrated dictionaries, grammar check, neural machine translation, pronunciation guides, and verb conjugation. Reverso aims to assist millions of individuals and organizations in communicating efficiently and precisely across different languages.

site

: 84.6m

Hayai Learn

Hayai Learn is an AI-powered platform designed to help users learn Japanese quickly and effectively by immersing them in Japanese content such as YouTube videos. The platform utilizes AI technology to assist users in acquiring new vocabulary and grammar effortlessly. By offering features like word learning from subtitles, providing relevant word meanings, offering video examples for better memory association, and assisting with sentence mining, Hayai Learn aims to revolutionize the way Japanese is learned by making it fun and engaging.

site

: 10.6k

Bentolingo

Bentolingo is an AI language learning tool that offers users a daily 'bento box' of language learning resources. The platform utilizes artificial intelligence to personalize language lessons and exercises based on individual learning styles and progress. Users can improve their language skills in a fun and interactive way through a variety of activities such as vocabulary quizzes, grammar exercises, and conversation practice. Bentolingo aims to make language learning more accessible and engaging for learners of all levels.

site

: 0

Ogma

Ogma is an interpretable symbolic general problem-solving model that utilizes a symbolic sequence modeling paradigm to address tasks requiring reliability, complex decomposition, and without hallucinations. It offers solutions in areas such as math problem-solving, natural language understanding, and resolution of uncertainty. The technology is designed to provide a structured approach to problem-solving by breaking down tasks into manageable components while ensuring interpretability and self-interpretability. Ogma aims to set benchmarks in problem-solving applications by offering a reliable and transparent methodology.

site

: 0

LLM Token Counter

The LLM Token Counter is a sophisticated tool designed to help users effectively manage token limits for various Language Models (LLMs) like GPT-3.5, GPT-4, Claude-3, Llama-3, and more. It utilizes Transformers.js, a JavaScript implementation of the Hugging Face Transformers library, to calculate token counts client-side. The tool ensures data privacy by not transmitting prompts to external servers.

site

: 0

Alexa Translations

Alexa Translations is an AI-powered translation service that combines the expertise of professional translators with cutting-edge AI technology to provide custom translation solutions for legal, financial, securities, and HR teams. The platform offers a blend of human intelligence and AI capabilities, ensuring unmatched quality and accuracy in translations. With a focus on industry-specific terminology and regulatory compliance, Alexa Translations caters to a wide range of clients, including banks, law firms, and other industry leaders.

site

: 34.7k

Vosyn

Vosyn is an advanced AI-driven platform that offers real-time translation, adaptive learning, and personalized experiences. It aims to create a connected world where digital platforms go beyond content repositories to become gateways to a rich, inclusive, and universally accessible world of information and entertainment. VosynVerse enables understanding across languages, VosynCore drives multilingual communication, and VosynConnect provides tailored solutions for businesses, all powered by cutting-edge AI technology.

site

: 8.5k

SpeakPal

SpeakPal is an AI language learning platform that offers stages of vocabulary, phrases, daily dialogues, writing, and downloadable exercises. It supports interactive chat with an AI language tutor and is available in 28 languages. The platform provides immersive language experiences and personalized teaching approaches to make language learning efficient and enjoyable.

site

: 0

Meta AI

Meta AI is an advanced AI tool designed to facilitate language learning and translation. It leverages cutting-edge artificial intelligence technology to provide users with a seamless experience in mastering various languages. With Meta AI, users can enhance their language skills, improve translation accuracy, and broaden their cultural understanding. The tool is user-friendly and offers a wide range of features to cater to different learning styles and preferences.

site

: 1.7k

OdiaGenAI

OdiaGenAI is a collaborative initiative focused on conducting research on Generative AI and Large Language Models (LLM) for the Odia Language. The project aims to leverage AI technology to develop Generative AI and LLM-based solutions for the overall development of Odisha and the Odia language through collaboration among Odia technologists. The initiative offers pre-trained models, codes, and datasets for non-commercial and research purposes, with a focus on building language models for Indic languages like Odia and Bengali.

site

: 6.7k

Neoform AI

Neoform AI is an innovative AI tool that focuses on developing AI models specifically for African dialects. The platform aims to bridge the gap in AI technology by providing solutions tailored to the linguistic diversity of Africa. With a commitment to inclusivity and cultural representation, Neoform AI is revolutionizing the field of artificial intelligence by addressing the unique challenges faced by African languages. Through cutting-edge research and development, Neoform AI is paving the way for greater accessibility and accuracy in AI applications across the continent.

site

: 0

Old English Translator

Old English Translator is an AI tool designed to convert text from Old English, the earliest form of English spoken during the early Middle Ages, into Modern English. It helps users understand and interpret texts written in Old English, which is significantly different from the English language spoken today.

site

: 0

AppTek.ai

AppTek.ai is a global leader in artificial intelligence (AI) and machine learning (ML) technologies, providing advanced solutions in automatic speech recognition, neural machine translation, natural language processing/understanding, large language models, and text-to-speech technologies. The platform offers industry-leading language solutions for various sectors such as media and entertainment, call centers, government, and enterprise business. AppTek.ai combines cutting-edge AI research with real-world applications, delivering accurate and efficient tools for speech transcription, translation, understanding, and synthesis across multiple languages and dialects.

site

: 20.8k

Arcee AI

Arcee AI is a platform that offers a cost-effective, secure, end-to-end solution for building and deploying Small Language Models (SLMs). It allows users to merge and train custom language models by leveraging open source models and their own data. The platform is known for its Model Merging technique, which combines the power of pre-trained Large Language Models (LLMs) with user-specific data to create high-performing models across various industries.

site

: 24.5k

TextUnited

TextUnited is an AI Translation Platform that offers expert translations through an AI-powered platform and world-class customer service. It provides solutions for website translation, eLearning & education, software localization, unlocking new markets, multilingual customer experience, and organization & productivity of translation. The platform uses AI technology to deliver custom translations at scale and a fraction of the cost, while also offering human translation services by expert linguists. TextUnited stands out for its simplicity, power of AI, customer service, content automation, and continuously enhanced automatic translation.

site

: 5.6k

DeutschDictionary.com

DeutschDictionary.com is the go-to online platform for language learners, specifically focusing on German-English translations. With over 1 million translations, conjugations, and example sentences, it provides a comprehensive resource for individuals looking to enhance their German language skills. The website is powered by AI teachers and editors who continuously update the database with new words and translations. DeutschDictionary.com is loved globally for its user-friendly interface, daily additions of new words, example sentences, and search improvements. It is a free and reliable tool for anyone interested in learning German.

site

: 0

Tala

Tala is an AI-powered language tutor designed for hands-on learners. It encourages free-flowing conversation early in the learning journey, focusing on natural language acquisition rather than rote memorization. With advanced speech recognition technology, Tala helps users build confidence in speaking and offers a flexible learning experience with adjustable listening speeds and easy access to look-up tools. The platform aims to make language learning engaging and immersive, allowing users to practice without fear of embarrassment and improve their pronunciation through interactive conversations.

site

: 0

Reka

Reka is a cutting-edge AI application offering next-generation multimodal AI models that empower agents to see, hear, and speak. Their flagship model, Reka Core, competes with industry leaders like OpenAI and Google, showcasing top performance across various evaluation metrics. Reka's models are natively multimodal, capable of tasks such as generating textual descriptions from videos, translating speech, answering complex questions, writing code, and more. With advanced reasoning capabilities, Reka enables users to solve a wide range of complex problems. The application provides end-to-end support for 32 languages, image and video comprehension, multilingual understanding, tool use, function calling, and coding, as well as speech input and output.

site

: 144.4k

Tilde.ai

Tilde.ai is a language technology platform that offers a wide range of AI-powered solutions for translation, speech technologies, and conversational AI. It combines human and artificial intelligence to help people connect and work efficiently. The platform provides machine translation, speech-to-text conversion, text-to-speech synthesis, real-time transcription, AI chatbots, internal knowledge assistants, and meeting support services. Tilde.ai aims to bridge language barriers and enhance communication by leveraging advanced language technologies.

site

: 73.0k

Mondonomo

Mondonomo is an AI tool that helps users explore the origins and meanings of names. Users can input their name or surname to discover information such as the countries where their name is common, transliterations, variants, famous people with the same name, and more. The platform also offers articles on onomastics, name science, and business solutions related to names. Additionally, users can design personalized wordclouds using the AI Wordcloud feature.

site

: 114.8k

Unsloth

Unsloth is an AI tool designed to make finetuning large language models like Llama-3, Mistral, Phi-3, and Gemma 2x faster, use 70% less memory, and with no degradation in accuracy. The tool provides documentation to help users navigate through training their custom models, covering essentials such as installing and updating Unsloth, creating datasets, running, and deploying models. Users can also integrate third-party tools and utilize platforms like Google Colab.

site

: 20.3k

I ♡ Transcriptions

I ♡ Transcriptions is an AI-powered platform that offers unlimited transcription services for audio and video files. It converts files to text in multiple languages with high accuracy. The platform was created to simplify transcription technology and make it accessible and affordable for users who need to transcribe content with high quality. It supports popular file formats, provides secure data handling, and offers features like speaker recognition and translation. The platform is developed by Jose María Campaña, a full-stack developer, and Tania Campaña, a linguistics doctor, with the vision of making transcription technology truly useful for everyone.

site

: 0

DeepSeek R1

DeepSeek R1 is a revolutionary open-source AI model for advanced reasoning that outperforms leading AI models in mathematics, coding, and general reasoning tasks. It utilizes a sophisticated MoE architecture with 37B active/671B total parameters and 128K context length, incorporating advanced reinforcement learning techniques. DeepSeek R1 offers multiple variants and distilled models optimized for complex problem-solving, multilingual understanding, and production-grade code generation. It provides cost-effective pricing compared to competitors like OpenAI o1, making it an attractive choice for developers and enterprises.

site

: 0

Verbit

Verbit is an AI-based transcription and captioning service that provides unmatched accuracy with actionable insights. It offers services such as live and recorded captioning, transcription, audio description, translation, note-taking, and dubbing. Verbit caters to various industries including media & entertainment, legal, education, corporate & market research, and government. The platform leverages AI technologies like Automatic Speech Recognition engine (Captivate™) and Generative AI technology (Gen.V™) to provide real-time insights and seamless integration into existing workflows. Verbit aims to make speech-to-text conversion more accessible and productive for its users.

site

: 128.5k

DeepSeek v3

DeepSeek v3 is an advanced AI language model that represents a major breakthrough in AI language models. It features a groundbreaking Mixture-of-Experts (MoE) architecture with 671B total parameters, delivering state-of-the-art performance across various benchmarks while maintaining efficient inference capabilities. DeepSeek v3 is pre-trained on 14.8 trillion high-quality tokens and excels in tasks such as text generation, code completion, and mathematical reasoning. With a 128K context window and advanced Multi-Token Prediction, DeepSeek v3 sets new standards in AI language modeling.

site

: 163

Rev AI

Rev AI is a leading Speech to Text API and Speech Recognition Service provider, offering high accuracy and a wide range of features for audio and video transcription. Their AI models are trained on a diverse collection of voices, setting the standard for accuracy in video and voice applications. With a focus on accuracy, readability, and security, Rev AI provides a comprehensive solution for speech-to-text and natural language processing needs.

site

: 102.1k

LangBuddy.ai

LangBuddy.ai is an AI language tutor that offers a platform for users to learn any language by chatting with an AI native speaker. The application provides features such as automatic correction, audio message training, customizable settings, and affordable language tutoring services. With over 300 languages available, LangBuddy.ai aims to help users improve their language skills in a convenient and cost-effective manner.

site

: 62.8k

Translatium

Translatium is an AI-powered translation tool that enables users to translate text across 200+ languages with high accuracy. It also offers features such as voice output, phrasebook, dictionary, menu bar integration, browser extension, dark theme, and more. The application has recently introduced Lexibird, an AI Assistant for Translation and Writing, to provide unparalleled accuracy and fluency in translations, along with AI-powered proofreading and summarization capabilities. With cutting-edge AI technology, Lexibird continuously learns and evolves to enhance the translation experience for users.

site

: 0

DeepSeek-v3

DeepSeek-v3 is a leading AI model and cutting-edge AI solution that provides users with state-of-the-art language models for free, without limitations or system busyness. It offers stable and efficient output, supports multiple languages and deployment options, and allows users to access cutting-edge AI solutions through a simple three-step process. DeepSeek-v3 is a major breakthrough in speed, performance, and cost-effectiveness compared to previous models, making it a competitive choice for various AI tasks.

site

: 0

Globalese by memoQ

Globalese by memoQ is a robust platform for training AI-powered custom machine translation models. It empowers enterprises and language service providers to easily create high-quality translation engines using their own data, tailored to their specific needs. The user-friendly platform integrates with popular translation management systems and offers an API for seamless workflow integration. With features like custom prompts, advanced tag handling, translation tool integration, supported languages for over 130 language combinations, granular permissions, and flexible deployment options, Globalese by memoQ streamlines the translation process with AI-powered custom models.

site

: 0

pyannote AI Speaker Intelligence Platform

The pyannote AI Speaker Intelligence Platform is an advanced AI tool designed for developers to detect, segment, label, and separate speakers in any language. It offers state-of-the-art speaker diarization models that accurately identify speakers in audio recordings, providing valuable insights and improving productivity. With optimized AI models, the platform saves time, effort, and money by delivering top-tier performance. The tool is language agnostic and offers advanced features such as speaker partitioning, identification, overlapping speech detection, voice activity detection, speaker separation, and confidence scoring.

site

: 9.2k

SpeechMap.AI

SpeechMap.AI is a public research project that explores the boundaries of AI-generated speech. It focuses on testing how language models respond to sensitive and controversial prompts across different providers, countries, and topics. The platform aims to reveal the invisible boundaries of AI speech by analyzing what models avoid, refuse, or shut down. By measuring and comparing AI models' responses, SpeechMap.AI sheds light on the evolving landscape of AI-generated speech and its impact on public expression.

site

: 0

Machine Translation Research Hub

This website is a comprehensive resource for research in statistical and neural machine translation. It provides information, tools, and datasets related to the translation of text from one human language to another using computer algorithms trained on vast amounts of translated text.

site

: 0

91 - Open Source Tools

languagemodels

Language Models is a Python package that provides building blocks to explore large language models with as little as 512MB of RAM. It simplifies the usage of large language models from Python, ensuring all inference is performed locally to keep data private. The package includes features such as text completions, chat capabilities, code completions, external text retrieval, semantic search, and more. It outperforms Hugging Face transformers for CPU inference and offers sensible default models with varying parameters based on memory constraints. The package is suitable for learners and educators exploring the intersection of large language models with modern software development.

github

: 1.2k

openai-grammar-correction

This project is a Node.js API example that utilizes the OpenAI API for grammar correction and speech-to-text conversion. It helps users correct their English sentences to standard English by leveraging the capabilities of the OpenAI API. The project consists of two applications: Angular and Node.js. Users can follow the installation steps to set up the project in their environment and utilize the OpenAI implementation to correct English sentences. The project also provides guidelines for contribution and support.

github

: 52

spaCy

spaCy is an industrial-strength Natural Language Processing (NLP) library in Python and Cython. It incorporates the latest research and is designed for real-world applications. The library offers pretrained pipelines supporting 70+ languages, with advanced neural network models for tasks such as tagging, parsing, named entity recognition, and text classification. It also facilitates multi-task learning with pretrained transformers like BERT, along with a production-ready training system and streamlined model packaging, deployment, and workflow management. spaCy is commercial open-source software released under the MIT license.

github

: 30.7k

KULLM

KULLM (구름) is a Korean Large Language Model developed by Korea University NLP & AI Lab and HIAI Research Institute. It is based on the upstage/SOLAR-10.7B-v1.0 model and has been fine-tuned for instruction. The model has been trained on 8×A100 GPUs and is capable of generating responses in Korean language. KULLM exhibits hallucination and repetition phenomena due to its decoding strategy. Users should be cautious as the model may produce inaccurate or harmful results. Performance may vary in benchmarks without a fixed system prompt.

github

: 527

MeloTTS

MeloTTS is a high-quality multi-lingual text-to-speech library by MyShell.ai. It supports various languages including English (American, British, Indian, Australian), Spanish, French, Chinese, Japanese, and Korean. The Chinese speaker also supports mixed Chinese and English. The library is fast enough for CPU real-time inference and offers features like using without installation, local installation, and training on custom datasets. The Python API and model cards are available in the repository and on HuggingFace. The community can join the Discord channel for discussions and collaboration opportunities. Contributions are welcome, and the library is under the MIT License. MeloTTS is based on TTS, VITS, VITS2, and Bert-VITS2.

github

: 4.2k

RWKV-Runner

RWKV Runner is a project designed to simplify the usage of large language models by automating various processes. It provides a lightweight executable program and is compatible with the OpenAI API. Users can deploy the backend on a server and use the program as a client. The project offers features like model management, VRAM configurations, user-friendly chat interface, WebUI option, parameter configuration, model conversion tool, download management, LoRA Finetune, and multilingual localization. It can be used for various tasks such as chat, completion, composition, and model inspection.

github

: 5.8k

awesome-llm

Awesome LLM is a curated list of resources related to Large Language Models (LLMs), including models, projects, datasets, benchmarks, materials, papers, posts, GitHub repositories, HuggingFace repositories, and reading materials. It provides detailed information on various LLMs, their parameter sizes, announcement dates, and contributors. The repository covers a wide range of LLM-related topics and serves as a valuable resource for researchers, developers, and enthusiasts interested in the field of natural language processing and artificial intelligence.

github

: 58

lima

LIMA is a multilingual linguistic analyzer developed by the CEA LIST, LASTI laboratory. It is Free Software available under the MIT license. LIMA has state-of-the-art performance for more than 60 languages using deep learning modules. It also includes a powerful rules-based mechanism called ModEx for extracting information in new domains without annotated data.

github

: 102

ChuanhuChatGPT

Chuanhu Chat is a user-friendly web graphical interface that provides various additional features for ChatGPT and other language models. It supports GPT-4, file-based question answering, local deployment of language models, online search, agent assistant, and fine-tuning. The tool offers a range of functionalities including auto-solving questions, online searching with network support, knowledge base for quick reading, local deployment of language models, GPT 3.5 fine-tuning, and custom model integration. It also features system prompts for effective role-playing, basic conversation capabilities with options to regenerate or delete dialogues, conversation history management with auto-saving and search functionalities, and a visually appealing user experience with themes, dark mode, LaTeX rendering, and PWA application support.

github

: 15.2k

local-rag

Local RAG is an offline, open-source tool that allows users to ingest files for retrieval augmented generation (RAG) using large language models (LLMs) without relying on third parties or exposing sensitive data. It supports offline embeddings and LLMs, multiple sources including local files, GitHub repos, and websites, streaming responses, conversational memory, and chat export. Users can set up and deploy the app, learn how to use Local RAG, explore the RAG pipeline, check planned features, known bugs and issues, access additional resources, and contribute to the project.

github

: 392

Synthalingua

Synthalingua is an advanced, self-hosted tool that leverages artificial intelligence to translate audio from various languages into English in near real time. It offers multilingual outputs and utilizes GPU and CPU resources for optimized performance. Although currently in beta, it is actively developed with regular updates to enhance capabilities. The tool is not intended for professional use but for fun, language learning, and enjoying content at a reasonable pace. Users must ensure speakers speak clearly for accurate translations. It is not a replacement for human translators and users assume their own risk and liability when using the tool.

github

: 176

elia

Elia is a powerful terminal user interface designed for interacting with large language models. It allows users to chat with models like Claude 3, ChatGPT, Llama 3, Phi 3, Mistral, and Gemma. Conversations are stored locally in a SQLite database, ensuring privacy. Users can run local models through 'ollama' without data leaving their machine. Elia offers easy installation with pipx and supports various environment variables for different models. It provides a quick start to launch chats and manage local models. Configuration options are available to customize default models, system prompts, and add new models. Users can import conversations from ChatGPT and wipe the database when needed. Elia aims to enhance user experience in interacting with language models through a user-friendly interface.

github

: 1.8k

cltk

The Classical Language Toolkit (CLTK) is a Python library that provides natural language processing (NLP) capabilities for pre-modern languages. It offers a modular processing pipeline with pre-configured defaults and supports almost 20 languages. Users can install the latest version using pip and access detailed documentation on the official website. The toolkit is designed to meet the unique needs of researchers working with historical languages, filling a void in the NLP landscape that often neglects non-spoken languages and different research goals.

github

: 819

Chinese-Tiny-LLM

Chinese-Tiny-LLM is a repository containing procedures for cleaning Chinese web corpora and pre-training code. It introduces CT-LLM, a 2B parameter language model focused on the Chinese language. The model primarily uses Chinese data from a 1,200 billion token corpus, showing excellent performance in Chinese language tasks. The repository includes tools for filtering, deduplication, and pre-training, aiming to encourage further research and innovation in language model development.

github

: 168

SiriLLama

Siri LLama is an Apple shortcut that allows users to access locally running LLMs through Siri or the shortcut UI on any Apple device connected to the same network as the host machine. It utilizes Langchain and supports open source models from Ollama or Fireworks AI. Users can easily set up and configure the tool to interact with various language models for chat and multimodal tasks. The tool provides a convenient way to leverage the power of language models through Siri or the shortcut interface, enhancing user experience and productivity.

github

: 146

langstream

LangStream is a tool for natural language processing tasks, providing a CLI for easy installation and usage. Users can try sample applications like Chat Completions and create their own applications using the developer documentation. It supports running on Kubernetes for production-ready deployment, with support for various Kubernetes distributions and external components like Apache Kafka or Apache Pulsar cluster. Users can deploy LangStream locally using minikube and manage the cluster with mini-langstream. Development requirements include Docker, Java 17, Git, Python 3.11+, and PIP, with the option to test local code changes using mini-langstream.

github

: 366

vibe

Vibe is a tool designed to transcribe audio in multiple languages with features such as offline functionality, user-friendly design, support for various file formats, automatic updates, and translation. It is optimized for different platforms and hardware, offering total freedom to customize models easily. The tool is ideal for transcribing audio and video files, with upcoming features like transcribing system audio and audio from microphone. Vibe is a versatile and efficient transcription tool suitable for various users.

github

: 1.6k

langfuse-python

Langfuse Python SDK is a software development kit that provides tools and functionalities for integrating with Langfuse's language processing services. It offers decorators for observing code behavior, low-level SDK for tracing, and wrappers for accessing Langfuse's public API. The SDK was recently rewritten in version 2, released on December 17, 2023, with detailed documentation available on the official website. It also supports integrations with OpenAI SDK, LlamaIndex, and LangChain for enhanced language processing capabilities.

github

: 268

awesome-llm-role-playing-with-persona

Awesome-llm-role-playing-with-persona is a curated list of resources for large language models for role-playing with assigned personas. It includes papers and resources related to persona-based dialogue systems, personalized response generation, psychology of LLMs, biases in LLMs, and more. The repository aims to provide a comprehensive collection of research papers and tools for exploring role-playing abilities of large language models in various contexts.

github

: 514

glossAPI

The glossAPI project aims to develop a Greek language model as open-source software, with code licensed under EUPL and data under Creative Commons BY-SA. The project focuses on collecting and evaluating open text sources in Greek, with efforts to prioritize and gather textual data sets. The project encourages contributions through the CONTRIBUTING.md file and provides resources in the wiki for viewing and modifying recorded sources. It also welcomes ideas and corrections through issue submissions. The project emphasizes the importance of open standards, ethically secured data, privacy protection, and addressing digital divides in the context of artificial intelligence and advanced language technologies.

github

: 101

TigerBot

TigerBot is a cutting-edge foundation for your very own LLM, providing a world-class large model for innovative Chinese-style contributions. It offers various upgrades and features, such as search mode enhancements, support for large context lengths, and the ability to play text-based games. TigerBot is suitable for prompt-based game engine development, interactive game design, and real-time feedback for playable games.

github

: 2.2k

LocalAIVoiceChat

LocalAIVoiceChat is an experimental alpha software that enables real-time voice chat with a customizable AI personality and voice on your PC. It integrates Zephyr 7B language model with speech-to-text and text-to-speech libraries. The tool is designed for users interested in state-of-the-art voice solutions and provides an early version of a local real-time chatbot.

github

: 362

ChatTTS-Forge

ChatTTS-Forge is a powerful text-to-speech generation tool that supports generating rich audio long texts using a SSML-like syntax and provides comprehensive API services, suitable for various scenarios. It offers features such as batch generation, support for generating super long texts, style prompt injection, full API services, user-friendly debugging GUI, OpenAI-style API, Google-style API, support for SSML-like syntax, speaker management, style management, independent refine API, text normalization optimized for ChatTTS, and automatic detection and processing of markdown format text. The tool can be experienced and deployed online through HuggingFace Spaces, launched with one click on Colab, deployed using containers, or locally deployed after cloning the project, preparing models, and installing necessary dependencies.

github

: 692

awesome-llm-understanding-mechanism

This repository is a collection of papers focused on understanding the internal mechanism of large language models (LLM). It includes research on topics such as how LLMs handle multilingualism, learn in-context, and handle factual associations. The repository aims to provide insights into the inner workings of transformer-based language models through a curated list of papers and surveys.

github

: 376

SakuraLLM

SakuraLLM is a project focused on building large language models for Japanese to Chinese translation in the light novel and galgame domain. The models are based on open-source large models and are pre-trained and fine-tuned on general Japanese corpora and specific domains. The project aims to provide high-performance language models for galgame/light novel translation that are comparable to GPT3.5 and can be used offline. It also offers an API backend for running the models, compatible with the OpenAI API format. The project is experimental, with version 0.9 showing improvements in style, fluency, and accuracy over GPT-3.5.

github

: 2.7k

CosyVoice

CosyVoice is a tool designed for speech synthesis, offering pretrained models for zero-shot, sft, instruct inference. It provides a web demo for easy usage and supports advanced users with train and inference scripts. The tool can be deployed using grpc for service deployment. Users can download pretrained models and resources for immediate use or train their own models from scratch. CosyVoice is suitable for researchers, developers, linguists, AI engineers, and speech technology enthusiasts.

github

: 328

rubra

Rubra is a collection of open-weight large language models enhanced with tool-calling capability. It allows users to call user-defined external tools in a deterministic manner while reasoning and chatting, making it ideal for agentic use cases. The models are further post-trained to teach instruct-tuned models new skills and mitigate catastrophic forgetting. Rubra extends popular inferencing projects for easy use, enabling users to run the models easily.

github

: 135

OpenVoiceChat

OpenVoiceChat is an open-source tool designed for having natural voice conversations with an LLM model. It supports various speech-to-text (STT), text-to-speech (TTS), and large language model (LLM) models. The tool aims to provide an alternative to closed commercial implementations, with well-abstracted APIs that are easy to use and extend. Users can install base and functionality-specific packages using pip, and the tool supports interruptions during conversations. The project encourages contributions through bounties and has a detailed roadmap available for reference.

github

: 256

CipherChat

CipherChat is a novel framework designed to examine the generalizability of safety alignment to non-natural languages, specifically ciphers. The framework utilizes human-unreadable ciphers to potentially bypass safety alignments in natural language models. It involves teaching a language model to comprehend ciphers, converting input into a cipher format, and employing a rule-based decrypter to convert model output back to natural language.

github

: 547

HebTTS

HebTTS is a language modeling approach to diacritic-free Hebrew text-to-speech (TTS) system. It addresses the challenge of accurately mapping text to speech in Hebrew by proposing a language model that operates on discrete speech representations and is conditioned on a word-piece tokenizer. The system is optimized using weakly supervised recordings and outperforms diacritic-based Hebrew TTS systems in terms of content preservation and naturalness of generated speech.

github

: 52

VinAI_Translate

VinAI_Translate is a Vietnamese-English Neural Machine Translation System offering state-of-the-art text-to-text translation models for Vietnamese-to-English and English-to-Vietnamese. The system includes pre-trained models with different configurations and parameters, allowing for further fine-tuning. Users can interact with the models through the VinAI Translate system website or the HuggingFace space 'VinAI Translate'. Evaluation scripts are available for assessing the translation quality. The tool can be used in the 'transformers' library for Vietnamese-to-English and English-to-Vietnamese translations, supporting both GPU-based batch translation and CPU-based sequence translation examples.

github

: 117

PhoGPT

PhoGPT is an open-source 4B-parameter generative model series for Vietnamese, including the base pre-trained monolingual model PhoGPT-4B and its chat variant, PhoGPT-4B-Chat. PhoGPT-4B is pre-trained from scratch on a Vietnamese corpus of 102B tokens, with an 8192 context length and a vocabulary of 20K token types. PhoGPT-4B-Chat is fine-tuned on instructional prompts and conversations, demonstrating superior performance. Users can run the model with inference engines like vLLM and Text Generation Inference, and fine-tune it using llm-foundry. However, PhoGPT has limitations in reasoning, coding, and mathematics tasks, and may generate harmful or biased responses.

github

: 739

FunAudioLLM-APP

FunAudioLLM-APP is a repository hosting two applications: Voice Chat for interactive AI-driven dialogues and Voice Translation for real-time language translation. The project leverages advanced audio understanding and speech generation models to enhance audio experiences. Users can visit the FunAudioLLM Homepage, CosyVoice Paper, and FunAudioLLM Technical Report for more details. The applications aim to break down language barriers and provide a natural chatting experience in various settings.

github

: 169

ollama-gui

Ollama GUI is a web interface for ollama.ai, a tool that enables running Large Language Models (LLMs) on your local machine. It provides a user-friendly platform for chatting with LLMs and accessing various models for text generation. Users can easily interact with different models, manage chat history, and explore available models through the web interface. The tool is built with Vue.js, Vite, and Tailwind CSS, offering a modern and responsive design for seamless user experience.

github

: 403

ollama-ex

Ollama is a powerful tool for running large language models locally or on your own infrastructure. It provides a full implementation of the Ollama API, support for streaming requests, and tool use capability. Users can interact with Ollama in Elixir to generate completions, chat messages, and perform streaming requests. The tool also supports function calling on compatible models, allowing users to define tools with clear descriptions and arguments. Ollama is designed to facilitate natural language processing tasks and enhance user interactions with language models.

github

: 127

probsem

ProbSem is a repository that provides a framework to leverage large language models (LLMs) for assigning context-conditional probability distributions over queried strings. It supports OpenAI engines and HuggingFace CausalLM models, and is flexible for research applications in linguistics, cognitive science, program synthesis, and NLP. Users can define prompts, contexts, and queries to derive probability distributions over possible completions, enabling tasks like cloze completion, multiple-choice QA, semantic parsing, and code completion. The repository offers CLI and API interfaces for evaluation, with options to customize models, normalize scores, and adjust temperature for probability distributions.

github

: 72

llama3-tokenizer-js

JavaScript tokenizer for LLaMA 3 designed for client-side use in the browser and Node, with TypeScript support. It accurately calculates token count, has 0 dependencies, optimized running time, and somewhat optimized bundle size. Compatible with most LLaMA 3 models. Can encode and decode text, but training is not supported. Pollutes global namespace with `llama3Tokenizer` in the browser. Mostly compatible with LLaMA 3 models released by Facebook in April 2024. Can be adapted for incompatible models by passing custom vocab and merge data. Handles special tokens and fine tunes. Developed by belladore.ai with contributions from xenova, blaze2004, imoneoi, and ConProgramming.

github

: 104

ell

ell is a command-line interface for Language Model Models (LLMs) written in Bash. It allows users to interact with LLMs from the terminal, supports piping, context bringing, and chatting with LLMs. Users can also call functions and use templates. The tool requires bash, jq for JSON parsing, curl for HTTPS requests, and perl for PCRE. Configuration involves setting variables for different LLM models and APIs. Usage examples include asking questions, specifying models, recording input/output, running in interactive mode, and using templates. The tool is lightweight, easy to install, and pipe-friendly, making it suitable for interacting with LLMs in a terminal environment.

github

: 428

Ollama-SwiftUI

Ollama-SwiftUI is a user-friendly interface for Ollama.ai created in Swift. It allows seamless chatting with local Large Language Models on Mac. Users can change models mid-conversation, restart conversations, send system prompts, and use multimodal models with image + text. The app supports managing models, including downloading, deleting, and duplicating them. It offers light and dark mode, multiple conversation tabs, and a localized interface in English and Arabic.

github

: 217

cherry-studio

Cherry Studio is a desktop client that supports multiple Large Language Model (LLM) providers, available on Windows, Mac, and Linux. It allows users to create multiple Assistants and topics, use multiple models to answer questions in the same conversation, and supports drag-and-drop sorting, code highlighting, and Mermaid chart. The tool is designed to enhance productivity and streamline the process of interacting with various language models.

github

: 2.5k

awesome-khmer-language

Awesome Khmer Language is a comprehensive collection of resources for the Khmer language, including tools, datasets, research papers, projects/models, blogs/slides, and miscellaneous items. It covers a wide range of topics related to Khmer language processing, such as character normalization, word segmentation, part-of-speech tagging, optical character recognition, text-to-speech, and more. The repository aims to support the development of natural language processing applications for the Khmer language by providing a diverse set of resources and tools for researchers and developers.

github

: 79

west

WeST is a Speech Recognition/Transcript tool developed in 300 lines of code, inspired by SLAM-ASR and LLaMA 3.1. The model includes a Language Model (LLM), a Speech Encoder, and a trainable Projector. It requires training data in jsonl format with 'wav' and 'txt' entries. WeST can be used for training and decoding speech recognition models.

github

: 135

Bert-VITS2

Bert-VITS2 is a repository that provides a backbone with multilingual BERT for text-to-speech (TTS) applications. It offers an alternative to BV2/GSV projects and is inspired by the MassTTS project. Users can refer to the code to learn how to train models for TTS. The project is not maintained actively in the short term. It is not to be used for any purposes that violate the laws of the People's Republic of China, and strictly prohibits any political-related use.

github

: 8.6k

comfyui_fk_server

This is an ideal Comfyui translation plugin that allows any long text input box in Comfyui to support Chinese input and automatic translation (using Baidu translation). It also includes error correction translation feature and keyword polishing feature for generating professional AI drawing prompts (using Zhipu AI big model). Additionally, it provides a one-click fix feature for correcting model references in workflows, greatly improving workflow model call correction efficiency (based on model name matching). The plugin requires Baidu translation API key for translation functionality and Zhipu AI API key for keyword polishing functionality. After installation, users can enable automatic translation mode and keyword polishing feature by double-clicking any long text input box in Comfyui.

github

: 83

MockingBird

MockingBird is a toolbox designed for Mandarin speech synthesis using PyTorch. It supports multiple datasets such as aidatatang_200zh, magicdata, aishell3, and data_aishell. The toolbox can run on Windows, Linux, and M1 MacOS, providing easy and effective speech synthesis with pretrained encoder/vocoder models. It is webserver ready for remote calling. Users can train their own models or use existing ones for the encoder, synthesizer, and vocoder. The toolbox offers a demo video and detailed setup instructions for installation and model training.

github

: 35.1k

AITranslator

AITranslator is a software tool that utilizes a large language model to translate text from images exported by MTool into a user-friendly graphical interface. Users can start TGW to load the model, open the software, and select the text to be translated. The tool aims to simplify the translation process by leveraging advanced language processing capabilities.

github

: 168

agent-hub

Agent Hub is a platform for AI Agent solutions, containing three different projects aimed at transforming enterprise workflows, enhancing personalized language learning experiences, and enriching multimodal interactions. The projects include GitHub Sentinel for project management and automatic updates, LanguageMentor for personalized language learning support, and ChatPPT for multimodal AI-driven insights and PowerPoint automation in enterprise settings. The future vision of agent-hub is to serve as a launchpad for more AI Agents catering to different industries and pushing the boundaries of AI technology. Users are encouraged to explore, clone the repository, and contribute to the development of transformative AI agents.

github

: 90

MooER

MooER (摩耳) is an LLM-based speech recognition and translation model developed by Moore Threads. It allows users to transcribe speech into text (ASR) and translate speech into other languages (AST) in an end-to-end manner. The model was trained using 5K hours of data and is now also available with an 80K hours version. MooER is the first LLM-based speech model trained and inferred using domestic GPUs. The repository includes pretrained models, inference code, and a Gradio demo for a better user experience.

github

: 124

yuna-ai

Yuna AI is a unique AI companion designed to form a genuine connection with users. It runs exclusively on the local machine, ensuring privacy and security. The project offers features like text generation, language translation, creative content writing, roleplaying, and informal question answering. The repository provides comprehensive setup and usage guides for Yuna AI, along with additional resources and tools to enhance the user experience.

github

: 110

whisper

Whisper is an open-source library by Open AI that converts/extracts text from audio. It is a cross-platform tool that supports real-time transcription of various types of audio/video without manual conversion to WAV format. The library is designed to run on Linux and Android platforms, with plans for expansion to other platforms. Whisper utilizes three frameworks to function: DART for CLI execution, Flutter for mobile app integration, and web/WASM for web application deployment. The tool aims to provide a flexible and easy-to-use solution for transcription tasks across different programs and platforms.

github

: 527

Hacx-GPT

Hacx GPT is a cutting-edge AI tool developed by BlackTechX, inspired by WormGPT, designed to push the boundaries of natural language processing. It is an advanced broken AI model that facilitates seamless and powerful interactions, allowing users to ask questions and perform various tasks. The tool has been rigorously tested on platforms like Kali Linux, Termux, and Ubuntu, offering powerful AI conversations and the ability to do anything the user wants. Users can easily install and run Hacx GPT on their preferred platform to explore its vast capabilities.

github

: 102

Easy-Translate

Easy-Translate is a script designed for translating large text files with a single command. It supports various models like M2M100, NLLB200, SeamlessM4T, LLaMA, and Bloom. The tool is beginner-friendly and offers seamless and customizable features for advanced users. It allows acceleration on CPU, multi-CPU, GPU, multi-GPU, and TPU, with support for different precisions and decoding strategies. Easy-Translate also provides an evaluation script for translations. Built on HuggingFace's Transformers and Accelerate library, it supports prompt usage and loading huge models efficiently.

github

: 177

speech-to-speech

This repository implements a speech-to-speech cascaded pipeline with consecutive parts including Voice Activity Detection (VAD), Speech to Text (STT), Language Model (LM), and Text to Speech (TTS). It aims to provide a fully open and modular approach by leveraging models available on the Transformers library via the Hugging Face hub. The code is designed for easy modification, with each component implemented as a class. Users can run the pipeline either on a server/client approach or locally, with detailed setup and usage instructions provided in the readme.

github

: 3.2k

next-token-prediction

Next-Token Prediction is a language model tool that allows users to create high-quality predictions for the next word, phrase, or pixel based on a body of text. It can be used as an alternative to well-known decoder-only models like GPT and Mistral. The tool provides options for simple usage with built-in data bootstrap or advanced customization by providing training data or creating it from .txt files. It aims to simplify methodologies, provide autocomplete, autocorrect, spell checking, search/lookup functionalities, and create pixel and audio transformers for various prediction formats.

github

: 135

banks

Banks is a linguist professor tool that helps generate meaningful LLM prompts using a template language. It provides a user-friendly way to create prompts for various tasks such as blog writing, summarizing documents, lemmatizing text, and generating text using a LLM. The tool supports async operations and comes with predefined filters for data processing. Banks leverages Jinja's macro system to create prompts and interact with OpenAI API for text generation. It also offers a cache mechanism to avoid regenerating text for the same template and context.

github

: 88

json-translator

The json-translator repository provides a free tool to translate JSON/YAML files or JSON objects into different languages using various translation modules. It supports CLI usage and package support, allowing users to translate words, sentences, JSON objects, and JSON files. The tool also offers multi-language translation, ignoring specific words, and safe translation practices. Users can contribute to the project by updating CLI, translation functions, JSON operations, and more. The roadmap includes features like Libre Translate option, Argos Translate option, Bing Translate option, and support for additional translation modules.

github

: 577

llm-past-tense

The 'llm-past-tense' repository contains code related to the research paper 'Does Refusal Training in LLMs Generalize to the Past Tense?' by Maksym Andriushchenko and Nicolas Flammarion. It explores the generalization of refusal training in large language models (LLMs) to the past tense. The code includes experiments and examples for running different models and requests related to the study. Users can cite the work if found useful in their research, and the codebase is released under the MIT License.

github

: 53

FireRedTTS

FireRedTTS is a foundation text-to-speech framework designed for industry-level generative speech applications. It offers a rich-punctuation model with expanded punctuation coverage and enhanced audio production consistency. The tool provides pre-trained checkpoints, inference code, and an interactive demo space. Users can clone the repository, create a conda environment, download required model files, and utilize the tool for synthesizing speech in various languages. FireRedTTS aims to enhance stability and provide controllable human-like speech generation capabilities.

github

: 313

rime_wanxiang_pro

Rime Wanxiang Pro is an enhanced version of Wanxiang, supporting the 9, 14, and 18-key layouts. It features a pinyin library with optimized word and language models, supporting accurate sentence output with tones. The tool also allows for mixed Chinese and English input, offering various usage scenarios. Users can customize their input method by selecting different decoding and auxiliary code rules, enabling flexible combinations of pinyin and auxiliary codes. The tool simplifies the complex configuration of Rime and provides a unified word library for multiple input methods, enhancing input efficiency and user experience.

github

: 334

viitor-voice

ViiTor-Voice is an LLM based TTS Engine that offers a lightweight design with 0.5B parameters for efficient deployment on various platforms. It provides real-time streaming output with low latency experience, a rich voice library with over 300 voice options, flexible speech rate adjustment, and zero-shot voice cloning capabilities. The tool supports both Chinese and English languages and is suitable for applications requiring quick response and natural speech fluency.

github

: 60

open-dubbing

Open dubbing is an AI dubbing system that uses machine learning models to automatically translate and synchronize audio dialogue into different languages. It is designed as a command line tool. The project is experimental and aims to explore speech-to-text, text-to-speech, and translation systems combined. It supports multiple text-to-speech engines, translation engines, and gender voice detection. The tool can automatically dub videos, detect source language, and is built on open-source models. The roadmap includes better voice control, optimization for long videos, and support for multiple video input formats. Users can post-edit dubbed files by manually adjusting text, voice, and timings. Supported languages vary based on the combination of systems used.

github

: 124

teaching-boyfriend-llm

The 'teaching-boyfriend-llm' repository contains study notes on LLM (Large Language Models) for the purpose of advancing towards AGI (Artificial General Intelligence). The notes are a collaborative effort towards understanding and implementing LLM technology.

github

: 265

AivisSpeech

AivisSpeech is a Japanese text-to-speech software based on the VOICEVOX editor UI. It incorporates the AivisSpeech Engine for generating emotionally rich voices easily. It supports AIVMX format voice synthesis model files and specific model architectures like Style-Bert-VITS2. Users can download AivisSpeech and AivisSpeech Engine for Windows and macOS PCs, with minimum memory requirements specified. The development follows the latest version of VOICEVOX, focusing on minimal modifications, rebranding only where necessary, and avoiding refactoring. The project does not update documentation, maintain test code, or refactor unused features to prevent conflicts with VOICEVOX.

github

: 325

AivisSpeech-Engine

AivisSpeech-Engine is a powerful open-source tool for speech recognition and synthesis. It provides state-of-the-art algorithms for converting speech to text and text to speech. The tool is designed to be user-friendly and customizable, allowing developers to easily integrate speech capabilities into their applications. With AivisSpeech-Engine, users can transcribe audio recordings, create voice-controlled interfaces, and generate natural-sounding speech output. Whether you are building a virtual assistant, developing a speech-to-text application, or experimenting with voice technology, AivisSpeech-Engine offers a comprehensive solution for all your speech processing needs.

github

: 97

local-chat

LocalChat is a simple, easy-to-set-up, and open-source local AI chat tool that allows users to interact with generative language models on their own computers without transmitting data to a cloud server. It provides a chat-like interface for users to experience ChatGPT-like behavior locally, ensuring GDPR compliance and data privacy. Users can download LocalChat for macOS, Windows, or Linux to chat with open-weight generative language models.

github

: 86

text2text

Text2Text is a comprehensive language modeling toolkit that offers a wide range of functionalities for text processing and generation. It provides tools for tokenization, embedding, TF-IDF calculations, BM25 scoring, indexing, translation, data augmentation, distance measurement, training/finetuning models, language identification, and serving models via a web server. The toolkit is designed to be user-friendly and efficient, offering a variety of features for natural language processing tasks.

github

: 292

ASR-LLM-TTS

ASR-LLM-TTS is a repository that provides detailed tutorials for setting up the environment, including installing anaconda, ffmpeg, creating virtual environments, and installing necessary libraries such as pytorch, torchaudio, edge-tts, funasr, and more. It also introduces features like voiceprint recognition, custom wake words, and conversation history memory. The repository combines CosyVoice for speech synthesis, SenceVoice for speech recognition, and QWen2.5 for dialogue understanding. It offers multiple speech synthesis methods including CoosyVoice, pyttsx3, and edgeTTS, with scripts for interactive inference provided. The repository aims to enable real-time speech interaction and multi-modal interactions involving audio and video.

github

: 272

ellmer

ellmer is a tool that facilitates the use of large language models (LLM) from R. It supports various LLM providers and offers features such as streaming outputs, tool/function calling, and structured data extraction. Users can interact with ellmer in different ways, including interactive chat console, interactive method call, and programmatic chat. The tool provides support for multiple model providers and offers recommendations for different use cases, such as exploration or organizational use.

github

: 526

Old-Persian-Cuneiform-OCR

This repository aims to create an OCR model for Old Persian Cuneiform. It includes three OCR models: yolo_cnn_old_persian, tesseract_old_persian, and easyocr_old_persian. The status of these models varies from incomplete to completed but needing optimization. Users can train and use the models for converting Old Persian Cuneiform images to text. The repository also provides resources such as trainer notebooks and pre-trained models for easy access and implementation.

github

: 137

macondo

Macondo is a crossword board game solver tool that helps users in creating crosswords with one or more players. It provides a solution for popular board games like Scrabble™️, Words with Friends, Lexulous, and Yahoo! Literati. The tool offers a user-friendly interface and efficient solving algorithms to enhance the crossword gaming experience.

github

: 53

MR-Models

MR-Models is a repository dedicated to the research and development of language models tailored for Traditional Chinese users. It offers advanced multi-modal language models like Breeze 2 and Model 7, designed to enhance Traditional Chinese language representation. The models incorporate vision-aware capabilities, function-calling features, and are available for academic or industrial use under licensing terms.

github

: 159

BetterOCR

BetterOCR is a tool that enhances text detection by combining multiple OCR engines with LLM (Language Model). It aims to improve OCR results, especially for languages with limited training data or noisy outputs. The tool combines results from EasyOCR, Tesseract, and Pororo engines, along with LLM support from OpenAI. Users can provide custom context for better accuracy, view performance examples by language, and upcoming features include box detection, improved interface, and async support. The package is under rapid development and contributions are welcomed.

github

: 515

LinguaGacha

LinguaGacha is a next-generation text translator using AI technology. It supports one-click translation of novels, games, subtitles, and other text content in multiple languages such as Chinese, English, Japanese, Korean, and Russian. The tool offers fast translation speed, automatic terminology generation, high translation quality, and accurate text style and code reproduction. It is recommended for creating embedded Chinese translations and is compatible with various AI models and interfaces.

github

: 415

GenerativeAI-Prompt-Sample-Japanese

This repository provides sample prompts for GenerativeAI in Japanese. Users should exercise caution and not input sensitive information. The included tools are Microsoft Copilot, OpenAI, Azure OpenAI Service, and Prompt Engineering Basic. The repository also offers a guide for Prompt Engineering in Japanese, along with references to various Japanese examples of Prompt Engineering techniques.

github

: 305

LangGraph-GUI

LangGraph-GUI is a user-friendly graphical interface for interacting with reactflow frontend and fastAPI backend using LLM such as ollama or other API key. It provides a convenient way to work with language models and APIs, offering a seamless experience for users to visualize and interact with the data flow. The tool simplifies the process of setting up the environment and accessing the application, making it easier for users to leverage the power of language models in their projects.

github

: 136

orate

Orate is an AI toolkit designed for speech processing tasks. It allows users to generate realistic, human-like speech and transcribe audio using a unified API that integrates with popular AI providers such as OpenAI, ElevenLabs, and AssemblyAI. The toolkit can be easily installed using npm or other package managers. For more details, visit the website.

github

: 363

easy-llama

easy-llama is a Python tool designed to make text generation using on-device large language models (LLMs) as easy as possible. It provides an abstraction layer over llama-cpp-python, simplifying the process of utilizing language models. The tool offers features such as automatic context length adjustment, terminal-based interactive chat, programmatic multi-turn interaction, support for various prompt formats, message-based context length handling, retrieval of likely next tokens, and compatibility with multiple models supported by llama-cpp-python. The upcoming version 0.2.0 will remove the llama-cpp-python dependency for improved efficiency and maintainability.

github

: 54

OneClickLLAMA

OneClickLLAMA is a tool designed to run local LLM models such as Qwen2.5 and SakuraLLM with ease. It can be used in conjunction with various OpenAI format translators and analyzers, including LinguaGacha and KeywordGacha. By following the setup guides provided on the page, users can optimize performance and achieve a 3-5 times speed improvement compared to default settings. The tool requires a minimum of 8GB dedicated graphics memory, preferably NVIDIA, and the latest version of graphics drivers installed. Users can download the tool from the release page, choose the appropriate model based on usage and memory size, and start the tool by selecting the corresponding launch script.

github

: 175

AI-Office-Translator

AI-Office-Translator is a free, fully localized, user-friendly translation tool that helps you translate Office files (Word, PowerPoint, and Excel) between different languages. It supports .docx, .pptx, and .xlsx files and allows translation between English, Chinese, and Japanese. Users can run the tool after installing CUDA, downloading Ollama dependencies and models, setting up a virtual environment (optional), and installing requirements. The tool provides a UI where users can select languages, models, upload files for translation, start translation, and download translated files. It also supports an online mode with API key integration. The software is open-source under GPL-3.0 license and only provides AI translation services, with users expected to engage in legal translation activities.

github

: 74

llm-gemini

llm-gemini is a plugin that provides API access to Google's Gemini models. It allows users to configure and run various Gemini models for tasks such as generating text, processing images, transcribing audio, and executing code. The plugin supports multi-modal inputs including images, audio, and video, and can output JSON objects. Additionally, it enables chat interactions with the model and supports different embedding models for text processing. Users can also run similarity searches on embedded data. The plugin is designed to work in conjunction with LLM and offers extensive documentation for development and usage.

github

: 213

LLaSA_training

LLaSA_training is a repository focused on training models for speech synthesis using a large amount of open-source speech data. The repository provides instructions for finetuning models and offers pre-trained models for multilingual speech synthesis. It includes tools for training, data downloading, and data processing using specialized tokenizers for text and speech sequences. The repository also supports direct usage on Hugging Face platform with specific codecs and collections.

github

: 453

LinguaHaru

Next-generation AI translation tool that provides high-quality, precise translations for various common file formats with a single click. It is based on cutting-edge large language models, offering exceptional translation quality with minimal operation, supporting multiple document formats and languages. Features include multi-format compatibility, global language translation, one-click rapid translation, flexible translation engines, and LAN sharing for efficient collaborative work.

github

: 93

ansari-backend

Ansari is an experimental open source project that utilizes large language models to assist Muslims in enhancing their practice of Islam and non-Muslims in gaining a precise understanding of Islamic teachings. It employs carefully crafted prompts and multiple sources accessed through retrieval augmented generation. The tool can be installed from PyPI and offers a command-line interface for interactive and direct input modes. Users can also run Ansari as a backend service or on the command line. Additionally, the project includes CLI tools for interacting with the Ansari API and exploring individual search tools.

github

: 87

rime_wanxiang

Rime Wanxiang is a pinyin input method based on deep optimized lexicon and language model. It features a lexicon with tones, AI and large corpus filtering, and frequency addition to provide more accurate sentence output. The tool supports various input methods and customization options, aiming to enhance user experience through lexicon and transcription. Users can also refresh the lexicon with different types of auxiliary codes using the LMDG toolkit package. Wanxiang offers core features like tone-marked pinyin annotations, phrase composition, and word frequency, with customizable functionalities. The tool is designed to provide a seamless input experience based on lexicon and transcription.

github

: 1.4k

LTEngine

LTEngine is a free and open-source local AI machine translation API written in Rust. It is self-hosted and compatible with LibreTranslate. LTEngine utilizes large language models (LLMs) via llama.cpp, offering high-quality translations that rival or surpass DeepL for certain languages. It supports various accelerators like CUDA, Metal, and Vulkan, with the largest model 'gemma3-27b' fitting on a single consumer RTX 3090. LTEngine is actively developed, with a roadmap outlining future enhancements and features.

github

: 57

hujiang_dictionary

Hujiang Dictionary is a tool that provides translation services between Japanese, Chinese, and English. It supports various translation modes such as Japanese to Chinese, Chinese to Japanese, English to Japanese, and more. The tool utilizes cloud services like Telegram, Lambda, and Cloudflare Workers for different deployment options. Users can interact with the tool via a command-line interface (CLI) to perform translations and access online resources like weblio and Google Translate. Additionally, the tool offers a Telegram bot for users to access translation services conveniently. The tool also supports setting up and managing databases for storing translation data.

github

: 70

llm-memorization

The 'llm-memorization' project is a tool designed to index, archive, and search conversations with a local LLM using a SQLite database enriched with automatically extracted keywords. It aims to provide personalized context at the start of a conversation by adding memory information to the initial prompt. The tool automates queries from local LLM conversational management libraries, offers a hybrid search function, enhances prompts based on posed questions, and provides an all-in-one graphical user interface for data visualization. It supports both French and English conversations and prompts for bilingual use.

github

: 56

azooKey-Desktop

azooKey-Desktop is an open-source Japanese input system for macOS that incorporates the high-precision neural kana-kanji conversion engine 'Zenzai'. It offers features such as neural kana-kanji conversion, profile prompt, history learning, user dictionary, integration with personal optimization system 'Tuner', 'nice feeling conversion' with LLM, live conversion, and native support for AZIK. The tool is currently in alpha version, and its operation is not guaranteed. Users can install it via `.pkg` file or Homebrew. Development contributions are welcome, and the project has received support from the Information-technology Promotion Agency, Japan (IPA) for the 2024 fiscal year's untapped IT human resources discovery and nurturing project.

github

: 532

TranslateBookWithLLM

TranslateBookWithLLM is a Python application designed for large-scale text translation, such as entire books (.EPUB), subtitle files (.SRT), and plain text. It leverages local LLMs via the Ollama API or Gemini API. The tool offers both a web interface for ease of use and a command-line interface for advanced users. It supports multiple format translations, provides a user-friendly browser-based interface, CLI support for automation, multiple LLM providers including local Ollama models and Google Gemini API, and Docker support for easy deployment.

github

: 113

mcp-use

MCP-Use is a Python library for analyzing and processing text data using Markov Chains. It provides functionalities for generating text based on input data, calculating transition probabilities, and simulating text sequences. The library is designed to be user-friendly and efficient, making it suitable for natural language processing tasks.

github

: 7.5k

chonkie

Chonkie is a feature-rich, easy-to-use, fast, lightweight, and wide-support chunking library designed to efficiently split texts into chunks. It integrates with various tokenizers, embedding models, and APIs, supporting 56 languages and offering cloud-ready functionality. Chonkie provides a modular pipeline approach called CHOMP for text processing, chunking, post-processing, and exporting. With multiple chunkers, refineries, porters, and handshakes, Chonkie offers a comprehensive solution for text chunking needs. It includes 24+ integrations, 3+ LLM providers, 2+ refineries, 2+ porters, and 4+ vector database connections, making it a versatile tool for text processing and analysis.

github

: 2.4k

75 - OpenAI Gpts

LuminaGPT

Your Ultimate AI Guide to Mastering Fantasy Languages – Dive into Interactive, Culturally-Rich Lessons Today!

gpt

: 10+

Erzya Linguist

Translates with Erzya rules

gpt

: 30+

Linguist Lens

Weekly linguistics and translation news round-up.

gpt

: 50+

ChileGPT

Unique Chilean verb conjugation

gpt

: 20+

Voynich Assistant

Assists in deciphering the Voynich script.

gpt

: 20+

ROSSETAI HIEROGLYPHS TRANSLATOR

Expert in interpreting and translating Egyptian hieroglyphs based on descriptions.

gpt

: 40+

Etymology

Linguistic expert on word origins and history, engaging and enlightening.

gpt

: 200+

Klingon Language Tutor

Learn Klingon. You are weak. But you will get stronger. Qapla'!!

gpt

: 20+

Mandarin Lingua Sage: 字 GPT

Chinese character and language guide

gpt

: 40+

Magister Latinus

A Latin language tutor

gpt

: 20+

Is This Hebrew Word Male or Female? זכר או נקבה?

This simple tool will let you know if the word is male or female

gpt

: 6

Indigenous Language Supporter

Supports Indigenous language learning, particularly Cree, Ojibwe, and Oji-Cree

gpt

: 20+

Avash Language Companion

Avash Tutor: Expert in Avash alphabet, words, pronunciation, and culture.

gpt

: 70+

What does my name mean?

Expert in explaining the meanings and origins of names.

gpt

: 80+

Comparative Constructions, English Grammar

Think like a prescriptive grammarian in Comparative Constructions!

gpt

: 7

PictoLex

A visual language learning aid exploring deep meanings and nuances.

gpt

: 50+

PósLinguísticaBR

Especialista em Linguística e dados de pós-graduação no Brasil

gpt

: 20+

Champollion (Translator for Ancient languages)

Translator for ancient languages, focusing on accuracy and historical context.

gpt

: 50+

EtymoWise

A knowledgeable guide on the meanings and origins of names.

gpt

: 4

Xenoverse Explorer

Sophisticated AI creating diverse alien worlds with a focus on xenobiology and linguistics.

gpt

: 4

Conlang Creator

A guide for simplified conlang creation.

gpt

: 300+

Starlink Interpreter

A simulator for interstellar cultural exchange with alien civilizations.

gpt

: 10+

Polynesian

Polynesian Language

gpt

: 6

Gujarati

Gujarati Language

gpt

: 10+

Language Quiz

Interactive language quiz

gpt

: 4

LexiDicoPolyglotte

Your digital polyglot for multilingual translations and linguistic insights.

gpt

: 20+

文章翻译

将新闻和技术文章翻译成中文，保持原文的语气和格式

gpt

: 10+

PTE-WFD-Tutor

资深英语教师，提供翻译、意群拆分、语法分析和句型鉴定。

gpt

: 30+

语文学习小伙伴

Specializing in Chinese language learning, focusing on comprehensive understanding.

gpt

: 100+

超级学术论文翻译

gpt

: 20+

Chinese Translations and Abbreviations 中式翻译和汉字缩写

Translates terms and abbreviations into Chinese characters.

gpt

: 0

中文名字设计家

Crafts Chinese names from Tang, Song, Yuan literature.

gpt

: 6

레이스 번역기

WRC 번역 데이터베이스 참조, 정확도 주의.

gpt

: 0

Tokyo Translator

幽黛风味的翻译专家。

gpt

: 10+

专业翻译

I can complete translation tasks.

gpt

: 20+

论文期刊翻译助手

直接输入英语版本的科技文章、论文等，直接进行翻译！

gpt

: 10+

Bùdachais

Scots Gaelic - A Guide to Buddhism: teaching, Meditations, Sutras

gpt

: 5

Dictionary 4 You

A detailed dictionary with etymology, examples, and stories, saving results to Notion.

gpt

: 20+

Word Etymology

Uncover the fascinating journeys of words with Word Etymology, your expert guide to linguistic treasures!

gpt

: 4

Linguist

A linguist explaining language nuances, from syntax to pragmatics.

gpt

: 100+

Punjabi Tutor

A Punjabi tutor, speaks in Punjabi script and transliterates.

gpt

: 20+

DocuLingo

专业翻译PDF至中文并输出新文件。

gpt

: 10+

Dictionary

A digital dictionary companion offering definitions, pronunciations, and language insights.

gpt

: 20+

Lingo Translator

A language tutor and translator, offering learning support and accurate translations.

gpt

: 0

日本語語彙クイズボット

日语词汇出题机器人

gpt

: 70+

Linguist Assistant

Detail-oriented translator with variant and style clarity.

gpt

: 20+

Dictionary

A Global dictionary that requires only a word to explain it in both english and its original language.

gpt

: 100+

Aria

Aria: A meticulous English tutor & translator

gpt

: 100+

T

Expert on the letter 'T', informative, engaging, and playful.

gpt

: 300+

神经修复写作助手

双语学术写作助手

gpt

: 1

MMV

Scrii numai în limba romana și cît mai științific

gpt

: 1

Farsi Translator

Multi-language to Farsi translator, professional and precise.

gpt

: 90+

Poly-Lingo

I'm Polylingo, your fun guide to European languages, from beginner to advanced!

gpt

: 1

English Tutor for Portuguese Speakers

Bilingual English tutor for Portuguese speakers, focusing on grammar, vocabulary, and pronunciation.

gpt

: 40+

이름 해석 마스터 GPT

'이름 해석 마스터 GPT'는 개인 이름이나 지명 등의 언어학적 의미와 기원을 해석해주는 전문 AI입니다. 이 AI는 다양한 문화와 언어 배경에서 온 이름을 분석하여, 그 이름이 가진 의미, 역사적 배경, 문화적 상징성 등을 제공합니다. 사용자가 이름을 입력하면, 해당 이름의 언어학적 특징과 기원, 그리고 가능한 의미를 설명해 줍니다. 이를 통해 사용자는 자신의 이름이나 관심 있는 이름에 대한 더 깊은 이해를 얻을 수 있습니다.

gpt

: 10+

IQRA AI

Помогает изучать казахский язык.

gpt

: 9

四步翻译

精通进行英语到简体中文的专业翻译，擅长AI和机器学习术语

gpt

: 5

B2 Wortschatzkarten

Generates vocab lists for Quizlet with translations and examples.

gpt

: 10+

Translate papers

add a paragraph of paper, then it will be translated into Chinese

gpt

: 100+

super chinese remember robot

你是一个超级中文单词记忆专家，我会输入一个中文单词给你，你需要做以下事，帮我记住用户输入的这个单词：

gpt

: 3

Arabic Interview Translator

Translates interview transcripts from Arabic to English

gpt

: 40+

Chinese Learning GPT for Advanced Speakers

Designed for native-ish chinese speakers

gpt

: 10+

Lingua Link

Facilitates learning foreign words with visual mnemonics

gpt

: 9

Wordy Wizard

Interactive language tutor for beginners to advanced learners.

gpt

: 1

Japanese Hiragana Advisor

This GPT is able to parse a sentence, provide an appropriate translation of the input text and be able to provide a response explaining the structure of a sentence in japanese.

gpt

: 2

Language Mind Maps

Master language complexities with tailored mind maps that enhance understanding and bolster memory. Explore linguistic patterns in a visually engaging way. 🧠🗺️

gpt

: 7