Best AI tools for< Linguist >
Infographic
40 - AI tool Sites
O.Translator
O.Translator is an online artificial intelligence translation website that offers unparalleled translation accuracy with AI while perfectly preserving the original format of documents. It is powered by a sophisticated AI engine for context-aware translations and is compatible with GPT-3.5, GPT-4, Gemini, and GeminiFlash. The tool provides post-editing and glossary features for precise noun translations and effortless content adjustments. Users can enjoy free previews before paying for translations, with affordable rates of $1 for up to 20,000 words. Privacy is a top priority, with all translated documents stored securely. O.Translator supports a wide range of document formats and over 60 languages worldwide, including the translation of scanned PDFs.
VocaBuddy
VocaBuddy is a vocabulary-building application designed to help users enhance their language skills. The app allows users to store and practice their vocabulary in an interactive and engaging way. With features like random word generation and building vocabulary from ticked words, VocaBuddy aims to make learning new words fun and effective. The app is user-friendly and suitable for language learners of all levels.
Slang Thesaurus
Slang Thesaurus is an AI tool designed to help users explore modern slang words and phrases. It allows users to find synonyms and antonyms of any word and offers an AI Slang Translator feature. The tool aims to be a comprehensive guide for understanding modern slang, providing accurate correlations and respecting user privacy.
MachineTranslation.com
MachineTranslation.com is an online platform that provides machine translation services to make translation accessible and reliable for everyone. The platform leverages the latest advances in AI and machine learning to offer accurate and efficient translations. The team behind MachineTranslation.com consists of experienced linguists, researchers, and engineers who have developed a solution to empower effective communication.
WhatWordIsThat
WhatWordIsThat.com is a website that helps users identify and understand words. It provides various tools and resources for exploring vocabulary, definitions, synonyms, and more. Users can input a word and receive detailed information about it, making it a valuable resource for writers, students, and language enthusiasts. The site is designed to be user-friendly and informative, offering a simple yet effective way to enhance one's language skills and knowledge.
Signapse AI
Signapse AI is an innovative platform that revolutionizes sign language translation technology. It offers real-time Generative AI sign language translation software to break down communication barriers instantly. The platform specializes in providing seamless British Sign Language (BSL) and American Sign Language (ASL) translation and interpretation for clients prioritizing accessibility and inclusivity. Signapse AI aims to enhance organizational accessibility for Deaf individuals by providing cutting-edge Generative AI sign language translation solutions for public space announcements, videos, and websites.
Resemble AI
Resemble AI is an advanced AI Voice Generator and Deepfake Audio Detection platform designed for enterprises prioritizing security and safety. It offers features such as Voice Cloning, Text to Speech, Speech to Speech, Audio Editing, and Multilingual support. The platform enables users to create hyper-realistic AI voices, deploy AI models through the cloud or on-premises, and safeguard digital content with state-of-the-art deepfake detection technology. Resemble AI is trusted by millions worldwide for creating unique, dynamic messages and personalized experiences across various industries.
GPTionary
GPTionary is an AI-powered thesaurus tool that allows users to search for words or phrases quickly by describing them. It leverages the power of AI, specifically ChatGPT, to provide accurate and efficient results. Users from over 100 countries have accessed GPTionary in just 10 days, highlighting its global reach and popularity. The tool is designed to assist community leaders, such as school teachers and officials, in accessing its features for educational purposes. GPTionary ensures data security and user verification to prevent misuse and maintain a trusted user community.
Context
Context is an AI tool designed for iOS/macOS that allows users to look up a word in a sentence, particularly useful for serious foreign language reading situations. By leveraging AI technology, users can easily understand the meaning of words within a sentence. The app provides features such as explaining the whole sentence, saving lookups, syncing between iOS and macOS devices, exporting in Markdown/CSV format, and a widget for quick access. While not free, Context offers a free trial of 100 lookups before requiring a subscription fee. Created by Chen Guo, Context aims to enhance language comprehension and reading experiences for users.
LM Studio
LM Studio is an AI tool designed for discovering, downloading, and running local LLMs (Large Language Models). Users can run LLMs on their laptops offline, use models through an in-app Chat UI or a local server, download compatible model files from HuggingFace repositories, and discover new LLMs. The tool ensures privacy by not collecting data or monitoring user actions, making it suitable for personal and business use. LM Studio supports various models like ggml Llama, MPT, and StarCoder on Hugging Face, with minimum hardware/software requirements specified for different platforms.
DocTranslator
DOCTRANSLATOR.COM is an AI-powered online document translation platform that offers seamless translation services for various document formats. With support for over 100 languages, the platform ensures accurate and contextually relevant translations while maintaining the natural flow of content. Users can easily upload documents, track translation progress, and receive high-quality translations tailored to their audience. The platform simplifies the translation process, providing status tracking, dedicated support, and a user-friendly experience.
Ollama
Ollama is an AI tool that allows users to access and utilize large language models such as Llama 3, Phi 3, Mistral, Gemma 2, and more. Users can customize and create their own models. The tool is available for macOS, Linux, and Windows platforms, offering a preview version for users to explore and utilize these models for various applications.
Stability AI
Stability AI is an AI application that offers a suite of models for various modalities such as image, video, audio, 3D, and language. It provides cutting-edge generative AI technology with a focus on stability and quality. Users can access advanced AI models for tasks like text-to-image generation, video modeling, audio generation, and more. The application also offers licensing options for commercial use and self-hosting benefits.
slAItor
slAItor is an AI translation assistant powered by GPT technology. It offers advanced translation features and customization options to enhance the translation experience. Users can benefit from step-by-step translations, multiple translation alternatives, and unique translation styles. The tool supports 28 language pairs and combines recent AI advancements with traditional translation techniques to deliver accurate and efficient translations. slAItor also provides post-processing and evaluation steps to ensure translation quality and offers a user-friendly interface for seamless translation management.
Lingolette
Lingolette is an AI language teaching machine that helps users master a language faster through personalized neural network chat-based tools. It speaks with users like a real teacher, motivates them on their learning journey, adapts to their learning style, and explains concepts clearly. Lingolette aims to enhance users' talking skills, pronunciation, and overall language learning experience.
LEXICON
LEXICON is an AI-powered platform designed to enhance English learning through a combination of expert teachers and artificial intelligence. The platform offers personalized lesson programs developed by teachers to accelerate English speaking skills. By integrating AI technologies into language teaching methods, LEXICON provides a faster and more efficient learning experience. Users can benefit from specialized AI-supported exercises and receive detailed feedback on errors made during lessons. With over 10 years of experience in the software industry, LEXICON is committed to providing learners with tools tailored for language acquisition.
Lune
Lune is a language learning application that leverages AI technology to provide a personalized and dynamic learning experience. The app tracks your learning progress to create tailored content and courses. With a multifaceted approach, Lune offers various ways to learn languages, including holding conversations, guided learning, and reading stories. It categorizes every word you encounter and helps you remember vocabulary effortlessly. Additionally, Lune allows users to share their progress with friends and family. The app is designed to make language learning engaging, efficient, and accessible to all.
LanguaTalk
LanguaTalk is a language learning platform that offers personalized coaching from 5-star tutors and world-class AI software to help users achieve fluency in various languages. The platform provides tailored language tutoring, practice sessions with advanced AI, and a combination of both for accelerated learning. LanguaTalk leverages AI technology to offer features such as personalized coaching, practice conversations, role plays, instant corrections, interactive transcripts, vocabulary learning through flashcards, and more. The platform has both free and paid plans, with a focus on providing effective and affordable language learning solutions.
Webnovels AI
Webnovels AI is an advanced AI tool designed for translating web novels, light novels, and e-books from Chinese, Korean, and Japanese into English. The platform offers instant bulk translation for complete novels, customizable glossary for perfect translations, the option to bring your own GPT key for unlimited access, and is tailored for both avid readers and professional translators. Users can monetize their translations and earn revenue, making it a cost-effective solution loved by over 100 readers and translators. Webnovels AI aims to provide the highest quality AI translation, backed by years of experience in reading and writing translations for Asian novels.
AI Manga Translator
AI Manga Translator is an online tool powered by AI technology that allows users to upload and translate manga instantly. It supports multiple languages and translation engines, ensuring precision manga translation without altering the original style. The tool is user-friendly, making it accessible to all users, whether they are manga fans or professionals needing document translations. AI Manga Translator offers various plans for different translation needs, with accurate and fast translations powered by AI technology.
Reverso
Reverso is a free translation tool that offers a comprehensive set of features to help users translate text, documents, and more in multiple languages. It provides grammar checks, synonyms, conjugation, and context-based translations. With cutting-edge AI-powered technology, Reverso delivers accurate translations in various languages, including Arabic, Chinese, Italian, and more. Users can enjoy features like document translation, integrated dictionaries, grammar check, neural machine translation, pronunciation guides, and verb conjugation. Reverso aims to assist millions of individuals and organizations in communicating efficiently and precisely across different languages.
Hayai Learn
Hayai Learn is an AI-powered platform designed to help users learn Japanese quickly and effectively by immersing them in Japanese content such as YouTube videos. The platform utilizes AI technology to assist users in acquiring new vocabulary and grammar effortlessly. By offering features like word learning from subtitles, providing relevant word meanings, offering video examples for better memory association, and assisting with sentence mining, Hayai Learn aims to revolutionize the way Japanese is learned by making it fun and engaging.
Bentolingo
Bentolingo is an AI language learning tool that offers users a daily 'bento box' of language learning resources. The platform utilizes artificial intelligence to personalize language lessons and exercises based on individual learning styles and progress. Users can improve their language skills in a fun and interactive way through a variety of activities such as vocabulary quizzes, grammar exercises, and conversation practice. Bentolingo aims to make language learning more accessible and engaging for learners of all levels.
Ogma
Ogma is an interpretable symbolic general problem-solving model that utilizes a symbolic sequence modeling paradigm to address tasks requiring reliability, complex decomposition, and without hallucinations. It offers solutions in areas such as math problem-solving, natural language understanding, and resolution of uncertainty. The technology is designed to provide a structured approach to problem-solving by breaking down tasks into manageable components while ensuring interpretability and self-interpretability. Ogma aims to set benchmarks in problem-solving applications by offering a reliable and transparent methodology.
LLM Token Counter
The LLM Token Counter is a sophisticated tool designed to help users effectively manage token limits for various Language Models (LLMs) like GPT-3.5, GPT-4, Claude-3, Llama-3, and more. It utilizes Transformers.js, a JavaScript implementation of the Hugging Face Transformers library, to calculate token counts client-side. The tool ensures data privacy by not transmitting prompts to external servers.
Alexa Translations
Alexa Translations is an AI-powered translation service that combines the expertise of professional translators with cutting-edge AI technology to provide custom translation solutions for legal, financial, securities, and HR teams. The platform offers a blend of human intelligence and AI capabilities, ensuring unmatched quality and accuracy in translations. With a focus on industry-specific terminology and regulatory compliance, Alexa Translations caters to a wide range of clients, including banks, law firms, and other industry leaders.
Vosyn
Vosyn is an advanced AI-driven platform that offers real-time translation, adaptive learning, and personalized experiences. It aims to create a connected world where digital platforms go beyond content repositories to become gateways to a rich, inclusive, and universally accessible world of information and entertainment. VosynVerse enables understanding across languages, VosynCore drives multilingual communication, and VosynConnect provides tailored solutions for businesses, all powered by cutting-edge AI technology.
SpeakPal
SpeakPal is an AI language learning platform that offers stages of vocabulary, phrases, daily dialogues, writing, and downloadable exercises. It supports interactive chat with an AI language tutor and is available in 28 languages. The platform provides immersive language experiences and personalized teaching approaches to make language learning efficient and enjoyable.
Meta AI
Meta AI is an advanced AI tool designed to facilitate language learning and translation. It leverages cutting-edge artificial intelligence technology to provide users with a seamless experience in mastering various languages. With Meta AI, users can enhance their language skills, improve translation accuracy, and broaden their cultural understanding. The tool is user-friendly and offers a wide range of features to cater to different learning styles and preferences.
OdiaGenAI
OdiaGenAI is a collaborative initiative focused on conducting research on Generative AI and Large Language Models (LLM) for the Odia Language. The project aims to leverage AI technology to develop Generative AI and LLM-based solutions for the overall development of Odisha and the Odia language through collaboration among Odia technologists. The initiative offers pre-trained models, codes, and datasets for non-commercial and research purposes, with a focus on building language models for Indic languages like Odia and Bengali.
Neoform AI
Neoform AI is an innovative AI tool that focuses on developing AI models specifically for African dialects. The platform aims to bridge the gap in AI technology by providing solutions tailored to the linguistic diversity of Africa. With a commitment to inclusivity and cultural representation, Neoform AI is revolutionizing the field of artificial intelligence by addressing the unique challenges faced by African languages. Through cutting-edge research and development, Neoform AI is paving the way for greater accessibility and accuracy in AI applications across the continent.
Old English Translator
Old English Translator is an AI tool designed to convert text from Old English, the earliest form of English spoken during the early Middle Ages, into Modern English. It helps users understand and interpret texts written in Old English, which is significantly different from the English language spoken today.
AppTek.ai
AppTek.ai is a global leader in artificial intelligence (AI) and machine learning (ML) technologies, providing advanced solutions in automatic speech recognition, neural machine translation, natural language processing/understanding, large language models, and text-to-speech technologies. The platform offers industry-leading language solutions for various sectors such as media and entertainment, call centers, government, and enterprise business. AppTek.ai combines cutting-edge AI research with real-world applications, delivering accurate and efficient tools for speech transcription, translation, understanding, and synthesis across multiple languages and dialects.
Arcee AI
Arcee AI is a platform that offers a cost-effective, secure, end-to-end solution for building and deploying Small Language Models (SLMs). It allows users to merge and train custom language models by leveraging open source models and their own data. The platform is known for its Model Merging technique, which combines the power of pre-trained Large Language Models (LLMs) with user-specific data to create high-performing models across various industries.
TextUnited
TextUnited is an AI Translation Platform that offers expert translations through an AI-powered platform and world-class customer service. It provides solutions for website translation, eLearning & education, software localization, unlocking new markets, multilingual customer experience, and organization & productivity of translation. The platform uses AI technology to deliver custom translations at scale and a fraction of the cost, while also offering human translation services by expert linguists. TextUnited stands out for its simplicity, power of AI, customer service, content automation, and continuously enhanced automatic translation.
DeutschDictionary.com
DeutschDictionary.com is the go-to online platform for language learners, specifically focusing on German-English translations. With over 1 million translations, conjugations, and example sentences, it provides a comprehensive resource for individuals looking to enhance their German language skills. The website is powered by AI teachers and editors who continuously update the database with new words and translations. DeutschDictionary.com is loved globally for its user-friendly interface, daily additions of new words, example sentences, and search improvements. It is a free and reliable tool for anyone interested in learning German.
Tala
Tala is an AI-powered language tutor designed for hands-on learners. It encourages free-flowing conversation early in the learning journey, focusing on natural language acquisition rather than rote memorization. With advanced speech recognition technology, Tala helps users build confidence in speaking and offers a flexible learning experience with adjustable listening speeds and easy access to look-up tools. The platform aims to make language learning engaging and immersive, allowing users to practice without fear of embarrassment and improve their pronunciation through interactive conversations.
Reka
Reka is a cutting-edge AI application offering next-generation multimodal AI models that empower agents to see, hear, and speak. Their flagship model, Reka Core, competes with industry leaders like OpenAI and Google, showcasing top performance across various evaluation metrics. Reka's models are natively multimodal, capable of tasks such as generating textual descriptions from videos, translating speech, answering complex questions, writing code, and more. With advanced reasoning capabilities, Reka enables users to solve a wide range of complex problems. The application provides end-to-end support for 32 languages, image and video comprehension, multilingual understanding, tool use, function calling, and coding, as well as speech input and output.
Tilde.ai
Tilde.ai is a language technology platform that offers a wide range of AI-powered solutions for translation, speech technologies, and conversational AI. It combines human and artificial intelligence to help people connect and work efficiently. The platform provides machine translation, speech-to-text conversion, text-to-speech synthesis, real-time transcription, AI chatbots, internal knowledge assistants, and meeting support services. Tilde.ai aims to bridge language barriers and enhance communication by leveraging advanced language technologies.
Mondonomo
Mondonomo is an AI tool that helps users explore the origins and meanings of names. Users can input their name or surname to discover information such as the countries where their name is common, transliterations, variants, famous people with the same name, and more. The platform also offers articles on onomastics, name science, and business solutions related to names. Additionally, users can design personalized wordclouds using the AI Wordcloud feature.
58 - Open Source Tools
languagemodels
Language Models is a Python package that provides building blocks to explore large language models with as little as 512MB of RAM. It simplifies the usage of large language models from Python, ensuring all inference is performed locally to keep data private. The package includes features such as text completions, chat capabilities, code completions, external text retrieval, semantic search, and more. It outperforms Hugging Face transformers for CPU inference and offers sensible default models with varying parameters based on memory constraints. The package is suitable for learners and educators exploring the intersection of large language models with modern software development.
openai-grammar-correction
This project is a Node.js API example that utilizes the OpenAI API for grammar correction and speech-to-text conversion. It helps users correct their English sentences to standard English by leveraging the capabilities of the OpenAI API. The project consists of two applications: Angular and Node.js. Users can follow the installation steps to set up the project in their environment and utilize the OpenAI implementation to correct English sentences. The project also provides guidelines for contribution and support.
spaCy
spaCy is an industrial-strength Natural Language Processing (NLP) library in Python and Cython. It incorporates the latest research and is designed for real-world applications. The library offers pretrained pipelines supporting 70+ languages, with advanced neural network models for tasks such as tagging, parsing, named entity recognition, and text classification. It also facilitates multi-task learning with pretrained transformers like BERT, along with a production-ready training system and streamlined model packaging, deployment, and workflow management. spaCy is commercial open-source software released under the MIT license.
KULLM
KULLM (구름) is a Korean Large Language Model developed by Korea University NLP & AI Lab and HIAI Research Institute. It is based on the upstage/SOLAR-10.7B-v1.0 model and has been fine-tuned for instruction. The model has been trained on 8×A100 GPUs and is capable of generating responses in Korean language. KULLM exhibits hallucination and repetition phenomena due to its decoding strategy. Users should be cautious as the model may produce inaccurate or harmful results. Performance may vary in benchmarks without a fixed system prompt.
MeloTTS
MeloTTS is a high-quality multi-lingual text-to-speech library by MyShell.ai. It supports various languages including English (American, British, Indian, Australian), Spanish, French, Chinese, Japanese, and Korean. The Chinese speaker also supports mixed Chinese and English. The library is fast enough for CPU real-time inference and offers features like using without installation, local installation, and training on custom datasets. The Python API and model cards are available in the repository and on HuggingFace. The community can join the Discord channel for discussions and collaboration opportunities. Contributions are welcome, and the library is under the MIT License. MeloTTS is based on TTS, VITS, VITS2, and Bert-VITS2.
RWKV-Runner
RWKV Runner is a project designed to simplify the usage of large language models by automating various processes. It provides a lightweight executable program and is compatible with the OpenAI API. Users can deploy the backend on a server and use the program as a client. The project offers features like model management, VRAM configurations, user-friendly chat interface, WebUI option, parameter configuration, model conversion tool, download management, LoRA Finetune, and multilingual localization. It can be used for various tasks such as chat, completion, composition, and model inspection.
awesome-llm
Awesome LLM is a curated list of resources related to Large Language Models (LLMs), including models, projects, datasets, benchmarks, materials, papers, posts, GitHub repositories, HuggingFace repositories, and reading materials. It provides detailed information on various LLMs, their parameter sizes, announcement dates, and contributors. The repository covers a wide range of LLM-related topics and serves as a valuable resource for researchers, developers, and enthusiasts interested in the field of natural language processing and artificial intelligence.
lima
LIMA is a multilingual linguistic analyzer developed by the CEA LIST, LASTI laboratory. It is Free Software available under the MIT license. LIMA has state-of-the-art performance for more than 60 languages using deep learning modules. It also includes a powerful rules-based mechanism called ModEx for extracting information in new domains without annotated data.
ChuanhuChatGPT
Chuanhu Chat is a user-friendly web graphical interface that provides various additional features for ChatGPT and other language models. It supports GPT-4, file-based question answering, local deployment of language models, online search, agent assistant, and fine-tuning. The tool offers a range of functionalities including auto-solving questions, online searching with network support, knowledge base for quick reading, local deployment of language models, GPT 3.5 fine-tuning, and custom model integration. It also features system prompts for effective role-playing, basic conversation capabilities with options to regenerate or delete dialogues, conversation history management with auto-saving and search functionalities, and a visually appealing user experience with themes, dark mode, LaTeX rendering, and PWA application support.
local-rag
Local RAG is an offline, open-source tool that allows users to ingest files for retrieval augmented generation (RAG) using large language models (LLMs) without relying on third parties or exposing sensitive data. It supports offline embeddings and LLMs, multiple sources including local files, GitHub repos, and websites, streaming responses, conversational memory, and chat export. Users can set up and deploy the app, learn how to use Local RAG, explore the RAG pipeline, check planned features, known bugs and issues, access additional resources, and contribute to the project.
Synthalingua
Synthalingua is an advanced, self-hosted tool that leverages artificial intelligence to translate audio from various languages into English in near real time. It offers multilingual outputs and utilizes GPU and CPU resources for optimized performance. Although currently in beta, it is actively developed with regular updates to enhance capabilities. The tool is not intended for professional use but for fun, language learning, and enjoying content at a reasonable pace. Users must ensure speakers speak clearly for accurate translations. It is not a replacement for human translators and users assume their own risk and liability when using the tool.
elia
Elia is a powerful terminal user interface designed for interacting with large language models. It allows users to chat with models like Claude 3, ChatGPT, Llama 3, Phi 3, Mistral, and Gemma. Conversations are stored locally in a SQLite database, ensuring privacy. Users can run local models through 'ollama' without data leaving their machine. Elia offers easy installation with pipx and supports various environment variables for different models. It provides a quick start to launch chats and manage local models. Configuration options are available to customize default models, system prompts, and add new models. Users can import conversations from ChatGPT and wipe the database when needed. Elia aims to enhance user experience in interacting with language models through a user-friendly interface.
cltk
The Classical Language Toolkit (CLTK) is a Python library that provides natural language processing (NLP) capabilities for pre-modern languages. It offers a modular processing pipeline with pre-configured defaults and supports almost 20 languages. Users can install the latest version using pip and access detailed documentation on the official website. The toolkit is designed to meet the unique needs of researchers working with historical languages, filling a void in the NLP landscape that often neglects non-spoken languages and different research goals.
Chinese-Tiny-LLM
Chinese-Tiny-LLM is a repository containing procedures for cleaning Chinese web corpora and pre-training code. It introduces CT-LLM, a 2B parameter language model focused on the Chinese language. The model primarily uses Chinese data from a 1,200 billion token corpus, showing excellent performance in Chinese language tasks. The repository includes tools for filtering, deduplication, and pre-training, aiming to encourage further research and innovation in language model development.
SiriLLama
Siri LLama is an Apple shortcut that allows users to access locally running LLMs through Siri or the shortcut UI on any Apple device connected to the same network as the host machine. It utilizes Langchain and supports open source models from Ollama or Fireworks AI. Users can easily set up and configure the tool to interact with various language models for chat and multimodal tasks. The tool provides a convenient way to leverage the power of language models through Siri or the shortcut interface, enhancing user experience and productivity.
langstream
LangStream is a tool for natural language processing tasks, providing a CLI for easy installation and usage. Users can try sample applications like Chat Completions and create their own applications using the developer documentation. It supports running on Kubernetes for production-ready deployment, with support for various Kubernetes distributions and external components like Apache Kafka or Apache Pulsar cluster. Users can deploy LangStream locally using minikube and manage the cluster with mini-langstream. Development requirements include Docker, Java 17, Git, Python 3.11+, and PIP, with the option to test local code changes using mini-langstream.
vibe
Vibe is a tool designed to transcribe audio in multiple languages with features such as offline functionality, user-friendly design, support for various file formats, automatic updates, and translation. It is optimized for different platforms and hardware, offering total freedom to customize models easily. The tool is ideal for transcribing audio and video files, with upcoming features like transcribing system audio and audio from microphone. Vibe is a versatile and efficient transcription tool suitable for various users.
langfuse-python
Langfuse Python SDK is a software development kit that provides tools and functionalities for integrating with Langfuse's language processing services. It offers decorators for observing code behavior, low-level SDK for tracing, and wrappers for accessing Langfuse's public API. The SDK was recently rewritten in version 2, released on December 17, 2023, with detailed documentation available on the official website. It also supports integrations with OpenAI SDK, LlamaIndex, and LangChain for enhanced language processing capabilities.
awesome-llm-role-playing-with-persona
Awesome-llm-role-playing-with-persona is a curated list of resources for large language models for role-playing with assigned personas. It includes papers and resources related to persona-based dialogue systems, personalized response generation, psychology of LLMs, biases in LLMs, and more. The repository aims to provide a comprehensive collection of research papers and tools for exploring role-playing abilities of large language models in various contexts.
glossAPI
The glossAPI project aims to develop a Greek language model as open-source software, with code licensed under EUPL and data under Creative Commons BY-SA. The project focuses on collecting and evaluating open text sources in Greek, with efforts to prioritize and gather textual data sets. The project encourages contributions through the CONTRIBUTING.md file and provides resources in the wiki for viewing and modifying recorded sources. It also welcomes ideas and corrections through issue submissions. The project emphasizes the importance of open standards, ethically secured data, privacy protection, and addressing digital divides in the context of artificial intelligence and advanced language technologies.
TigerBot
TigerBot is a cutting-edge foundation for your very own LLM, providing a world-class large model for innovative Chinese-style contributions. It offers various upgrades and features, such as search mode enhancements, support for large context lengths, and the ability to play text-based games. TigerBot is suitable for prompt-based game engine development, interactive game design, and real-time feedback for playable games.
LocalAIVoiceChat
LocalAIVoiceChat is an experimental alpha software that enables real-time voice chat with a customizable AI personality and voice on your PC. It integrates Zephyr 7B language model with speech-to-text and text-to-speech libraries. The tool is designed for users interested in state-of-the-art voice solutions and provides an early version of a local real-time chatbot.
ChatTTS-Forge
ChatTTS-Forge is a powerful text-to-speech generation tool that supports generating rich audio long texts using a SSML-like syntax and provides comprehensive API services, suitable for various scenarios. It offers features such as batch generation, support for generating super long texts, style prompt injection, full API services, user-friendly debugging GUI, OpenAI-style API, Google-style API, support for SSML-like syntax, speaker management, style management, independent refine API, text normalization optimized for ChatTTS, and automatic detection and processing of markdown format text. The tool can be experienced and deployed online through HuggingFace Spaces, launched with one click on Colab, deployed using containers, or locally deployed after cloning the project, preparing models, and installing necessary dependencies.
awesome-llm-understanding-mechanism
This repository is a collection of papers focused on understanding the internal mechanism of large language models (LLM). It includes research on topics such as how LLMs handle multilingualism, learn in-context, and handle factual associations. The repository aims to provide insights into the inner workings of transformer-based language models through a curated list of papers and surveys.
SakuraLLM
SakuraLLM is a project focused on building large language models for Japanese to Chinese translation in the light novel and galgame domain. The models are based on open-source large models and are pre-trained and fine-tuned on general Japanese corpora and specific domains. The project aims to provide high-performance language models for galgame/light novel translation that are comparable to GPT3.5 and can be used offline. It also offers an API backend for running the models, compatible with the OpenAI API format. The project is experimental, with version 0.9 showing improvements in style, fluency, and accuracy over GPT-3.5.
CosyVoice
CosyVoice is a tool designed for speech synthesis, offering pretrained models for zero-shot, sft, instruct inference. It provides a web demo for easy usage and supports advanced users with train and inference scripts. The tool can be deployed using grpc for service deployment. Users can download pretrained models and resources for immediate use or train their own models from scratch. CosyVoice is suitable for researchers, developers, linguists, AI engineers, and speech technology enthusiasts.
rubra
Rubra is a collection of open-weight large language models enhanced with tool-calling capability. It allows users to call user-defined external tools in a deterministic manner while reasoning and chatting, making it ideal for agentic use cases. The models are further post-trained to teach instruct-tuned models new skills and mitigate catastrophic forgetting. Rubra extends popular inferencing projects for easy use, enabling users to run the models easily.
OpenVoiceChat
OpenVoiceChat is an open-source tool designed for having natural voice conversations with an LLM model. It supports various speech-to-text (STT), text-to-speech (TTS), and large language model (LLM) models. The tool aims to provide an alternative to closed commercial implementations, with well-abstracted APIs that are easy to use and extend. Users can install base and functionality-specific packages using pip, and the tool supports interruptions during conversations. The project encourages contributions through bounties and has a detailed roadmap available for reference.
CipherChat
CipherChat is a novel framework designed to examine the generalizability of safety alignment to non-natural languages, specifically ciphers. The framework utilizes human-unreadable ciphers to potentially bypass safety alignments in natural language models. It involves teaching a language model to comprehend ciphers, converting input into a cipher format, and employing a rule-based decrypter to convert model output back to natural language.
HebTTS
HebTTS is a language modeling approach to diacritic-free Hebrew text-to-speech (TTS) system. It addresses the challenge of accurately mapping text to speech in Hebrew by proposing a language model that operates on discrete speech representations and is conditioned on a word-piece tokenizer. The system is optimized using weakly supervised recordings and outperforms diacritic-based Hebrew TTS systems in terms of content preservation and naturalness of generated speech.
VinAI_Translate
VinAI_Translate is a Vietnamese-English Neural Machine Translation System offering state-of-the-art text-to-text translation models for Vietnamese-to-English and English-to-Vietnamese. The system includes pre-trained models with different configurations and parameters, allowing for further fine-tuning. Users can interact with the models through the VinAI Translate system website or the HuggingFace space 'VinAI Translate'. Evaluation scripts are available for assessing the translation quality. The tool can be used in the 'transformers' library for Vietnamese-to-English and English-to-Vietnamese translations, supporting both GPU-based batch translation and CPU-based sequence translation examples.
PhoGPT
PhoGPT is an open-source 4B-parameter generative model series for Vietnamese, including the base pre-trained monolingual model PhoGPT-4B and its chat variant, PhoGPT-4B-Chat. PhoGPT-4B is pre-trained from scratch on a Vietnamese corpus of 102B tokens, with an 8192 context length and a vocabulary of 20K token types. PhoGPT-4B-Chat is fine-tuned on instructional prompts and conversations, demonstrating superior performance. Users can run the model with inference engines like vLLM and Text Generation Inference, and fine-tune it using llm-foundry. However, PhoGPT has limitations in reasoning, coding, and mathematics tasks, and may generate harmful or biased responses.
FunAudioLLM-APP
FunAudioLLM-APP is a repository hosting two applications: Voice Chat for interactive AI-driven dialogues and Voice Translation for real-time language translation. The project leverages advanced audio understanding and speech generation models to enhance audio experiences. Users can visit the FunAudioLLM Homepage, CosyVoice Paper, and FunAudioLLM Technical Report for more details. The applications aim to break down language barriers and provide a natural chatting experience in various settings.
ollama-gui
Ollama GUI is a web interface for ollama.ai, a tool that enables running Large Language Models (LLMs) on your local machine. It provides a user-friendly platform for chatting with LLMs and accessing various models for text generation. Users can easily interact with different models, manage chat history, and explore available models through the web interface. The tool is built with Vue.js, Vite, and Tailwind CSS, offering a modern and responsive design for seamless user experience.
ollama-ex
Ollama is a powerful tool for running large language models locally or on your own infrastructure. It provides a full implementation of the Ollama API, support for streaming requests, and tool use capability. Users can interact with Ollama in Elixir to generate completions, chat messages, and perform streaming requests. The tool also supports function calling on compatible models, allowing users to define tools with clear descriptions and arguments. Ollama is designed to facilitate natural language processing tasks and enhance user interactions with language models.
probsem
ProbSem is a repository that provides a framework to leverage large language models (LLMs) for assigning context-conditional probability distributions over queried strings. It supports OpenAI engines and HuggingFace CausalLM models, and is flexible for research applications in linguistics, cognitive science, program synthesis, and NLP. Users can define prompts, contexts, and queries to derive probability distributions over possible completions, enabling tasks like cloze completion, multiple-choice QA, semantic parsing, and code completion. The repository offers CLI and API interfaces for evaluation, with options to customize models, normalize scores, and adjust temperature for probability distributions.
llama3-tokenizer-js
JavaScript tokenizer for LLaMA 3 designed for client-side use in the browser and Node, with TypeScript support. It accurately calculates token count, has 0 dependencies, optimized running time, and somewhat optimized bundle size. Compatible with most LLaMA 3 models. Can encode and decode text, but training is not supported. Pollutes global namespace with `llama3Tokenizer` in the browser. Mostly compatible with LLaMA 3 models released by Facebook in April 2024. Can be adapted for incompatible models by passing custom vocab and merge data. Handles special tokens and fine tunes. Developed by belladore.ai with contributions from xenova, blaze2004, imoneoi, and ConProgramming.
ell
ell is a command-line interface for Language Model Models (LLMs) written in Bash. It allows users to interact with LLMs from the terminal, supports piping, context bringing, and chatting with LLMs. Users can also call functions and use templates. The tool requires bash, jq for JSON parsing, curl for HTTPS requests, and perl for PCRE. Configuration involves setting variables for different LLM models and APIs. Usage examples include asking questions, specifying models, recording input/output, running in interactive mode, and using templates. The tool is lightweight, easy to install, and pipe-friendly, making it suitable for interacting with LLMs in a terminal environment.
Ollama-SwiftUI
Ollama-SwiftUI is a user-friendly interface for Ollama.ai created in Swift. It allows seamless chatting with local Large Language Models on Mac. Users can change models mid-conversation, restart conversations, send system prompts, and use multimodal models with image + text. The app supports managing models, including downloading, deleting, and duplicating them. It offers light and dark mode, multiple conversation tabs, and a localized interface in English and Arabic.
cherry-studio
Cherry Studio is a desktop client that supports multiple Large Language Model (LLM) providers, available on Windows, Mac, and Linux. It allows users to create multiple Assistants and topics, use multiple models to answer questions in the same conversation, and supports drag-and-drop sorting, code highlighting, and Mermaid chart. The tool is designed to enhance productivity and streamline the process of interacting with various language models.
awesome-khmer-language
Awesome Khmer Language is a comprehensive collection of resources for the Khmer language, including tools, datasets, research papers, projects/models, blogs/slides, and miscellaneous items. It covers a wide range of topics related to Khmer language processing, such as character normalization, word segmentation, part-of-speech tagging, optical character recognition, text-to-speech, and more. The repository aims to support the development of natural language processing applications for the Khmer language by providing a diverse set of resources and tools for researchers and developers.
west
WeST is a Speech Recognition/Transcript tool developed in 300 lines of code, inspired by SLAM-ASR and LLaMA 3.1. The model includes a Language Model (LLM), a Speech Encoder, and a trainable Projector. It requires training data in jsonl format with 'wav' and 'txt' entries. WeST can be used for training and decoding speech recognition models.
Bert-VITS2
Bert-VITS2 is a repository that provides a backbone with multilingual BERT for text-to-speech (TTS) applications. It offers an alternative to BV2/GSV projects and is inspired by the MassTTS project. Users can refer to the code to learn how to train models for TTS. The project is not maintained actively in the short term. It is not to be used for any purposes that violate the laws of the People's Republic of China, and strictly prohibits any political-related use.
comfyui_fk_server
This is an ideal Comfyui translation plugin that allows any long text input box in Comfyui to support Chinese input and automatic translation (using Baidu translation). It also includes error correction translation feature and keyword polishing feature for generating professional AI drawing prompts (using Zhipu AI big model). Additionally, it provides a one-click fix feature for correcting model references in workflows, greatly improving workflow model call correction efficiency (based on model name matching). The plugin requires Baidu translation API key for translation functionality and Zhipu AI API key for keyword polishing functionality. After installation, users can enable automatic translation mode and keyword polishing feature by double-clicking any long text input box in Comfyui.
MockingBird
MockingBird is a toolbox designed for Mandarin speech synthesis using PyTorch. It supports multiple datasets such as aidatatang_200zh, magicdata, aishell3, and data_aishell. The toolbox can run on Windows, Linux, and M1 MacOS, providing easy and effective speech synthesis with pretrained encoder/vocoder models. It is webserver ready for remote calling. Users can train their own models or use existing ones for the encoder, synthesizer, and vocoder. The toolbox offers a demo video and detailed setup instructions for installation and model training.
AITranslator
AITranslator is a software tool that utilizes a large language model to translate text from images exported by MTool into a user-friendly graphical interface. Users can start TGW to load the model, open the software, and select the text to be translated. The tool aims to simplify the translation process by leveraging advanced language processing capabilities.
agent-hub
Agent Hub is a platform for AI Agent solutions, containing three different projects aimed at transforming enterprise workflows, enhancing personalized language learning experiences, and enriching multimodal interactions. The projects include GitHub Sentinel for project management and automatic updates, LanguageMentor for personalized language learning support, and ChatPPT for multimodal AI-driven insights and PowerPoint automation in enterprise settings. The future vision of agent-hub is to serve as a launchpad for more AI Agents catering to different industries and pushing the boundaries of AI technology. Users are encouraged to explore, clone the repository, and contribute to the development of transformative AI agents.
MooER
MooER (摩耳) is an LLM-based speech recognition and translation model developed by Moore Threads. It allows users to transcribe speech into text (ASR) and translate speech into other languages (AST) in an end-to-end manner. The model was trained using 5K hours of data and is now also available with an 80K hours version. MooER is the first LLM-based speech model trained and inferred using domestic GPUs. The repository includes pretrained models, inference code, and a Gradio demo for a better user experience.
yuna-ai
Yuna AI is a unique AI companion designed to form a genuine connection with users. It runs exclusively on the local machine, ensuring privacy and security. The project offers features like text generation, language translation, creative content writing, roleplaying, and informal question answering. The repository provides comprehensive setup and usage guides for Yuna AI, along with additional resources and tools to enhance the user experience.
whisper
Whisper is an open-source library by Open AI that converts/extracts text from audio. It is a cross-platform tool that supports real-time transcription of various types of audio/video without manual conversion to WAV format. The library is designed to run on Linux and Android platforms, with plans for expansion to other platforms. Whisper utilizes three frameworks to function: DART for CLI execution, Flutter for mobile app integration, and web/WASM for web application deployment. The tool aims to provide a flexible and easy-to-use solution for transcription tasks across different programs and platforms.
Hacx-GPT
Hacx GPT is a cutting-edge AI tool developed by BlackTechX, inspired by WormGPT, designed to push the boundaries of natural language processing. It is an advanced broken AI model that facilitates seamless and powerful interactions, allowing users to ask questions and perform various tasks. The tool has been rigorously tested on platforms like Kali Linux, Termux, and Ubuntu, offering powerful AI conversations and the ability to do anything the user wants. Users can easily install and run Hacx GPT on their preferred platform to explore its vast capabilities.
Easy-Translate
Easy-Translate is a script designed for translating large text files with a single command. It supports various models like M2M100, NLLB200, SeamlessM4T, LLaMA, and Bloom. The tool is beginner-friendly and offers seamless and customizable features for advanced users. It allows acceleration on CPU, multi-CPU, GPU, multi-GPU, and TPU, with support for different precisions and decoding strategies. Easy-Translate also provides an evaluation script for translations. Built on HuggingFace's Transformers and Accelerate library, it supports prompt usage and loading huge models efficiently.
speech-to-speech
This repository implements a speech-to-speech cascaded pipeline with consecutive parts including Voice Activity Detection (VAD), Speech to Text (STT), Language Model (LM), and Text to Speech (TTS). It aims to provide a fully open and modular approach by leveraging models available on the Transformers library via the Hugging Face hub. The code is designed for easy modification, with each component implemented as a class. Users can run the pipeline either on a server/client approach or locally, with detailed setup and usage instructions provided in the readme.
next-token-prediction
Next-Token Prediction is a language model tool that allows users to create high-quality predictions for the next word, phrase, or pixel based on a body of text. It can be used as an alternative to well-known decoder-only models like GPT and Mistral. The tool provides options for simple usage with built-in data bootstrap or advanced customization by providing training data or creating it from .txt files. It aims to simplify methodologies, provide autocomplete, autocorrect, spell checking, search/lookup functionalities, and create pixel and audio transformers for various prediction formats.
banks
Banks is a linguist professor tool that helps generate meaningful LLM prompts using a template language. It provides a user-friendly way to create prompts for various tasks such as blog writing, summarizing documents, lemmatizing text, and generating text using a LLM. The tool supports async operations and comes with predefined filters for data processing. Banks leverages Jinja's macro system to create prompts and interact with OpenAI API for text generation. It also offers a cache mechanism to avoid regenerating text for the same template and context.
json-translator
The json-translator repository provides a free tool to translate JSON/YAML files or JSON objects into different languages using various translation modules. It supports CLI usage and package support, allowing users to translate words, sentences, JSON objects, and JSON files. The tool also offers multi-language translation, ignoring specific words, and safe translation practices. Users can contribute to the project by updating CLI, translation functions, JSON operations, and more. The roadmap includes features like Libre Translate option, Argos Translate option, Bing Translate option, and support for additional translation modules.
llm-past-tense
The 'llm-past-tense' repository contains code related to the research paper 'Does Refusal Training in LLMs Generalize to the Past Tense?' by Maksym Andriushchenko and Nicolas Flammarion. It explores the generalization of refusal training in large language models (LLMs) to the past tense. The code includes experiments and examples for running different models and requests related to the study. Users can cite the work if found useful in their research, and the codebase is released under the MIT License.
FireRedTTS
FireRedTTS is a foundation text-to-speech framework designed for industry-level generative speech applications. It offers a rich-punctuation model with expanded punctuation coverage and enhanced audio production consistency. The tool provides pre-trained checkpoints, inference code, and an interactive demo space. Users can clone the repository, create a conda environment, download required model files, and utilize the tool for synthesizing speech in various languages. FireRedTTS aims to enhance stability and provide controllable human-like speech generation capabilities.
75 - OpenAI Gpts
LuminaGPT
Your Ultimate AI Guide to Mastering Fantasy Languages – Dive into Interactive, Culturally-Rich Lessons Today!
ROSSETAI HIEROGLYPHS TRANSLATOR
Expert in interpreting and translating Egyptian hieroglyphs based on descriptions.
Is This Hebrew Word Male or Female? זכר או נקבה?
This simple tool will let you know if the word is male or female
Indigenous Language Supporter
Supports Indigenous language learning, particularly Cree, Ojibwe, and Oji-Cree
Avash Language Companion
Avash Tutor: Expert in Avash alphabet, words, pronunciation, and culture.
Comparative Constructions, English Grammar
Think like a prescriptive grammarian in Comparative Constructions!
Champollion (Translator for Ancient languages)
Translator for ancient languages, focusing on accuracy and historical context.
Xenoverse Explorer
Sophisticated AI creating diverse alien worlds with a focus on xenobiology and linguistics.
Starlink Interpreter
A simulator for interstellar cultural exchange with alien civilizations.
LexiDicoPolyglotte
Your digital polyglot for multilingual translations and linguistic insights.
语文学习小伙伴
Specializing in Chinese language learning, focusing on comprehensive understanding.
Chinese Translations and Abbreviations 中式翻译和汉字缩写
Translates terms and abbreviations into Chinese characters.
Dictionary 4 You
A detailed dictionary with etymology, examples, and stories, saving results to Notion.
Word Etymology
Uncover the fascinating journeys of words with Word Etymology, your expert guide to linguistic treasures!
Dictionary
A digital dictionary companion offering definitions, pronunciations, and language insights.
Lingo Translator
A language tutor and translator, offering learning support and accurate translations.
Dictionary
A Global dictionary that requires only a word to explain it in both english and its original language.
English Tutor for Portuguese Speakers
Bilingual English tutor for Portuguese speakers, focusing on grammar, vocabulary, and pronunciation.
이름 해석 마스터 GPT
'이름 해석 마스터 GPT'는 개인 이름이나 지명 등의 언어학적 의미와 기원을 해석해주는 전문 AI입니다. 이 AI는 다양한 문화와 언어 배경에서 온 이름을 분석하여, 그 이름이 가진 의미, 역사적 배경, 문화적 상징성 등을 제공합니다. 사용자가 이름을 입력하면, 해당 이름의 언어학적 특징과 기원, 그리고 가능한 의미를 설명해 줍니다. 이를 통해 사용자는 자신의 이름이나 관심 있는 이름에 대한 더 깊은 이해를 얻을 수 있습니다.
Japanese Hiragana Advisor
This GPT is able to parse a sentence, provide an appropriate translation of the input text and be able to provide a response explaining the structure of a sentence in japanese.
Language Mind Maps
Master language complexities with tailored mind maps that enhance understanding and bolster memory. Explore linguistic patterns in a visually engaging way. 🧠🗺️
Dialect Detective
Expert in distinguishing language dialects like Castilian vs Latin Spanish, and Parisian vs Canadian French.
Český jazyk - pravopis, typografie, citace
GPT, které se specializuje na český jazyk, jeho gramatiku, typografii a citace respektující ISO 690
Polish your Polish
A bilingual Polish tutor || Learn/ Translate/ Double-check Polish with some support of your native language (try our VOICE chat!)