data:image/s3,"s3://crabby-images/74c83/74c83df2ebf176f02fdd6a78b77f5efae33d2d47" alt="AIAS"
AIAS
免费,可商用,Java AI 人工智能一站式解决方案,为工作减负,为产品研发加速。项目类别包括:Java版 Pytorch 训练引擎,AI SDK,web应用等在内,合计超过100个项目组成的项目集。| Artificial Intelligence Accelerator Kit. It provides: a project collection consisting of over 100 projects, including AI SDK, web applications, desktop applications, image generation,
Stars: 812
data:image/s3,"s3://crabby-images/55cee/55ceecf0f8b219c17aed8a71fe4e1f1fdffa7171" alt="screenshot"
AIAS is a comprehensive AI training platform that offers courses and practical examples in various AI fields such as traditional image processing, deep learning algorithms, JavaAI applications, NLP, web development, image generation, and desktop application development. The platform also provides SDKs for tasks like image recognition, OCR, natural language processing, audio processing, video analysis, and big data analysis. Users can access training materials, source code, and tools for developing AI applications across different domains.
README:
- 相关源码
- 技术答疑
-
- JavaCV - java版的OpenCV实现传统图像处理(提供常用代码例子)
-
- NDArray - java版的numpy,用于高性能处理矩阵(提供常用代码例子)
-
- 深度学习算法基础
- 基础知识:前馈神经网络,卷积神经网络,循环神经网络
- 图像识别:图像分类,图像分割,目标检测
-
- java版的模型开发与训练
-
- pytorch 模型开发与训练
-
- 图像处理_SDK(培训常用图像处理,并提供可商用的源码)
- 人脸工具箱
- 人脸高清修复
- 图文高清_黑白上色
-
- NLP_SDK(培训常用自然语言处理,并提供可商用的源码)
- 代码特征向量提取
- 中文特征向量提取
- 多语言文本特征向量提取
- 机器翻译
-
- Web应用(培训如何开发web类应用,并提供可商用的源码)
- OCR,OCR自定义模版
- 人脸搜索
- 以图搜图
- 图像文本跨模态搜索
- 文本搜索
- 代码语义搜索
- 一键抠图
- 图像高清
- 机器翻译
-
- AIGC 图像生成(培训如何开发图像生成类应用,并提供可商用的源码)
- AIGC提示词如何撰写
- 图像生成预处理
- 图像生成SD工具箱
- 模型微调(LoRA)
-
- AI桌面应用开发(培训如何开发桌面应用,并提供可商用的源码)
- 大模型桌面应用
- OCR桌面应用
- 图像高清放大
-
- 大模型
- 大模型算法原理(transformer,训练,微调,推理优化)
- 知识库,RAG增强生成等
- 提示词工程
- 支持图像分类,目标检测
- 支持的能力清单
1). OCR文字识别
2). 机器翻译
3). 语音识别
...
OCR文字识别 - 自由文本识别(支持旋转、倾斜的图片)- 文本图片转正 (一般情况下不需要,因为ocr 原生支持旋转、倾斜的图片 ) |
|
语音识别 - 英文语音识别,- 中文语音识别。 |
|
202种语言互相翻译 - 支持202种语言互相翻译 |
- 1_image_sdks - [图像识别 SDK]
1). 工具箱系列:图像处理工具箱(静态图像)
2). 目标检测
3). 图像分割
4). GAN
5). 其它类别:OCR等
...
OCR工具箱 1:方向检测 - ocr_sdks/ocr_direction_det_sdk - OCR图像预处理。 |
|
OCR工具箱 2:OCR文字识别 1. ocr_sdks/ocr_v3_sdk1). V3 文本检测: - 中文文本检测 - 英文文本检测 - 多语言文本检测 2). V3 文本识别: - 中文简体 - 中文繁体 - 英文 - 韩语 - 日语 - 阿拉伯 - 梵文 - 泰米尔语 - 泰卢固语 - 卡纳达文 - 斯拉夫 2. ocr_sdks/ocr_v4_sdk - 原生支持倾斜文本文字识别。 - 更高的识别精度 - 支持中英文。 |
|
OCR工具箱 4:版面分析 - ocr_sdks/ocr_layout_sdk可以用于配合文字识别, 表格识别的流水线处理使用。 1). 中文版面分析 2). 英文版面分析 3). 中英文文档 - 表格区域检测 |
|
OCR工具箱 5: 表格识别 - ocr_sdks/ocr_table_sdk- 中英文表格识别。 |
|
动物分类识别 |
|
菜品分类识别 |
|
烟火检测 |
|
行人检测 |
|
智慧工地检测 |
|
车辆检测 |
- 2_nlp_sdks - [自然语言 SDK]
1). 工具箱系列:sentencepiece,fastText,npy/npz文件处理等。
2). 大模型
3). 词向量
4). 机器翻译
...
Sentencepiece分词 |
|
jieba分词 |
- 3_audio_sdks - [语音处理 SDK]
1). 工具箱系列:音素工具箱,librosa,java sound,javacv ffmpeg, fft, vad工具箱等。
2). 声音克隆
3). 语音合成
4). 声纹识别
5). 语音识别
...
中文语音识别(ASR) 1. 短语音- asr_whisper_sdk 2. 长语音 - asr_whisper_long_sdk |
|
TTS 文本转为语音 - tts_sdk- TTS 文本转为语音。 |
- 4_video_sdks - [视频解析SDK]
1). 摄像头口罩检测 - camera_facemask_sdk
2). MP4检测口罩 - mp4_facemask_sdk
3). rtsp取流检测口罩 - rtsp_facemask_sdk
视频流分析 1. 摄像头口罩检测- camera_facemask_sdk 2. MP4检测口罩 - mp4_facemask_sdk 3. rtsp取流检测口罩 - rtsp_facemask_sdk |
- 5_bigdata_sdks - [大数据SDK]
1). flink-情感倾向分析【英文】- flink_sentence_encoder_sdk
2). kafka-情感倾向分析【英文】- kafka_sentiment_analysis_sdk
...
大数据分析 flink-情感倾向分析flink_sentiment_analysis_sdk kafka-情感倾向分析 kafka_sentiment_analysis_sdk 针对带有主观描述的文本, 可自动判断该文本的情感极性类别并给出相应的置信度。 |
For Tasks:
Click tags to check more tools for each tasksFor Jobs:
Alternative AI tools for AIAS
Similar Open Source Tools
data:image/s3,"s3://crabby-images/55cee/55ceecf0f8b219c17aed8a71fe4e1f1fdffa7171" alt="AIAS Screenshot"
AIAS
AIAS is a comprehensive AI training platform that offers courses and practical examples in various AI fields such as traditional image processing, deep learning algorithms, JavaAI applications, NLP, web development, image generation, and desktop application development. The platform also provides SDKs for tasks like image recognition, OCR, natural language processing, audio processing, video analysis, and big data analysis. Users can access training materials, source code, and tools for developing AI applications across different domains.
data:image/s3,"s3://crabby-images/ffebf/ffebf7b8d506ca1fdcf00b3c44fc7b28425ec488" alt="chatgpt-auto-refresh Screenshot"
chatgpt-auto-refresh
ChatGPT Auto Refresh is a userscript that keeps ChatGPT sessions fresh by eliminating network errors and Cloudflare checks. It removes the 10-minute time limit from conversations when Chat History is disabled, ensuring a seamless experience. The tool is safe, lightweight, and a time-saver, allowing users to keep their sessions alive without constant copy/paste/refresh actions. It works even in background tabs, providing convenience and efficiency for users interacting with ChatGPT. The tool relies on the chatgpt.js library and is compatible with various browsers using Tampermonkey, making it accessible to a wide range of users.
data:image/s3,"s3://crabby-images/f4d9a/f4d9aec796f85136042cb4bf1a467617bb48d229" alt="aituber-kit Screenshot"
aituber-kit
AITuber-Kit is a tool that enables users to interact with AI characters, conduct AITuber live streams, and engage in external integration modes. Users can easily converse with AI characters using various LLM APIs, stream on YouTube with AI character reactions, and send messages to server apps via WebSocket. The tool provides settings for API keys, character configurations, voice synthesis engines, and more. It supports multiple languages and allows customization of VRM models and background images. AITuber-Kit follows the MIT license and offers guidelines for adding new languages to the project.
data:image/s3,"s3://crabby-images/00516/005161705878c36be7ea42873034a39add8018ff" alt="langchat Screenshot"
langchat
LangChat is an enterprise AIGC project solution in the Java ecosystem. It integrates AIGC large model functionality on top of the RBAC permission system to help enterprises quickly customize AI knowledge bases and enterprise AI robots. It supports integration with various large models such as OpenAI, Gemini, Ollama, Azure, Zhifu, Alibaba Tongyi, Baidu Qianfan, etc. The project is developed solely by TyCoding and is continuously evolving. It features multi-modality, dynamic configuration, knowledge base support, advanced RAG capabilities, function call customization, multi-channel deployment, workflows visualization, AIGC client application, and more.
data:image/s3,"s3://crabby-images/7c10b/7c10b3597710597fd80aed10c94fa904aefdf13b" alt="FastGPT Screenshot"
FastGPT
FastGPT is a knowledge base Q&A system based on the LLM large language model, providing out-of-the-box data processing, model calling and other capabilities. At the same time, you can use Flow to visually arrange workflows to achieve complex Q&A scenarios!
data:image/s3,"s3://crabby-images/72783/7278372ea82b546e8cdd610fc4852b8d8131d0fd" alt="lobe-icons Screenshot"
lobe-icons
Lobe Icons is a collection of popular AI / LLM Model Brand SVG logos and icons. It features lightweight and scalable icons designed with highly optimized scalable vector graphics (SVG) for optimal performance. The collection is tree-shakable, allowing users to import only the icons they need to reduce the overall bundle size of their projects. Lobe Icons has an active community of designers and developers who can contribute and seek support on platforms like GitHub and Discord. The repository supports a wide range of brands across different models, providers, and applications, with more brands continuously being added through contributions. Users can easily install Lobe UI with the provided commands and integrate it with NextJS for server-side rendering. Local development can be done using Github Codespaces or by cloning the repository. Contributions are welcome, and users can contribute code by checking out the GitHub Issues. The project is MIT licensed and maintained by LobeHub.
data:image/s3,"s3://crabby-images/65b82/65b8298406041b85b1d2471303ca869745d249b2" alt="googlegpt Screenshot"
googlegpt
GoogleGPT is a browser extension that brings the power of ChatGPT to Google Search. With GoogleGPT, you can ask ChatGPT questions and get answers directly in your search results. You can also use GoogleGPT to generate text, translate languages, and more. GoogleGPT is compatible with all major browsers, including Chrome, Firefox, Edge, and Safari.
data:image/s3,"s3://crabby-images/dc918/dc91806953a0e4b9018ec3f7d1b3edad35612dc1" alt="chatgpt.js Screenshot"
chatgpt.js
chatgpt.js is a powerful JavaScript library that allows for super easy interaction w/ the ChatGPT DOM. * Feature-rich * Object-oriented * Easy-to-use * Lightweight (yet optimally performant)
data:image/s3,"s3://crabby-images/de2ea/de2ea1dcc3c92df483c7ecd20f1cc866a2d4d5f1" alt="awesome-LLM-AIOps Screenshot"
awesome-LLM-AIOps
The 'awesome-LLM-AIOps' repository is a curated list of academic research and industrial materials related to Large Language Models (LLM) and Artificial Intelligence for IT Operations (AIOps). It covers various topics such as incident management, log analysis, root cause analysis, incident mitigation, and incident postmortem analysis. The repository provides a comprehensive collection of papers, projects, and tools related to the application of LLM and AI in IT operations, offering valuable insights and resources for researchers and practitioners in the field.
data:image/s3,"s3://crabby-images/55b1f/55b1f5494f779f0cabf236755bf9eb605639945a" alt="bravegpt Screenshot"
bravegpt
BraveGPT is a userscript that brings the power of ChatGPT to Brave Search. It allows users to engage with a conversational AI assistant directly within their search results, providing instant and personalized responses to their queries. BraveGPT is powered by GPT-4, the latest and most advanced language model from OpenAI, ensuring accurate and comprehensive answers. With BraveGPT, users can ask questions, get summaries, generate creative content, and more, all without leaving the Brave Search interface. The tool is easy to install and use, making it accessible to users of all levels. BraveGPT is a valuable addition to the Brave Search experience, enhancing its capabilities and providing users with a more efficient and informative search experience.
data:image/s3,"s3://crabby-images/ee883/ee883ae87cd35317b956c549278a8434874c9c96" alt="MING Screenshot"
MING
MING is an open-sourced Chinese medical consultation model fine-tuned based on medical instructions. The main functions of the model are as follows: Medical Q&A: answering medical questions and analyzing cases. Intelligent consultation: giving diagnosis results and suggestions after multiple rounds of consultation.
data:image/s3,"s3://crabby-images/fbacb/fbacbbc29f0153115561de688e976f3ec7570344" alt="Awesome-LM-SSP Screenshot"
Awesome-LM-SSP
The Awesome-LM-SSP repository is a collection of resources related to the trustworthiness of large models (LMs) across multiple dimensions, with a special focus on multi-modal LMs. It includes papers, surveys, toolkits, competitions, and leaderboards. The resources are categorized into three main dimensions: safety, security, and privacy. Within each dimension, there are several subcategories. For example, the safety dimension includes subcategories such as jailbreak, alignment, deepfake, ethics, fairness, hallucination, prompt injection, and toxicity. The security dimension includes subcategories such as adversarial examples, poisoning, and system security. The privacy dimension includes subcategories such as contamination, copyright, data reconstruction, membership inference attacks, model extraction, privacy-preserving computation, and unlearning.
data:image/s3,"s3://crabby-images/01fec/01feccc5630a1bc05c03b7aba81c5fff0d84ef59" alt="springboot-openai-chatgpt Screenshot"
springboot-openai-chatgpt
The springboot-openai-chatgpt repository is an open-source project for a super AI brain that utilizes GPT technology to quickly generate language content such as copies, love letters, and questions. Users can input keywords to enhance work efficiency and creativity. The AI brain combines powerful question-answering systems and knowledge graphs to provide comprehensive and accurate answers. It supports programming tasks, generates code using GPT, and continuously strengthens its capabilities with growing data to provide superior intelligent applications.
data:image/s3,"s3://crabby-images/7dc54/7dc54429873ef6ce9ea7d9ecac7d1cdac21c7c3e" alt="aiwechat-vercel Screenshot"
aiwechat-vercel
aiwechat-vercel is a tool that integrates AI capabilities into WeChat public accounts using Vercel functions. It requires minimal server setup, low entry barriers, and only needs a domain name that can be bound to Vercel, with almost zero cost. The tool supports various AI models, continuous Q&A sessions, chat functionality, system prompts, and custom commands. It aims to provide a platform for learning and experimentation with AI integration in WeChat public accounts.
For similar tasks
data:image/s3,"s3://crabby-images/55cee/55ceecf0f8b219c17aed8a71fe4e1f1fdffa7171" alt="AIAS Screenshot"
AIAS
AIAS is a comprehensive AI training platform that offers courses and practical examples in various AI fields such as traditional image processing, deep learning algorithms, JavaAI applications, NLP, web development, image generation, and desktop application development. The platform also provides SDKs for tasks like image recognition, OCR, natural language processing, audio processing, video analysis, and big data analysis. Users can access training materials, source code, and tools for developing AI applications across different domains.
data:image/s3,"s3://crabby-images/acc2c/acc2ceb45ec56809b50ee71874733caebbe4f4dc" alt="aws-ai-ml-workshop-kr Screenshot"
aws-ai-ml-workshop-kr
AWS AI/ML Workshop & example collection in Korean. The example codes in this repository are divided into 4 categories: AI services, Applied AI, SageMaker, Integration, Generative AI, and AWS Neuron. Each directory has its own Readme file. This repository also provides useful information for self-studying SageMaker.
data:image/s3,"s3://crabby-images/8939a/8939a85f58d0be9ee9a37deff975416070963012" alt="AiLearning-Theory-Applying Screenshot"
AiLearning-Theory-Applying
This repository provides a comprehensive guide to understanding and applying artificial intelligence (AI) theory, including basic knowledge, machine learning, deep learning, and natural language processing (BERT). It features detailed explanations, annotated code, and datasets to help users grasp the concepts and implement them in practice. The repository is continuously updated to ensure the latest information and best practices are covered.
data:image/s3,"s3://crabby-images/37991/379919f0235c43fbc45f343131226fba43222f95" alt="AI0x0.com Screenshot"
AI0x0.com
AI 0x0 is a versatile AI query generation desktop floating assistant application that supports MacOS and Windows. It allows users to utilize AI capabilities in any desktop software to query and generate text, images, audio, and video data, helping them work more efficiently. The application features a dynamic desktop floating ball, floating dialogue bubbles, customizable presets, conversation bookmarking, preset packages, network acceleration, query mode, input mode, mouse navigation, deep customization of ChatGPT Next Web, support for full-format libraries, online search, voice broadcasting, voice recognition, voice assistant, application plugins, multi-model support, online text and image generation, image recognition, frosted glass interface, light and dark theme adaptation for each language model, and free access to all language models except Chat0x0 with a key.
data:image/s3,"s3://crabby-images/5aaeb/5aaeb4fce7aac314b8303cdf1d5dfd9ebbc4d877" alt="monadic-chat Screenshot"
monadic-chat
Monadic Chat is a locally hosted web application designed to create and utilize intelligent chatbots. It provides a Linux environment on Docker to GPT and other LLMs, enabling the execution of advanced tasks that require external tools. The tool supports voice interaction, image and video recognition and generation, and AI-to-AI chat, making it useful for using AI and developing various applications. It is available for Mac, Windows, and Linux (Debian/Ubuntu) with easy-to-use installers.
data:image/s3,"s3://crabby-images/83afc/83afcd39fd69a41723dd590c7594d452ad40edd5" alt="VisionCraft Screenshot"
VisionCraft
The VisionCraft API is a free API for using over 100 different AI models. From images to sound.
data:image/s3,"s3://crabby-images/fa48f/fa48f2d0db61427023099414ac1c2eb560ac53b8" alt="openvino Screenshot"
openvino
OpenVINO™ is an open-source toolkit for optimizing and deploying AI inference. It provides a common API to deliver inference solutions on various platforms, including CPU, GPU, NPU, and heterogeneous devices. OpenVINO™ supports pre-trained models from Open Model Zoo and popular frameworks like TensorFlow, PyTorch, and ONNX. Key components of OpenVINO™ include the OpenVINO™ Runtime, plugins for different hardware devices, frontends for reading models from native framework formats, and the OpenVINO Model Converter (OVC) for adjusting models for optimal execution on target devices.
data:image/s3,"s3://crabby-images/2c46a/2c46a6847fa78880c37f14cbbc4914aa400b3a09" alt="djl-demo Screenshot"
djl-demo
The Deep Java Library (DJL) is a framework-agnostic Java API for deep learning. It provides a unified interface to popular deep learning frameworks such as TensorFlow, PyTorch, and MXNet. DJL makes it easy to develop deep learning applications in Java, and it can be used for a variety of tasks, including image classification, object detection, natural language processing, and speech recognition.
For similar jobs
data:image/s3,"s3://crabby-images/7a828/7a828889d979cbf4be5a04454f679734bb36585f" alt="sweep Screenshot"
sweep
Sweep is an AI junior developer that turns bugs and feature requests into code changes. It automatically handles developer experience improvements like adding type hints and improving test coverage.
data:image/s3,"s3://crabby-images/cac11/cac1100b7e92d3c9c9529eacfe5a6e8d943d8f57" alt="teams-ai Screenshot"
teams-ai
The Teams AI Library is a software development kit (SDK) that helps developers create bots that can interact with Teams and Microsoft 365 applications. It is built on top of the Bot Framework SDK and simplifies the process of developing bots that interact with Teams' artificial intelligence capabilities. The SDK is available for JavaScript/TypeScript, .NET, and Python.
data:image/s3,"s3://crabby-images/10f6b/10f6b939c21eecaacb4aeb678159f5a587a20256" alt="ai-guide Screenshot"
ai-guide
This guide is dedicated to Large Language Models (LLMs) that you can run on your home computer. It assumes your PC is a lower-end, non-gaming setup.
data:image/s3,"s3://crabby-images/8b8c3/8b8c30180bcfba25fde40a102b6ae98fd35704b8" alt="classifai Screenshot"
classifai
Supercharge WordPress Content Workflows and Engagement with Artificial Intelligence. Tap into leading cloud-based services like OpenAI, Microsoft Azure AI, Google Gemini and IBM Watson to augment your WordPress-powered websites. Publish content faster while improving SEO performance and increasing audience engagement. ClassifAI integrates Artificial Intelligence and Machine Learning technologies to lighten your workload and eliminate tedious tasks, giving you more time to create original content that matters.
data:image/s3,"s3://crabby-images/c6b52/c6b52a0438e707c19f9dcb358608627496141f31" alt="chatbot-ui Screenshot"
chatbot-ui
Chatbot UI is an open-source AI chat app that allows users to create and deploy their own AI chatbots. It is easy to use and can be customized to fit any need. Chatbot UI is perfect for businesses, developers, and anyone who wants to create a chatbot.
data:image/s3,"s3://crabby-images/2fa15/2fa15d62e208bea0a119405a82ad37a6b24564c0" alt="BricksLLM Screenshot"
BricksLLM
BricksLLM is a cloud native AI gateway written in Go. Currently, it provides native support for OpenAI, Anthropic, Azure OpenAI and vLLM. BricksLLM aims to provide enterprise level infrastructure that can power any LLM production use cases. Here are some use cases for BricksLLM: * Set LLM usage limits for users on different pricing tiers * Track LLM usage on a per user and per organization basis * Block or redact requests containing PIIs * Improve LLM reliability with failovers, retries and caching * Distribute API keys with rate limits and cost limits for internal development/production use cases * Distribute API keys with rate limits and cost limits for students
data:image/s3,"s3://crabby-images/e597e/e597e24a3c2657c376591c1e0da9159b22cd2ff2" alt="uAgents Screenshot"
uAgents
uAgents is a Python library developed by Fetch.ai that allows for the creation of autonomous AI agents. These agents can perform various tasks on a schedule or take action on various events. uAgents are easy to create and manage, and they are connected to a fast-growing network of other uAgents. They are also secure, with cryptographically secured messages and wallets.
data:image/s3,"s3://crabby-images/8ab69/8ab692a869eef895ffca840dda9b43d13f3cf958" alt="griptape Screenshot"
griptape
Griptape is a modular Python framework for building AI-powered applications that securely connect to your enterprise data and APIs. It offers developers the ability to maintain control and flexibility at every step. Griptape's core components include Structures (Agents, Pipelines, and Workflows), Tasks, Tools, Memory (Conversation Memory, Task Memory, and Meta Memory), Drivers (Prompt and Embedding Drivers, Vector Store Drivers, Image Generation Drivers, Image Query Drivers, SQL Drivers, Web Scraper Drivers, and Conversation Memory Drivers), Engines (Query Engines, Extraction Engines, Summary Engines, Image Generation Engines, and Image Query Engines), and additional components (Rulesets, Loaders, Artifacts, Chunkers, and Tokenizers). Griptape enables developers to create AI-powered applications with ease and efficiency.