RookieAI_yolov8
基于yolov8实现的AI自瞄项目 AI self-aiming project based on yolov8
Stars: 190
RookieAI_yolov8 is an open-source project designed for developers and users interested in utilizing YOLOv8 models for object detection tasks. The project provides instructions for setting up the required libraries and Pytorch, as well as guidance on using custom or official YOLOv8 models. Users can easily train their own models and integrate them with the software. The tool offers features for packaging the code, managing model files, and organizing the necessary resources for running the software. It also includes updates and optimizations for better performance and functionality, with a focus on FPS game aimbot functionalities. The project aims to provide a comprehensive solution for object detection tasks using YOLOv8 models.
README:
Version | Python |
---|---|
2.4.3或更早 |
3.7+ |
2.4.4.2或更新 |
3.10+ |
前言:为维护环境不提供直接打包成型的软件,开源代码鼓励自学。 使用前务必查看参数说明文档
[!Warning]
如果你只是想使用,请直接下载打包好的exe文件,使用前请先阅读参数解释文档
如果你只是本地运行该项目,请不要使用pyinstaller打包
打包方法尚未完善,遇到问题也不要提交Issues,请自行解决
-
✨ 超高速无痛下载 ✨
pip install -r requirements.txt -i https://pypi.doubanio.com/simple/ pip install torch torchvision torchaudio -f https://mirror.sjtu.edu.cn/pytorch-wheels/torch_stable.html --no-index
-
当未找到模型时会自动下载YOLOv8n模型,你也可以⬇️
访问YOLOv8GitHub界面获取更多官方yolov8模型以快速开始
访问ultralytics官网查看官方网站帮助文档
-
打开软件>选择模型文件>保存设置>关闭软件,重启软件。
即可加载上选择的模型文件
或者:
修改默认文件地址:
#默认的模型文件地址 default_model_file="yolov8n.pt"
此处教程未更新,仅供参考
如果您是开发者,无需阅读此步骤,使用终端运行即可
-
pip install -r requirements.txt pip install torch==2.2.0 torchvision==0.17.0 torchaudio==2.2.0 --index-url https://download.pytorch.org/whl/cu121
pip install pyinstaller
-
pyinstaller xxxxx.py
将xxxxx替换为代码实际名称。
更多打包参数介绍:Pythonpyinstaller打包exe最完整教程
❗必定会遇到的问题:
Q:运行提示
FileNotFoundError:
,问题:缺少库A:把编译器虚拟环境里的库全部复制粘贴到打包出的文件夹<_internal>里就可以了。
参考:打包后库不全解决办法
-
建议自行训练
学习资料:
❗在软件中选择完模型文件后需保存重启后才会生效,因为需要重新加载模型。
-
将
_internal(包含软件环境/库) body_photo.png(软件需要的图片1) logo-bird.png(软件需要的图片2) 程序.exe(主程序) settings.json(参数保存) 模型文件.pt(模型文件)
放在同一目录下,直接运行exe文件即可。
oldGUI版与newGUI版对比图:
🎉全新V2.0版本,全新布局,更好优化,更多设置
AtlasOS对 Windows 进行修改,专为游戏玩家设计。具有更高的游戏帧率和更低的延迟。同时在此系统上使用RookieAI可更高效的利用GPU资源得到更高的推理帧率。
boosterX是一款系统优化软件,优化Windows,降低延迟、提高 FPS。在AtlasOS系统上使用可进一步进行优化。
配置单:截图模式:mss 截图高/宽:320 显卡:RTX4080M 模型:YOLOv8s_TheFinals_teammate_enemy_04.engine
原版windows空载运行RookieAI2.4.3 对比 AtlasOS 空载运行RookieAI2.4.3:
此项目最初目的为Apex的Aimbot,未考虑其他游戏,可能会出现因反作弊禁止WIN32移动方式而无法使用的情况!
已知游戏:VALORANT
面对日益增多的配置文件参数,我新建了参数解释文档,里面介绍了配置文件内所有参数的信息,前往参数解释文档查看。
(;′⌒`)悲报:23号刚做好的可以适配valorant的版本由于拳头5/24的反作弊更新当场就似了:(
🎉🎉🎉非常感谢由RicardoJoaquim提供的英文特别版本🎉🎉🎉
9/6/2024更新:
🎛️版本更新:V2.5.6.1(beta)
概要:新增一些热键
新增热键左Ctrl
右Ctrl
左Shift
右Shift
左Alt
右Alt
8/26/2024更新:
🎛️版本更新:V2.5.6(beta)
概要:增加对于onnx模型的支持
➕新增:全目标识别。可通过'目标选择-全部'让模型识别编号为0-9的所有目标(可通过代码修改上限)
➕新增:onnx模型支持。配合'cuDNN_download_V9.3_12.6.bat'工具快速部署onnx环境即可支持GPU加速的onnx模型使用。
➖移除:像素预测”,一并移除相关可调选项。
❗注意:如需使用onnx模型需下载cuDNN,可通过tool
文件夹内的cuDNN_download_V9.3_12.6.bat
工具快速部署cuDNN环境,随后在环境中执行
pip install onnxruntime-gpu
下载onnx运行库,最后将onnx
与onnxruntime-gpu
文件夹移动到程序运行环境文件夹_internal
中即可。
Tips:请删除CPU版本onnxruntime,会与GPU版本冲突。
❗注意:requirements.txt
请求列表已添加onnxruntime-gpu
8/19/2024更新:
🎛️版本更新:V2.5.5(beta)
概要:添加目标移动方向识别,预测瞄准位置
➕新增:目标预判。使用稀疏流光推理函数分析环境中像素点的移动方向从而得到目标的移动方向从而进行预判瞄准位置。在“预测方法—自动预测”进行开启。手动预测开发中。
➕新增:鼠标平滑。通过三层函数处理最终输出平滑的鼠标移动:1.检测短时间内反向移动,并进行过滤 2.检测到目标停止时,减慢速度以精确瞄准 3.指数平滑,将前一帧的位置与当前预测的位置进行加权平均,减少瞬间的大幅度的非常规鼠标移动。
➖移除:像素预测”,一并移除相关可调选项。
🛠️修改:原“倍率预测”改为“鼠标加速方法—倍率加速”
🛠️优化:进一步优化分段瞄准
❗注意:目前鼠标平滑与目标预判不可同时开启,会出现瞄准位置的错位。
🕳️完整版预计:1.稀疏流光/预判相关参数调整页面 2.鼠标平滑参数调整页面
8/16/2024更新:
🎛️版本更新:V2.5.3(beta)
概要:V2.5.3(beta)优化倍率预测,添加自瞄平滑(暂时与倍率预测绑定)
➕新增:鼠标移动平滑处理
➖移除:新分段瞄准模式,感觉没毛用
🛠️优化:倍率预测(效果依然不理想)
🛠️优化:分段瞄准逻辑:当目标进入软锁范围后,鼠标的移动速度会根据距离逐渐变化,而不是突然改变,从而更加平滑和自然的瞄准。
7/15/2024更新:
🎛️版本更新:V2.4.4.2(beta)/V2.5.0(beta)
概要:V2.4.4.2(beta)版更新‘忽略颜色’功能/V2.5.0(bera)版重构代码,移除部分无用内容
➕新增:V2.4.4.2(beta)忽略颜色功能,有助于单标签模型区分敌我。
➕新增:V2.5.0(beta)添加自动开火,目标距离判断正在测试
➖移除:V2.5.0(beta)移除随机特征码
🛠️优化:V2.5.0(beta),@molanp对V2.4.3版本的代码进行优化
Python要求>=3.10
📑新增文件:V2.5.0(beta)
👋感谢:
@molanp:重构并优化V2.4.3版本代码。
@CougarP:对于颜色忽略功能的支持。
7/8/2024更新:
🎛️版本更新:V2.4.3
概要:V2.4.3正式版,添加辅助压枪功能。 ➕新增:辅助压枪,参数可通过对应参数设置页面调整
➕新增:可在settings.json中将"deactivate_dxcam"设置为true禁用dxcam模块加载
🛠️优化:通过匿名API来请求更新公告,速度更快更稳定,每小时60次请求限制。
❗注意:配置文件(settings.json)发生变更,注意替换。
👋感谢:
@molanp:提供使用API请求更新公告的办法,使请求与加载速度更快更稳定!
🕳️V2.4.4计划:
自动扳机,停止瞄准范围。
7/3/2024更新:
🎛️版本更新:V2.4.2
概要:修复V2.4.1正式版的BUG
🛠️修复:死循环,意外弹窗,参数重置等恶性BUG
7/2/2024更新:
🎛️版本更新:V2.4.1
概要:V2.4.1正式版,添加完善更多功能的GUI页面。
➕新增:随机特征码
➕新增:随机窗口名
➕新增:鼠标移动库选择:
-
win32
-
mouse
-
飞易来U盘
-
罗技移动(需下载旧版GHUB,安装包在LGmouseControl文件夹中)
➕新增:x轴瞄准偏移
➕新增:随机瞄准偏移(随机瞄准部位)参数配置GUI,在弹出的额外窗口中设置随机瞄准区间以及切换位置速度
🛠️优化:控制台不再刷屏(除了调参拉进度条),FPS显示速度可在代码内调整
🛠️修复:修复Dxcam截图方式黑屏的BUG
❗注意:配置文件(settings.json)发生变更,注意替换。
❗注意:requirements.txt内添加了一些库
📑新增文件:
-
罗技移动DLL(MouseControl.dll)
-
罗技GHUB以及相关文件(LGmouseControl文件夹)
👋感谢:
@loy0516:提出加入睡眠,防止刷在加载的建议。
@CougarP:提供英文版以及英文版特有功能:
-
添加PID控制,使鼠标移动更平滑、更精确
-
颜色忽略,识别检测框上方区域中的颜色,如果颜色与所选颜色匹配,则忽略框等。这些功能稍后会同步到中文版中。
🕳️V2.4.2计划:
垂直压枪,自动扳机,kmbox兼容
🤔请求:如果有更好用的鼠标移动库、移动方式,欢迎联系我,我会参考并添加到代码中!TY
6/21/2024更新:
🎛️版本更新:V2.4.1(test)
概要:该版本旨在提高软件稳定性安全性,绕过反作弊检测。提高软件代码的随机性。
➕新增:随机特征码。
➕新增:随机窗口名。
➕新增:随机瞄准偏移。
瞄准偏移功能参数详见参数解释文档
❗注意:配置文件发生变更,注意替换。
❗注意:测试版GUI不完善,新功能需要在配置文件内修改。
❗注意:测试版本已移动至Beta文件夹内
6/4/2024更新:
🎛️版本更新:V2.3.3(test)
➕新增:微调水平瞄准偏移功能(测试)。
❗注意:配置文件发生变更,注意替换。
❗注意:测试版GUI不完善,新功能需要在配置文件内修改。
❗❗已知问题:使用TRT模型可能会出现识别异常,边框异常的现象,建议多换几个TRT模型或使用PT模型。
🕳️V2.3.3计划:
1.加入垂直压枪功能;
2.完善水平瞄准偏移GUI页面;
3.推理参数兼容选择。
5/30/2024更新:
🎛️版本更新:V2.3.2
➕新增:对TRT模型的兼容。现在可以正常使用*.engine模型进行推理。
📑V2.4.0测试版:
易键鼠版本,基于v2.3.1修改。
仅推荐 懂易键鼠的开发者 使用,该版本旨在绕过某些屏蔽win32移动方式的游戏使用(例如瓦,虽然已经似了),同时更难被检测。
目前处于半成品状态且需自行购入易键鼠U盘,不推荐小白使用。
5/12/2024更新:
🎛️版本更新:V2.3.1(正式版)
➕新增:分段瞄准四项参数的GUI模块,方便调整。
✨提示:软件窗口默认置顶
🛠️优化:优化部分代码,指定使用显卡推理(device="cuda:0"
)。
4/27/2024更新:
🎛️版本更新:V2.3(test)测试版
➕新增:
-
更快的截图方式(基于DXcam的优化版本bettercam),可在配置文件切换截图方式(MSS/bettercam)。
-
现在可以在配置文件里开关是否加载公告。
-
一键打开配置文件按钮,编辑配置文件更方便。
-
推出分段瞄准功能(测试版),更好的瞄准体验,前往配置文件内开启。
📑新建: 参数解释文档(Parameter_explanation.txt)
该文件将介绍settings.json中所有参数的意义
❗注意:
-
测试版GUI不完善,新功能需要在配置文件内开启与修改。
-
更多说明注意事项请查看参数解释文档
-
引入了新的截图库(bettercam),已添加到requirements.txt中。
-
配置文件发生变更,注意替换!
🛠️优化:优化加载过程,但还是不稳定
🕳️未来计划:
完善V2.3版本GUI
自动扳机,平滑鼠标
4/22/2024更新:
➕新增:联网检测版本功能(依赖于Github的README.md,可能需要科学上网才能加载)。
✨提示:过时版本号为红色,最新版本号为绿色,点击版本号可快速跳转至Github界面。
❗注意:加载时间可能变长,加载时间不稳定,加载页面会出现无响应问题,等待几秒钟即可,问题已在调查。
🕳️未来计划:
使用贝塞尔曲线平滑鼠标
自动扳机,平滑鼠标
4/21/2024更新:
🛠️修复:修复了报错NameError的问题,现在可以稳定加载了。
4/20/2024更新:
🎉推出:V2.2版本,重新精简/分类各项设置页面
➕新增
-
推理延迟修正设置(预测)
- 倍率预测
- 像素预测
-
可选择单个识别目标(需要模型支持,并且需要根据你的模型手动修改代码中的目标列表名字)
-
软件内可开关的测试画面,方便调试设置
-
一键重启软件按钮(用于更换模型后的快速重启以及报错后快速重启)
-
加入更新公告栏(依赖于Github的README.md,可能需要科学上网才能加载)
🛠️优化:默认隐藏外部"frame_"调试窗口(可前往settings.json开启)以优化推理速度
❗注意:
-
"V2.2"版本的配置文件(settings.json)已更改,需要替换。
-
github库已整理,历史版本与对应可使用的配置文件已整理到historic_version文件夹。
❗❗已知问题:可能会报错nameError,会在后面版本进行修复,当前可以通过重启软件解决,并在属性中勾选“管理员启动”与“注册此程序以重新启动”可能会好一些
4/8/2024更新:
🎉推出:全新GUI版本,使用customtkinter库进行美化,界面更直观,简洁。
➕新增:启动画面
📑新建:版本号现在起从V1.0开始计算(前面都当测试吧)
❗注意:"newGUI"版本需要两个新的图片文件,注意替换。
❗注意:"oldGUI"版本图片文件已移除,"oldGUI"版本已移除背景图相关代码。
4/2/2024更新:
✨改进:独立出侧键触发开关,使其适用于任何按键触发方式,方便开关。
🛠️优化:优化整理触发判定代码,更统一易懂
❗注意:配置文件内容已更改,新增了侧键保存参数,注意替换。
4/1/2024更新:
✨改进:
自瞄偏差。(之前只使用固定像素偏移达到垂直瞄准位置的效果,没有考虑敌人的远近距造成的影响,近距离偏移过低,远距离偏移过高。
现改为计算目标中心点到上边框的距离乘以用户设置倍数的方式使远近距离偏移量都能在合适位置)。
🛠️优化:优化代码量,去除暂时未启用功能代码,加入更多批注方便阅读。
➖移除:暂时移除压枪功能,测试不出意外中断原因,考虑独立出来。
🕳️未来计划:
1.支持其他格式文件,例如ONNX。
2.独立出"侧键触发"开关。
3/19/2024更新:
➕新增:自定义压枪(经常失灵,就当没有,后面修)
➕新增:新增侧键鼠标下侧键触发,贴脸腰射好用。(需选择"shift+按下"触发方式)
🎛️测试:经测试,支持最新YOLOv9,YOLOv5,YOLOv8。可自行训练YOLO模型并使用
📑新建:添加鼠标压枪参数文件,可自行调整。(只能垂直压枪,分为三段,可自己改更多段。有BUG用着用着压枪就没了...)
🕳️未来计划:
1.支持其他格式文件,例如ONNX。
2.独立出"侧键触发"开关。
2/27/2024更新:
➕新增:自定义微调自瞄位置(自瞄偏差)
➕新增:可手动选择模型文件,应用范围更广
✨改进:半透明UI界面
✨改进:调参页面置顶
❗注意:配置文件内容已更改,需替换为最新版
2/4/2024更新:
🛠️修复:文件无法手动选择,进程冲突导致的参数调整面板失效
📑新建:创建更新日志,从此开始记录每次的更新内容
2/25/2024更新:
📢立项:基于yolov8的FPS游戏自瞄软件
🤖更新:实现基本的自瞄功能等。
1/28/2024发布:
👼初次创建基于yolov8实现本地视频预处理项目
Current latest version: V2.4.3
For Tasks:
Click tags to check more tools for each tasksFor Jobs:
Alternative AI tools for RookieAI_yolov8
Similar Open Source Tools
RookieAI_yolov8
RookieAI_yolov8 is an open-source project designed for developers and users interested in utilizing YOLOv8 models for object detection tasks. The project provides instructions for setting up the required libraries and Pytorch, as well as guidance on using custom or official YOLOv8 models. Users can easily train their own models and integrate them with the software. The tool offers features for packaging the code, managing model files, and organizing the necessary resources for running the software. It also includes updates and optimizations for better performance and functionality, with a focus on FPS game aimbot functionalities. The project aims to provide a comprehensive solution for object detection tasks using YOLOv8 models.
MarkMap-OpenAi-ChatGpt
MarkMap-OpenAi-ChatGpt is a Vue.js-based mind map generation tool that allows users to generate mind maps by entering titles or content. The application integrates the markmap-lib and markmap-view libraries, supports visualizing mind maps, and provides functions for zooming and adapting the map to the screen. Users can also export the generated mind map in PNG, SVG, JPEG, and other formats. This project is suitable for quickly organizing ideas, study notes, project planning, etc. By simply entering content, users can get an intuitive mind map that can be continuously expanded, downloaded, and shared.
ChatMemOllama
ChatMemOllama is a personal WeChat public account chatbot that combines a local AI model (provided by Ollama) and mem0 memory management functionality. The project aims to provide an intelligent, personalized chat experience. It features a local AI model for conversation, memory management through mem0 for a coherent dialogue experience, support for multiple users simultaneously (with logic issues in the test version), and quick responses within 5 seconds to users with timeout prompts. It allows or prohibits other users from calling AI, with ongoing development tasks including debugging multiple user handling logic and keyword replies, and completed tasks such as basic conversation and tool calling. The ultimate goal is to wait for pre-task testing completion.
siteproxy
Siteproxy 2.0 is a web proxy tool that utilizes service worker for enhanced stability and increased website coverage. It replaces express with hono for a 4x speed boost and supports deployment on Cloudflare worker. It enables reverse proxying, allowing access to YouTube/Google without VPN, and supports login for GitHub and Telegram web. The tool also features DuckDuckGo AI Chat with free access to GPT3.5 and Claude3. It offers a pure web-based online proxy with no client configuration required, facilitating reverse proxying to the internet.
Yi-Ai
Yi-Ai is a project based on the development of nineai 2.4.2. It is for learning and reference purposes only, not for commercial use. The project includes updates to popular models like gpt-4o and claude3.5, as well as new features such as model image recognition. It also supports various functionalities like model sorting, file type extensions, and bug fixes. The project provides deployment tutorials for both integrated and compiled packages, with instructions for environment setup, configuration, dependency installation, and project startup. Additionally, it offers a management platform with different access levels and emphasizes the importance of following the steps for proper system operation.
chatgpt-webui
ChatGPT WebUI is a user-friendly web graphical interface for various LLMs like ChatGPT, providing simplified features such as core ChatGPT conversation and document retrieval dialogues. It has been optimized for better RAG retrieval accuracy and supports various search engines. Users can deploy local language models easily and interact with different LLMs like GPT-4, Azure OpenAI, and more. The tool offers powerful functionalities like GPT4 API configuration, system prompt setup for role-playing, and basic conversation features. It also provides a history of conversations, customization options, and a seamless user experience with themes, dark mode, and PWA installation support.
MoneyPrinterTurbo
MoneyPrinterTurbo is a tool that can automatically generate video content based on a provided theme or keyword. It can create video scripts, materials, subtitles, and background music, and then compile them into a high-definition short video. The tool features a web interface and an API interface, supporting AI-generated video scripts, customizable scripts, multiple HD video sizes, batch video generation, customizable video segment duration, multilingual video scripts, multiple voice synthesis options, subtitle generation with font customization, background music selection, access to high-definition and copyright-free video materials, and integration with various AI models like OpenAI, moonshot, Azure, and more. The tool aims to simplify the video creation process and offers future plans to enhance voice synthesis, add video transition effects, provide more video material sources, offer video length options, include free network proxies, enable real-time voice and music previews, support additional voice synthesis services, and facilitate automatic uploads to YouTube platform.
AI-Drug-Discovery-Design
AI-Drug-Discovery-Design is a repository focused on Artificial Intelligence-assisted Drug Discovery and Design. It explores the use of AI technology to accelerate and optimize the drug development process. The advantages of AI in drug design include speeding up research cycles, improving accuracy through data-driven models, reducing costs by minimizing experimental redundancies, and enabling personalized drug design for specific patients or disease characteristics.
Tianji
Tianji is a free, non-commercial artificial intelligence system developed by SocialAI for tasks involving worldly wisdom, such as etiquette, hospitality, gifting, wishes, communication, awkwardness resolution, and conflict handling. It includes four main technical routes: pure prompt, Agent architecture, knowledge base, and model training. Users can find corresponding source code for these routes in the tianji directory to replicate their own vertical domain AI applications. The project aims to accelerate the penetration of AI into various fields and enhance AI's core competencies.
airda
airda(Air Data Agent) is a multi-agent system for data analysis, which can understand data development and data analysis requirements, understand data, and generate SQL and Python code for data query, data visualization, machine learning and other tasks.
AirPower4T
AirPower4T is a development base library based on Vue3 TypeScript Element Plus Vite, using decorators, object-oriented, Hook and other front-end development methods. It provides many common components and some feedback components commonly used in background management systems, and provides a lot of enums and decorators.
Code-Interpreter-Api
Code Interpreter API is a project that combines a scheduling center with a sandbox environment, dedicated to creating the world's best code interpreter. It aims to provide a secure, reliable API interface for remotely running code and obtaining execution results, accelerating the development of various AI agents, and being a boon to many AI enthusiasts. The project innovatively combines Docker container technology to achieve secure isolation and execution of Python code. Additionally, the project supports storing generated image data in a PostgreSQL database and accessing it through API endpoints, providing rich data processing and storage capabilities.
gzm-design
Gzm Design is a free and open-source poster designer developed using the latest mainstream technologies such as Vue3, Vite4, TypeScript, etc. It provides features like PSD import, JSON import, multiple pages support, shortcut key support, template import, layer management, ruler tool, pen tool, element editing, preview, file download, canvas zooming and dragging, border stroke, filling, blending modes, text formatting, group handling, canvas size modification, rich text support, masking, shadow effects, undo/redo functionality, QR code tool, barcode tool, and ruler line npm package encapsulation.
hugging-llm
HuggingLLM is a project that aims to introduce ChatGPT to a wider audience, particularly those interested in using the technology to create new products or applications. The project focuses on providing practical guidance on how to use ChatGPT-related APIs to create new features and applications. It also includes detailed background information and system design introductions for relevant tasks, as well as example code and implementation processes. The project is designed for individuals with some programming experience who are interested in using ChatGPT for practical applications, and it encourages users to experiment and create their own applications and demos.
paper-ai
Paper-ai is a tool that helps you write papers using artificial intelligence. It provides features such as AI writing assistance, reference searching, and editing and formatting tools. With Paper-ai, you can quickly and easily create high-quality papers.
ChatGPT-airport-tizi-fanqiang
This repository provides a curated list of recommended airport proxies for accessing ChatGPT and other AI tools while bypassing internet restrictions. The proxies are tested and verified to ensure reliability and stability. The readme includes detailed instructions on how to set up and use the proxies with various devices and platforms. Additionally, the repository offers advanced tutorials on upgrading to GPT-4/Plus, deploying a 24/7 ChatGPT微信机器人 server, and using Claude-3 securely and for free.
For similar tasks
RookieAI_yolov8
RookieAI_yolov8 is an open-source project designed for developers and users interested in utilizing YOLOv8 models for object detection tasks. The project provides instructions for setting up the required libraries and Pytorch, as well as guidance on using custom or official YOLOv8 models. Users can easily train their own models and integrate them with the software. The tool offers features for packaging the code, managing model files, and organizing the necessary resources for running the software. It also includes updates and optimizations for better performance and functionality, with a focus on FPS game aimbot functionalities. The project aims to provide a comprehensive solution for object detection tasks using YOLOv8 models.
VisionCraft
The VisionCraft API is a free API for using over 100 different AI models. From images to sound.
openvino
OpenVINO™ is an open-source toolkit for optimizing and deploying AI inference. It provides a common API to deliver inference solutions on various platforms, including CPU, GPU, NPU, and heterogeneous devices. OpenVINO™ supports pre-trained models from Open Model Zoo and popular frameworks like TensorFlow, PyTorch, and ONNX. Key components of OpenVINO™ include the OpenVINO™ Runtime, plugins for different hardware devices, frontends for reading models from native framework formats, and the OpenVINO Model Converter (OVC) for adjusting models for optimal execution on target devices.
djl-demo
The Deep Java Library (DJL) is a framework-agnostic Java API for deep learning. It provides a unified interface to popular deep learning frameworks such as TensorFlow, PyTorch, and MXNet. DJL makes it easy to develop deep learning applications in Java, and it can be used for a variety of tasks, including image classification, object detection, natural language processing, and speech recognition.
nnstreamer
NNStreamer is a set of Gstreamer plugins that allow Gstreamer developers to adopt neural network models easily and efficiently and neural network developers to manage neural network pipelines and their filters easily and efficiently.
cortex
Nitro is a high-efficiency C++ inference engine for edge computing, powering Jan. It is lightweight and embeddable, ideal for product integration. The binary of nitro after zipped is only ~3mb in size with none to minimal dependencies (if you use a GPU need CUDA for example) make it desirable for any edge/server deployment.
PyTorch-Tutorial-2nd
The second edition of "PyTorch Practical Tutorial" was completed after 5 years, 4 years, and 2 years. On the basis of the essence of the first edition, rich and detailed deep learning application cases and reasoning deployment frameworks have been added, so that this book can more systematically cover the knowledge involved in deep learning engineers. As the development of artificial intelligence technology continues to emerge, the second edition of "PyTorch Practical Tutorial" is not the end, but the beginning, opening up new technologies, new fields, and new chapters. I hope to continue learning and making progress in artificial intelligence technology with you in the future.
CVPR2024-Papers-with-Code-Demo
This repository contains a collection of papers and code for the CVPR 2024 conference. The papers cover a wide range of topics in computer vision, including object detection, image segmentation, image generation, and video analysis. The code provides implementations of the algorithms described in the papers, making it easy for researchers and practitioners to reproduce the results and build upon the work of others. The repository is maintained by a team of researchers at the University of California, Berkeley.
For similar jobs
weave
Weave is a toolkit for developing Generative AI applications, built by Weights & Biases. With Weave, you can log and debug language model inputs, outputs, and traces; build rigorous, apples-to-apples evaluations for language model use cases; and organize all the information generated across the LLM workflow, from experimentation to evaluations to production. Weave aims to bring rigor, best-practices, and composability to the inherently experimental process of developing Generative AI software, without introducing cognitive overhead.
LLMStack
LLMStack is a no-code platform for building generative AI agents, workflows, and chatbots. It allows users to connect their own data, internal tools, and GPT-powered models without any coding experience. LLMStack can be deployed to the cloud or on-premise and can be accessed via HTTP API or triggered from Slack or Discord.
VisionCraft
The VisionCraft API is a free API for using over 100 different AI models. From images to sound.
kaito
Kaito is an operator that automates the AI/ML inference model deployment in a Kubernetes cluster. It manages large model files using container images, avoids tuning deployment parameters to fit GPU hardware by providing preset configurations, auto-provisions GPU nodes based on model requirements, and hosts large model images in the public Microsoft Container Registry (MCR) if the license allows. Using Kaito, the workflow of onboarding large AI inference models in Kubernetes is largely simplified.
PyRIT
PyRIT is an open access automation framework designed to empower security professionals and ML engineers to red team foundation models and their applications. It automates AI Red Teaming tasks to allow operators to focus on more complicated and time-consuming tasks and can also identify security harms such as misuse (e.g., malware generation, jailbreaking), and privacy harms (e.g., identity theft). The goal is to allow researchers to have a baseline of how well their model and entire inference pipeline is doing against different harm categories and to be able to compare that baseline to future iterations of their model. This allows them to have empirical data on how well their model is doing today, and detect any degradation of performance based on future improvements.
tabby
Tabby is a self-hosted AI coding assistant, offering an open-source and on-premises alternative to GitHub Copilot. It boasts several key features: * Self-contained, with no need for a DBMS or cloud service. * OpenAPI interface, easy to integrate with existing infrastructure (e.g Cloud IDE). * Supports consumer-grade GPUs.
spear
SPEAR (Simulator for Photorealistic Embodied AI Research) is a powerful tool for training embodied agents. It features 300 unique virtual indoor environments with 2,566 unique rooms and 17,234 unique objects that can be manipulated individually. Each environment is designed by a professional artist and features detailed geometry, photorealistic materials, and a unique floor plan and object layout. SPEAR is implemented as Unreal Engine assets and provides an OpenAI Gym interface for interacting with the environments via Python.
Magick
Magick is a groundbreaking visual AIDE (Artificial Intelligence Development Environment) for no-code data pipelines and multimodal agents. Magick can connect to other services and comes with nodes and templates well-suited for intelligent agents, chatbots, complex reasoning systems and realistic characters.