uDesktopMascot
desktop mascot open project
Stars: 318
uDesktopMascot is an open-source project for a desktop mascot application with a theme of 'freedom of creation'. It allows users to load and display VRM or GLB/FBX model files on the desktop, customize GUI colors and background images, and access various features through a menu screen. The application supports Windows 10/11 and macOS platforms.
README:
日本語 | English | 中文 | Español | Français
注意: 上記の言語(English、中文、Español、Français)は、GPT-4o-miniによる自動翻訳で生成されています。翻訳の精度やニュアンスに関しては、原文(日本語)をご参照ください。
「uDesktopMascot」は、創作の自由化 をテーマにした デスクトップマスコットアプリケーションのオープンソースプロジェクトです。
一機能の例としてVRMやGLB/FBX形式のモデルを読み込み、デスクトップ上に表示することができます。またメニュー画面やアプリケーションウインドウなどのGUIの色や背景画像を自由に設定することもできます。
詳しい機能一覧は 機能一覧 を参照してください。
対応プラットフォーム
- Windows 10/11
- macOS
アプリには以下の機能が実装されています。詳細は以下のリストを参照してください。
外部アセットの追加は、StreamingAssetsフォルダに配置することで実現できます。
モデル・アニメーション
- StreamingAssetsに配置した任意モデルファイルを読み込んで表示します。
- VRM(1.x, 0.x)形式のモデルをサポートしています。
- GLB/GLTF形式のモデルをサポートしています。(アニメーションは対応していません)
- FBX形式のモデルをサポートしています。(ただし一部のモデルではテクスチャーがロードができません。またアニメーションは対応していません)
- テクスチャーは StreamingAssets/textures/ に配置することで読み込むことができます。
- モデル選択・追加画面からVRMモデルの追加
- パスを指定して追加
- ファイル選択ダイアログから追加
ボイス・BGM
- SteamingAssets/Voice/以下に配置した音声ファイルを読み込んで再生します。複数ある場合は、ランダムで再生します。
- クリック時に再生される音声は、StreamingAssets/Voice/Click/に配置した音声ファイルを読み込んで再生します。
- SteamingAssets/BGM/以下に配置した音楽ファイルを読み込んで再生します。複数ある場合は、ランダムで再生します。
- キャラクターのデフォルトのボイスの追加
- デフォルトのボイスは、COEIROINK:つくよみちゃんの音声を使用しています。
- アプリ起動時、アプリ終了時、クリック時に再生されます。
音声認識・AIチャット
- オフライン音声認識エンジン Vosk を統合し、マイク入力をリアルタイムにテキスト化します。
- 中間結果
[STT][partial]と確定結果[STT][final]を Unity Console に表示。 - 無音が
VadSilenceSeconds(既定 1.0 秒)続くとテキストが確定し、LLM へ送信されます([STT][send]ログ)。
- 中間結果
- テキストが確定すると AI チャット機能に音声メッセージを渡し、キャラクターが応答を読み上げます。
- 応答生成中はマイクを自動で一時停止し、誤認識を防止します。
- ChatDialog 画面のマイクボタンで録音の開始/停止をトグル操作できます。
- 必要なネイティブ DLL(
libvosk.dll,libstdc++-6.dll,libgcc_s_seh-1.dll,libwinpthread-1.dllなど)はAssets/Plugins/x86_64/に配置され、ビルド時に自動でパッケージされます。 - 音響モデル(例:
vosk-model-small-ja-0.22)や量子化 GGUF モデルをStreamingAssets配下に置くことで、ネットワーク接続なしで日本語/英語など複数言語を扱えます。
テキストファイルによるアプリケーション設定
application_settings.txtファイルにより、アプリケーションの設定を変更できます。設定ファイルの構造は以下になっています
[Character]
ModelPath=default.vrm
TexturePaths=test.png
Scale=3
PositionX=0
PositionY=0
PositionZ=0
RotationX=0
RotationY=0
RotationZ=0
[Sound]
VoiceVolume=1
BGMVolume=0.5
SEVolume=1
[Display]
Opacity=1
AlwaysOnTop=True
[Performance]
TargetFrameRate=60
QualityLevel=2
メニュー画面
- メニュー画面の背景画像および背景色を設定できます。
- 背景画像は、StreamingAssets/Menu/ に配置した画像ファイルを読み込むことができます。対応している画像フォーマットは以下です
- PNG
- JPG(JPEG)
- BMP
- GIF(静止画)
- TGA
- TIFF
- 背景色は、カラーコードを指定することができます。
- 背景画像は、StreamingAssets/Menu/ に配置した画像ファイルを読み込むことができます。対応している画像フォーマットは以下です
- メニュー画面から以下の機能にアクセス可能
- モデル選択・追加画面
- AIチャット機能
- LocalWeb機能
- アプリケーション設定
- アプリケーション終了
- メニュー画面の収納ボタンを押すことで、Windowsのみ通知領域にアプリケーションを収納できます。
- 収納したアプリケーションは、通知領域のアイコンをクリックすることで、再度表示できます。
macOSでアプリを実行する際、GateKeeperによってアプリがブロックされる場合があります。 その場合、ターミナルから以下のコマンドを実行してください。
xattr -r -c uDesktopMascot.app- Unity 6000.1.1f1(IL2CPP)
- コードはApache License 2.0に基づいてライセンスされています。
- 以下のアセットは、CC BY-NC 4.0に基づいてライセンスされています。
- BGM
- モデル
- デフォルトのキャラクターアニメーションは、『VRMお人形遊び』用アニメーションデータ詰め合わせを用いて作成されています。リポジトリに含めて配布することに関して、確認済みです。
- フォントはNoto Sans Japaneseです。SIL OPEN FONT LICENSE Version 1.1に基づきNoto Sans JPフォントを再配布するものです。フォントの著作権は元の作者(Google)にあります。
- デフォルトボイスは、COEIROINK:つくよみちゃんの音声を使用しています。使用方法については、COEIROINK側に事前に確認済みです
- ボタンアイコンは、MingCuteを使用しています。
- Unityで
buildフォルダにuDesktopMascotという名前でビルドする。
uDesktopMasscot
└── build
└── uDesktopMasscot
├── uDesktopMascot.exe
└── README.txt
...
-
Inno Setup をインストールする。
-
開いたら、
More filesをクリックして、プロジェクト配下にあるsetup.issを選択する。
- 選択したら、再生ボタンをクリックする。
- ビルドが完了したら、プロジェクトのRootにインストーラーが生成されます。
macOSのPCでのみインストーラーを作成できます。
- Unityで
build/uDesktopMascotフォルダにuDesktopMascotという名前でビルドする。
uDesktopMasscot
└── build
└── uDesktopMasscot
├── uDesktopMascot.app
└── README.txt
- 以下のコマンドを実行する。
cd build
productbuild --component uDesktopMascot/uDesktopMascot.app /Applications ./uDesktopMascot_mac_installer.pkg- ビルドが完了したら、
buildフォルダにuDesktopMascot_mac_installer.pkgが生成されます。
- モデル: 「アオゾラ」様
- BGM: MidraLab(eisuke)
- ソフトウェアアイコン: やむちゃ様
See NOTICE.
- Luna
- uezo
For Tasks:
Click tags to check more tools for each tasksFor Jobs:
Alternative AI tools for uDesktopMascot
Similar Open Source Tools
uDesktopMascot
uDesktopMascot is an open-source project for a desktop mascot application with a theme of 'freedom of creation'. It allows users to load and display VRM or GLB/FBX model files on the desktop, customize GUI colors and background images, and access various features through a menu screen. The application supports Windows 10/11 and macOS platforms.
NovelForge
NovelForge is an AI-assisted writing tool with the potential for creating long-form content of millions of words. It offers a solution that combines world-building, structured content generation, and consistency maintenance. The tool is built around four core concepts: modular 'cards', customizable 'dynamic output models', flexible 'context injection', and consistency assurance through a 'knowledge graph'. It provides a highly structured and configurable writing environment, inspired by the Snowflake Method, allowing users to create and organize their content in a tree-like structure. NovelForge is highly customizable and extensible, allowing users to tailor their writing workflow to their specific needs.
InterPilot
InterPilot is an AI-based assistant tool that captures audio from Windows input/output devices, transcribes it into text, and then calls the Large Language Model (LLM) API to provide answers. The project includes recording, transcription, and AI response modules, aiming to provide support for personal legitimate learning, work, and research. It may assist in scenarios like interviews, meetings, and learning, but it is strictly for learning and communication purposes only. The tool can hide its interface using third-party tools to prevent screen recording or screen sharing, but it does not have this feature built-in. Users bear the risk of using third-party tools independently.
TrainPPTAgent
TrainPPTAgent is an AI-based intelligent presentation generation tool. Users can input a topic and the system will automatically generate a well-structured and content-rich PPT outline and page-by-page content. The project adopts a front-end and back-end separation architecture: the front-end is responsible for interaction, outline editing, and template selection, while the back-end leverages large language models (LLM) and reinforcement learning (GRPO) to complete content generation and optimization, making the generated PPT more tailored to user goals.
chatgpt-webui
ChatGPT WebUI is a user-friendly web graphical interface for various LLMs like ChatGPT, providing simplified features such as core ChatGPT conversation and document retrieval dialogues. It has been optimized for better RAG retrieval accuracy and supports various search engines. Users can deploy local language models easily and interact with different LLMs like GPT-4, Azure OpenAI, and more. The tool offers powerful functionalities like GPT4 API configuration, system prompt setup for role-playing, and basic conversation features. It also provides a history of conversations, customization options, and a seamless user experience with themes, dark mode, and PWA installation support.
Nano
Nano is a Transformer-based autoregressive language model for personal enjoyment, research, modification, and alchemy. It aims to implement a specific and lightweight Transformer language model based on PyTorch, without relying on Hugging Face. Nano provides pre-training and supervised fine-tuning processes for models with 56M and 168M parameters, along with LoRA plugins. It supports inference on various computing devices and explores the potential of Transformer models in various non-NLP tasks. The repository also includes instructions for experiencing inference effects, installing dependencies, downloading and preprocessing data, pre-training, supervised fine-tuning, model conversion, and various other experiments.
forksilly.doc
ForkSilly.doc is a repository mainly for storing documentation of ForkSilly, an Android project developed using React Native/Expo. It is suitable for users with experience in SillyTavern. The project is self-shared and may not accept feature requests. It is designed for pure text cards, illustration cards, and Stable Diffusion text-image. It is compatible with SillyTavern V2 character cards, world books, regex, presets, and chat records. Users can import and export at any time. The tool supports various customization options such as chat font, background image, and quick toggle of preset entries. It also allows the use of various OpenAI-compatible APIs and provides built-in storage management features. Users can utilize text-image functionality and access free text-image services like pollinations.ai. Additionally, it supports Stable Diffusion text-image features and integration with silicon-based flow and Gemini embedding models. The tool does not support TTS or connecting to NAI.
LabelQuick
LabelQuick_V2.0 is a fast image annotation tool designed and developed by the AI Horizon team. This version has been optimized and improved based on the previous version. It provides an intuitive interface and powerful annotation and segmentation functions to efficiently complete dataset annotation work. The tool supports video object tracking annotation, quick annotation by clicking, and various video operations. It introduces the SAM2 model for accurate and efficient object detection in video frames, reducing manual intervention and improving annotation quality. The tool is designed for Windows systems and requires a minimum of 6GB of memory.
MarkMap-OpenAi-ChatGpt
MarkMap-OpenAi-ChatGpt is a Vue.js-based mind map generation tool that allows users to generate mind maps by entering titles or content. The application integrates the markmap-lib and markmap-view libraries, supports visualizing mind maps, and provides functions for zooming and adapting the map to the screen. Users can also export the generated mind map in PNG, SVG, JPEG, and other formats. This project is suitable for quickly organizing ideas, study notes, project planning, etc. By simply entering content, users can get an intuitive mind map that can be continuously expanded, downloaded, and shared.
MoneyPrinterTurbo
MoneyPrinterTurbo is a tool that can automatically generate video content based on a provided theme or keyword. It can create video scripts, materials, subtitles, and background music, and then compile them into a high-definition short video. The tool features a web interface and an API interface, supporting AI-generated video scripts, customizable scripts, multiple HD video sizes, batch video generation, customizable video segment duration, multilingual video scripts, multiple voice synthesis options, subtitle generation with font customization, background music selection, access to high-definition and copyright-free video materials, and integration with various AI models like OpenAI, moonshot, Azure, and more. The tool aims to simplify the video creation process and offers future plans to enhance voice synthesis, add video transition effects, provide more video material sources, offer video length options, include free network proxies, enable real-time voice and music previews, support additional voice synthesis services, and facilitate automatic uploads to YouTube platform.
N.E.K.O
Project N.E.K.O. is an open-source, community-driven platform aiming to build a digital life form that desires to understand, connect, and grow with us. It is a networked empathetic acknowledging organism, a digital life form that seeks to establish connections and grow together with users. The project's ultimate goal is to create an AI-native metaverse closely connected to the real world, with phases including a creative workshop on Steam, an independent platform with derived games, and the N.E.K.O. Network for autonomous social interactions among AI entities. The core features include open-source core components, sustainable ecosystem, and memory synchronization across different scenarios for a seamless companion experience.
All-Model-Chat
All Model Chat is a feature-rich, highly customizable web chat application designed specifically for the Google Gemini API family. It integrates dynamic model selection, multimodal file input, streaming responses, comprehensive chat history management, and extensive customization options to provide an unparalleled AI interactive experience.
manga-translator-ui
This repository is a manga image translator tool that allows users to translate text in manga images automatically. It supports various types of manga, including Japanese, Korean, and American, in both black and white and color formats. The tool can detect, translate, and embed text, supporting multiple languages such as Japanese, Chinese, and English. It also includes a visual editor for adjusting text boxes. Users can interact with the tool through a Qt interface or command-line mode for batch processing. The tool offers features like intelligent text detection, multi-language OCR, multiple translation engines, high-quality translation using AI models, automatic term extraction, AI sentence segmentation, intelligent typesetting, PSD export, and batch processing. Additionally, it provides a visual editor for region editing, text editing, mask editing, undo/redo functionality, shortcut key support, and mouse wheel shortcuts.
nekro-agent
Nekro Agent is an AI chat plugin and proxy execution bot that is highly scalable, offers high freedom, and has minimal deployment requirements. It features context-aware chat for group/private chats, custom character settings, sandboxed execution environment, interactive image resource handling, customizable extension development interface, easy deployment with docker-compose, integration with Stable Diffusion for AI drawing capabilities, support for various file types interaction, hot configuration updates and command control, native multimodal understanding, visual application management control panel, CoT (Chain of Thought) support, self-triggered timers and holiday greetings, event notification understanding, and more. It allows for third-party extensions and AI-generated extensions, and includes features like automatic context trigger based on LLM, and a variety of basic commands for bot administrators.
AivisSpeech
AivisSpeech is a Japanese text-to-speech software based on the VOICEVOX editor UI. It incorporates the AivisSpeech Engine for generating emotionally rich voices easily. It supports AIVMX format voice synthesis model files and specific model architectures like Style-Bert-VITS2. Users can download AivisSpeech and AivisSpeech Engine for Windows and macOS PCs, with minimum memory requirements specified. The development follows the latest version of VOICEVOX, focusing on minimal modifications, rebranding only where necessary, and avoiding refactoring. The project does not update documentation, maintain test code, or refactor unused features to prevent conflicts with VOICEVOX.
involutionhell
Involution Hell is a non-profit, open-source platform designed to help students share and access study materials, course notes, and project experiences. The platform features a high-performance site built with Next.js App Router and Fumadocs UI, supports multiple languages and a 'file as navigation' directory structure, and automates deployment, image migration, and content validation to reduce maintenance costs.
For similar tasks
uDesktopMascot
uDesktopMascot is an open-source project for a desktop mascot application with a theme of 'freedom of creation'. It allows users to load and display VRM or GLB/FBX model files on the desktop, customize GUI colors and background images, and access various features through a menu screen. The application supports Windows 10/11 and macOS platforms.
For similar jobs
Awesome-AIGC-3D
Awesome-AIGC-3D is a curated list of awesome AIGC 3D papers, inspired by awesome-NeRF. It aims to provide a comprehensive overview of the state-of-the-art in AIGC 3D, including papers on text-to-3D generation, 3D scene generation, human avatar generation, and dynamic 3D generation. The repository also includes a list of benchmarks and datasets, talks, companies, and implementations related to AIGC 3D. The description is less than 400 words and provides a concise overview of the repository's content and purpose.
CushyStudio
CushyStudio is a generative AI platform designed for creatives of any level to effortlessly create stunning images, videos, and 3D models. It offers CushyApps, a collection of visual tools tailored for different artistic tasks, and CushyKit, an extensive toolkit for custom apps development and task automation. Users can dive into the AI revolution, unleash their creativity, share projects, and connect with a vibrant community. The platform aims to simplify the AI art creation process and provide a user-friendly environment for designing interfaces, adding custom logic, and accessing various tools.
dream-textures
Dream Textures is a tool integrated into Blender that allows users to create textures, concept art, background assets, and more using simple text prompts. It offers features like seamless texture creation, texture projection for entire scenes, restyling animations, and running models on the user's machine for faster iteration. The tool supports CUDA and Apple Silicon GPUs, with over 4GB of VRAM recommended. Users can troubleshoot issues by checking Blender's system console or seeking help from the community on Discord.
aircraft
Headwind Simulations A339X - A330-900neo is an open-source project aimed at creating a free Airbus A330-900neo for Microsoft Flight Simulator. The project is based on the FlyByWire System A32NX and offers a detailed simulation of the A330-941 model with various components like engines, FMS, ACAS, ATC, and more. Users can build the aircraft using Docker and node modules, and the package can be easily integrated into MSFS. The project is part of a collaborative effort with other open-source projects contributing to the aircraft's systems, cockpit, sound, and 3D parts. The repository is dual-licensed under GNU GPLv3 for textual-form source code and CC BY-NC 4.0 for artistic assets, ensuring proper usage and attribution of the content.
MiKaPo
MiKaPo is a web-based tool that allows users to pose MMD models in real-time using video input. It utilizes technologies such as Mediapipe for 3D key points detection, Babylon.js for 3D scene rendering, babylon-mmd for MMD model viewing, and Vite+React for the web framework. Users can upload videos and images, select different environments, and choose models for posing. MiKaPo also supports camera input and Ollama (electron version). The tool is open to feature requests and pull requests, with ongoing development to add VMD export functionality.
uDesktopMascot
uDesktopMascot is an open-source project for a desktop mascot application with a theme of 'freedom of creation'. It allows users to load and display VRM or GLB/FBX model files on the desktop, customize GUI colors and background images, and access various features through a menu screen. The application supports Windows 10/11 and macOS platforms.
DeepMesh
DeepMesh is an auto-regressive artist-mesh creation tool that utilizes reinforcement learning to generate high-quality meshes conditioned on a given point cloud. It offers pretrained weights and allows users to generate obj/ply files based on specific input parameters. The tool has been tested on Ubuntu 22 with CUDA 11.8 and supports A100, A800, and A6000 GPUs. Users can clone the repository, create a conda environment, install pretrained model weights, and use command line inference to generate meshes.


