
uDesktopMascot
desktop mascot open project
Stars: 318

uDesktopMascot is an open-source project for a desktop mascot application with a theme of 'freedom of creation'. It allows users to load and display VRM or GLB/FBX model files on the desktop, customize GUI colors and background images, and access various features through a menu screen. The application supports Windows 10/11 and macOS platforms.
README:
日本語 | English | 中文 | Español | Français
注意: 上記の言語(English、中文、Español、Français)は、GPT-4o-miniによる自動翻訳で生成されています。翻訳の精度やニュアンスに関しては、原文(日本語)をご参照ください。
「uDesktopMascot」は、創作の自由化
をテーマにした デスクトップマスコットアプリケーションのオープンソースプロジェクトです。
一機能の例としてVRMやGLB/FBX形式のモデルを読み込み、デスクトップ上に表示することができます。またメニュー画面やアプリケーションウインドウなどのGUIの色や背景画像を自由に設定することもできます。
詳しい機能一覧は 機能一覧 を参照してください。
対応プラットフォーム
- Windows 10/11
- macOS
アプリには以下の機能が実装されています。詳細は以下のリストを参照してください。
外部アセットの追加は、StreamingAssetsフォルダに配置することで実現できます。
モデル・アニメーション
- StreamingAssetsに配置した任意モデルファイルを読み込んで表示します。
- VRM(1.x, 0.x)形式のモデルをサポートしています。
- GLB/GLTF形式のモデルをサポートしています。(アニメーションは対応していません)
- FBX形式のモデルをサポートしています。(ただし一部のモデルではテクスチャーがロードができません。またアニメーションは対応していません)
- テクスチャーは StreamingAssets/textures/ に配置することで読み込むことができます。
- モデル選択・追加画面からVRMモデルの追加
- パスを指定して追加
- ファイル選択ダイアログから追加
ボイス・BGM
- SteamingAssets/Voice/以下に配置した音声ファイルを読み込んで再生します。複数ある場合は、ランダムで再生します。
- クリック時に再生される音声は、StreamingAssets/Voice/Click/に配置した音声ファイルを読み込んで再生します。
- SteamingAssets/BGM/以下に配置した音楽ファイルを読み込んで再生します。複数ある場合は、ランダムで再生します。
- キャラクターのデフォルトのボイスの追加
- デフォルトのボイスは、COEIROINK:つくよみちゃんの音声を使用しています。
- アプリ起動時、アプリ終了時、クリック時に再生されます。
音声認識・AIチャット
- オフライン音声認識エンジン Vosk を統合し、マイク入力をリアルタイムにテキスト化します。
- 中間結果
[STT][partial]
と確定結果[STT][final]
を Unity Console に表示。 - 無音が
VadSilenceSeconds
(既定 1.0 秒)続くとテキストが確定し、LLM へ送信されます([STT][send]
ログ)。
- 中間結果
- テキストが確定すると AI チャット機能に音声メッセージを渡し、キャラクターが応答を読み上げます。
- 応答生成中はマイクを自動で一時停止し、誤認識を防止します。
- ChatDialog 画面のマイクボタンで録音の開始/停止をトグル操作できます。
- 必要なネイティブ DLL(
libvosk.dll
,libstdc++-6.dll
,libgcc_s_seh-1.dll
,libwinpthread-1.dll
など)はAssets/Plugins/x86_64/
に配置され、ビルド時に自動でパッケージされます。 - 音響モデル(例:
vosk-model-small-ja-0.22
)や量子化 GGUF モデルをStreamingAssets
配下に置くことで、ネットワーク接続なしで日本語/英語など複数言語を扱えます。
テキストファイルによるアプリケーション設定
application_settings.txtファイルにより、アプリケーションの設定を変更できます。設定ファイルの構造は以下になっています
[Character]
ModelPath=default.vrm
TexturePaths=test.png
Scale=3
PositionX=0
PositionY=0
PositionZ=0
RotationX=0
RotationY=0
RotationZ=0
[Sound]
VoiceVolume=1
BGMVolume=0.5
SEVolume=1
[Display]
Opacity=1
AlwaysOnTop=True
[Performance]
TargetFrameRate=60
QualityLevel=2
メニュー画面
- メニュー画面の背景画像および背景色を設定できます。
- 背景画像は、StreamingAssets/Menu/ に配置した画像ファイルを読み込むことができます。対応している画像フォーマットは以下です
- PNG
- JPG(JPEG)
- BMP
- GIF(静止画)
- TGA
- TIFF
- 背景色は、カラーコードを指定することができます。
- 背景画像は、StreamingAssets/Menu/ に配置した画像ファイルを読み込むことができます。対応している画像フォーマットは以下です
- メニュー画面から以下の機能にアクセス可能
- モデル選択・追加画面
- AIチャット機能
- LocalWeb機能
- アプリケーション設定
- アプリケーション終了
- メニュー画面の収納ボタンを押すことで、Windowsのみ通知領域にアプリケーションを収納できます。
- 収納したアプリケーションは、通知領域のアイコンをクリックすることで、再度表示できます。
macOSでアプリを実行する際、GateKeeperによってアプリがブロックされる場合があります。 その場合、ターミナルから以下のコマンドを実行してください。
xattr -r -c uDesktopMascot.app
- Unity 6000.1.1f1(IL2CPP)
- コードはApache License 2.0に基づいてライセンスされています。
- 以下のアセットは、CC BY-NC 4.0に基づいてライセンスされています。
- BGM
- モデル
- デフォルトのキャラクターアニメーションは、『VRMお人形遊び』用アニメーションデータ詰め合わせを用いて作成されています。リポジトリに含めて配布することに関して、確認済みです。
- フォントはNoto Sans Japaneseです。SIL OPEN FONT LICENSE Version 1.1に基づきNoto Sans JPフォントを再配布するものです。フォントの著作権は元の作者(Google)にあります。
- デフォルトボイスは、COEIROINK:つくよみちゃんの音声を使用しています。使用方法については、COEIROINK側に事前に確認済みです
- ボタンアイコンは、MingCuteを使用しています。
- Unityで
build
フォルダにuDesktopMascot
という名前でビルドする。
uDesktopMasscot
└── build
└── uDesktopMasscot
├── uDesktopMascot.exe
└── README.txt
...
-
Inno Setup をインストールする。
-
開いたら、
More files
をクリックして、プロジェクト配下にあるsetup.iss
を選択する。
- 選択したら、再生ボタンをクリックする。
- ビルドが完了したら、プロジェクトのRootにインストーラーが生成されます。
macOSのPCでのみインストーラーを作成できます。
- Unityで
build/uDesktopMascot
フォルダにuDesktopMascot
という名前でビルドする。
uDesktopMasscot
└── build
└── uDesktopMasscot
├── uDesktopMascot.app
└── README.txt
- 以下のコマンドを実行する。
cd build
productbuild --component uDesktopMascot/uDesktopMascot.app /Applications ./uDesktopMascot_mac_installer.pkg
- ビルドが完了したら、
build
フォルダにuDesktopMascot_mac_installer.pkg
が生成されます。
- モデル: 「アオゾラ」様
- BGM: MidraLab(eisuke)
- ソフトウェアアイコン: やむちゃ様
See NOTICE.
- Luna
- uezo
For Tasks:
Click tags to check more tools for each tasksFor Jobs:
Alternative AI tools for uDesktopMascot
Similar Open Source Tools

uDesktopMascot
uDesktopMascot is an open-source project for a desktop mascot application with a theme of 'freedom of creation'. It allows users to load and display VRM or GLB/FBX model files on the desktop, customize GUI colors and background images, and access various features through a menu screen. The application supports Windows 10/11 and macOS platforms.

gzm-design
Gzm Design is a free and open-source poster designer developed using the latest mainstream technologies such as Vue3, Vite4, TypeScript, etc. It provides features like PSD import, JSON import, multiple pages support, shortcut key support, template import, layer management, ruler tool, pen tool, element editing, preview, file download, canvas zooming and dragging, border stroke, filling, blending modes, text formatting, group handling, canvas size modification, rich text support, masking, shadow effects, undo/redo functionality, QR code tool, barcode tool, and ruler line npm package encapsulation.

Nano
Nano is a Transformer-based autoregressive language model for personal enjoyment, research, modification, and alchemy. It aims to implement a specific and lightweight Transformer language model based on PyTorch, without relying on Hugging Face. Nano provides pre-training and supervised fine-tuning processes for models with 56M and 168M parameters, along with LoRA plugins. It supports inference on various computing devices and explores the potential of Transformer models in various non-NLP tasks. The repository also includes instructions for experiencing inference effects, installing dependencies, downloading and preprocessing data, pre-training, supervised fine-tuning, model conversion, and various other experiments.

AivisSpeech
AivisSpeech is a Japanese text-to-speech software based on the VOICEVOX editor UI. It incorporates the AivisSpeech Engine for generating emotionally rich voices easily. It supports AIVMX format voice synthesis model files and specific model architectures like Style-Bert-VITS2. Users can download AivisSpeech and AivisSpeech Engine for Windows and macOS PCs, with minimum memory requirements specified. The development follows the latest version of VOICEVOX, focusing on minimal modifications, rebranding only where necessary, and avoiding refactoring. The project does not update documentation, maintain test code, or refactor unused features to prevent conflicts with VOICEVOX.

AIResume
AIResume is an open-source resume creation platform that helps users easily create professional resumes, integrating AI technology to assist users in polishing their resumes. The project allows for template development using Vue 3, Vite, TypeScript, and Ant Design Vue. Users can edit resumes, export them as PDFs, switch between multiple resume templates, and collaborate on template development. AI features include resume refinement, deep optimization based on individual projects or experiences, and simulated interviews for user practice. Additional functionalities include theme color switching, high customization options, dark/light mode switching, real-time preview, drag-and-drop resume scaling, data export/import, data clearing, sample data prefilling, template market showcasing, and more.

rime_wanxiang
Rime Wanxiang is a pinyin input method based on deep optimized lexicon and language model. It features a lexicon with tones, AI and large corpus filtering, and frequency addition to provide more accurate sentence output. The tool supports various input methods and customization options, aiming to enhance user experience through lexicon and transcription. Users can also refresh the lexicon with different types of auxiliary codes using the LMDG toolkit package. Wanxiang offers core features like tone-marked pinyin annotations, phrase composition, and word frequency, with customizable functionalities. The tool is designed to provide a seamless input experience based on lexicon and transcription.

AiNiee
AiNiee is a tool focused on AI translation, capable of automatically translating RPG SLG games, Epub TXT novels, Srt Lrc subtitles, and more. It provides features for configuring AI platforms, proxies, and translation settings. Users can utilize this tool for translating game scripts, novels, and subtitles efficiently. The tool supports multiple AI platforms and offers tutorials for beginners. It also includes functionalities for extracting and translating game text, with options for customizing translation projects and managing translation tasks effectively.

AI-CloudOps
AI+CloudOps is a cloud-native operations management platform designed for enterprises. It aims to integrate artificial intelligence technology with cloud-native practices to significantly improve the efficiency and level of operations work. The platform offers features such as AIOps for monitoring data analysis and alerts, multi-dimensional permission management, visual CMDB for resource management, efficient ticketing system, deep integration with Prometheus for real-time monitoring, and unified Kubernetes management for cluster optimization.

AivisSpeech-Engine
AivisSpeech-Engine is a powerful open-source tool for speech recognition and synthesis. It provides state-of-the-art algorithms for converting speech to text and text to speech. The tool is designed to be user-friendly and customizable, allowing developers to easily integrate speech capabilities into their applications. With AivisSpeech-Engine, users can transcribe audio recordings, create voice-controlled interfaces, and generate natural-sounding speech output. Whether you are building a virtual assistant, developing a speech-to-text application, or experimenting with voice technology, AivisSpeech-Engine offers a comprehensive solution for all your speech processing needs.

AirPower4T
AirPower4T is a development base library based on Vue3 TypeScript Element Plus Vite, using decorators, object-oriented, Hook and other front-end development methods. It provides many common components and some feedback components commonly used in background management systems, and provides a lot of enums and decorators.

LabelQuick
LabelQuick_V2.0 is a fast image annotation tool designed and developed by the AI Horizon team. This version has been optimized and improved based on the previous version. It provides an intuitive interface and powerful annotation and segmentation functions to efficiently complete dataset annotation work. The tool supports video object tracking annotation, quick annotation by clicking, and various video operations. It introduces the SAM2 model for accurate and efficient object detection in video frames, reducing manual intervention and improving annotation quality. The tool is designed for Windows systems and requires a minimum of 6GB of memory.

aituber-kit
AITuber-Kit is a tool that enables users to interact with AI characters, conduct AITuber live streams, and engage in external integration modes. Users can easily converse with AI characters using various LLM APIs, stream on YouTube with AI character reactions, and send messages to server apps via WebSocket. The tool provides settings for API keys, character configurations, voice synthesis engines, and more. It supports multiple languages and allows customization of VRM models and background images. AITuber-Kit follows the MIT license and offers guidelines for adding new languages to the project.

Tianji
Tianji is a free, non-commercial artificial intelligence system developed by SocialAI for tasks involving worldly wisdom, such as etiquette, hospitality, gifting, wishes, communication, awkwardness resolution, and conflict handling. It includes four main technical routes: pure prompt, Agent architecture, knowledge base, and model training. Users can find corresponding source code for these routes in the tianji directory to replicate their own vertical domain AI applications. The project aims to accelerate the penetration of AI into various fields and enhance AI's core competencies.

bailing
Bailing is an open-source voice assistant designed for natural conversations with users. It combines Automatic Speech Recognition (ASR), Voice Activity Detection (VAD), Large Language Model (LLM), and Text-to-Speech (TTS) technologies to provide a high-quality voice interaction experience similar to GPT-4o. Bailing aims to achieve GPT-4o-like conversation effects without the need for GPU, making it suitable for various edge devices and low-resource environments. The project features efficient open-source models, modular design allowing for module replacement and upgrades, support for memory function, tool integration for information retrieval and task execution via voice commands, and efficient task management with progress tracking and reminders.

get_jobs
Get Jobs is a tool designed to help users find and apply for job positions on various recruitment platforms in China. It features AI job matching, automatic cover letter generation, multi-platform job application, automated filtering of inactive HR and headhunter positions, real-time WeChat message notifications, blacklisted company updates, driver adaptation for Win11, centralized configuration, long-lasting cookie login, XPathHelper plugin, global logging, and more. The tool supports platforms like Boss直聘, 猎聘, 拉勾, 51job, and 智联招聘. Users can configure the tool for customized job searches and applications.

NGCBot
NGCBot is a WeChat bot based on the HOOK mechanism, supporting scheduled push of security news from FreeBuf, Xianzhi, Anquanke, and Qianxin Attack and Defense Community, KFC copywriting, filing query, phone number attribution query, WHOIS information query, constellation query, weather query, fishing calendar, Weibei threat intelligence query, beautiful videos, beautiful pictures, and help menu. It supports point functions, automatic pulling of people, ad detection, automatic mass sending, Ai replies, rich customization, and easy for beginners to use. The project is open-source and periodically maintained, with additional features such as Ai (Gpt, Xinghuo, Qianfan), keyword invitation to groups, automatic mass sending, and group welcome messages.
For similar tasks

uDesktopMascot
uDesktopMascot is an open-source project for a desktop mascot application with a theme of 'freedom of creation'. It allows users to load and display VRM or GLB/FBX model files on the desktop, customize GUI colors and background images, and access various features through a menu screen. The application supports Windows 10/11 and macOS platforms.
For similar jobs

Awesome-AIGC-3D
Awesome-AIGC-3D is a curated list of awesome AIGC 3D papers, inspired by awesome-NeRF. It aims to provide a comprehensive overview of the state-of-the-art in AIGC 3D, including papers on text-to-3D generation, 3D scene generation, human avatar generation, and dynamic 3D generation. The repository also includes a list of benchmarks and datasets, talks, companies, and implementations related to AIGC 3D. The description is less than 400 words and provides a concise overview of the repository's content and purpose.

CushyStudio
CushyStudio is a generative AI platform designed for creatives of any level to effortlessly create stunning images, videos, and 3D models. It offers CushyApps, a collection of visual tools tailored for different artistic tasks, and CushyKit, an extensive toolkit for custom apps development and task automation. Users can dive into the AI revolution, unleash their creativity, share projects, and connect with a vibrant community. The platform aims to simplify the AI art creation process and provide a user-friendly environment for designing interfaces, adding custom logic, and accessing various tools.

dream-textures
Dream Textures is a tool integrated into Blender that allows users to create textures, concept art, background assets, and more using simple text prompts. It offers features like seamless texture creation, texture projection for entire scenes, restyling animations, and running models on the user's machine for faster iteration. The tool supports CUDA and Apple Silicon GPUs, with over 4GB of VRAM recommended. Users can troubleshoot issues by checking Blender's system console or seeking help from the community on Discord.

aircraft
Headwind Simulations A339X - A330-900neo is an open-source project aimed at creating a free Airbus A330-900neo for Microsoft Flight Simulator. The project is based on the FlyByWire System A32NX and offers a detailed simulation of the A330-941 model with various components like engines, FMS, ACAS, ATC, and more. Users can build the aircraft using Docker and node modules, and the package can be easily integrated into MSFS. The project is part of a collaborative effort with other open-source projects contributing to the aircraft's systems, cockpit, sound, and 3D parts. The repository is dual-licensed under GNU GPLv3 for textual-form source code and CC BY-NC 4.0 for artistic assets, ensuring proper usage and attribution of the content.

MiKaPo
MiKaPo is a web-based tool that allows users to pose MMD models in real-time using video input. It utilizes technologies such as Mediapipe for 3D key points detection, Babylon.js for 3D scene rendering, babylon-mmd for MMD model viewing, and Vite+React for the web framework. Users can upload videos and images, select different environments, and choose models for posing. MiKaPo also supports camera input and Ollama (electron version). The tool is open to feature requests and pull requests, with ongoing development to add VMD export functionality.

uDesktopMascot
uDesktopMascot is an open-source project for a desktop mascot application with a theme of 'freedom of creation'. It allows users to load and display VRM or GLB/FBX model files on the desktop, customize GUI colors and background images, and access various features through a menu screen. The application supports Windows 10/11 and macOS platforms.

DeepMesh
DeepMesh is an auto-regressive artist-mesh creation tool that utilizes reinforcement learning to generate high-quality meshes conditioned on a given point cloud. It offers pretrained weights and allows users to generate obj/ply files based on specific input parameters. The tool has been tested on Ubuntu 22 with CUDA 11.8 and supports A100, A800, and A6000 GPUs. Users can clone the repository, create a conda environment, install pretrained model weights, and use command line inference to generate meshes.