uDesktopMascot

desktop mascot open project

Stars: 318

Visit

uDesktopMascot is an open-source project for a desktop mascot application with a theme of 'freedom of creation'. It allows users to load and display VRM or GLB/FBX model files on the desktop, customize GUI colors and background images, and access various features through a menu screen. The application supports Windows 10/11 and macOS platforms.

README:

uDesktopMascot

日本語 | English | 中文 | Español | Français

注意: 上記の言語（English、中文、Español、Français）は、GPT-4o-miniによる自動翻訳で生成されています。翻訳の精度やニュアンスに関しては、原文（日本語）をご参照ください。

uDesktopMascot

概要

「uDesktopMascot」は、創作の自由化 をテーマにしたデスクトップマスコットアプリケーションのオープンソースプロジェクトです。一機能の例としてVRMやGLB/FBX形式のモデルを読み込み、デスクトップ上に表示することができます。またメニュー画面やアプリケーションウインドウなどのGUIの色や背景画像を自由に設定することもできます。詳しい機能一覧は機能一覧を参照してください。

対応プラットフォーム

Windows 10/11
macOS

機能一覧

アプリには以下の機能が実装されています。詳細は以下のリストを参照してください。

外部アセットの追加は、StreamingAssetsフォルダに配置することで実現できます。

モデル・アニメーション

StreamingAssetsに配置した任意モデルファイルを読み込んで表示します。
- VRM(1.x, 0.x)形式のモデルをサポートしています。
- GLB/GLTF形式のモデルをサポートしています。(アニメーションは対応していません)
- FBX形式のモデルをサポートしています。(ただし一部のモデルではテクスチャーがロードができません。またアニメーションは対応していません)
  - テクスチャーは StreamingAssets/textures/ に配置することで読み込むことができます。
モデル選択・追加画面からVRMモデルの追加
- パスを指定して追加
- ファイル選択ダイアログから追加

ボイス・BGM

SteamingAssets/Voice/以下に配置した音声ファイルを読み込んで再生します。複数ある場合は、ランダムで再生します。
- クリック時に再生される音声は、StreamingAssets/Voice/Click/に配置した音声ファイルを読み込んで再生します。
SteamingAssets/BGM/以下に配置した音楽ファイルを読み込んで再生します。複数ある場合は、ランダムで再生します。
キャラクターのデフォルトのボイスの追加
- デフォルトのボイスは、COEIROINK:つくよみちゃんの音声を使用しています。
- アプリ起動時、アプリ終了時、クリック時に再生されます。

音声認識・AIチャット

オフライン音声認識エンジン Vosk を統合し、マイク入力をリアルタイムにテキスト化します。
- 中間結果 [STT][partial] と確定結果 [STT][final] を Unity Console に表示。
- 無音が VadSilenceSeconds（既定 1.0 秒）続くとテキストが確定し、LLM へ送信されます（[STT][send] ログ）。
テキストが確定すると AI チャット機能に音声メッセージを渡し、キャラクターが応答を読み上げます。
- 応答生成中はマイクを自動で一時停止し、誤認識を防止します。
ChatDialog 画面のマイクボタンで録音の開始／停止をトグル操作できます。
必要なネイティブ DLL（libvosk.dll, libstdc++-6.dll, libgcc_s_seh-1.dll, libwinpthread-1.dll など）は Assets/Plugins/x86_64/ に配置され、ビルド時に自動でパッケージされます。
音響モデル（例: vosk-model-small-ja-0.22）や量子化 GGUF モデルを StreamingAssets 配下に置くことで、ネットワーク接続なしで日本語／英語など複数言語を扱えます。

テキストファイルによるアプリケーション設定

application_settings.txtファイルにより、アプリケーションの設定を変更できます。

設定ファイルの構造は以下になっています

[Character]
ModelPath=default.vrm
TexturePaths=test.png
Scale=3
PositionX=0
PositionY=0
PositionZ=0
RotationX=0
RotationY=0
RotationZ=0

[Sound]
VoiceVolume=1
BGMVolume=0.5
SEVolume=1

[Display]
Opacity=1
AlwaysOnTop=True

[Performance]
TargetFrameRate=60
QualityLevel=2

メニュー画面

メニュー画面の背景画像および背景色を設定できます。
- 背景画像は、StreamingAssets/Menu/ に配置した画像ファイルを読み込むことができます。対応している画像フォーマットは以下です
  - PNG
  - JPG(JPEG)
  - BMP
  - GIF(静止画)
  - TGA
  - TIFF
- 背景色は、カラーコードを指定することができます。
メニュー画面から以下の機能にアクセス可能
- モデル選択・追加画面
- AIチャット機能
- LocalWeb機能
- アプリケーション設定
- アプリケーション終了
メニュー画面の収納ボタンを押すことで、Windowsのみ通知領域にアプリケーションを収納できます。
- 収納したアプリケーションは、通知領域のアイコンをクリックすることで、再度表示できます。

macOSでの実行について

macOSでアプリを実行する際、GateKeeperによってアプリがブロックされる場合があります。その場合、ターミナルから以下のコマンドを実行してください。

xattr -r -c uDesktopMascot.app

requirements

Unity 6000.1.1f1(IL2CPP)

license

コードはApache License 2.0に基づいてライセンスされています。
以下のアセットは、CC BY-NC 4.0に基づいてライセンスされています。
- BGM
- モデル

素材について

デフォルトのキャラクターアニメーションは、『VRMお人形遊び』用アニメーションデータ詰め合わせを用いて作成されています。リポジトリに含めて配布することに関して、確認済みです。
フォントはNoto Sans Japaneseです。SIL OPEN FONT LICENSE Version 1.1に基づきNoto Sans JPフォントを再配布するものです。フォントの著作権は元の作者（Google）にあります。
デフォルトボイスは、COEIROINK:つくよみちゃんの音声を使用しています。使用方法については、COEIROINK側に事前に確認済みです
ボタンアイコンは、MingCuteを使用しています。

インストーラー作成方法

Windows

UnityでbuildフォルダにuDesktopMascotという名前でビルドする。

uDesktopMasscot
└── build
    └── uDesktopMasscot
        ├── uDesktopMascot.exe
        └── README.txt
        ...

Inno Setup をインストールする。
開いたら、More filesをクリックして、プロジェクト配下にあるsetup.issを選択する。

選択したら、再生ボタンをクリックする。

ビルドが完了したら、プロジェクトのRootにインストーラーが生成されます。

macOS

macOSのPCでのみインストーラーを作成できます。

Unityでbuild/uDesktopMascotフォルダにuDesktopMascotという名前でビルドする。

uDesktopMasscot
└── build
    └── uDesktopMasscot
        ├── uDesktopMascot.app
        └── README.txt

以下のコマンドを実行する。

cd build
productbuild --component uDesktopMascot/uDesktopMascot.app /Applications ./uDesktopMascot_mac_installer.pkg

ビルドが完了したら、buildフォルダにuDesktopMascot_mac_installer.pkgが生成されます。

制作者クレジット

モデル: 「アオゾラ」様
BGM: MidraLab(eisuke)
ソフトウェアアイコン: やむちゃ様

3rd Party Notices

See NOTICE.

sponsor

Luna
uezo

For Tasks:

Click tags to check more tools for each tasks

customize desktop mascot load 3d models play background music access application settings create desktop installer

For Jobs:

graphic designer game developer ui/ux designer 3d modeler software engineer

Alternative AI tools for uDesktopMascot

Similar Open Source Tools

uDesktopMascot

github

: 318

NovelForge

NovelForge is an AI-assisted writing tool with the potential for creating long-form content of millions of words. It offers a solution that combines world-building, structured content generation, and consistency maintenance. The tool is built around four core concepts: modular 'cards', customizable 'dynamic output models', flexible 'context injection', and consistency assurance through a 'knowledge graph'. It provides a highly structured and configurable writing environment, inspired by the Snowflake Method, allowing users to create and organize their content in a tree-like structure. NovelForge is highly customizable and extensible, allowing users to tailor their writing workflow to their specific needs.

github

: 134

prompt-optimizer

Prompt Optimizer is a powerful AI prompt optimization tool that helps you write better AI prompts, improving AI output quality. It supports both web application and Chrome extension usage. The tool features intelligent optimization for prompt words, real-time testing to compare before and after optimization, integration with multiple mainstream AI models, client-side processing for security, encrypted local storage for data privacy, responsive design for user experience, and more.

github

: 15.5k

gzm-design

Gzm Design is a free and open-source poster designer developed using the latest mainstream technologies such as Vue3, Vite4, TypeScript, etc. It provides features like PSD import, JSON import, multiple pages support, shortcut key support, template import, layer management, ruler tool, pen tool, element editing, preview, file download, canvas zooming and dragging, border stroke, filling, blending modes, text formatting, group handling, canvas size modification, rich text support, masking, shadow effects, undo/redo functionality, QR code tool, barcode tool, and ruler line npm package encapsulation.

github

: 513

chatgpt-webui

ChatGPT WebUI is a user-friendly web graphical interface for various LLMs like ChatGPT, providing simplified features such as core ChatGPT conversation and document retrieval dialogues. It has been optimized for better RAG retrieval accuracy and supports various search engines. Users can deploy local language models easily and interact with different LLMs like GPT-4, Azure OpenAI, and more. The tool offers powerful functionalities like GPT4 API configuration, system prompt setup for role-playing, and basic conversation features. It also provides a history of conversations, customization options, and a seamless user experience with themes, dark mode, and PWA installation support.

github

: 79

rime_wanxiang

Rime Wanxiang is a pinyin input method based on deep optimized lexicon and language model. It features a lexicon with tones, AI and large corpus filtering, and frequency addition to provide more accurate sentence output. The tool supports various input methods and customization options, aiming to enhance user experience through lexicon and transcription. Users can also refresh the lexicon with different types of auxiliary codes using the LMDG toolkit package. Wanxiang offers core features like tone-marked pinyin annotations, phrase composition, and word frequency, with customizable functionalities. The tool is designed to provide a seamless input experience based on lexicon and transcription.

github

: 1.4k

Nano

Nano is a Transformer-based autoregressive language model for personal enjoyment, research, modification, and alchemy. It aims to implement a specific and lightweight Transformer language model based on PyTorch, without relying on Hugging Face. Nano provides pre-training and supervised fine-tuning processes for models with 56M and 168M parameters, along with LoRA plugins. It supports inference on various computing devices and explores the potential of Transformer models in various non-NLP tasks. The repository also includes instructions for experiencing inference effects, installing dependencies, downloading and preprocessing data, pre-training, supervised fine-tuning, model conversion, and various other experiments.

github

: 160

ClaraVerse

ClaraVerse is a privacy-first AI assistant and agent builder that allows users to chat with AI, create intelligent agents, and turn them into fully functional apps. It operates entirely on open-source models running on the user's device, ensuring data privacy and security. With features like AI assistant, image generation, intelligent agent builder, and image gallery, ClaraVerse offers a versatile platform for AI interaction and app development. Users can install ClaraVerse through Docker, native desktop apps, or the web version, with detailed instructions provided for each option. The tool is designed to empower users with control over their AI stack and leverage community-driven innovations for AI development.

github

: 3.4k

AivisSpeech

AivisSpeech is a Japanese text-to-speech software based on the VOICEVOX editor UI. It incorporates the AivisSpeech Engine for generating emotionally rich voices easily. It supports AIVMX format voice synthesis model files and specific model architectures like Style-Bert-VITS2. Users can download AivisSpeech and AivisSpeech Engine for Windows and macOS PCs, with minimum memory requirements specified. The development follows the latest version of VOICEVOX, focusing on minimal modifications, rebranding only where necessary, and avoiding refactoring. The project does not update documentation, maintain test code, or refactor unused features to prevent conflicts with VOICEVOX.

github

: 325

InterPilot

InterPilot is an AI-based assistant tool that captures audio from Windows input/output devices, transcribes it into text, and then calls the Large Language Model (LLM) API to provide answers. The project includes recording, transcription, and AI response modules, aiming to provide support for personal legitimate learning, work, and research. It may assist in scenarios like interviews, meetings, and learning, but it is strictly for learning and communication purposes only. The tool can hide its interface using third-party tools to prevent screen recording or screen sharing, but it does not have this feature built-in. Users bear the risk of using third-party tools independently.

github

: 88

AiNiee

AiNiee is a tool focused on AI translation, capable of automatically translating RPG SLG games, Epub TXT novels, Srt Lrc subtitles, and more. It provides features for configuring AI platforms, proxies, and translation settings. Users can utilize this tool for translating game scripts, novels, and subtitles efficiently. The tool supports multiple AI platforms and offers tutorials for beginners. It also includes functionalities for extracting and translating game text, with options for customizing translation projects and managing translation tasks effectively.

github

: 2.2k

GalTransl

GalTransl is an automated translation tool for Galgames that combines minor innovations in several basic functions with deep utilization of GPT prompt engineering. It is used to create embedded translation patches. The core of GalTransl is a set of automated translation scripts that solve most known issues when using ChatGPT for Galgame translation and improve overall translation quality. It also integrates with other projects to streamline the patch creation process, reducing the learning curve to some extent. Interested users can more easily build machine-translated patches of a certain quality through this project and may try to efficiently build higher-quality localization patches based on this framework.

github

: 1.8k

TrainPPTAgent

TrainPPTAgent is an AI-based intelligent presentation generation tool. Users can input a topic and the system will automatically generate a well-structured and content-rich PPT outline and page-by-page content. The project adopts a front-end and back-end separation architecture: the front-end is responsible for interaction, outline editing, and template selection, while the back-end leverages large language models (LLM) and reinforcement learning (GRPO) to complete content generation and optimization, making the generated PPT more tailored to user goals.

github

: 81

AivisSpeech-Engine

AivisSpeech-Engine is a powerful open-source tool for speech recognition and synthesis. It provides state-of-the-art algorithms for converting speech to text and text to speech. The tool is designed to be user-friendly and customizable, allowing developers to easily integrate speech capabilities into their applications. With AivisSpeech-Engine, users can transcribe audio recordings, create voice-controlled interfaces, and generate natural-sounding speech output. Whether you are building a virtual assistant, developing a speech-to-text application, or experimenting with voice technology, AivisSpeech-Engine offers a comprehensive solution for all your speech processing needs.

github

: 97

AirPower4T

AirPower4T is a development base library based on Vue3 TypeScript Element Plus Vite, using decorators, object-oriented, Hook and other front-end development methods. It provides many common components and some feedback components commonly used in background management systems, and provides a lot of enums and decorators.

github

: 280

LabelQuick

LabelQuick_V2.0 is a fast image annotation tool designed and developed by the AI Horizon team. This version has been optimized and improved based on the previous version. It provides an intuitive interface and powerful annotation and segmentation functions to efficiently complete dataset annotation work. The tool supports video object tracking annotation, quick annotation by clicking, and various video operations. It introduces the SAM2 model for accurate and efficient object detection in video frames, reducing manual intervention and improving annotation quality. The tool is designed for Windows systems and requires a minimum of 6GB of memory.

github

: 70

For similar tasks

uDesktopMascot

github

: 318

For similar jobs

Awesome-AIGC-3D

Awesome-AIGC-3D is a curated list of awesome AIGC 3D papers, inspired by awesome-NeRF. It aims to provide a comprehensive overview of the state-of-the-art in AIGC 3D, including papers on text-to-3D generation, 3D scene generation, human avatar generation, and dynamic 3D generation. The repository also includes a list of benchmarks and datasets, talks, companies, and implementations related to AIGC 3D. The description is less than 400 words and provides a concise overview of the repository's content and purpose.

github

: 516

armortools

3D content creation tools.

github

: 3.1k

CushyStudio

CushyStudio is a generative AI platform designed for creatives of any level to effortlessly create stunning images, videos, and 3D models. It offers CushyApps, a collection of visual tools tailored for different artistic tasks, and CushyKit, an extensive toolkit for custom apps development and task automation. Users can dive into the AI revolution, unleash their creativity, share projects, and connect with a vibrant community. The platform aims to simplify the AI art creation process and provide a user-friendly environment for designing interfaces, adding custom logic, and accessing various tools.

github

: 641

dream-textures

Dream Textures is a tool integrated into Blender that allows users to create textures, concept art, background assets, and more using simple text prompts. It offers features like seamless texture creation, texture projection for entire scenes, restyling animations, and running models on the user's machine for faster iteration. The tool supports CUDA and Apple Silicon GPUs, with over 4GB of VRAM recommended. Users can troubleshoot issues by checking Blender's system console or seeking help from the community on Discord.

github

: 7.7k

aircraft

Headwind Simulations A339X - A330-900neo is an open-source project aimed at creating a free Airbus A330-900neo for Microsoft Flight Simulator. The project is based on the FlyByWire System A32NX and offers a detailed simulation of the A330-941 model with various components like engines, FMS, ACAS, ATC, and more. Users can build the aircraft using Docker and node modules, and the package can be easily integrated into MSFS. The project is part of a collaborative effort with other open-source projects contributing to the aircraft's systems, cockpit, sound, and 3D parts. The repository is dual-licensed under GNU GPLv3 for textual-form source code and CC BY-NC 4.0 for artistic assets, ensuring proper usage and attribution of the content.

github

: 169

MiKaPo

MiKaPo is a web-based tool that allows users to pose MMD models in real-time using video input. It utilizes technologies such as Mediapipe for 3D key points detection, Babylon.js for 3D scene rendering, babylon-mmd for MMD model viewing, and Vite+React for the web framework. Users can upload videos and images, select different environments, and choose models for posing. MiKaPo also supports camera input and Ollama (electron version). The tool is open to feature requests and pull requests, with ongoing development to add VMD export functionality.

github

: 75

uDesktopMascot

github

: 318

DeepMesh

DeepMesh is an auto-regressive artist-mesh creation tool that utilizes reinforcement learning to generate high-quality meshes conditioned on a given point cloud. It offers pretrained weights and allows users to generate obj/ply files based on specific input parameters. The tool has been tested on Ubuntu 22 with CUDA 11.8 and supports A100, A800, and A6000 GPUs. Users can clone the repository, create a conda environment, install pretrained model weights, and use command line inference to generate meshes.

github

: 350