PhonemeX Pro – Offline TTS, Lip-Sync, Avatar & Face Rig System
Carruto
$76.99
(no ratings)
Jump AssetStore
On-device Text-to-Speech for Unity with full phoneme data, modular lip-sync, avatar integration, and facial rig support.✅ Works in HDRP inside the Editor, but Web builds with HDRP are not supported by Unity. Use URP or Built-in for Web. In Built-in RP, if UI input issues occur in the sample scene, select the EventSystem and click the “Fix” button in the Inspector to resolve them automatically.PhonemeX Pro Edition is a modular, fully offline Text-to-Speech and character voice system built for Unity mobile, desktop, and web builds.It provides real-time, on-device speech synthesis with full access to phoneme timing data at runtime, enabling speech generation, lip-sync, facial animation, and voice-driven character systems.What’s New in Version 1.1.0PhonemeX Pro has evolved from a standalone TTS component into a complete modular voice and animation framework. This update introduces a new internal architecture that separates voice generation, lip-sync, avatar driving, and facial rigging into independent runtime systems. Existing projects remain compatible, while advanced users can now build more structured and scalable character pipelines.Core Systems IncludedOn-Device Text-to-Speech SystemA cross-platform, fully offline TTS engine that runs entirely inside Unity. Generates real-time audio along with detailed phoneme timing data for runtime logic and animation.Real-Time Lip-Sync SystemA dedicated lip-sync pipeline that schedules phoneme data, blends viseme weights, and drives facial motion in real time. Designed for accurate speech-to-animation synchronization.Modular Avatar SystemPhonemeX Pro includes a flexible avatar integration layer compatible with the AvatarX character set. The avatar system applies lip-sync and expression data to fully rigged characters using blendshapes or custom drivers, enabling speech-driven facial animation with minimal setup.The system is designed to work with a wide range of humanoid characters and animation setups, allowing developers to integrate PhonemeX into existing projects without being locked to a specific character format.Face Rig & Expression SystemAn optional facial rigging and expression framework for authoring, blending, and layering facial poses, emotions, and micro-expressions on top of lip-sync animation.Modular DesignEach system can be used independently depending on your project needs:Use only the TTS system for speech playbackCombine TTS and lip-sync for talking charactersIntegrate the avatar and face rig systems for full facial animation controlModules are loosely coupled and designed to be extended or replaced as your project grows.Key FeaturesFully Offline, On-Device TTS – No internet connection or server required by defaultReal-Time Phoneme Access – Phoneme timing data available at runtimeCross-Platform Support – Android, iOS, WebGL, Windows, macOS, and LinuxLightweight & Fast – Optimized for mobile and browser-based buildsMulti-Voice Support – Easily switch between supported voice modelsModular Runtime Architecture – Clean separation between voice, lip-sync, and animation systemsOptional online phoneme services can be integrated if desired, but are not required for core functionality.Local AI Inference – Uses Unity’s official on-device AI inference packagesDemo Scenes Included – Examples showcasing TTS, lip-sync, avatar integration, and face rig workflowsSupported PlatformsAndroid (ARM / ARM64)iOS / iPadOSWebGLWindows (x86 / x64)macOSLinuxDependencies / RequirementsUnity 6 LTS or newerOn-device AI inference package: Unity AI Inference EngineUnity Editor Coroutines package: (com.unity.editorcoroutines)Newtonsoft Json package: (com.unity.nuget.newtonsoft-json, included with Unity)Required packages can be installed automatically via the PhonemeX installer.Perfect ForGames and visual novelsDigital humans and virtual charactersInteractive NPCs and AI agentsEducational and accessibility applicationsWeb-based interactive tools and simulationsNeed Help or Want to ConnectJoin our official Carruto Support Discord: Join HereDocumentation | Support EmailGet updates, ask questions, and share your projects with the community.Demo BuildsYou can download prebuilt demo applications to test the asset before purchasing:Windows Demo: DownloadMacOS Demo: DownloadThese demos showcase the real runtime behavior of the asset.No login or registration required.PhonemeX Pro is designed as a scalable voice and character animation foundation, from simple on-device speech to fully animated, expressive characters.Created by Carruto – professional tools for AI-driven audio and character systems.Offline, On-Device Text-to-SpeechReal-time speech synthesis running entirely inside Unity builds, with no internet connection required for core functionality.Real-Time Phoneme Data AccessProvides phoneme timing data alongside generated audio for accurate lip-sync, facial animation, and runtime logic.Modular Runtime ArchitectureIndependent runtime modules for voice generation, lip-sync, avatar driving, and facial rig systems.Cross-Platform Runtime SupportDesigned for Android, iOS, WebGL, Windows, macOS, and Linux builds.Lightweight & Performance-OrientedOptimized runtime footprint suitable for mobile and browser-based applications.Multi-Voice Model SupportSupports switching between multiple voice models (Piper-compatible format).On-Device AI InferenceUses Unity’s official on-device AI inference packages:– Unity AI Inference / Inference Engine for Unity 6 and newerOptional External Phoneme IntegrationSupports optional external phoneme services for advanced workflows, without affecting offline core functionality.Demo Scenes & Documentation IncludedExample scenes and detailed documentation covering setup, architecture, and integration workflows.This package integrates pre-trained AI models to provide fully offline text-to-speech functionality. Specifically, it uses a Piper TTS voice model (in Sentis format) for local speech synthesis.Note:All inference runs locally on the target device (offline), and no user data is collected or transmitted.




