
Challenges
Our client wanted to enable users to engage in real-time video calls with an avatar of an advisor, allowing for natural and interactive conversations.
HBLAB's Solutions
- Captured voice input via Zoom SDK API and used TTS to generate speech-based responses
- Applied the RAG method to generate answers based on user questions
- Generated videos with cloned voice-over tailored to the question, and streamed them directly to the streaming server
Project details
-
Used Technologies STT, TTS, RAG, VAE, 3D Modeling
-
Development Team 2 AI Engineer, 0.5 PM
-
Duration 4 months
Results
Related cases
Explore our success stories to spark ideas for your business and discover why partnering with us is the right choice.
A company in Japan specializing in smart electronic devices for pets
The client, which is a large Singaporean IT provider, operates an IT issues ticket/request system that allow users to manage equipment, other users, and handle, create, process, and track IT incident requests within the company
The client, which specializes in temporary houses and office equipment rental in Thailand