Inspiration
Traveling to foreign countries can be exciting, but language barriers and unfamiliar city logistics often make it stressful. Lingoso was inspired by the need for a personal AI companion that travelers can rely on to communicate, find local services, and navigate new places confidently, without worrying about language or cultural gaps.
What it does
Lingoso is an AI travel buddy that helps users:
Translate conversations in real-time using a friendly AI avatar (Angie)
Locate nearby services such as grocery stores, restaurants, salons, gyms, bars, and places of worship
Provide local guidance on transportation, events, and cultural etiquette
Offer currency conversion and recommend banking apps for easy payments abroad
Deliver personalized, voice-enabled assistance so travelers can explore freely
How we built it
Lingoso was built using:
OpenAI Realtime API for conversational AI and live text-to-speech interaction
Heygen AI avatars to provide a visual, expressive AI guide
Vue 3 + Composition API for a responsive, interactive frontend
Supabase / REST APIs for user data management and location-based services
Web Audio API & PCM16 streams for real-time AI voice playback
Integrated geolocation and mapping APIs to provide nearby places and directions
Challenges we ran into
Capturing and playing real-time AI audio while keeping latency low
Synchronizing AI speech and transcript captions for accessibility
Handling diverse languages and accents in real-time translation
Optimizing the UI for both voice and text interactions without overwhelming the user
Accomplishments that we're proud of
Successfully built a real-time AI travel companion that speaks and translates on the fly
Created a voice-enabled avatar interface that feels like a friendly local guide
Integrated multiple travel features (location, currency, events, banking apps) into one seamless experience
Delivered a system that can support multiple languages and maintain context during conversations
What we learned
The Realtime API is powerful, but streaming and handling audio data requires careful management of buffers and events
Real-time translation and TTS integration can be complex but extremely rewarding for user experience
Synchronizing audio, transcripts, and UI updates is key for creating a natural, conversational AI interface
User context (location, preferences, previous interactions) is crucial for personalized AI guidance
What's next for Lingoso
Expand language support and voice options for more inclusivity
Add offline capabilities for essential features like currency conversion and location lookup
Enhance the avatar’s personality and expressiveness using animation and gesture tracking
Implement personalized recommendations based on user behavior and travel patterns
Build a mobile app version for on-the-go travel assistance
Built With
- openai
- scss
- simli
- typescript
- vue
Log in or sign up for Devpost to join the conversation.