Total number of members in the team: 4

Name of your project: 1738

Have you used any Google technologies? (Please mention the names):

Have you used the Google Gemini API? (Yes/No): Yes

Describe the problem you identified that could exist in the year 2080. (Max 300 words)

By the year 2080, traditional sports commentary may face significant challenges due to shifting viewer preferences, declining availability of skilled human commentators, and a surge in personalized content consumption. As immersive and interactive technologies like augmented reality (AR), virtual reality (VR), and mixed reality become mainstream, audiences will expect real-time, adaptive commentary tailored to their specific interests, language, skill level, and even emotional tone. Human commentators, while skilled, cannot scale to meet these hyper-personalized expectations across a global audience.

Additionally, with the growth of grassroots, amateur, and niche sports—many of which lack professional coverage—the demand for high-quality, scalable, and affordable live commentary will outpace what traditional broadcast models can offer. This creates a gap in both accessibility and engagement, especially for underserved communities and emerging sports markets.

Describe your proposed solution to that problem. (Max 300 words)

Our proposed solution is an AI-powered commentary system that uses intelligent cameras and real-time text-to-speech (TTS) technology to automatically analyze and narrate sports gameplay. This system is designed to meet the future demand for scalable, personalized, and immersive sports experiences across all levels of play—from professional leagues to grassroots and recreational games.

The core of our solution is an AI camera equipped with advanced computer vision algorithms that track players, identify game events, and understand context in real time. As the action unfolds, the system generates live commentary using natural language generation (NLG) models trained specifically for sports. This narration is delivered through high-quality, customizable TTS voices, creating a seamless, human-like experience.

By automating the sports commentary pipeline, our solution solves the future challenge of delivering engaging, cost-effective, and customized coverage at scale. It ensures that every game, no matter how small or remote, can have professional-grade, real-time narration—bringing players and fans closer together in a world where sports remain a universal language.

Give a brief summary of your code. (Max 300 words)

Our code powers a real-time AI sports commentary system by combining computer vision, natural language generation, and text-to-speech technologies. It starts with an AI camera that uses machine learning to track players and detect key gameplay moments like goals, fouls, or substitutions. Once an event is recognized, the system generates natural-sounding commentary using a language model trained specifically on sports data. That commentary is then instantly converted to speech using a high-quality text-to-speech engine, allowing for real-time audio output that feels just like a live human commentator. Everything runs through a low-latency processing pipeline to ensure that the commentary stays perfectly in sync with the action on screen. Users can also customize the voice, tone, or language of the commentary to match their preferences. The system is modular and scalable, making it ideal for anything from local youth games to professional matches, and it’s built to work with future technologies like AR and VR.

Built With

Share this project:

Updates