Fusemachines AI Fellowship Capstone Project

Football Commentary Generation System

An end-to-end AI pipeline for automated football match commentary using computer vision, object tracking, LLM narration, and speech synthesis.

PythonRoboflowYOLOByteTrackGroq APIDeepGramComputer VisionNLP

Overview

The Football Commentary Generation System is an end-to-end AI pipeline built during the Fusemachines AI Fellowship. The system detects match entities, tracks movement, identifies football events, generates natural language commentary, and outputs speech.

What I Built

  • Used Roboflow’s pre-trained model with YOLO architecture for player, ball, and referee detection.
  • Integrated ByteTrack for multi-object tracking under occlusion and complex motion.
  • Designed rule-based event detection for passes, interceptions, blocks, and shots.
  • Generated play-by-play narration using Groq LLM with structured prompt engineering.
  • Integrated DeepGram Text-to-Speech for real-time audio commentary output.

Collaboration

This project was developed with project partner Sudip Shrestha as part of the Fusemachines AI Fellowship capstone.