About

FireTV is a cutting-edge, AI-driven video streaming platform built to highly personalize and socialize the viewing experience. It goes beyond traditional streaming by utilizing AI to detect a user's current mood and instantly recommending content tailored to their emotional state. Additionally, FireTV features intelligent video summarization for long-form content and robust real-time synchronization, allowing friends to host watch parties and stream videos together in perfect sync.

Tech Stack

Vite
FastAPI
Python
Google Gemini Flash API
WebSockets

Features

Mood-Based Recommendations

Analyzes user signals (text, audio, or video) to detect emotions and suggests contextually matching videos.

AI Video Summarizer

Automatically generates concise text summaries and highlights key moments of long videos using Gemini.

Synchronized Group Watching

Supports real-time, multi-user watch parties with perfectly synced play, pause, and seek controls.

Live Social Interaction

WebSocket integration broadcasts chat messages, mood shifts, and playback state changes instantly to all users in a viewing room.

Architecture

01

Client Layer

A blazing-fast frontend built with Vite handles the UI, video player logic, and capturing user signals for mood detection.

02

Real-Time Streaming Layer

A Python-based FastAPI backend manages active WebSocket connections, acting as the central hub for playback events and chat messages.

03

AI Integration Layer

The backend securely communicates with the Gemini Flash API to process video transcripts for summarization and evaluate user inputs for real-time emotional state classification.

Future Improvements

Integrated Voice Chat

Add WebRTC-based voice channels to the watch parties so users can talk to each other in real-time.

Granular Emotion Tracking

Introduce continuous emotion tracking via webcam that dynamically adjusts the recommended queue as the user's mood shifts.

Smart Chapter Segmentation

Expand the AI summarizer to automatically inject clickable visual chapters onto the video scrubber based on topic changes.