Skip to content

Daydream Cloud: Remote GPU Processing in Scope#397

Open
emranemran wants to merge 24 commits intomainfrom
backend-fal-v6
Open

Daydream Cloud: Remote GPU Processing in Scope#397
emranemran wants to merge 24 commits intomainfrom
backend-fal-v6

Conversation

@emranemran
Copy link

@emranemran emranemran commented Feb 4, 2026

Daydream Cloud: Remote GPU Processing in Scope

Summary

This PR adds a Cloud Mode toggle to Scope, enabling users to run video generation pipelines on remote cloud GPUs (H100s for now) instead of requiring a local GPU. Aside from Plugins and LoRAs, Local and Cloud mode are at feature parity.

Architecture Overview

Local Mode (Current)

┌─────────────────────────────────────────────────────────────┐
│                      User's Machine                          │
│  ┌─────────────┐     WebRTC      ┌────────────────────────┐ │
│  │   Browser   │ ◄─────────────► │    Scope Backend       │ │
│  │  (Frontend) │                 │  (Local GPU - RTX/etc) │ │
│  └─────────────┘                 └────────────────────────┘ │
│                                            │                 │
│                                     ┌──────┴──────┐          │
│                                     │    Spout    │          │
│                                     │  (Windows)  │          │
│                                     └─────────────┘          │
└─────────────────────────────────────────────────────────────┘

Cloud Mode

When to use: Desktop app where Spout I/O or local frame processing is needed.

How to enable: Start Scope, sign in, click "Connect to Cloud" in UI.

┌───────────────────────────────────────────────────────────────────────┐
│                           User's Machine                               │
│                                                                        │
│  ┌───────────┐        ┌─────────────────────────┐        ┌─────────┐  │
│  │  Browser  │ WebRTC │    Scope Backend        │ Spout  │ TouchD  │  │
│  │ (Frontend)│◄──────►│    (Client Mode)        │◄──────►│ Resolume│  │
│  └───────────┘        │                         │        │ etc.    │  │
│                       │  ┌───────────────────┐  │        └─────────┘  │
│                       │  │ CloudConnection   │  │                     │
│                       │  │ Manager           │  │                     │
│                       │  └────────┬──────────┘  │                     │
│                       └───────────┼─────────────┘                     │
└───────────────────────────────────┼───────────────────────────────────┘
                                    │
                          WebSocket │ + WebRTC
                          (signaling│ + video)
                                    │
                                    ▼
                       ┌─────────────────────────────────┐
                       │         Daydream Cloud            │
                       │  ┌───────────────────────────┐  │
                       │  │       fal_app.py          │  │
                       │  │  ┌─────────────────────┐  │  │
                       │  │  │   Scope Backend     │  │  │
                       │  │  │   (H100 GPU)        │  │  │
                       │  │  └─────────────────────┘  │  │
                       │  └───────────────────────────┘  │
                       └─────────────────────────────────┘

Flow:
1. User clicks "Connect to Cloud" in Settings panel
2. Local backend connects to Cloud backend via WebSocket
3. Local backend acts as WebRTC CLIENT to Cloud backend (role reversal!)
4. Frames from browser/Spout → local backend → Cloud → processed → back
5. Spout I/O works transparently (local backend handles it)

Key Components

Backend (scope-server)

File Purpose
cloud_connection.py WebSocket connection manager with request/response correlation
cloud_webrtc_client.py WebRTC peer connection (client role) to fal.ai
cloud_track.py MediaStreamTrack for bidirectional video relay
frame_processor.py Modified to route frames to cloud when connected
webrtc.py Extended to handle cloud mode offers

Cloud (fal.ai)

File Purpose
fal_app.py Serverless wrapper running Scope on fal.ai H100
FAL_DEPLOYMENT.md Deployment guide for fal.ai

Frontend

File Purpose
cloudAdapter.ts WebSocket-based API client for fal.ai
cloudContext.tsx React context for cloud mode state
useUnifiedWebRTC.ts Unified hook supporting both local and cloud modes
useApi.ts API hook that routes to local or cloud
CloudModeToggle.tsx UI component for enabling cloud mode

API Endpoints

New endpoints for cloud connection management:

POST /api/v1/cloud/connect    - Connect to fal.ai cloud
POST /api/v1/cloud/disconnect - Disconnect from cloud
GET  /api/v1/cloud/status     - Get connection status and stats

When connected to cloud, existing endpoints are automatically proxied:

  • POST /api/v1/pipeline/load → proxied to cloud
  • GET /api/v1/pipeline/status → proxied to cloud
  • GET /api/v1/pipelines/schemas → proxied to cloud
  • POST /api/v1/webrtc/offer → handled locally (relay mode)

Features

Core Functionality

  • Connect/disconnect to fal.ai cloud
  • WebRTC video relay through local backend
  • API call proxying via WebSocket
  • Spout input/output in cloud mode
  • Reference image upload to cloud
  • Recording download from cloud

Authentication

Cloud mode requires Daydream authentication:

  1. User signs in via OAuth flow
  2. Frontend passes user ID to backend
  3. Backend includes user ID in cloud connection
  4. Events are tracked with user context in Kafka (only on Cloud backend - Local backend does not track anything)

emranemran and others added 16 commits February 3, 2026 16:00
Add fal.ai serverless wrapper (fal_app.py) that runs Scope server
on fal.ai GPU infrastructure (H100):

- Start Scope server as subprocess on fal runner
- WebSocket endpoint for WebRTC signaling and API proxying
- Session data cleanup on disconnect (prevents data leakage)
- Connection ID for correlating logs across client and server
- Base64 encoding for binary file uploads/downloads
- Assets directory configuration for fal environment

Signed-off-by: emranemran <emran.mah@gmail.com>
Signed-off-by: Max Holland <max@livepeer.org>
Add modules enabling local Scope server to act as WebRTC client
to cloud for GPU processing:

- CloudConnectionManager: WebSocket connection with request/response correlation
- CloudWebRTCClient: Peer connection from local server to cloud
- CloudTrack: MediaStreamTrack for bidirectional video through cloud
- Request keyframe (PLI) on track received for VP8 decoder stability
- Cloud-related Pydantic schemas for API requests/responses

Signed-off-by: emranemran <emran.mah@gmail.com>
Signed-off-by: Max Holland <max@livepeer.org>
Modify FrameProcessor to support cloud mode (cloud processing):

- Route frames to cloud via WebRTC when connected
- Receive processed frames from cloud callback
- Spout sender/receiver work transparently in both modes
- WebRTC manager handles cloud mode offers
- Fix Spout integration in cloud mode

Signed-off-by: emranemran <emran.mah@gmail.com>
Signed-off-by: Max Holland <max@livepeer.org>
Add REST API endpoints for cloud connection management:

- POST /api/v1/cloud/connect - Connect to cloud
- POST /api/v1/cloud/disconnect - Disconnect from cloud
- GET /api/v1/cloud/status - Get connection status
- Proxy model download requests to cloud in cloud mode
- Proxy hardware API calls to cloud
- Reference image upload handling in cloud mode
- Recording download support in cloud mode

Signed-off-by: emranemran <emran.mah@gmail.com>
Signed-off-by: Max Holland <max@livepeer.org>
Add UI components for managing cloud connection:

- CloudModeToggle: Connection status, connect/disconnect buttons
- Connection ID display with copy-to-clipboard
- Disable controls during cloud connection attempt
- Disable LoRA selection in cloud mode (not supported)
- Show connecting/disconnecting state with spinner

Signed-off-by: emranemran <emran.mah@gmail.com>
Signed-off-by: Max Holland <max@livepeer.org>
Add React hooks and utilities for cloud mode WebRTC:

- useUnifiedWebRTC: Unified hook supporting both local and cloud modes
- useApi: API request hook with cloud mode support
- usePipelines: Refresh pipeline list on cloud mode toggle
- usePipeline: Remove duplicate status polling
- cloudAdapter/cloudContext: Cloud SDK integration

Signed-off-by: emranemran <emran.mah@gmail.com>
Signed-off-by: Max Holland <max@livepeer.org>
Pipeline changes for text-to-video mode:

- Skip video input for text-to-video pipelines
- Input mode detection in wan2_1 pipeline

Signed-off-by: emranemran <emran.mah@gmail.com>
Signed-off-by: Max Holland <max@livepeer.org>
Add GitHub Actions workflow to build Docker images for feature branches.

Signed-off-by: emranemran <emran.mah@gmail.com>
Signed-off-by: Max Holland <max@livepeer.org>
…connections

Implements a maximum connection duration limit to control cloud GPU costs.
Connections are automatically closed after 60 minutes regardless of activity.

Changes:
- Add MAX_CONNECTION_DURATION_SECONDS (3600s) and TIMEOUT_CHECK_INTERVAL_SECONDS (60s) constants
- Track connection_start_time when WebSocket connects
- Add check_max_duration_exceeded() helper that sends a graceful "closing" message with reason "max_duration" before disconnect
- Use asyncio.wait_for() with timeout on ws.receive_text() to enable periodic duration checks during idle periods
- Also check duration after each received message to catch high-activity scenarios

The client receives a JSON message {"type": "closing", "reason": "max_duration", "elapsed_seconds": ...} before disconnection, allowing graceful handling.

Signed-off-by: emranemran <emran.mah@gmail.com>
Signed-off-by: Max Holland <max@livepeer.org>
Integrates Daydream authentication and adds comprehensive event tracking
via Kafka for both local and cloud modes.

Frontend:
- Add OAuth sign-in flow with Daydream authentication
- Show cloud mode toggle only when authenticated
- Pass user ID to backend for log correlation

Backend:
- Add kafka_publisher module for async Kafka event publishing
- Publish stream lifecycle events (stream_started, stream_stopped, stream_error)
- Track user_id through cloud connections for log correlation

Cloud (fal_app.py):
- Add KafkaPublisher class for WebSocket event tracking
- Publish websocket_connected/disconnected events with user context
- Support set_user_id message for log correlation

Signed-off-by: emranemran <emran.mah@gmail.com>
Signed-off-by: Max Holland <max@livepeer.org>
- Add PipelinesProvider wrapper in App.tsx
- Fix ReportBugDialog prop (onClose vs onOpenChange)
- Add vaeTypes to PipelineInfo type
- Add refreshPipelines to PipelinesContext
- Install @radix-ui/react-tabs dependency
- Add DaydreamAccountSection component with auth + cloud mode toggle
- Move cloud mode from StreamPage right panel to Settings General tab
- Add Switch UI component for toggle
- Remove sign in/sign out buttons from Header (now in Settings)
- Delete unused CloudModeToggle component
load_params in PipelineLoadRequest is already dict[str, Any], not a
Pydantic model, so calling model_dump() on it fails.
The method is handle_offer_with_relay, not handle_offer_with_cloud.
Method was renamed in cloud_connection.py but frame_processor.py
was still calling the old name, causing frames not to be sent
in video mode.
@mjh1 mjh1 force-pushed the backend-fal-v6 branch 2 times, most recently from 5857021 to 73dac11 Compare February 4, 2026 16:59
@emranemran emranemran marked this pull request as ready for review February 4, 2026 22:43
@emranemran emranemran changed the title Backend fal v6 Daydream Cloud: Remote GPU Processing in Scope Feb 4, 2026
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants