Skip to content

Releases: OpenMind/OM1-video-processor

v1.0.1-beta.2

13 Feb 23:55

Choose a tag to compare

v1.0.1-beta.2

What's new in this release

The raw video stream is published to the local media MTX server at the URL /top_camera_raw for local usage.

What's Changed

Full Changelog: v1.0.1-beta.1...v1.0.1-beta.2

v1.0.1-beta.1

09 Feb 22:00
4702eb2

Choose a tag to compare

v1.0.1-beta.1

What's new in this release

  • Switched the base Docker image from JetPack to CUDA 13.0.0 with Ubuntu 24.04
  • Updated the Python version from 3.10 to 3.12
  • CUDA driver mismatch issue fixed
  • Fixed TensorRT version

What's Changed

Full Changelog: v1.0.0...v1.0.1-beta.1

v1.0.0

18 Dec 19:06
7a272d8

Choose a tag to compare

What's new in this release

  • Updated the environment to support newer Python and CUDA version. It refines dependency management, and improves model handling. The changes enhance compatibility with latest libraries and streamline deployment, especially for NVIDIA THOR and specific version of TensorRT-based inference.
  • Updated Dockerfile for Thor support
  • Fixed CUDA driver mismatch
  • Added venv to docker image

What's Changed

Full Changelog: v1.0.0-beta.1...v1.0.0

v1.0.0-beta.1

13 Nov 18:01
c085fa2

Choose a tag to compare

Overview

First beta release for OM1 video processor. This release introduces major foundational features that enable developers and integrators to build advanced streaming and analytics solutions with ease.

Features

  • Face Detection and Anonymization: Added advanced face detection capabilities with real-time anonymization. Faces can now be automatically blurred or masked to protect privacy in live or recorded streams. This process takes place on the edge device of the robot.
  • RTSP for Audio and Video Streaming: Introduced full RTSP (Real-Time Streaming Protocol) support, enabling seamless transmission of both audio and video data. This allows integration with a wider range of cameras, streaming servers, and third-party applications. RTSP manages streaming sessions but does not typically transport the media data itself
  • Support Multiple Video streams: Enhanced the system to support multiple concurrent video streams. Users can now view, process, and manage several input sources simultaneously without performance degradation.
  • Support the Local and Remote Video Stream: Added the ability to handle both local camera feeds and remote video sources. This provides greater flexibility for hybrid setups that combine on-premise and cloud-based video inputs.
  • Reduced Microphone Latency: Optimized the audio pipeline to significantly reduce microphone input latency. This ensures more natural and synchronized communication in real-time applications.
  • Dynamic FPS Support: Implemented dynamic frame rate adjustment to optimize performance and bandwidth usage. The system now automatically adapts FPS based on network conditions and processing load.
  • Noise Cancellation and Echo Reduction: Integrated advanced audio processing algorithms for noise suppression and echo reduction. This results in clearer, higher-quality sound for both streaming and recording scenarios.

What's Changed

New Contributors

Full Changelog: https://github.com/OpenMind/OM1-video-processor/commits/v1.0.0-beta.1