NVIDIA adds AI virtual cameras & speech recognition to Holoscan

By Max Miller2025-06-19T09:59:00+01:00

No comments

Tool allows users to create multiple camea angles from single camera

Holoscan-AI-Media NVIDIA

NVIDIA has added AI virtual cameras and speech recognition to its Holoscan For Media platform.

Holoscan For Media allows live media and video pipelines to run on the same infrastructure as AI, enabling users to access AI applications. It was launched last year.

The company has now added what it describes as AI reference applications for Holoscan, which can interface with uncompressed ST 2110 streams and add AI effects with minimal latency.

One of these is AI virtual cameras, built with PyTorch and NVIDIA DeepStream SDK. This detects and tracks individuals in the stream, and then creates multiple cropped virtual camera outputs focused on those individuals. This means a user can generate multiple AI-generated camera feeds from a single static camera.

AIMediaNews

Follow AI Media News on Linkedin here, and X here, and sign up to its weekly newsletter here.

In addition, NVIDIA has announced AI speech recognition, which consists of a web user interface that monitors transcription in real time and enables users to search for words. Users see live captions of the incoming stream, along with a search field to search through the transcription.

NVIDIA recommends that people who want to use these tools should have an AI workstation with an NVIDIA RTX Pro GPU and an NVIDIA ConnectX network interface card (with loopback cable or switch connectivity) or a certified multi-GPU system, a functional NVIDIA Holoscan for Media environment using either a local developer setup with Kubernetes or the platform reference deployment guide with a jump node, and a Visual Studio Code or any other IDE for Linux platforms. The GNU Compiler Collection (GCC) can also be used.

Topics

No comments

Features
Canada’s TVO: Ontario-based, global facing

2025-06-20T08:56:00Z By Alice Redman

The head of production at the children’s and documentary channel expains she’s on the look out for both global and local hits
Comment
Nurturing emerging talent in an AI world

2025-11-20T12:03:00Z By Max Miller

Thomas Pearson, global director of people at Bitmovin, on how the industry can continue to train while integrating AI tools
Features
Gallery: AI Creative Summit 2025

2025-11-19T15:49:00Z

Industry leaders, technologists, filmmakers, and policy experts gathered at the BFI Southbank to discuss the growth of AI