Software

NVIDIA Introduces Vision AI Agent Skills for Edge and Cloud

NVIDIA's new tools help developers build and deploy vision AI agents that turn video data into operational intelligence, with 95% average precision in defect detection using synthetic data.

Image: NVIDIA

NVIDIA has introduced new vision AI agent skills and blueprints to help developers create and deploy AI models that convert video data into actionable insights. These tools are designed for edge and cloud environments, where AI models must operate under strict latency, power, and connectivity constraints. By providing reusable workflows, NVIDIA aims to streamline the development and optimization of vision AI agents across various industries, including manufacturing, smart cities, and industrial operations. The tools integrate with NVIDIA Omniverse and Metropolis, enabling developers to generate synthetic data, fine-tune models, and deploy agentic video applications more efficiently. Source: nvidia

The new skills include the Defect Image Generation tool, which creates synthetic defect data to address the challenge of limited real-world training examples. This is particularly useful in manufacturing, where defect detection is critical for quality control. In a case study with Corning, a model trained on just eight real defect images, augmented with synthetic data from NVIDIA's tools, achieved 95% average precision and perfect recall on the most challenging defect class. This performance surpassed a baseline model trained solely on real data, significantly reducing the time needed for inspection projects. Source: nvidia

NVIDIA's approach also includes tools for video data augmentation, model fine-tuning, and video search and summarization (VSS) skills, which help developers create deployable workflows for alerts, reporting, and stream management. These tools are part of a broader strategy to address challenges in vision AI agent development, such as data gaps, lack of fine-tuning expertise, and complex deployment workflows. By leveraging OpenUSD and NVIDIA Omniverse, developers can build and test digital twins of real-world environments, enabling more accurate and adaptable AI models. Source: nvidia

Key points

NVIDIA's new vision AI agent skills help developers generate synthetic defect data for training models.
A model trained on eight real defect images and augmented with synthetic data achieved 95% average precision and perfect recall on the most challenging defect class.
NVIDIA's tools include video data augmentation, model fine-tuning, and video search and summarization (VSS) skills.
NVIDIA's approach uses OpenUSD and NVIDIA Omniverse to build and test digital twins of real-world environments.
In a case study with Corning, synthetic data from NVIDIA's tools surpassed a baseline model trained solely on real data.
NVIDIA's tools aim to streamline the development and optimization of vision AI agents across various industries.

Source: NVIDIA Read the original →

WRITTEN BY

Theo Almeida

AI Software & Developer Tools

Theo covers AI software, developer tools, frameworks, and the platforms builders use every day.