Thursday, April 16, 2026
  • Login
SB Crypto Guru News- latest crypto news, NFTs, DEFI, Web3, Metaverse
No Result
View All Result
  • HOME
  • BITCOIN
  • CRYPTO UPDATES
    • GENERAL
    • ALTCOINS
    • ETHEREUM
    • CRYPTO EXCHANGES
    • CRYPTO MINING
  • BLOCKCHAIN
  • NFT
  • DEFI
  • WEB3
  • METAVERSE
  • REGULATIONS
  • SCAM ALERT
  • ANALYSIS
CRYPTO MARKETCAP
  • HOME
  • BITCOIN
  • CRYPTO UPDATES
    • GENERAL
    • ALTCOINS
    • ETHEREUM
    • CRYPTO EXCHANGES
    • CRYPTO MINING
  • BLOCKCHAIN
  • NFT
  • DEFI
  • WEB3
  • METAVERSE
  • REGULATIONS
  • SCAM ALERT
  • ANALYSIS
No Result
View All Result
SB Crypto Guru News- latest crypto news, NFTs, DEFI, Web3, Metaverse
No Result
View All Result

NVIDIA Unveils Generative AI-Powered Visual AI Agents for Edge Deployment

by SB Crypto Guru News
July 17, 2024
in Blockchain
Reading Time: 3 mins read
0 0
A A
0




Timothy Morano
Jul 17, 2024 18:22

NVIDIA introduces Vision Language Models (VLMs) for dynamic video analysis, enhancing AI capabilities at the edge with Jetson Orin platform.



NVIDIA Unveils Generative AI-Powered Visual AI Agents for Edge Deployment

An exciting breakthrough in AI technology—Vision Language Models (VLMs)—offers a more dynamic and flexible method for video analysis, according to NVIDIA Technical Blog. VLMs enable users to interact with image and video input using natural language, making the technology more accessible and adaptable. These models can run on the NVIDIA Jetson Orin edge AI platform or discrete GPUs through NIMs.

What is a Visual AI Agent?

A visual AI agent is powered by a VLM where users can ask a broad range of questions in natural language and get insights that reflect true intent and context in a recorded or live video. These agents can be interacted with through easy-to-use REST APIs and integrated with other services and mobile apps. This new generation of visual AI agents helps to summarize scenes, create a wide range of alerts, and extract actionable insights from videos using natural language.

NVIDIA Metropolis brings visual AI agent workflows, which are reference solutions that accelerate the development of AI applications powered by VLMs, to extract insights with contextual understanding from videos, whether deployed at the edge or cloud.

For cloud deployment, developers can use NVIDIA NIM, a set of inference microservices that include industry-standard APIs, domain-specific code, optimized inference engines, and enterprise runtime, to power the visual AI Agents. Get started by visiting the API catalog to explore and try the foundation models directly from a browser.

Building Visual AI Agents for the Edge

Jetson Platform Services is a suite of prebuilt microservices that provide essential out-of-the-box functionality for building computer vision solutions on NVIDIA Jetson Orin. Included in these microservices are AI services with support for generative AI models such as zero-shot detection and state-of-the-art VLMs. VLMs combine a large language model with a vision transformer, enabling complex reasoning on text and visual input.

The VLM of choice on Jetson is VILA, given its state-of-the-art reasoning capabilities and speed by optimizing the tokens per image. By combining VLMs with Jetson Platform Services, a VLM-based visual AI agent application can be created that detects events on a live-streaming camera and sends notifications to the user through a mobile app.

Integration with Mobile App

The full end-to-end system can now integrate with a mobile app to build the VLM-powered Visual AI Agent. To get video input for the VLM, the Jetson Platform Services networking service and VST automatically discover and serve IP cameras connected to the network. These are made available to the VLM service and mobile app through the VST REST APIs.

From the app, users can set custom alerts in natural language such as “Is there a fire” on their selected live stream. Once the alert rules are set, the VLM will evaluate the live stream and notify the user in real-time through a WebSocket connected to the mobile app. This will trigger a popup notification on the mobile device, allowing users to ask follow-up questions in chat mode.

Conclusion

This development highlights the potential of VLMs combined with Jetson Platform Services to build advanced Visual AI Agents. The full source code for the VLM AI service is available on GitHub, providing a reference for developers to learn how to use VLMs and build their own microservices.

For more information, visit the NVIDIA Technical Blog.

Image source: Shutterstock




Source link

Tags: AgentsAIPoweredBitcoin NewsCrypto NewsCrypto UpdatesdeploymentEdgeGenerativeLatest News on CryptoNvidiaSB Crypto Guru Newsunveilsvisual
Previous Post

How to Overcome the Challenges of Remote Work in the Professional Services Industry

Next Post

Analyst Predicts Price Will Rocket To $0.00004128 ATH

Related Posts

INJ Futures Launch on CFTC-Regulated Bitnomial, ETF Clock Starts

INJ Futures Launch on CFTC-Regulated Bitnomial, ETF Clock Starts

by SB Crypto Guru News
April 15, 2026
0

Caroline Bishop Apr 15, 2026 22:29 Bitnomial debuts US-regulated Injective futures, beginning the six-month track record needed for Canary Capital's...

Paxos Labs Secures M for Crypto Yield Platform Amplify

Paxos Labs Secures $12M for Crypto Yield Platform Amplify

by SB Crypto Guru News
April 14, 2026
0

Terrill Dicki Apr 14, 2026 21:55 Blockchain Capital leads funding round as Paxos Labs expands Amplify platform offering yield, lending...

Digital Asset Compliance: Why It Matters More Than Ever

Digital Asset Compliance: Why It Matters More Than Ever

by SB Crypto Guru News
April 14, 2026
0

Digital assets are gradually becoming a part of everyday finance and enterprise operations in many ways. The cryptocurrency market has...

GIGGLE Price Prediction: Overbought Rally Eyes  Resistance – 60% Chance of Pullback to

GIGGLE Price Prediction: Overbought Rally Eyes $52 Resistance – 60% Chance of Pullback to $30

by SB Crypto Guru News
April 13, 2026
0

Iris Coleman Apr 13, 2026 16:25 GIGGLE's explosive 34.5% surge has pushed RSI deep into overbought territory at 71.66, while...

AAVE Price Prediction: Recovery to -96 by Late April Despite Current Oversold Conditions

AAVE Price Prediction: Recovery to $94-96 by Late April Despite Current Oversold Conditions

by SB Crypto Guru News
April 12, 2026
0

Iris Coleman Apr 12, 2026 09:17 AAVE price prediction shows potential recovery to $94-96 range by month-end as RSI remains...

Load More
Next Post
Analyst Predicts Price Will Rocket To alt=

Analyst Predicts Price Will Rocket To $0.00004128 ATH

this photograph of Trump exists within a rich art history

this photograph of Trump exists within a rich art history

Facebook Twitter LinkedIn Tumblr RSS

CATEGORIES

  • Altcoin
  • Analysis
  • Bitcoin
  • Blockchain
  • Crypto Exchanges
  • Crypto Updates
  • DeFi
  • Ethereum
  • Metaverse
  • Mining
  • NFT
  • Regulations
  • Scam Alert
  • Uncategorized
  • Web3

SITE MAP

  • Disclaimer
  • Privacy Policy
  • DMCA
  • Cookie Privacy Policy
  • Terms and Conditions
  • Contact us

Copyright © 2022 - SB Crypto Guru News.
SB Crypto Guru News is not responsible for the content of external sites.

Welcome Back!

Login to your account below

Forgotten Password?

Retrieve your password

Please enter your username or email address to reset your password.

Log In
No Result
View All Result
  • HOME
  • BITCOIN
  • CRYPTO UPDATES
    • GENERAL
    • ALTCOINS
    • ETHEREUM
    • CRYPTO EXCHANGES
    • CRYPTO MINING
  • BLOCKCHAIN
  • NFT
  • DEFI
  • WEB3
  • METAVERSE
  • REGULATIONS
  • SCAM ALERT
  • ANALYSIS

Copyright © 2022 - SB Crypto Guru News.
SB Crypto Guru News is not responsible for the content of external sites.