• About
  • Landing Page
  • Buy JNews
SB Crypto Guru News- latest crypto news, NFTs, DEFI, Web3, Metaverse
  • HOME
  • BITCOIN
  • CRYPTO UPDATES
    • GENERAL
    • ALTCOINS
    • ETHEREUM
    • CRYPTO EXCHANGES
    • CRYPTO MINING
  • BLOCKCHAIN
  • NFT
  • DEFI
  • WEB3
  • METAVERSE
  • REGULATIONS
  • SCAM ALERT
  • ANALYSIS
No Result
View All Result
  • HOME
  • BITCOIN
  • CRYPTO UPDATES
    • GENERAL
    • ALTCOINS
    • ETHEREUM
    • CRYPTO EXCHANGES
    • CRYPTO MINING
  • BLOCKCHAIN
  • NFT
  • DEFI
  • WEB3
  • METAVERSE
  • REGULATIONS
  • SCAM ALERT
  • ANALYSIS
No Result
View All Result
SB Crypto Guru News- latest crypto news, NFTs, DEFI, Web3, Metaverse
No Result
View All Result

NVIDIA Delves into RAPIDS cuVS IVF-PQ for Accelerated Vector Search

SB Crypto Guru News by SB Crypto Guru News
July 18, 2024
in Blockchain
0 0
0
NVIDIA Delves into RAPIDS cuVS IVF-PQ for Accelerated Vector Search




Zach Anderson
Jul 18, 2024 20:12

NVIDIA explores the RAPIDS cuVS IVF-PQ algorithm, enhancing vector search performance through compression and GPU acceleration.



NVIDIA Delves into RAPIDS cuVS IVF-PQ for Accelerated Vector Search

In a detailed blog post, NVIDIA has provided insights into their RAPIDS cuVS IVF-PQ algorithm, which aims to accelerate vector search by leveraging GPU technology and advanced compression techniques. This is part one of a two-part series that continues from their previous exploration of the IVF-Flat algorithm.

IVF-PQ Algorithm Introduction

The blog post introduces IVF-PQ (Inverted File Index with Product Quantization), an algorithm designed to enhance search performance and reduce memory usage by storing data in a compressed form. This method, however, comes at the cost of some accuracy, a trade-off that will be further explored in the second part of the series.

IVF-PQ builds upon the concepts of IVF-Flat, which uses an inverted file index to limit the search complexity to a smaller subset of data through clustering. Product quantization (PQ) adds another layer of compression by encoding database vectors, making the process more efficient for large datasets.

Performance Benchmarks

NVIDIA shared benchmarks using the DEEP dataset, which contains a billion records and 96 dimensions, amounting to 360 GiB in size. A typical IVF-PQ configuration compresses this into an index of 54 GiB without significantly impacting search performance, or as small as 24 GiB with a slight slowdown. This compression allows the index to fit into GPU memory.

Comparisons with the popular CPU algorithm HNSW on a 100-million subset of the DEEP dataset show that cuVS IVF-PQ can significantly accelerate both index building and vector search.

Algorithm Overview

IVF-PQ follows a two-step process: a coarse search and a fine search. The coarse search is identical to IVF-Flat, while the fine search involves calculating distances between query points and vectors in probed clusters, but with the vectors stored in a compressed format.

This compression is achieved through PQ, which approximates a vector using two-level quantization. This allows IVF-PQ to fit more data into GPU memory, enhancing memory bandwidth utilization and speeding up the search process.

Optimizations and Performance

NVIDIA has implemented various optimizations in cuVS to ensure the IVF-PQ algorithm performs efficiently on GPUs. These include:

  • Fusing operations to reduce output size and optimize memory bandwidth utilization.
  • Storing the lookup table (LUT) in GPU shared memory when possible for faster access.
  • Using a custom 8-bit floating point data type in the LUT for faster data conversion.
  • Aligning data in 16-byte chunks to optimize data transfers.
  • Implementing an “early stop” check to avoid unnecessary distance computations.

NVIDIA’s benchmarks on a 100-million scale dataset show that IVF-PQ outperforms IVF-Flat, particularly with larger batch sizes, achieving up to 3-4 times the number of queries per second.

Conclusion

IVF-PQ is a robust ANN search algorithm that leverages clustering and compression to enhance search performance and throughput. The first part of NVIDIA’s blog series provides a comprehensive overview of the algorithm’s workings and its advantages on GPU platforms. For more detailed performance tuning recommendations, NVIDIA encourages readers to explore the second part of their series.

For more information, visit the NVIDIA Technical Blog.

Image source: Shutterstock




Source link

Tags: acceleratedBitcoin NewsCrypto NewsCrypto UpdatescuVSDelvesIVFPQLatest News on CryptoNvidiaRAPIDSSB Crypto Guru NewsSearchvector
Previous Post

Streamly: Community Banks, Credit Unions, and Big Tech Partnerships

Next Post

Pindrop Raises $100 Million to Fight Deepfakes

Next Post
Pindrop Raises 0 Million to Fight Deepfakes

Pindrop Raises $100 Million to Fight Deepfakes

  • Trending
  • Comments
  • Latest
Meta Pumps a Further  Million into Horizon Metaverse

Meta Pumps a Further $50 Million into Horizon Metaverse

February 24, 2025
Big XR News from Google, Samsung, Qualcomm, Sony, XREAL, Magic Leap, Lynx, Meta, Microsoft, TeamViewer, Haply

Big XR News from Google, Samsung, Qualcomm, Sony, XREAL, Magic Leap, Lynx, Meta, Microsoft, TeamViewer, Haply

December 13, 2024
Meta Quest Pro Discontinued! Enterprise-Grade MR Headset is No Longer Available

Meta Quest Pro Discontinued! Enterprise-Grade MR Headset is No Longer Available

January 6, 2025
How to Get Token Prices with an RPC Node – Moralis Web3

How to Get Token Prices with an RPC Node – Moralis Web3

September 3, 2024
How to Get NFT Balances with One RPC Call – Moralis Web3

How to Get NFT Balances with One RPC Call – Moralis Web3

August 30, 2024
Exploring Moonbeam – Why Build on Moonbeam? – Moralis Web3

Exploring Moonbeam – Why Build on Moonbeam? – Moralis Web3

September 11, 2024
Chinese Company Moves To Buy 0 Million Worth Of XRP, SEC Filing Shows

Chinese Company Moves To Buy $300 Million Worth Of XRP, SEC Filing Shows

0
Trump’s Bill Gets Roasted, Elon Musk Inspires M Token

Trump’s Bill Gets Roasted, Elon Musk Inspires $53M Token

0
HTX Jumps Two Spots to #8 in Kaiko’s Q2 Exchange Ranking

HTX Jumps Two Spots to #8 in Kaiko’s Q2 Exchange Ranking

0
Corporate Bitcoin Buying Binge Could Trigger Crash: StanChart

Corporate Bitcoin Buying Binge Could Trigger Crash: StanChart

0
Reddit Sues AI Startup Anthropic Over Alleged AI Training

Reddit Sues AI Startup Anthropic Over Alleged AI Training

0
ethduti.es: Ethereum Validator Duties Tracker | Track current & upcoming duties for your validators

ethduti.es: Ethereum Validator Duties Tracker | Track current & upcoming duties for your validators

0
Chinese Company Moves To Buy 0 Million Worth Of XRP, SEC Filing Shows

Chinese Company Moves To Buy $300 Million Worth Of XRP, SEC Filing Shows

June 5, 2025
Trump’s Bill Gets Roasted, Elon Musk Inspires M Token

Trump’s Bill Gets Roasted, Elon Musk Inspires $53M Token

June 5, 2025
HTX Jumps Two Spots to #8 in Kaiko’s Q2 Exchange Ranking

HTX Jumps Two Spots to #8 in Kaiko’s Q2 Exchange Ranking

June 5, 2025
Mastercard Ready To Abandon Manual Card Transactions For Tokenized Transactions By 2030

Mastercard Ready To Abandon Manual Card Transactions For Tokenized Transactions By 2030

June 5, 2025
Amazon AI Project Brings Jobs and Tech to North Carolina

Amazon AI Project Brings Jobs and Tech to North Carolina

June 5, 2025
Corporate Bitcoin Buying Binge Could Trigger Crash: StanChart

Corporate Bitcoin Buying Binge Could Trigger Crash: StanChart

June 5, 2025
SB Crypto Guru News- latest crypto news, NFTs, DEFI, Web3, Metaverse

Find the latest Bitcoin, Ethereum, blockchain, crypto, Business, Fintech News, interviews, and price analysis at SB Crypto Guru News.

CATEGORIES

  • Altcoin
  • Analysis
  • Bitcoin
  • Blockchain
  • Crypto Exchanges
  • Crypto Updates
  • DeFi
  • Ethereum
  • Metaverse
  • Mining
  • NFT
  • Regulations
  • Scam Alert
  • Uncategorized
  • Web3

SITE MAP

  • Disclaimer
  • Privacy Policy
  • DMCA
  • Cookie Privacy Policy
  • Terms and Conditions
  • Contact us
  • Disclaimer
  • Privacy Policy
  • DMCA
  • Cookie Privacy Policy
  • Terms and Conditions
  • Contact us

© 2025 JNews - Premium WordPress news & magazine theme by Jegtheme.

Welcome Back!

Login to your account below

Forgotten Password?

Retrieve your password

Please enter your username or email address to reset your password.

Log In
No Result
View All Result
  • HOME
  • BITCOIN
  • CRYPTO UPDATES
    • GENERAL
    • ALTCOINS
    • ETHEREUM
    • CRYPTO EXCHANGES
    • CRYPTO MINING
  • BLOCKCHAIN
  • NFT
  • DEFI
  • WEB3
  • METAVERSE
  • REGULATIONS
  • SCAM ALERT
  • ANALYSIS

© 2025 JNews - Premium WordPress news & magazine theme by Jegtheme.