integrate quantized data to storages #1311

IvanPleshkov · 2023-01-03T09:55:40Z

This PR is an integration of the quantization library into Qdrant:
https://github.com/qdrant/quantization

Quantization is a simplifying of vector data. It's helpful for reducing memory usage and scoring performance.

Quantization is enabled while segment creation uses a separate quantization config. For now, the config has only an enable flag.

curl -X PUT "http://$QDRANT_HOST/collections/test_collection" \
  -H 'Content-Type: application/json' \
  --data-raw '{
      "vectors": { ... },
      "quantization_config": {
        "enable": true
      }
    }' | jq

Quantized data is contained in RAM only, no mmap for quantized data for now. Quantized data is used for HNSW indexing and for HNSW search. Any search without HNSW index (plain search, exact search, etc) doesn't use quantized data.

For increasing accuracy, the user can disable quantized data using the ignore_quantization flag in search parameters. Because HNSW index is built using quantized data, ignore_quantization can just increase accuracy but not fix it to the original data accuracy.

If the search uses quantized data, final scores will be recalculated using the original data.

For memmap storage, there is RAM copy for deleted flags if quantization is enabled. Quantization on RAM is not effective when deleted flags are memmapped.

Perf and acc tests on real and random datasets are reported here:
https://www.notion.so/qdrant/7bit-quantization-in-hnsw-9f0467b5010849dcb359d0da93d844e0

All Submissions:

Have you followed the guidelines in our Contributing document?
Have you checked to ensure there aren't other open Pull Requests for the same update/change?

New Feature Submissions:

Does your submission pass tests?
Have you lint your code locally using cargo fmt command prior to submission?
Have you checked your code using cargo clippy command?

Changes to Core Features:

Have you added an explanation of what your changes do and why you'd like us to include them?
Have you written new tests for your core changes, as applicable?
Have you successfully ran tests with your changes locally?

* review api * wip: refactor quantization integrations * wip: refactor quantization integrations * wip: fmt * include quantization into snapshot * fmt

* integrate quantized data to storages * revert gitignore * are you happy clippy * quantize in optimizer * provide flag * fix segfault * skip quantization flag, update scores * use quantization flag * are you happy fmt * use quantization flag * quantized search test * are you happy fmt * refactor test, refactor scorer choosing * are you happy fmt * run quantization on segment builder * decrease testing parameters * simplify segment * update version * remove use_quantization flag * provide quantization config * quantization version up * euclid dist * add euclid test * saveload * fix initialization bugs * quantization lib version up * fix arm build * refactor scorer selecting * quant lib version up * are you happy fmt * are you happy fmt * are you happy clippy * add save/load test for simple storage * add comments * quantiles * quantization mmap * remove f32 * mmap test * fix mmap slice * fix mmap test * use chunks for quantization storage * fix build * are you happy fmt * update quantization library * update quantization lib * update quantization lib * integrate api changes * are you happy fmt * change quantization api * additional checks in tests * update quantization version * fix unit tests * add quantization to storage config * use quantization for all cardinality search cases * Integrate quantization suggestions 2 (#1520) * review api * wip: refactor quantization integrations * wip: refactor quantization integrations * wip: fmt * include quantization into snapshot * fmt --------- Co-authored-by: Andrey Vasnetsov <andrey@vasnetsov.com>

IvanPleshkov added 30 commits January 3, 2023 10:54

integrate quantized data to storages

6959df4

revert gitignore

63691b4

are you happy clippy

6abff6d

quantize in optimizer

d70caab

provide flag

6749beb

fix segfault

d851e1d

skip quantization flag, update scores

f736ce4

use quantization flag

817efd1

are you happy fmt

0e4490d

use quantization flag

dfd69de

quantized search test

82dc7d2

are you happy fmt

708c4cc

refactor test, refactor scorer choosing

ff4d359

are you happy fmt

d1892f7

run quantization on segment builder

59113ab

decrease testing parameters

9f47e8d

simplify segment

fa3c3a8

update version

2926b0c

remove use_quantization flag

1b1d5b1

provide quantization config

5ca3e12

quantization version up

40a8960

euclid dist

77263c1

add euclid test

ae44533

saveload

8c36fa5

fix initialization bugs

e08fafb

quantization lib version up

48ff551

fix arm build

e4be928

refactor scorer selecting

d628c52

quant lib version up

6333fff

are you happy fmt

f7e54eb

IvanPleshkov added 14 commits February 8, 2023 08:04

quantiles

303cf8f

quantization mmap

b668b96

remove f32

970ac92

mmap test

4cd30c3

fix mmap slice

f77fba8

fix mmap test

265c3e2

use chunks for quantization storage

6a04dff

Merge branch 'dev' into integrate-quantization

d6074f5

fix build

f26c5d5

are you happy fmt

b2a7bfc

update quantization library

7079de7

update quantization lib

03f8dfc

update quantization lib

a6baf83

Merge branch 'dev' into integrate-quantization

e88b238

IvanPleshkov mentioned this pull request Feb 13, 2023

Shared deleted flags #1468

Closed

IvanPleshkov and others added 10 commits February 16, 2023 13:37

integrate api changes

e23bba5

are you happy fmt

adf633d

change quantization api

62d9352

additional checks in tests

63830a6

update quantization version

31ea0d8

Merge branch 'dev' into integrate-quantization

39218a8

fix unit tests

ce4e385

add quantization to storage config

f217f34

use quantization for all cardinality search cases

86f916f

Integrate quantization suggestions 2 (#1520)

a1c282e

* review api * wip: refactor quantization integrations * wip: refactor quantization integrations * wip: fmt * include quantization into snapshot * fmt

generall approved these changes Mar 3, 2023

View reviewed changes

generall merged commit 5174388 into dev Mar 3, 2023

generall mentioned this pull request Apr 19, 2023

upd wal commit #1749

Closed

8 tasks

agourlay deleted the integrate-quantization branch July 12, 2023 15:46

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

integrate quantized data to storages #1311

integrate quantized data to storages #1311

IvanPleshkov commented Jan 3, 2023 •

edited

Loading

integrate quantized data to storages #1311

integrate quantized data to storages #1311

Conversation

IvanPleshkov commented Jan 3, 2023 • edited Loading

All Submissions:

New Feature Submissions:

Changes to Core Features:

IvanPleshkov commented Jan 3, 2023 •

edited

Loading