Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

integrate quantized data to storages #1311

Merged
merged 59 commits into from
Mar 3, 2023
Merged
Show file tree
Hide file tree
Changes from 1 commit
Commits
Show all changes
59 commits
Select commit Hold shift + click to select a range
6959df4
integrate quantized data to storages
IvanPleshkov Jan 3, 2023
63691b4
revert gitignore
IvanPleshkov Jan 3, 2023
6abff6d
are you happy clippy
IvanPleshkov Jan 3, 2023
d70caab
quantize in optimizer
IvanPleshkov Jan 4, 2023
6749beb
provide flag
IvanPleshkov Jan 4, 2023
d851e1d
fix segfault
IvanPleshkov Jan 5, 2023
f736ce4
skip quantization flag, update scores
IvanPleshkov Jan 6, 2023
817efd1
use quantization flag
IvanPleshkov Jan 17, 2023
0e4490d
are you happy fmt
IvanPleshkov Jan 17, 2023
dfd69de
use quantization flag
IvanPleshkov Jan 17, 2023
82dc7d2
quantized search test
IvanPleshkov Jan 18, 2023
708c4cc
are you happy fmt
IvanPleshkov Jan 18, 2023
ff4d359
refactor test, refactor scorer choosing
IvanPleshkov Jan 18, 2023
d1892f7
are you happy fmt
IvanPleshkov Jan 18, 2023
59113ab
run quantization on segment builder
IvanPleshkov Jan 18, 2023
9f47e8d
decrease testing parameters
IvanPleshkov Jan 18, 2023
fa3c3a8
simplify segment
IvanPleshkov Jan 19, 2023
2926b0c
update version
IvanPleshkov Jan 19, 2023
1b1d5b1
remove use_quantization flag
IvanPleshkov Jan 19, 2023
5ca3e12
provide quantization config
IvanPleshkov Jan 19, 2023
40a8960
quantization version up
IvanPleshkov Jan 23, 2023
77263c1
euclid dist
IvanPleshkov Jan 25, 2023
ae44533
add euclid test
IvanPleshkov Jan 25, 2023
8c36fa5
saveload
IvanPleshkov Jan 26, 2023
e08fafb
fix initialization bugs
IvanPleshkov Jan 26, 2023
48ff551
quantization lib version up
IvanPleshkov Jan 26, 2023
e4be928
fix arm build
IvanPleshkov Jan 26, 2023
d628c52
refactor scorer selecting
IvanPleshkov Jan 26, 2023
6333fff
quant lib version up
IvanPleshkov Jan 26, 2023
f7e54eb
are you happy fmt
IvanPleshkov Jan 26, 2023
ba65b49
Merge branch 'dev' into integrate-quantization
IvanPleshkov Jan 30, 2023
f36c443
are you happy fmt
IvanPleshkov Jan 30, 2023
6773f03
are you happy clippy
IvanPleshkov Jan 30, 2023
6958d04
add save/load test for simple storage
IvanPleshkov Jan 30, 2023
f796067
add comments
IvanPleshkov Jan 30, 2023
303cf8f
quantiles
IvanPleshkov Feb 8, 2023
b668b96
quantization mmap
IvanPleshkov Feb 8, 2023
970ac92
remove f32
IvanPleshkov Feb 8, 2023
4cd30c3
mmap test
IvanPleshkov Feb 9, 2023
f77fba8
fix mmap slice
IvanPleshkov Feb 9, 2023
265c3e2
fix mmap test
IvanPleshkov Feb 9, 2023
6a04dff
use chunks for quantization storage
IvanPleshkov Feb 9, 2023
d6074f5
Merge branch 'dev' into integrate-quantization
IvanPleshkov Feb 9, 2023
f26c5d5
fix build
IvanPleshkov Feb 9, 2023
b2a7bfc
are you happy fmt
IvanPleshkov Feb 9, 2023
7079de7
update quantization library
IvanPleshkov Feb 9, 2023
03f8dfc
update quantization lib
IvanPleshkov Feb 9, 2023
a6baf83
update quantization lib
IvanPleshkov Feb 9, 2023
e88b238
Merge branch 'dev' into integrate-quantization
IvanPleshkov Feb 10, 2023
e23bba5
integrate api changes
IvanPleshkov Feb 16, 2023
adf633d
are you happy fmt
IvanPleshkov Feb 16, 2023
62d9352
change quantization api
IvanPleshkov Feb 17, 2023
63830a6
additional checks in tests
IvanPleshkov Feb 19, 2023
31ea0d8
update quantization version
IvanPleshkov Feb 27, 2023
39218a8
Merge branch 'dev' into integrate-quantization
IvanPleshkov Feb 27, 2023
ce4e385
fix unit tests
IvanPleshkov Feb 27, 2023
f217f34
add quantization to storage config
IvanPleshkov Feb 28, 2023
86f916f
use quantization for all cardinality search cases
IvanPleshkov Mar 1, 2023
a1c282e
Integrate quantization suggestions 2 (#1520)
generall Mar 3, 2023
File filter

Filter by extension

Filter by extension


Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
Prev Previous commit
Next Next commit
quantization mmap
  • Loading branch information
IvanPleshkov committed Feb 8, 2023
commit b668b96e9a6997d1fa9adef141b7b1a0a63a5ad5
2 changes: 1 addition & 1 deletion Cargo.lock

Some generated files are not rendered by default. Learn more about how customized files appear on GitHub.

1 change: 1 addition & 0 deletions lib/api/src/grpc/proto/collections.proto
Original file line number Diff line number Diff line change
Expand Up @@ -157,6 +157,7 @@ message OptimizersConfigDiff {
message QuantizationConfig {
bool enable = 1;
optional float quantile = 2;
optional bool always_ram = 3;
}

message CreateCollection {
Expand Down
2 changes: 2 additions & 0 deletions lib/api/src/grpc/qdrant.rs
Original file line number Diff line number Diff line change
Expand Up @@ -176,6 +176,8 @@ pub struct QuantizationConfig {
pub enable: bool,
#[prost(float, optional, tag = "2")]
pub quantile: ::core::option::Option<f32>,
#[prost(bool, optional, tag = "3")]
pub always_ram: ::core::option::Option<bool>,
}
#[allow(clippy::derive_partial_eq_without_eq)]
#[derive(Clone, PartialEq, ::prost::Message)]
Expand Down
2 changes: 2 additions & 0 deletions lib/collection/src/operations/conversions.rs
Original file line number Diff line number Diff line change
Expand Up @@ -215,6 +215,7 @@ impl From<CollectionInfo> for api::grpc::qdrant::CollectionInfo {
api::grpc::qdrant::QuantizationConfig {
enable: x.enable,
quantile: x.quantile,
always_ram: x.always_ram,
}
}),
}),
Expand Down Expand Up @@ -380,6 +381,7 @@ impl TryFrom<api::grpc::qdrant::CollectionConfig> for CollectionConfig {
quantization_config: config.quantization_config.map(|x| QuantizationConfig {
enable: x.enable,
quantile: x.quantile,
always_ram: x.always_ram,
}),
})
}
Expand Down
2 changes: 1 addition & 1 deletion lib/segment/src/segment_constructor/segment_builder.rs
Original file line number Diff line number Diff line change
Expand Up @@ -222,7 +222,7 @@ impl SegmentBuilder {
vector_data.vector_storage.borrow_mut().quantize(
&quantized_meta_path,
&quantized_data_path,
quantization.quantile,
quantization,
)?;
}
}
Expand Down
Original file line number Diff line number Diff line change
Expand Up @@ -115,9 +115,11 @@ fn create_segment(
let quantized_meta_path = vector_storage_path.join(QUANTIZED_META_PATH);
let quantized_data_path = vector_storage_path.join(QUANTIZED_DATA_PATH);
if quantized_meta_path.exists() && quantized_data_path.exists() {
vector_storage
.borrow_mut()
.load_quantization(&quantized_meta_path, &quantized_data_path)?;
vector_storage.borrow_mut().load_quantization(
&quantized_meta_path,
&quantized_data_path,
quantization_config,
)?;
}
}
}
Expand Down
14 changes: 13 additions & 1 deletion lib/segment/src/types.rs
Original file line number Diff line number Diff line change
Expand Up @@ -289,11 +289,23 @@ fn default_max_indexing_threads() -> usize {
0
}

#[derive(Default, Debug, Deserialize, Serialize, JsonSchema, Clone, PartialEq)]
#[derive(Default, Debug, Deserialize, Serialize, JsonSchema, Clone)]
#[serde(rename_all = "snake_case")]
pub struct QuantizationConfig {
pub enable: bool,
pub quantile: Option<f32>,
/// If true, quantized data is stored in RAM even if original data is memmapped.
/// If false, quantized data is stored like original data.
/// Default value is false.
pub always_ram: Option<bool>,
}

impl PartialEq for QuantizationConfig {
fn eq(&self, other: &Self) -> bool {
self.enable == other.enable
&& self.quantile == other.quantile
&& self.always_ram == other.always_ram
}
}

impl Eq for QuantizationConfig {}
Expand Down
27 changes: 16 additions & 11 deletions lib/segment/src/vector_storage/memmap_vector_storage.rs
Original file line number Diff line number Diff line change
Expand Up @@ -9,14 +9,13 @@ use std::sync::Arc;

use atomic_refcell::AtomicRefCell;

use super::quantized_vector_storage::QuantizedRawScorer;
use crate::common::Flusher;
use crate::data_types::vectors::VectorElementType;
use crate::entry::entry_point::{check_process_stopped, OperationResult};
use crate::spaces::metric::Metric;
use crate::spaces::simple::{CosineMetric, DotProductMetric, EuclidMetric};
use crate::spaces::tools::peek_top_largest_iterable;
use crate::types::{Distance, PointOffsetType, ScoreType};
use crate::types::{Distance, PointOffsetType, QuantizationConfig, ScoreType};
use crate::vector_storage::mmap_vectors::MmapVectors;
use crate::vector_storage::{RawScorer, ScoredPointOffset, VectorStorage, VectorStorageSS};

Expand Down Expand Up @@ -262,11 +261,7 @@ where
if let Some(quantized_data) = &mmap_store.quantized_vectors {
if let Some(deleted_ram) = &mmap_store.deleted_ram {
let query = TMetric::preprocess(vector).unwrap_or_else(|| vector.to_owned());
Some(Box::new(QuantizedRawScorer {
query: quantized_data.encode_query(&query),
quantized_data,
deleted: deleted_ram,
}))
Some(quantized_data.raw_scorer(&query, deleted_ram))
} else {
None
}
Expand All @@ -279,15 +274,25 @@ where
&mut self,
meta_path: &Path,
data_path: &Path,
quantile: Option<f32>,
quantization_config: &QuantizationConfig,
) -> OperationResult<()> {
let mmap_store = self.mmap_store.as_mut().unwrap();
mmap_store.quantize(TMetric::distance(), meta_path, data_path, quantile)
mmap_store.quantize(
TMetric::distance(),
meta_path,
data_path,
quantization_config,
)
}

fn load_quantization(&mut self, meta_path: &Path, data_path: &Path) -> OperationResult<()> {
fn load_quantization(
&mut self,
meta_path: &Path,
data_path: &Path,
quantization_config: &QuantizationConfig,
) -> OperationResult<()> {
let mmap_store = self.mmap_store.as_mut().unwrap();
mmap_store.load_quantization(meta_path, data_path)
mmap_store.load_quantization(meta_path, data_path, quantization_config)
}

fn score_points(
Expand Down
88 changes: 65 additions & 23 deletions lib/segment/src/vector_storage/mmap_vectors.rs
Original file line number Diff line number Diff line change
Expand Up @@ -9,13 +9,14 @@ use memmap2::{Mmap, MmapMut, MmapOptions};
use parking_lot::{RwLock, RwLockReadGuard};
use quantization::encoder::EncodingParameters;

use super::quantized_vector_storage::EncodedVectors;
use super::quantized_mmap_storage::{QuantizedMmapStorage, QuantizedMmapStorageBuilder};
use super::quantized_vector_storage::QuantizedVectorStorage;
use crate::common::error_logging::LogError;
use crate::common::Flusher;
use crate::data_types::vectors::VectorElementType;
use crate::entry::entry_point::{OperationError, OperationResult};
use crate::madvise;
use crate::types::{Distance, PointOffsetType};
use crate::types::{Distance, PointOffsetType, QuantizationConfig};

const HEADER_SIZE: usize = 4;
const DELETED_HEADER: &[u8; 4] = b"drop";
Expand All @@ -29,7 +30,7 @@ pub struct MmapVectors {
deleted_mmap: Arc<RwLock<MmapMut>>,
pub deleted_count: usize,
pub deleted_ram: Option<BitVec>,
pub quantized_vectors: Option<EncodedVectors>,
pub quantized_vectors: Option<Box<dyn QuantizedVectorStorage>>,
}

fn open_read(path: &Path) -> OperationResult<Mmap> {
Expand Down Expand Up @@ -101,39 +102,80 @@ impl MmapVectors {
self.deleted_ram = Some(deleted);
}

fn create_quantized_storage<TStorage, TStorageBuilder>(
&self,
meta_path: &Path,
data_path: &Path,
encoding_parameters: EncodingParameters,
) -> OperationResult<Box<dyn QuantizedVectorStorage>>
where
TStorage: quantization::encoder::Storage,
TStorageBuilder: quantization::encoder::StorageBuilder<TStorage>,
{
let vector_data_iterator = (0..self.num_vectors as u32).map(|i| {
let offset = self.data_offset(i as PointOffsetType).unwrap_or_default();
self.raw_vector_offset(offset)
});
let quantized_vectors = quantization::encoder::EncodedVectors::<Vec<u8>>::encode(
vector_data_iterator,
Vec::new(),
encoding_parameters,
)
.map_err(|e| OperationError::service_error(format!("Cannot quantize vector data: {e}")))?;
quantized_vectors.save(data_path, meta_path)?;
Ok(Box::new(quantized_vectors))
}

pub fn quantize(
&mut self,
distance: Distance,
meta_path: &Path,
data_path: &Path,
quantile: Option<f32>,
quantization_config: &QuantizationConfig,
) -> OperationResult<()> {
self.enable_deleted_ram();
let quantized_vectors = EncodedVectors::encode(
(0..self.num_vectors as u32).map(|i| {
let offset = self.data_offset(i as PointOffsetType).unwrap_or_default();
self.raw_vector_offset(offset)
}),
Vec::new(),
EncodingParameters {
distance_type: match distance {
Distance::Cosine => quantization::encoder::SimilarityType::Dot,
Distance::Euclid => quantization::encoder::SimilarityType::L2,
Distance::Dot => quantization::encoder::SimilarityType::Dot,
},
invert: distance == Distance::Euclid,
quantile,
let encoding_parameters = EncodingParameters {
distance_type: match distance {
Distance::Cosine => quantization::encoder::SimilarityType::Dot,
Distance::Euclid => quantization::encoder::SimilarityType::L2,
Distance::Dot => quantization::encoder::SimilarityType::Dot,
},
)
.map_err(|e| OperationError::service_error(format!("Cannot quantize vector data: {e}")))?;
quantized_vectors.save(data_path, meta_path)?;
invert: distance == Distance::Euclid,
quantile: quantization_config.quantile,
};
let quantized_vectors = if quantization_config.always_ram == Some(true) {
self.create_quantized_storage::<Vec<u8>, Vec<u8>>(
meta_path,
data_path,
encoding_parameters,
)?
} else {
self.create_quantized_storage::<QuantizedMmapStorage, QuantizedMmapStorageBuilder>(
meta_path,
data_path,
encoding_parameters,
)?
};
self.quantized_vectors = Some(quantized_vectors);
Ok(())
}

pub fn load_quantization(&mut self, meta_path: &Path, data_path: &Path) -> OperationResult<()> {
pub fn load_quantization(
&mut self,
meta_path: &Path,
data_path: &Path,
quantization_config: &QuantizationConfig,
) -> OperationResult<()> {
self.enable_deleted_ram();
self.quantized_vectors = Some(EncodedVectors::load(data_path, meta_path)?);
self.quantized_vectors = if quantization_config.always_ram == Some(true) {
Some(Box::new(
quantization::encoder::EncodedVectors::<Vec<u8>>::load(data_path, meta_path)?,
))
} else {
Some(Box::new(quantization::encoder::EncodedVectors::<
QuantizedMmapStorage,
>::load(data_path, meta_path)?))
};
Ok(())
}

Expand Down
1 change: 1 addition & 0 deletions lib/segment/src/vector_storage/mod.rs
Original file line number Diff line number Diff line change
@@ -1,6 +1,7 @@
pub mod chunked_vectors;
pub mod memmap_vector_storage;
mod mmap_vectors;
pub mod quantized_mmap_storage;
mod quantized_vector_storage;
pub mod simple_vector_storage;
mod vector_storage_base;
Expand Down
74 changes: 74 additions & 0 deletions lib/segment/src/vector_storage/quantized_mmap_storage.rs
Original file line number Diff line number Diff line change
@@ -0,0 +1,74 @@
use std::path::Path;

use memmap2::{Mmap, MmapMut};
use quantization::encoder::{EncodingParameters, Storage, StorageBuilder};

use crate::madvise;

pub struct QuantizedMmapStorage {
mmap: Mmap,
}

pub struct QuantizedMmapStorageBuilder {
mmap: MmapMut,
cursor_pos: usize,
}

impl Storage for QuantizedMmapStorage {
fn ptr(&self) -> *const u8 {
self.mmap.as_ptr()
}

fn from_file(path: &std::path::Path) -> std::io::Result<QuantizedMmapStorage> {
let file = std::fs::OpenOptions::new()
.read(true)
.write(false)
.create(false)
.open(path)?;
let mmap = unsafe { Mmap::map(&file)? };
madvise::madvise(&mmap, madvise::get_global())?;
Ok(Self { mmap })
}

fn save_to_file(&self, _path: &Path) -> std::io::Result<()> {
// do nothing because mmap is already saved
Ok(())
}
}

impl StorageBuilder<QuantizedMmapStorage> for QuantizedMmapStorageBuilder {
fn build(self) -> QuantizedMmapStorage {
self.mmap.flush().unwrap();
let mmap = self.mmap.make_read_only().unwrap(); // TODO: remove unwrap
QuantizedMmapStorage { mmap }
}

fn extend_from_slice(&mut self, other: &[u8]) {
self.mmap[self.cursor_pos..other.len()].copy_from_slice(other);
self.cursor_pos += other.len();
}
}

impl QuantizedMmapStorageBuilder {
pub fn new(
path: &Path,
size: usize,
dim: usize,
encoding_parameters: &EncodingParameters,
) -> std::io::Result<Self> {
let encoded_storage_size = encoding_parameters.estimate_encoded_storage_size(dim, size);
path.parent().map(std::fs::create_dir_all);
let file = std::fs::OpenOptions::new()
.read(true)
.write(true)
.create(true)
.open(path)?;
file.set_len(encoded_storage_size as u64)?;
let mmap = unsafe { MmapMut::map_mut(&file) }?;
madvise::madvise(&mmap, madvise::get_global())?;
Ok(Self {
mmap,
cursor_pos: 0,
})
}
}
Loading