Implementation of various string similarity and distance algorithms: Levenshtein, Jaro-winkler, n-Gram, Q-Gram, Jaccard index, Longest Common Subsequence edit distance, cosine similarity ...
-
Updated
Jun 1, 2022 - Java
Implementation of various string similarity and distance algorithms: Levenshtein, Jaro-winkler, n-Gram, Q-Gram, Jaccard index, Longest Common Subsequence edit distance, cosine similarity ...
Up to 200x Faster Dot Products & Similarity Metrics — for Python, Rust, C, JS, and Swift, supporting f64, f32, f16 real & complex, i8, and bit vectors using SIMD for both AVX2, AVX-512, NEON, SVE, & SVE2 📐
📚 String comparison and edit distance algorithms library, featuring : Levenshtein, LCS, Hamming, Damerau levenshtein (OSA and Adjacent transpositions algorithms), Jaro-Winkler, Cosine, etc...
A .NET port of java-string-similarity
Scalable Time Series Data Analytics
中文智能客服机器人demo,包含闲聊和专业问答2个部分,支持自定义组件(Chinese intelligent customer chatbot Demo, including the gossip and the professional Q&A(FAQ) , support for custom components!)
Quantify the difference between two arbitrary curves in space
DataGene - Identify How Similar TS Datasets Are to One Another (by @firmai)
金融时间序列(预测分析 / 相似度 / 数据处理)
Python library for processing (tandem) mass spectrometry data and for computing spectral similarities.
Information Theory and Distance Quantification with R
Free hands-on course with the implementation (in Python) and description of several computational, mathematical and statistical algorithms.
Reference implementation of the paper VERSE: Versatile Graph Embeddings from Similarity Measures
MTSS-GAN: Multivariate Time Series Simulation with Generative Adversarial Networks (by @firmai)
vips-powered ruby gem to measure images similarity, implementing dHash and IDHash algorithms
Parallel Barnes-Hut t-SNE implementation written in Rust.
Formed trajectories of sets of points.Experimented on finding similarities between trajectories based on DTW (Dynamic Time Warping) and LCSS (Longest Common SubSequence) algorithms.Modeled trajectories as strings based on a Grid representation.Benchmarked KNN, Random Forest, Logistic Regression classification algorithms to classify efficiently t…
building a recommendation system using graph search methodologies. We will be comparing these different approaches and closely observe the limitations of each.
Romanian WordNet (Data + API for Python)
Add a description, image, and links to the similarity-measures topic page so that developers can more easily learn about it.
To associate your repository with the similarity-measures topic, visit your repo's landing page and select "manage topics."