- Toronto, Canada
-
05:15
(UTC -04:00) - thedataquarry.com
- https://orcid.org/0000-0002-4944-3756
- @tech_optimist
- in/prrao87
-
-
knowledge-table Public
Forked from whyhow-ai/knowledge-tableKnowledge Table is an open-source package designed to simplify extracting and exploring structured data from unstructured documents.
Python MIT License UpdatedOct 16, 2024 -
show-notes Public
Forked from thechangelog/show-notesChangelog episode show notes in Markdown format 📝
UpdatedSep 26, 2024 -
llama_index Public
Forked from run-llama/llama_indexLlamaIndex is a data framework for your LLM applications
Python MIT License UpdatedSep 23, 2024 -
pydantic-benchmarks Public
Benchmarks testing the performance of various releases of Pydantic v2 🦀
-
fine-grained-sentiment Public
A comparison and discussion of different NLP methods for 5-class sentiment classification on the SST-5 dataset.
-
kuzudb-study Public
Benchmark study on KùzuDB, an embedded OLAP graph database, on an artificial social network dataset
-
awesome-duckdb Public
Forked from davidgasquez/awesome-duckdb🦆 A curated list of awesome DuckDB resources
Creative Commons Zero v1.0 Universal UpdatedJun 10, 2024 -
langchain Public
Forked from langchain-ai/langchain🦜🔗 Build context-aware reasoning applications
-
spacy-nlp Public
Natural Language Processing experiments using the spaCy library
Jupyter Notebook MIT License UpdatedMay 3, 2024 -
topic-modelling Public
Comparing the scalability and quality of topic models in Gensim and PySpark
-
kuzu-rdflib Public
Forked from DerwenAI/kuzu-rdflibAn integration of KùzuDB and RDFlib.
Python MIT License UpdatedApr 26, 2024 -
AeonG Public
Forked from hououou/AeonGAeonG: An Efficient Built-in Temporal Support in Graph Databases
C++ Other UpdatedApr 11, 2024 -
rustworkx Public
Forked from Qiskit/rustworkxA high performance Python graph library implemented in Rust.
Rust Apache License 2.0 UpdatedFeb 22, 2024 -
this-week-in-rust Public
Forked from rust-lang/this-week-in-rustData for this-week-in-rust.org
HTML UpdatedFeb 15, 2024 -
kuzu-docs Public
Forked from kuzudb/kuzu-docsJavaScript Creative Commons Attribution Share Alike 4.0 International UpdatedJan 14, 2024 -
lancedb-study Public
Benchmark study on LanceDB, an embedded vector DB, for full-text search and vector search
-
lancedb Public
Forked from lancedb/lancedbServerless, low-latency vector database for AI applications
Python Apache License 2.0 UpdatedDec 4, 2023 -
prrao87.github.io Public archive
Archived. My blog is now moved to https://github.com/thedataquarry
-
db-hub-fastapi Public
Async bulk data ingestion and querying in various document, graph and vector databases via their Python clients
-
mteb-validation Public
Compare different embedding models from MTEB leaderboard
-
meilisearch-python-sdk Public
Forked from sanders41/meilisearch-python-sdkAn async and sync Python client for the Meilisearch API
-
rag-data-ops Public
Code for data ops when building RAG applications using LangChain and LlamaIndex
-
kuzu-ui Public
Forked from kuzudb/explorerBrowser-based user interface for Kùzu graph database
Vue MIT License UpdatedOct 13, 2023 -
VectorHub Public
Forked from superlinked/VectorHubVectorHub is a free, open-source learning website for people (software developers to senior ML architects) interested in adding vector retrieval to their ML stack.
Other UpdatedSep 26, 2023 -
duckdb-study Public
Compare DuckDB, Polars and Pandas for generating an artificial dataset of persons and companies
-
weaviate-io Public
Forked from weaviate/weaviate-ioWebsite for the Weaviate vector database
MDX UpdatedAug 25, 2023 -
qdrant-client Public
Forked from qdrant/qdrant-clientPython client for Qdrant vector search engine
-
gh-action-test Public
Test GitHub actions and pre-commit hooks for experimenting with CI/CD and auto-linting workflows.
Python MIT License UpdatedJul 20, 2023 -
neo4j-python-fastapi Public
Bulk ingest data into Neo4j using sync or async Python, and expose the data via FastAPI