Skip to content

A Open-Source alternative to the Windows-Recall feature. Utilizing DocTR for OCR, JinaAI embedding models for embedding generations, ChromaDB to store and retrieve embeddings, and your LLM of choice for answering any questions

License

Notifications You must be signed in to change notification settings

data-max-hq/open-memory

Repository files navigation

jinaai-demo

Introduction

In light of concerns surrounding Windows' new 'Windows Recall' feature, which records screens and performs OCR, we present an open-source alternative. Our tool offers the same functionality while ensuring user data remains local. Windows Recall on Windows laptops continuously records screens and performs OCR. Despite the embedded NPUs in these devices, users worry about mandatory cloud data transmission. This worry is well-founded, as transmitting data to the cloud involves risks such as unauthorized access, data breaches, and potential misuse of personal information. To address these concerns, we have created an open-source tool that keeps all data processing local. Our solution prioritizes user privacy and security. By keeping all data processing on the user's device, we eliminate the need for any data to be transmitted to external servers. This approach ensures that users have full control over their data, significantly reducing the risk of unauthorized access or misuse. Leveraging the Power of JinaAI Our project showcases the power of open-source tools in creating robust, privacy-conscious alternatives to proprietary solutions. By leveraging libraries like PyAutoGUI, DocTR, Ollama, and JinaAI, we provide a feature-rich solution that meets user needs without compromising privacy. JinaAI plays a crucial role in our implementation by providing high-quality embedding and re-ranking capabilities. Here's how: Embedding Model: JinaAI's embedding model creates efficient and high-quality vector representations of the text extracted from screenshots. These embeddings capture the semantic meaning of the text, enabling precise and relevant information retrieval. Reranker Model: The reranker model from JinaAI enhances the accuracy of the retrieved data. After performing a standard semantic search, the reranker model re-evaluates the results to select the most relevant chunks. This ensures that the information presented to the user is not only contextually appropriate but also the most pertinent to their query. The combination of JinaAI’s embedding and reranking models significantly boosts the performance of our tool. These models are designed to work seamlessly together, providing an efficient and effective method for processing and retrieving data.

Project Overview

Our implementation leverages several open-source tools to replicate the functionality of Windows Recall without the need to send data to external servers. Here's a breakdown of the components:

  • PyAutoGUI: This tool is used for capturing screenshots at regular intervals.
  • DocTR: A state-of-the-art OCR tool that performs text recognition on the captured images.
  • Ollama: Used for local LLM’s.
  • JinaAI: JinaAI's embedding model and reranker enhances the accuracy of the retrieved data.
  • ChromaDB: A database used to store embeddings for efficient retrieval.

Setup Guide

Install Requirements: Follow this guide to install DocTR. Then, download and extract the project files into your working directory. Install Ollama: Choose your preferred LLM (we recommend QWEN2 for this use case). Run the Application: Using Flask: Flask --app app.py run Using Python: python app.py Access the UI: Open your browser and navigate to localhost:9876. Click "Toggle Recording" to start capturing your screen every 30 seconds. Press the red button to stop recording.

Functionality

Toggle Recording: To turn recording your screen on or off Query ChromaDB: Test the database to retrieve relevant pieces of context for a specific query. LLM Prompt: Pass a query to QWEN2:1.5b to get an explanation of what the user was doing based on the retrieved context. Summary In summary, our open-source alternative to Windows Recall offers a secure, private, and efficient way to continuously record screens and perform OCR, ensuring that all data remains under the user's control. By leveraging the high-quality embedding and reranking models from JinaAI, we enhance the accuracy and relevance of the information retrieved, providing users with a powerful and privacy-conscious solution. This project demonstrates the potential of open-source tools to address privacy concerns while delivering robust functionality.

About

A Open-Source alternative to the Windows-Recall feature. Utilizing DocTR for OCR, JinaAI embedding models for embedding generations, ChromaDB to store and retrieve embeddings, and your LLM of choice for answering any questions

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published