DocumentGPT is a web application that allows you to chat over your research document using OpenAI's chat API and perform semantic search using vector databases. This tool provides a seamless interface for interacting with your research document, exploring search results, and engaging in a conversation with an AI chatbot.
- Upload your PDF document and view its contents within the web app.
- Chat with an AI chatbot powered by OpenAI's chat API, using the content of your research document.
- Perform Google search directly from the chat interface to quickly find relevant information.
- Search for YouTube videos related to your research topic and watch them within the app.
- Get semantic search results from your research document using vector databases.
- Easy-to-use interface with multiple pages for a smooth user experience.
- Clone the repository:
git clone https://github.com/your-username/DocumentGPT.git
- Install the required dependencies:
pip install -r requirements.txt
- Set up your OpenAI API key and provide it in the web app.
- Run the application:
streamlit run main.py
- Access the web app in your browser at
http://localhost:8501
.
-
✅Vector Database Retreival Tool: Finds answers from vector database.
-
✅Arxiv Tool: Searches scientific articles on arxiv.org for specific topics.
-
✅Search Tool: Searches the internet for general web articles.
-
✅Summarization Tool: Summarizes entire document when requested.
Here are some areas for further improvement in DocumentGPT:
- AI Alignment: Improve the model behaviour by various techniques in prompt engineering.
- Tools: Addition of more helpul tools for better model responses.
- Parse Errors: Fix handling output parse errors.
You can try out the chatbot by visiting the deployed app on Streamlit Cloud: Deployed App
Note: Unfortunately PDF Display works fine locally, but in the deployed app only works on certain browsers like Safari, Firefox :(
Contributions to DocumentGPT are welcome! If you have any feedback, suggestions, or bug reports, please create an issue in the GitHub repository. You can also contribute to the project by submitting pull requests with your enhancements.
This project is licensed under the MIT License.
DocumentGPT is built using Streamlit, OpenAI Chat API, Langchain and various open-source libraries. I would like to acknowledge the contributions of the developers and contributors of these libraries.