Document to CSV Generator is a robust application designed to facilitate the efficient transformation of document content into structured CSV files. This tool allows users to easily upload multiple documents and define specific headers for the CSV columns directly within the application. Each header can be accompanied by a detailed column description to clarify the data extraction process.
- Streamlit Framework: Built on Streamlit to provide a smooth and interactive user experience, facilitating quick setup and real-time data processing.
- Watsonx AI and Llama 3 by Meta: Integrates Watsonx AI with Llama 3, Meta’s latest large language model, ensuring top-notch accuracy and efficiency in text analysis and data extraction.
- Column Customization: Users can define column names and provide detailed descriptions for each, enhancing understanding and control over data extraction.
- Multiple Document Upload: The app supports uploading several documents simultaneously, allowing for bulk data processing.
- Intelligent Data Extraction: Leveraging advanced algorithms, the application extracts relevant data from the uploaded documents and aligns it under the designated headers in the CSV.
Document to CSV Generator streamlines data extraction and organization, making it an indispensable tool for data analysis and management tasks.
- Installation
- Setting Up the Python Environment
- Activating the Python Environment
- Running the Streamlit App
- Usage
- Contributing
- License
- Python 3.7 or higher
- pip (Python package installer)
- virtualenv (for creating isolated Python environments)
-
Clone the repository:
git clone https://github.com/yourusername/your-repo-name.git cd your-repo-name
-
Install virtualenv if you don't have it:
pip install virtualenv
-
Create a virtual environment:
virtualenv venv
-
Install the required dependencies:
pip install -r requirements.txt
```sh
.\venv\Scripts\activate
```
```sh
source venv/bin/activate
```
-
Make sure your virtual environment is activated:
source venv/bin/activate # On macOS and Linux .\venv\Scripts\activate # On Windows
-
Run the Streamlit app:
streamlit run watsonx-app.py