Dantone

Data Scraping Project

Main Scripts

Scrapes key ratio data from morningstar and saves in a json format. Refer to notebooks/MstarDataUsage.ipynb for data usage examples.

Key points:

Specify the list of tickers to scrape in input/Stocklist.xlsx File path/type can be modified in ScrapeData.py
Monitor logs/logs.txt for status updates, errors.
On completion of script, #scraped, #pending, #errors printed in logs/progress. Results saved in output/Results-[timestamp].json
logs/pending contains list of tickers where error occured. ScrapeData.py first checks for pending stocks in logs/pending before loading StockList.xlsx

Two key standalone functionalities:

accumulate_results - Accumulate all result json files in a given directory into a single json results file.
merge_attributes - Merge key stats and fin stats into a single json object/dict for all the stocks in a result file.

Name		Name	Last commit message	Last commit date
Latest commit History 20 Commits
adapter		adapter
constants		constants
input		input
logs		logs
notebooks		notebooks
portfolio		portfolio
read		read
scrape		scrape
screen		screen
test/data		test/data
utils		utils
write		write
.gitignore		.gitignore
README.md		README.md
Test.py		Test.py
__init__.py		__init__.py
chromedriver.exe		chromedriver.exe
merge.py		merge.py
requirements.txt		requirements.txt
scrape.py		scrape.py
screen.py		screen.py