generated from CS601-F21/Side-Project
-
Notifications
You must be signed in to change notification settings - Fork 0
/
Copy pathWork Log.txt
56 lines (37 loc) · 1.41 KB
/
Work Log.txt
1
Side project work log: total about 51-52hours total hours on each item:Design the work flowHours: 1read in yelp review and business info in jsonHours: 3Chanlleges: json has nested format make it hard to read in filesÊinterveted index addÊhours: 3challenges: while adding the term to interted index, we are pre-calculating sentiment typeprocess yelp data:Hours: 5challenges: the raw data has mixed business and their reviews, but I only need restaurant data and business info, so I used python to filter and pre-process business data from the category and then filter out the corresponding reviewssentiment analysis:hours: 6challenges: research on sentiment analysis and tried but its too slowredesign to boost performance:hours: 3redesign the work flow and made changes:hours: 5Challenges: the sentiment analysis script is slow to process all sentences since I was pre-calculating them, so I re-design it in a way i can compute sentiment analysis until I know the term instead of computing it upfrontProcess data againHours: 3challenges: noticed the data doesnt contains resturants in CA, so I picked portland and also filter out the data 10 years ago since they are not relatable any moreintegrate html with code:Hours: 3Êjupyer notebook:Hours: 6Spotcheck on Sentiment analysis:Hour: 3Debugging:Hour: 4design html logo and font:Hour: 2clean code + refactoringHours: 4ÊÊÑÑtotal Ê51~52 hours