socialsent-frequent-words
Folders and files
Name | Name | Last commit date | ||
---|---|---|---|---|
parent directory.. | ||||
######################### SocialSent Sentiment Data ######################### This directory contains historical English sentiment lexicons for all decades in the range 1850-2000. Each decade lexicon contains sentiment scores for the top-5000 words in that decade (excluding stop-words.) See http://nlp.stanford.edu/projects/socialsent for links to the accompanying paper, with details on the algorithm, seeds words, and data sources. All files are .tsv's of the form: <word> <mean_sentiment> <std_sentiment> where mean_sentiment is the averaged inferred sentiment across bootstrap-sampled SentProp runs and std_sentiment is the standard deviation of these samples. SentProp was run with the following hyperparameters: num nearest neighbors k=25 random walk beta=0.9 50 bootstrap samples of size 7