sotu-db/nlp-libs/socialsent-frequent-words at master · taylorcate/sotu-db

History

Name		Name	Last commit message	Last commit date
parent directory ..
1850.tsv		1850.tsv
1860.tsv		1860.tsv
1870.tsv		1870.tsv
1880.tsv		1880.tsv
1890.tsv		1890.tsv
1900.tsv		1900.tsv
1910.tsv		1910.tsv
1920.tsv		1920.tsv
1930.tsv		1930.tsv
1940.tsv		1940.tsv
1950.tsv		1950.tsv
1960.tsv		1960.tsv
1970.tsv		1970.tsv
1980.tsv		1980.tsv
1990.tsv		1990.tsv
2000.tsv		2000.tsv
README.txt		README.txt

README.txt

#########################
SocialSent Sentiment Data
#########################

This directory contains historical English sentiment lexicons for all decades in the range 1850-2000. 
Each decade lexicon contains sentiment scores for the top-5000 words in that decade (excluding stop-words.)
See http://nlp.stanford.edu/projects/socialsent for links to the accompanying paper, with details on the algorithm, seeds words, and data sources.

All files are .tsv's of the form:

<word> <mean_sentiment> <std_sentiment>

where mean_sentiment is the averaged inferred sentiment across bootstrap-sampled SentProp runs 
and std_sentiment is the standard deviation of these samples.

SentProp was run with the following hyperparameters:

num nearest neighbors k=25
random walk beta=0.9
50 bootstrap samples of size 7

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

socialsent-frequent-words

socialsent-frequent-words

README.txt

Files

socialsent-frequent-words

Directory actions

More options

Directory actions

More options

Latest commit

History

socialsent-frequent-words

Folders and files

parent directory

README.txt