Skip to content

Commit

Permalink
add comments
Browse files Browse the repository at this point in the history
  • Loading branch information
supercoderhawk committed Jan 17, 2020
1 parent 754716c commit 2f83ba1
Showing 1 changed file with 4 additions and 0 deletions.
4 changes: 4 additions & 0 deletions wsdm_digg/data_process/raw_data_formatter.py
Original file line number Diff line number Diff line change
Expand Up @@ -8,6 +8,10 @@


class RawDataFormatter(object):
"""
1. transform raw candidate paper data, train validation and test data into json line format
2. extract the citation sentence which has citation indicator '[**##**]'
"""
nlp = spacy.load('en', disable=['ner', 'parser', 'textcat'])

def __init__(self, dirname):
Expand Down

0 comments on commit 2f83ba1

Please sign in to comment.