-
Notifications
You must be signed in to change notification settings - Fork 7
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Help regarding dataset. #3
Comments
Hi,I will give you a simple explain about this 4 files. Fist of all, this model is for single turn dialogue, which means we have two sentences here, One is source sentence and the other is target sentence, just like ask and answer.I split the dialogues into 2 files, they are source.txt and target.txt. The i-th line in source.txt is corresponding to the i-th line in target.txt and the same for the category.txt and the choice.txt. 1.category: target sentence emotion category 2.choice: target sentence emotional word annotation 3.source: source sentence 4.target: target sentence |
Hey thank you so much for the help, now I'm able to understand the data atleast. |
I haven't use his emotion classifier so I'm not quite sure. However, I think you can try his module, it seems works well. |
So what module do you suggest me to use? because that module does not create emotional word dictionary or uses it in any way. |
OK, what you said means that you don't need a emotion classifier. So, now you just need an English emotional word dictionary to get the file choice.txt. I think you can find one on the Internet, since I get one Chinese emotional word dictionary using Google search. |
Okay thanks, will look for it, and I guess after that it's just searching for a word, if emotion word exists it is 1, else 0. |
Sorry, I can't provide you my classifier. You can use this |
I would like to know more about how the text data available on NTCIR Short Text Conversation Task(STC-3) Chinese Emotional Conversation Generation (CECG) Subtask (http://coai.cs.tsinghua.edu.cn/hml/challenge/dataset_description/) was processed to the 4 files:
category: target sentence emotion category
choice: target sentence emotional word annotation
source: source sentence
target: target sentence
For which i looked at https://github.com/AaronYALai/Seq2seqAttn_ECM as well.
Hence, more information or guidance would be a great help. since this will help me in processing English dataset as well.
Regards
The text was updated successfully, but these errors were encountered: