Observe before Generate: Emotion-Cause aware Video Caption for Multimodal Emotion Cause Generation in Conversations

Fanfan Wang, Heqing Ma, Xiangqing Shen, Jianfei Yu*, Rui Xia*

This repository contains the code for ObG, a multimodal pipeline framework that first generates emotion-cause aware video captions (Observe) and then facilitates the generation of emotion causes (Generate).

Task

Multimodal Emotion Cause Generation in Conversations (MECGC) aims to generate the abstractive causes of given emotions based on multimodal context.

Dataset

ECGF is constructed by manually annotating the abstractive causes for each emotion labeled in the existing ECF dataset.

Requirements

conda env create -f environment.yml
conda activate obg

# install nlgeval for evaluation
pip install git+https://github.com/Maluuba/nlg-eval.git

Usage

1. Emotion-cause aware video captioning

Few-shot Data Synthesis

Gemini-Pro-Vision is used to generate emotion-cause aware video captions as supervised data for training ECCap. For the detailed instruction template, please refer to Figure 3 in our paper.

Model Training

# modify the data_dir, output_dir
bash ECCap.sh

2. Multimodal emotion cause generation

# modify the data_dir, output_dir
bash CGM.sh

Citation

@inproceedings{wang2024obg,
  title={Observe before Generate: Emotion-Cause aware Video Caption for Multimodal Emotion Cause Generation in Conversations},
  author={Wang, Fanfan and Ma, Heqing and Shen, Xiangqing and Yu, Jianfei and Xia, Rui},
  booktitle={Proceedings of the 32st ACM International Conference on Multimedia},
  pages={},
  year={2024}
}

@ARTICLE{ma2024monica,
  author={Ma, Heqing and Yu, Jianfei and Wang, Fanfan and Cao, Hanyu and Xia, Rui},
  journal={IEEE Transactions on Affective Computing}, 
  title={From Extraction to Generation: Multimodal Emotion-Cause Pair Generation in Conversations}, 
  year={2024},
  volume={},
  number={},
  pages={},
  doi={10.1109/TAFFC.2024.3446646}
}

Acknowledgements

Our code benefits from VL-T5 and CICERO. We appreciate their valuable contributions.

Name		Name	Last commit message	Last commit date
Latest commit History 15 Commits
CGM.sh		CGM.sh
ECCap.sh		ECCap.sh
LICENSE		LICENSE
README.md		README.md
dataset.png		dataset.png
ecg.py		ecg.py
ecg_data.py		ecg_data.py
ecg_model.py		ecg_model.py
environment.yml		environment.yml
framework.png		framework.png
task.png		task.png
trainer_base.py		trainer_base.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Observe before Generate: Emotion-Cause aware Video Caption for Multimodal Emotion Cause Generation in Conversations

Task

Dataset

Requirements

Usage

1. Emotion-cause aware video captioning

Few-shot Data Synthesis

Model Training

2. Multimodal emotion cause generation

Citation

Acknowledgements

About

Releases

Packages

Languages

License

NUSTM/MECGC

Folders and files

Latest commit

History

Repository files navigation

Observe before Generate: Emotion-Cause aware Video Caption for Multimodal Emotion Cause Generation in Conversations

Task

Dataset

Requirements

Usage

1. Emotion-cause aware video captioning

Few-shot Data Synthesis

Model Training

2. Multimodal emotion cause generation

Citation

Acknowledgements

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages