Future plan to train the model #2

Conless · 2024-11-09T04:03:34Z

Hi, thanks for the great work! I'm wondering if there are any plans to train the model on some datasets in the near future. Since there are currently no released weights for this model architecture, it would be incredibly helpful if training could be done, even at a smaller scale. This would allow the community to experiment further and potentially build on this architecture.

VachanVY · 2024-11-09T13:12:48Z

Hi, yes I plan to train in it on the coco captions dataset, soon! Do you know any story generation datasets that contain both image and text modality (it should not be very large, just like we have MNIST for CV)?

Conless · 2024-11-09T13:25:09Z

Hi, yes I plan to train in it on the coco captions dataset, soon! Do you know any story generation datasets that contain both image and text modality (it should not be very large, just like we have MNIST for CV)?

Yes I have found a small one: https://huggingface.co/datasets/sil-ai/bloom-vist

VachanVY · 2024-11-10T08:01:20Z

@Conless, could you prepare a preprocessing script for the dataset (please refer to the README for how the inputs are arranged for the model) and send a PR? If you're okay with it, otherwise I can start working on it in a couple of days as I’m busy. :)

VachanVY self-assigned this Nov 9, 2024

VachanVY added the enhancement New feature or request label Nov 9, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Future plan to train the model #2

Future plan to train the model #2

Conless commented Nov 9, 2024

VachanVY commented Nov 9, 2024

Conless commented Nov 9, 2024

VachanVY commented Nov 10, 2024

Future plan to train the model #2

Future plan to train the model #2

Comments

Conless commented Nov 9, 2024

VachanVY commented Nov 9, 2024

Conless commented Nov 9, 2024

VachanVY commented Nov 10, 2024