Deep learning is evolving quickly. Important new developments are appearing daily. This group attempts to keep up by reading and discussing current deep learning literature. This meetup uses discussion among the participants to speed understanding of current research results. That requires that some participants read the paper before attending. Anyone is welcome to attend and listen without reading the paper. If nobody reads the paper the meeting will be short.
Paper for January 14, 2025:
TokenFormer: Rethinking Transformer Scaling with Tokenized Model Parameters
https://arxiv.org/pdf/2410.23168
There are many YouTubes including by Yannic Kilcher:
https://www.youtube.com/watch?v=gfU5y7qCxF0
and Gabriel Mongaras:
https://www.youtube.com/watch?v=4lGgbkD6Z0I
Papers that we're reading, code that participants generate and other random stuff can be found at github site for the group.
https://github.com/davidmacmillan/DeepLearningStudyGroup