Skip to content

For temporal attentionΒ #9

Open
Open
@rakesh-reddy95

Description

Thanks for the code. I want to know how we can integrate this with the temporal attention based models like tune-a-video to generate videos. As the svdiff will be trained for 2D image while in video generation we have an additional dimension for number of frames.

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions