Skip to content
View ZQSIAT's full-sized avatar
🎯
Focusing
🎯
Focusing
  • Tongji University
  • Shanghai, China

Block or report ZQSIAT

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

This repo takes the initial step towards leveraging text learning for online action detection without explicit human supervision.

1 Updated Oct 28, 2024

Progress-Aware Online Action Segmentation for Egocentric Procedural Task Videos

Python 9 Updated Sep 9, 2024
Python 108 19 Updated Jun 27, 2021

A deep metric learning approach for action segmentation

Python 6 1 Updated Nov 15, 2023

LongMIT: Essential Factors in Crafting Effective Long Context Multi-Hop Instruction Datasets

Python 32 1 Updated Sep 30, 2024

[NeurIPS 2024] SimPO: Simple Preference Optimization with a Reference-Free Reward

Python 695 46 Updated Nov 4, 2024

The official implementation of Self-Play Fine-Tuning (SPIN)

Python 1,031 91 Updated May 8, 2024

This is work done by the Oxen.ai Community, trying to reproduce the Self-Rewarding Language Model paper from MetaAI.

Python 109 9 Updated Apr 25, 2024

Implementation of the ICML 2024 paper "Training Large Language Models for Reasoning through Reverse Curriculum Reinforcement Learning" presented by Zhiheng Xi et al.

Python 72 5 Updated Feb 9, 2024

g1: Using Llama-3.1 70b on Groq to create o1-like reasoning chains

Python 3,814 345 Updated Oct 7, 2024

Code for Quiet-STaR

Python 632 87 Updated Aug 21, 2024

Implementation of the Quiet-STAR paper (https://arxiv.org/pdf/2403.09629.pdf)

Python 39 2 Updated Aug 8, 2024

Huggingface transformers的中文文档

Python 169 20 Updated Nov 8, 2023

Official PyTorch implementation of CODA-LM(https://arxiv.org/abs/2404.10595)

Python 66 2 Updated Nov 3, 2024

[ECCV 2024 Oral] DriveLM: Driving with Graph Visual Question Answering

HTML 852 53 Updated Oct 9, 2024

An VideoQA dataset based on the videos from ActivityNet

Python 67 9 Updated Nov 22, 2020

✨✨Latest Advances on Multimodal Large Language Models

12,477 797 Updated Oct 29, 2024

open-source multimodal large language model that can hear, talk while thinking. Featuring real-time end-to-end speech input and streaming audio output conversational capabilities.

Python 3,051 271 Updated Oct 16, 2024

For the paper "Learning Discriminative Action Representations in Videos via Embedding Distance Correlation"

1 Updated Sep 13, 2024

🔍 An LLM-based Multi-agent Framework of Web Search Engine (like Perplexity.ai Pro and SearchGPT)

Python 5,061 512 Updated Nov 4, 2024

PyTorch implementation of Depthwise Separable Convolution

Python 11 Updated Aug 28, 2022

Long context evaluation for large language models

Python 185 15 Updated Nov 4, 2024

Free ChatGPT API Key,免费ChatGPT API,支持GPT4 API(免费),ChatGPT国内可用免费转发API,直连无需代理。可以搭配ChatBox等软件/插件使用,极大降低接口使用成本。国内即可无限制畅快聊天。

Python 23,954 1,826 Updated Oct 29, 2024

This is the official code of VideoAgent: A Memory-augmented Multimodal Agent for Video Understanding (ECCV 2024)

Python 123 5 Updated Sep 9, 2024

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

Python 20,077 2,209 Updated Aug 12, 2024

Implementation of Depthwise Separable Convolution (pytorch)

Python 70 7 Updated Mar 11, 2020

VILA - a multi-image visual language model with training, inference and evaluation recipe, deployable from cloud to edge (Jetson Orin and laptops)

Python 1,961 157 Updated Oct 31, 2024
Next