Skip to content
View shenben's full-sized avatar
💭
I may be slow to respond.
💭
I may be slow to respond.

Block or report shenben

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Beta Lists are currently in beta. Share feedback and report bugs.

Starred repositories

Showing results
Python 7 Updated Jul 18, 2024

Convert the cookies from Chrome's Application -> Storage -> Cookies, into the Netscape format accepted by youtube-dl

JavaScript 145 25 Updated Dec 19, 2021

code for AdaScale MLSys'19 https://proceedings.mlsys.org/book/275.pdf

Python 5 1 Updated Jun 24, 2020

A scalable inference server for models optimized with OpenVINO™

C++ 671 211 Updated Nov 4, 2024

2023年最新整理 c++后端开发,1000篇优秀博文,含内存,网络,架构设计,高性能,数据结构,基础组件,中间件,分布式相关

1,138 268 Updated Mar 17, 2023

The ffmpegcv is a ffmpeg backbone for open-cv like Video Reader and Writer

Python 166 25 Updated Sep 20, 2024

Intel developer staging area for unmerged upstream patch contributions to FFmpeg

97 33 Updated Nov 4, 2024

A small ffmpeg-based framebuffer media player

C 68 12 Updated Nov 2, 2024

SEPE-8K-Dataset

9 Updated Apr 26, 2024

Thoughts on Go performance optimization

10,682 596 Updated Jan 5, 2022

Code samples related to Intel(R) AMX

C 29 12 Updated Apr 8, 2024

A curated list of OpenVINO based AI projects

105 29 Updated Oct 24, 2024

GStreamer学习以及资料整理

C 40 7 Updated Dec 12, 2019

Efficient virtual system-on-chip on heterogeneous hardware

C 3 Updated Aug 31, 2024

AlpaServe: Statistical Multiplexing with Model Parallelism for Deep Learning Serving (OSDI 23)

Python 79 11 Updated Jul 14, 2023

Rcmp: Reconstructing RDMA-based Memory Disaggregation via CXL

C++ 43 11 Updated Dec 26, 2023

Low-latency and memory-efficient paging over RDMA networks

C++ 9 Updated Dec 20, 2021

This is object detection demo using DLStreamer and OpenVINO to run on Intel® CPU and iGPU

Python 9 1 Updated Apr 29, 2024

Deployed YOLOX to DL Streamer.

Python 2 1 Updated Jun 6, 2022

本项目旨在分享大模型相关技术原理以及实战经验(大模型工程化、大模型应用落地)

HTML 10,326 1,020 Updated Nov 3, 2024

⚡ Build your chatbot within minutes on your favorite device; offer SOTA compression techniques for LLMs; run LLMs efficiently on Intel Platforms⚡

Python 2,134 211 Updated Oct 8, 2024

Fast Inference of MoE Models with CPU-GPU Orchestration

Python 170 16 Updated Oct 30, 2024

A benchmark framework for decision forest inferences

Python 10 1 Updated Jan 19, 2024

📖A curated list of Awesome LLM Inference Paper with codes, TensorRT-LLM, vLLM, streaming-llm, AWQ, SmoothQuant, WINT8/4, Continuous Batching, FlashAttention, PagedAttention etc.

2,770 192 Updated Nov 1, 2024

LLM Inference benchmark

Python 347 28 Updated Jul 23, 2024

Supplementary material to paper "Extract-Transform-Load for Video Streams"

Python 6 1 Updated Apr 21, 2024

EQUI-VOCAL: Synthesizing Queries for Compositional Video Events from Limited User Interactions

Jupyter Notebook 6 1 Updated Jun 24, 2024

(Research) reduce the use of expensive object detectors when searching large video repos via clever sampling algorithms that account for redundancy in video

Jupyter Notebook 3 Updated Nov 22, 2021

Perceptual video quality assessment based on multi-method fusion.

Python 4,610 752 Updated Oct 14, 2024
Jupyter Notebook 190 15 Updated Sep 4, 2024
Next