YOLO-Streaming

Hi, this repository documents the process of pushing streams on some ultra-lightweight nets. The general steps are that opencv calls the board（like Raspberry Pi）'s camera, transmits the detected live video to an ultra-lightweight network like yolo-fastest, YOLOv4-tiny, YOLOv5s-onnx, and then talks about pushing the processed video frames to the web using the flask lightweight framework, which basically guarantees real-time performance.

Requirements

Please install the following packages first（for dnn）

Linux & MacOS & window
python>= 3.6.0
opencv-python>= 4.2.X
flask>= 1.0.0

Please install the following packages first（for ncnn）

Linux & MacOS & window
Visual Studio 2019
cmake-3.16.5
protobuf-3.4.0
opencv-3.4.0
vulkan-1.14.8

inference

YOLOv3-Fastest： https://github.com/dog-qiuqiu/Yolo-Fastest Models：Yolo-Fastest-1.1-xl

Equipment	Computing backend	System	Framework	input_size	Run time
Raspberrypi 3B	4xCortex-A53	Linux(arm64)	dnn	320	89ms
Intel	Core i5-4210	window10（x64）	dnn	320	21ms

YOLOv4-Tiny： https://github.com/AlexeyAB/darknet Models：yolov4-tiny.weights

Equipment	Computing backend	System	Framework	input_size	Run time
Raspberrypi 3B	4xCortex-A53	Linux(arm64)	dnn	320	315ms
Intel	Core i5-4210	window10（x64）	dnn	320	41ms

YOLOv5s-onnx： https://github.com/ultralytics/yolov5 Models：yolov5s.onnx

Equipment	Computing backend	System	Framework	input_size	Run time
Raspberrypi 3B	4xCortex-A53	Linux(arm64)	dnn	320	673ms
Intel	Core i5-4210	window10（x64）	dnn	320	131ms
Raspberrypi 3B	4xCortex-A53	Linux(arm64)	ncnn	160	716ms
Intel	Core i5-4210	window10（x64）	ncnn	160	197ms

Nanodet： https://github.com/RangiLyu/nanodet Models：nanodet.onnx

Equipment	Computing backend	System	Framework	input_size	Run time
Raspberrypi 3B	4xCortex-A53	Linux(arm64)	dnn	320	113ms
Intel	Core i5-4210	window10（x64）	dnn	320	23ms

updating. . .

Demo

First of all, I have tested this demo in window, mac and linux environments and it works in all of them.

Run v3_fastest.py

Inference images use python yolov3_fastest.py --image dog.jpg
Inference video use python yolov3_fastest.py --video test.mp4
Inference webcam use python yolov3_fastest.py --fourcc 0

Run v4_tiny.py

Inference images use python v4_tiny.py --image person.jpg
Inference video use python v4_tiny.py --video test.mp4
Inference webcam use python v4_tiny.py --fourcc 0

Run v5_dnn.py

Inference images use python v5_dnn.py --image person.jpg
Inference video use python v5_dnn.py --video test.mp4
Inference webcam use python v5_dnn.py --fourcc 0

Run NanoDet.py

Inference images use python NanoDet.py --image person.jpg
Inference video use python NanoDet.py --video test.mp4
Inference webcam use python NanoDet.py --fourcc 0

Run app.py -（Push-Streaming online）

Inference with v3-fastest python app.py --model v3_fastest
Inference with v4-tiny python app.py --model v4_tiny
Inference with v5-dnn python app.py --model v5_dnn
Inference with NanoDet python app.py --model NanoDet

⚡ Please note! Be sure to be on the same LAN！

Demo Effects

Run v3_fastest.py

image→video→capture→push stream

Run v4_tiny.py

image→video→capture→push stream 任需优化，后续补充量化版本，待更新...

Run v5_dnn.py

image(473 ms / Inference Image / Core i5-4210)→video→capture(213 ms / Inference Image / Core i5-4210)→push stream

2021-04-26 记：有趣的是，用onnx＋dnn的方式调用v5s的模型，推理图片要比摄像头处理帧多花一倍的时间，看了很久，还是找不出问题所在，希望看到的大佬可以帮看看代码，点破问题所在，感谢！

2021-05-01 更：今天找到了问题所在，因为v5_dnn.py文件中有个推理时间画在帧图上的功能（cv2.putText(frame, "TIME: " + str(localtime), (8, 30), cv2.FONT_HERSHEY_SIMPLEX, 1, (0, 0, 0), 2)），而这个功能居然花费了每帧推理后处理时间的2/3（大约一帧是50-80ms），后续的版本全部去掉，改成终端显示，每帧推理时间由190ms→130ms，也是恐怖额。

Supplement

This is a DNN repository that integrates the current detection algorithms. You may ask why call the model with DNN, not just git clone the whole framework down? In fact, when we are working with models, it is more advisable to separate training and inference. More, when you deploy models on a customer's production line, if you package up the training code and the training-dependent environment for each set of models (yet the customer's platform only needs you to infer, no training required for you), you will be dead after a few sets of models. As an example, here is the docker for the same version of yolov5 (complete code and dependencies & inference code and dependencies). The entire docker has enough memory to support about four sets of inference dockers.

这是一个整合了当前检测算法的DNN资源库。你可能会问，为什么用DNN调用模型，而不是直接用git克隆整个框架下来？事实上，当我们在处理模型的时候，把训练和推理分开是比较明智的。更多的是，当你在客户的生产线上部署模型的时候，如果你把每套模型的训练代码和依赖训练的环境打包起来（然而客户的平台只需要你推理，不需要你训练），那么你在几套模型之后就凉了呀。作为一个例子，这里是同一版本的yolov5的docker（完整的代码和依赖性→6.06G &推理代码和依赖性→0.4G）。整个docker有足够的内存来支持大约15套推理docker。

Thanks

other

中文操作教程：https://blog.csdn.net/weixin_45829462/article/details/115806322

Name		Name	Last commit message	Last commit date
Latest commit History 47 Commits
.idea		.idea
__pycache__		__pycache__
data		data
result		result
templates		templates
NanoDet.py		NanoDet.py
README.md		README.md
app.py		app.py
dog.jpg		dog.jpg
person.jpg		person.jpg
test.mp4		test.mp4
v3_fastest.py		v3_fastest.py
v4_tiny.py		v4_tiny.py
v5_dnn.py		v5_dnn.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

YOLO-Streaming

Requirements

inference

Demo

Demo Effects

Supplement

Thanks

other

About

Releases

Packages

Languages

matu-46177/YOLO-Streaming

Folders and files

Latest commit

History

Repository files navigation

YOLO-Streaming

Requirements

inference

Demo

Demo Effects

Supplement

Thanks

other

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages