makesense.ai is a free-to-use online tool for labeling photos. Thanks to the use of a browser it does not require any complicated installation - just visit the website and you are ready to go. It also doesn't matter which operating system you're running on - we do our best to be truly cross-platform. It is perfect for small computer vision deep learning projects, making the process of preparing a dataset much easier and faster. Prepared labels can be downloaded in one of the multiple supported formats. The application was written in TypeScript and is based on React/Redux duo.
You can find out more about our tool from the newly released documentation.
For AI to be free we need not just Open Source, but also a strong Open Data movement.
Andrew Ng
Figure 1. Basic version of the application - without AI support
makesense.ai strives to significantly reduce the time we have to spend on labeling photos. To achieve this, we are going to use many different AI models that will be able to give you recommendations as well as automate repetitive and tedious activities.
- SSD model pretrained on the COCO dataset, which will do some of the work for you in drawing bboxes on photos and also (in some cases) suggest a label.
- PoseNet model is a vision model that can be used to estimate the pose of a person in an image or video by estimating where key body joints are.
In the future, we also plan to add, among other things, models that classify photos, detect characteristic features of faces as well as whole faces. The engine that drives our AI functionalities is TensorFlow.js - JS version of the most popular framework for training neural networks. This choice allows us not only to speed up your work but also to care about the privacy of your data, because unlike with other commercial and open source tools, your photos do not have to be transferred to the server. This time AI comes to your device!
Figure 2. SSD model - allows you to detect multiple objects, speeding up the bbox labeling process
Figure 3. PoseNet model - allows you to detect people's poses in photos, automating point labeling in some usecases
# clone repository
git clone https://github.com/SkalskiP/make-sense.git
# navigate to main dir
cd make-sense
# install dependencies
npm install
# serve with hot reload at localhost:3000
npm start
To ensure proper functionality of the application locally, an npm 6.x.x
and node.js v12.x.x
versions are required. More information about this problem is available in the #16.
# Build Docker Image
docker build -t make_sense docker/
# Run Docker Image as Service
docker run -dit -p 3000:3000 --restart=always --name=make_sense make_sense
# Get Docker Container IP
docker inspect -f '{{range .NetworkSettings.Networks}}{{.IPAddress}}{{end}}' make_sense
# Go to `<DOCKER_CONTAINER_IP>:3000`
# Get Docker Container Logs
docker logs make_sense
Functionality | Context | Mac | Windows / Linux |
---|---|---|---|
Polygon autocomplete | Editor | Enter | Enter |
Cancel polygon drawing | Editor | Escape | Escape |
Delete currently selected label | Editor | Backspace | Delete |
Load previous image | Editor | β₯ + Left | Ctrl + Left |
Load next image | Editor | β₯ + Right | Ctrl + Right |
Zoom in | Editor | β₯ + + | Ctrl + + |
Zoom out | Editor | β₯ + - | Ctrl + - |
Move image | Editor | Up / Down / Left / Right | Up / Down / Left / Right |
Select Label | Editor | β₯ + 0-9 | Ctrl + 0-9 |
Exit popup | Popup | Escape | Escape |
Table 1. Supported keyboard shortcuts
CSV | YOLO | VOC XML | VGG JSON | COCO JSON | PIXEL MASK | |
---|---|---|---|---|---|---|
Point | β | β | β | β | β | β |
Line | β | β | β | β | β | β |
Rect | β | β | β | β | β | β |
Polygon | β | β | β | β | β | β |
Label | β | β | β | β | β | β |
Table 2. The matrix of supported labels export formats, where:
- β - supported format
- β - not yet supported format
- β - format does not make sense for a given label type
You can find examples of export files along with a description and schema on our Wiki.
CSV | YOLO | VOC XML | VGG JSON | COCO JSON | PIXEL MASK | |
---|---|---|---|---|---|---|
Point | β | β | β | β | β | β |
Line | β | β | β | β | β | β |
Rect | β | β | β | β | β | β |
Polygon | β | β | β | β | β | β |
Label | β | β | β | β | β | β |
Table 3. The matrix of supported labels import formats
We don't store your images, because we don't send them anywhere in the first place.
Our application is being actively developed. Check out our plans for the near future on our Wiki. If you have an idea for a new functionality, please hit us on Twitter and Gitter or create an issue where you can describe your concept. In the meantime, see what improvements we are planning for you in the future.
If you are just starting your adventure with deep learning and would like to learn and create something cool along the way, makesense.ai can help you with that. Leverage our bounding box labeling functionality to prepare a data set and use it to train your first state-of-the-art object detection model. Follow instructions and examples but most importantly, free your creativity.
Figure 4. Detection of players moving around the basketball court, based on YouTube-8M dataset.
Feel free to file issues or pull requests.
Please cite Make Sense in your publications if this is useful for your research. Here is an example BibTeX entry:
@MISC{make-sense,
author = {Piotr Skalski},
title = {{Make Sense}},
howpublished = "\url{https://github.com/SkalskiP/make-sense/}",
year = {2019},
}
This project is licensed under the GPL-3.0 License - see the LICENSE file for details. Copyright Β© 2019 Piotr Skalski.