“APAJI’s vision to educate girls, why not to extend it to specially-abled!”
-> Some people affected by speech and hearing impairment cannot express their feelings and emotions, hence are not able to communicate with the people. They rely on sign language which most of the people find hard to understand. Because of this, even if they are capable, they lack opportunities.
-> Thus, we came up with the idea of ‘DivyaAnuvadak’ which takes hand gesture as input and translate it to equivalent text which could be understood by common people. This would further help both our community and our university build a startup to educate specially-abled girls. This would eventually boost up their self-esteem and can make them independent.
-> We are trying to widen Apaji’s(Banasthali Vidyapith University's Founder) vision to educate every girl (specially-abled) that would ultimately help these Shantas to shine too.
-> The main problem that the project aims to solve is to reduce the communication gap of speech or hearing impaired people with the normal ones.
-> It converts ISL i.e., Indian Sign Language To corresponding equivalent Text.
-> DivyaAnuvadak, would serve the speech and hearing impaired people by providing their gestures converted to the equivalent text.
-> It will convert gestures into its corresponding text. The predefined gestures will be stored in the memory and will be compared with the original gesture performed by the user.
- Capture the gestures made by the user through a web camera
- Recognizing the gestures through Tensorflow Object Detection API by identifying features.
- Pre-Processing the captured data.
- Feeding the data to the model.
- Predicting the word based on processed model.
- Selecting the word of highest possibility.
- Converting the word into text and displaying it on the screen.
- cv2
- os
- time
- uuid
- Tensorflow
- Python
- OpenCV (Open Computer Vision Library)
- Python Notebook
- Visual Studio Code
- Collecting images for Deep Learning using webcam and OpenCV.
- Then label these images for Sign Language Detection using LabelImg tool of Python.
- Further split these images into training and testing datasets.
- The next step is to create a model for training and setting up all the paths.
- Then create label map for each image, where each label will contain a unique id.
- Setup Tensorflow Object Detection pipeline configuration.
- Create TF records using Tensorflow API.
- Download Tensorflow pretrained models from Tensorflow models.
- Copy the model configuration to training folder.
- Update this configuration file for Transfer Learning.
- Use Transfer Learning to train the Deep Learning model
- Train the model by executing the code in terminal.
- Final step is to load the model from checkpoints.
- Tada! now the model will be able to detect the gestures in real time using OpenCV and convert them to equivalent text.
- LabelImg - https://github.com/heartexlabs/labelImg
- TensorFlow Object Detection API Installation - https://tensorflow-object-detection-api-tutorial.readthedocs.io/en/latest/install.html
- Tensorflow Models - https://github.com/tensorflow/models