face_recognition.sh
demonstrates face recognition pipeline on a video stream.
./face_recognition.sh
-i --input
is an optional flag, a path to an input source (video file / camera device)--print-gst-launch
prints the ready gst-launch command without running it--show-fps
is an optional flag that enables printing FPS on screen--network
to set which network to use. choose from [scrfd_10g, scrfd_2.5g], default is scrfd_10g"
scrfd_10g
: scrfd pre-trained on Hailo's dataset - https://github.com/hailo-ai/hailo_model_zoo/blob/master/hailo_model_zoo/cfg/networks/scrfd_10g.yamlscrfd_2.5g
: scrfd pre-trained on Hailo's dataset - https://github.com/hailo-ai/hailo_model_zoo/blob/master/hailo_model_zoo/cfg/networks/scrfd_2.5g.yamlarcface_mobilefacenet
: https://github.com/hailo-ai/hailo_model_zoo/blob/master/hailo_model_zoo/cfg/networks/arcface_mobilefacenet.yaml
cd $TAPPAS_WORKSPACE/apps/h8/gstreamer/general/face_recognition/
./face_recognition.sh
The output should look like:
The pipeline is divided to 4 steps:
- Face detection and landmarks:
- Detect faces and predict the locations of key facial landmarks (such as eyes, nose, and mouth), in the video stream. The scrfd_10g network is more accurate but computationally heavier, while the scrfd_2.5g network is less accurate but more lightweight, providing better performance.
- Face alignment:
- This step involves using the detected landmarks and the original video frame to compute an affine transformation that aligns the face with a predefined destination matrix. This ensures that the face is consistently positioned in the same way for the next step in the pipeline.
- Embedding matrix:
- Run Arcface network to generate an embedding matrix for each aligned face. An embedding is a compact representation of the face that captures its unique characteristics. This embedding can then be compared to other embeddings to determine the similarity between faces.
- Gallery:
- Use the generated embeddings to find the closest matching face in the local database (named "Local Gallery" and stored in a JSON file). This allows the application to identify the person in the video stream.
The local gallery file face_recognition_local_gallery.json
is stored under apps/h8/gstreamer/general/face_recognition/resources/gallery
directory.
It contains the embeddings of the faces.
To add faces to the gallery, you can use the save_face.sh
script.
cd $TAPPAS_WORKSPACE/apps/h8/gstreamer/general/face_recognition/
./save_faces.sh
Options:
--network
to set which network to use. choose from [scrfd_10g, scrfd_2.5g], default is scrfd_10g--clean
to clean the local gallery file enitrely
The script goes over the .png
files in resources/faces
directory, and saves each face into the gallery.
The name of the face is determined by the file name.
To use your own video sources and faces, add your images to the resources/faces
directory and remove the original ones.
Make sure to use .png
format image files and a file name including the name of the person.
Also use --clean option to order the script to clean the gallery file before saving the new faces.
Note
It is recommended to first read the Retraining TAPPAS Models page.
Retraining Dockers (available on Hailo Model Zoo), can be used to replace the following models with ones that are trained on your own dataset:
scrfd_10g
- No retraining docker is available.
- Post process CPP file edit update post-processing:
- Update face_detection.cpp
(
scrfd()
function) with your new parameters, then recompile to createlibface_detection_post.so
- Update face_detection.cpp
(
scrfd_2.5g
- No retraining docker is available.
- Post process CPP file edit update post-processing:
- Update face_detection.cpp
(
scrfd()
function) with your new parameters, then recompile to createlibface_detection_post.so
- Update face_detection.cpp
(
arcface_mobilefacenet
- Retraining docker
- TAPPAS changes to replace model:
- Update HEF_PATH on the .sh file
- Update arcface.cpp
with your new parameters, then recompile to create
libface_recognition_post.so