pip install torch
pip install numpy
pip install Pillow
pip install opencv-python
pip install torchvision
1. You can change the last line of the run.py as you want.
For example : evaluate(input_path='files/1.JPG', output_path='files/1_black.jpg', mod='gold')
input_path : input image path
output_path : output image path
mode : color of the hair (There are 3 types : black, gold, red)
2. Run run.py
python run.py
This code is used for facial image parsing, which involves dividing an input image into different regions based on the different features present in the image. The code starts by importing necessary libraries including PyTorch, Numpy, PIL (Python Imaging Library), and OpenCV.
The similar function calculates the similarity of two colors by comparing their Green (G), Blue (B), and Red (R) values. If the difference between G, B or R value of both colors exceeds a certain threshold, then the function returns false, indicating that the colors are not similar. Otherwise, it calculates the ratio between each color component (G/B, B/R, and G/R) and appends them to an array ar. If the length of this array is less than 1, it means that the function was not able to calculate any ratios and returns False. Otherwise, it checks if the minimum value in the array is equal to zero, if yes, it returns False since division by zero is not defined. Finally, it calculates the brightness ratio (br) between the two colors and checks if the maximum ratio in the array divided by the minimum ratio is less than 1.05 and the brightness ratio is between 0.7 and 1.4, if yes, it returns true, indicating that the colors are similar.
The CFAR function determines the hair region from the face based on brightness and color percentage. The function takes six arguments- G, B, and R, which are the average hair color values, g, b, and r, which are the color values to be considered, pro, which is the hair percentage from the face, and bri, which is the hair brightness. It first creates an empty array ar and appends the ratio between each color component (G/g, B/b, and R/r) to this array, only if the g, b, and r values are greater than a certain threshold. If the length of this array is equal to 0, it means that the function was not able to calculate any ratios and returns True, indicating that the current pixel belongs to the hair region. Otherwise, it checks the brightness value and based on different parameters, returns True or False.
The vis_parsing_maps function takes input image, origin image, parsing annotation, stride as arguments and performs facial parsing.
vis_parsing_anno does face segmentation using face segmentation model. For example, if vis_parsing_anno[_x][_y] = 1 then it will be face region, if vis_parsing_anno[_x][_y] = 17 then it will be hair region. But that is not exact answer. The hair region will contain all of the hair region but it will contain another region including facial area around the scalp. So we need to apply hair detection on the hair region. So we applied CFAR function for detection exact hair region for that.
Finally, it changes the color of the detected hair region and saves the output image.
The evaluate function uses the pre-trained BiSeNet model loaded from a file and passes an input image to it for parsing using the vis_parsing_maps function. The output image is then saved to the specified output path. The mode argument specifies the type of colored mask applied to the output image- gold, red, or black.