This project involves the development of two Pix2Pix models designed to translate between infrared (IR) and visible light images. The primary objective was to explore the potential correlation between these two types of images, hypothesizing that certain visible characteristics (like bright spots or red shades) might correspond to thermal signatures in the IR spectrum.
The underlying hypothesis was that there exists a discernible pattern or fundamental rule connecting IR images with their visible counterparts. For example, it was speculated that areas of heat in IR images might directly correlate with specific color intensities or patterns in visible light images.
The dataset used for this experiment consists of approximately 12,000 paired images of IR and visible light spectrums, taken from the LLVIP dataset. Two separate Pix2Pix models were trained:
- IR to Visible Image Translation
- Visible to IR Image Translation
These models were expected to identify and learn any potential correlations between the two image types during the training process.
Upon evaluating the models' performance and analyzing the output, the results were significantly divergent from the expected correlations. The predicted images from both models did not align with their respective ground truths, indicating a lack of consistent patterns between the IR and visible datasets as hypothesized.
This outcome suggests that there is no straightforward or inherent rule that can predict the IR version of an image based solely on its visible spectrum representation, and vice versa. Therefore, the initial hypothesis that visible light features could directly inform or predict infrared characteristics was not supported by the experimental data.
We concluded that there is no discernible pattern or rule linking IR and visible images based solely Pix2Pix model producing poorly. Reason for drawning this conclusion is, Pix2Pix an image-to-image translation model, excels when there is a consistent underlying rule or pattern in the image translation process. If such a pattern exists, the model typically begins to produce accurate results within a few epochs, even with a relatively small dataset.
However, this does not imply that IR-visible image translation is unachievable. With a sufficiently large and diverse dataset, successful translation might still be possible. The basis for this would not be an inherent pattern between IR and visible conversions, but rather the model's ability to identify objects and assign colors based on its learned experience from past data.
The dataset used in this project is taken from the LLVIP dataset.