From 91f67002c2065386a9c865b6f2c793e47de7a398 Mon Sep 17 00:00:00 2001 From: Pattrigue Date: Fri, 26 Jan 2024 11:00:26 +0100 Subject: [PATCH] Fix figure alignments --- .../src/sections/data_overview.tex | 75 +++++++++---------- 1 file changed, 37 insertions(+), 38 deletions(-) diff --git a/report_pre_thesis/src/sections/data_overview.tex b/report_pre_thesis/src/sections/data_overview.tex index 7e8e9378..c796bd5b 100644 --- a/report_pre_thesis/src/sections/data_overview.tex +++ b/report_pre_thesis/src/sections/data_overview.tex @@ -4,8 +4,15 @@ \section{Data Overview}\label{sec:data_overview} For each sample, the data is split into five datasets, one for each location on the sample that was shot at by the laser. Each dataset contains CCS data stored in a \texttt{.csv} file. +\begin{figure*}[b] + \centering + \includegraphics[width=0.85\textwidth]{images/oxide_corr.png} + \caption{Correlation matrix of the composition data calculated using the Pearson correlation coefficient, illustrated as a heatmap.} + \label{fig:oxide_corr} +\end{figure*} + \begin{figure}[ht] - \scalebox{0.8}{ + \scalebox{0.9}{ \begin{forest} for tree={ font=\ttfamily, @@ -48,7 +55,6 @@ \section{Data Overview}\label{sec:data_overview} \label{fig:directory_structure} \end{figure} - Each \texttt{.csv} file represents a location on the sample that was shot at by the laser. They contain the following columns: @@ -76,34 +82,6 @@ \section{Data Overview}\label{sec:data_overview} \label{tab:ccs_data_example} \end{table*} -The rows in the location dataset represent which wavelength the intensity measurements were taken at. -There are $6144$ rows and $N$ columns, where $N$ is the number of shots taken for a given sample. -While $N=50$ for each sample in the calibration data, the number of shots taken on Mars for each sample can vary but is typically between $30$ and $50$\cite{maurice_chemcam_2016}. - -Table \ref{tab:ccs_data_example} shows an example of the CCS data for the first location for the \texttt{cadillac} sample. -As can be seen in the table, the second final row of the \texttt{cadillac} sample contains negative values, which is not physically possible. -These negative values represent noise and are a result of the initial preprocessing steps applied to the raw LIBS data. - -Figure \ref{fig:masked_regions} shows a spectral plot of the CCS data for the \texttt{cadillac} sample. -Note how it comprises of three different spectral regions - ultra-violet (UV), violet (VIO), and visible and near infrared (VNIR). -Separate instruments were used for each of these regions. -Consequently, the edges of the spectral regions are noisy because pixels at the edges of the CCD\footnote{A charge-coupled device (CCD) is a light-sensitive electronic detector that converts incoming photons into an electronic signal, commonly used in digital imaging and astronomy\cite{radionuclide_imaging}.} usually exhibit lower sensitivity compared to those at the center, and the optics vary in their reflective and absorptive properties at different wavelengths. -These regions, which also contain no unique major element diagnostic peaks, are masked out to enhance the accuracy and reliability of the quantitative analysis\cite{cleggRecalibrationMarsScience2017}. -Specifically, the masked ranges are defined in \citet{cleggRecalibrationMarsScience2017} as 240.811 --- 246.635, 338.457 --- 340.797, 382.138 --- 387.859, 473.184 --- 492.427, and 849 --- 905.574 nm and are highlighted in blue in Figure~\ref{fig:masked_regions}. - -\begin{figure} - \centering - \includegraphics[width=0.5\textwidth]{images/masked_regions.png} - \caption{Spectral plot of the CCS data for the \texttt{cadillac} sample. The blue regions represent the noisy edges of the spectral regions.} - \label{fig:masked_regions} -\end{figure} - -\subsection{Composition Data}\label{subsec:composition_data} -In addition to these datasets, there is also a \\ \texttt{ccam\_calibration\_compositions.csv} file that contains ground truth data for each major oxide in each sample. -There are a total of eight major oxides: \ce{SiO2}, \ce{TiO2}, \ce{Al2O3}, \ce{FeOT}, \ce{MnO}, \ce{MgO}, \ce{CaO}, \ce{Na2O}, and \ce{K2O}. -For each of these oxides, the data specifies their respective concentrations in each sample, expressed as a weight percentage (wt. \%) of the total composition. -An excerpt of this dataset is shown in Table~\ref{tab:composotion_data_example}. - \begin{table*}[!b] \centering \begin{tabular}{lllllllllllll} @@ -117,23 +95,44 @@ \subsection{Composition Data}\label{subsec:composition_data} TB2 & --- & --- & 60.4 & 0.93 & 20.5 & 11.6536 & 0.047 & 1.86 & 0.2 & 1.29 & 3.86 & 100.7406 \\ \bottomrule \end{tabular} -\caption{Exert from the composition dataset.} +\caption{Excerpt from the composition dataset.} \label{tab:composotion_data_example} \end{table*} -\begin{figure*} +The rows in the location dataset represent which wavelength the intensity measurements were taken at. +There are $6144$ rows and $N$ columns, where $N$ is the number of shots taken for a given sample. +While $N=50$ for each sample in the calibration data, the number of shots taken on Mars for each sample can vary but is typically between $30$ and $50$\cite{maurice_chemcam_2016}. + +Table \ref{tab:ccs_data_example} shows an example of the CCS data for the first location for the \texttt{cadillac} sample. +As can be seen in the table, the second final row of the \texttt{cadillac} sample contains negative values, which is not physically possible. +These negative values represent noise and are a result of the initial preprocessing steps applied to the raw LIBS data. + +\begin{figure}[t] + \centering + \includegraphics[width=0.5\textwidth]{images/masked_regions.png} + \caption{Spectral plot of the CCS data for the \texttt{cadillac} sample. The blue regions represent the noisy edges of the spectral regions.} + \label{fig:masked_regions} +\end{figure} + +Figure \ref{fig:masked_regions} shows a spectral plot of the CCS data for the \texttt{cadillac} sample. +Note how it comprises of three different spectral regions - ultra-violet (UV), violet (VIO), and visible and near infrared (VNIR). +Separate instruments were used for each of these regions. +Consequently, the edges of the spectral regions are noisy because pixels at the edges of the CCD\footnote{A charge-coupled device (CCD) is a light-sensitive electronic detector that converts incoming photons into an electronic signal, commonly used in digital imaging and astronomy\cite{radionuclide_imaging}.} usually exhibit lower sensitivity compared to those at the center, and the optics vary in their reflective and absorptive properties at different wavelengths. +These regions, which also contain no unique major element diagnostic peaks, are masked out to enhance the accuracy and reliability of the quantitative analysis\cite{cleggRecalibrationMarsScience2017}. +Specifically, the masked ranges are defined in \citet{cleggRecalibrationMarsScience2017} as 240.811 --- 246.635, 338.457 --- 340.797, 382.138 --- 387.859, 473.184 --- 492.427, and 849 --- 905.574 nm and are highlighted in blue in Figure~\ref{fig:masked_regions}. + +\subsection{Composition Data}\label{subsec:composition_data} +\begin{figure*}[t] \centering \includegraphics[width=0.85\textwidth]{images/composition_box_plot.png} \caption{Box plot of the composition data. The orange line represents the median, the black boxes represent the interquartile range for each oxide, and the whiskers represent the range of the data. The black circles represent outliers.} \label{fig:composition_box_plot} \end{figure*} -\begin{figure*} - \centering - \includegraphics[width=0.85\textwidth]{images/oxide_corr.png} - \caption{Correlation matrix of the composition data calculated using the Pearson correlation coefficient, illustrated as a heatmap.} - \label{fig:oxide_corr} -\end{figure*} +In addition to these datasets, there is also a \\ \texttt{ccam\_calibration\_compositions.csv} file that contains ground truth data for each major oxide in each sample. +There are a total of eight major oxides: \ce{SiO2}, \ce{TiO2}, \ce{Al2O3}, \ce{FeOT}, \ce{MnO}, \ce{MgO}, \ce{CaO}, \ce{Na2O}, and \ce{K2O}. +For each of these oxides, the data specifies their respective concentrations in each sample, expressed as a weight percentage (wt. \%) of the total composition. +An excerpt of this dataset is shown in Table~\ref{tab:composotion_data_example}. Figure \ref{fig:composition_box_plot} shows a box plot of the composition data.