Skip to content

Commit

Permalink
Fix figure alignments
Browse files Browse the repository at this point in the history
  • Loading branch information
Pattrigue committed Jan 26, 2024
1 parent 5b6ccff commit 91f6700
Showing 1 changed file with 37 additions and 38 deletions.
75 changes: 37 additions & 38 deletions report_pre_thesis/src/sections/data_overview.tex
Original file line number Diff line number Diff line change
Expand Up @@ -4,8 +4,15 @@ \section{Data Overview}\label{sec:data_overview}
For each sample, the data is split into five datasets, one for each location on the sample that was shot at by the laser.
Each dataset contains CCS data stored in a \texttt{.csv} file.

\begin{figure*}[b]
\centering
\includegraphics[width=0.85\textwidth]{images/oxide_corr.png}
\caption{Correlation matrix of the composition data calculated using the Pearson correlation coefficient, illustrated as a heatmap.}
\label{fig:oxide_corr}
\end{figure*}

\begin{figure}[ht]
\scalebox{0.8}{
\scalebox{0.9}{
\begin{forest}
for tree={
font=\ttfamily,
Expand Down Expand Up @@ -48,7 +55,6 @@ \section{Data Overview}\label{sec:data_overview}
\label{fig:directory_structure}
\end{figure}


Each \texttt{.csv} file represents a location on the sample that was shot at by the laser.
They contain the following columns:

Expand Down Expand Up @@ -76,34 +82,6 @@ \section{Data Overview}\label{sec:data_overview}
\label{tab:ccs_data_example}
\end{table*}

The rows in the location dataset represent which wavelength the intensity measurements were taken at.
There are $6144$ rows and $N$ columns, where $N$ is the number of shots taken for a given sample.
While $N=50$ for each sample in the calibration data, the number of shots taken on Mars for each sample can vary but is typically between $30$ and $50$\cite{maurice_chemcam_2016}.

Table \ref{tab:ccs_data_example} shows an example of the CCS data for the first location for the \texttt{cadillac} sample.
As can be seen in the table, the second final row of the \texttt{cadillac} sample contains negative values, which is not physically possible.
These negative values represent noise and are a result of the initial preprocessing steps applied to the raw LIBS data.

Figure \ref{fig:masked_regions} shows a spectral plot of the CCS data for the \texttt{cadillac} sample.
Note how it comprises of three different spectral regions - ultra-violet (UV), violet (VIO), and visible and near infrared (VNIR).
Separate instruments were used for each of these regions.
Consequently, the edges of the spectral regions are noisy because pixels at the edges of the CCD\footnote{A charge-coupled device (CCD) is a light-sensitive electronic detector that converts incoming photons into an electronic signal, commonly used in digital imaging and astronomy\cite{radionuclide_imaging}.} usually exhibit lower sensitivity compared to those at the center, and the optics vary in their reflective and absorptive properties at different wavelengths.
These regions, which also contain no unique major element diagnostic peaks, are masked out to enhance the accuracy and reliability of the quantitative analysis\cite{cleggRecalibrationMarsScience2017}.
Specifically, the masked ranges are defined in \citet{cleggRecalibrationMarsScience2017} as 240.811 --- 246.635, 338.457 --- 340.797, 382.138 --- 387.859, 473.184 --- 492.427, and 849 --- 905.574 nm and are highlighted in blue in Figure~\ref{fig:masked_regions}.

\begin{figure}
\centering
\includegraphics[width=0.5\textwidth]{images/masked_regions.png}
\caption{Spectral plot of the CCS data for the \texttt{cadillac} sample. The blue regions represent the noisy edges of the spectral regions.}
\label{fig:masked_regions}
\end{figure}

\subsection{Composition Data}\label{subsec:composition_data}
In addition to these datasets, there is also a \\ \texttt{ccam\_calibration\_compositions.csv} file that contains ground truth data for each major oxide in each sample.
There are a total of eight major oxides: \ce{SiO2}, \ce{TiO2}, \ce{Al2O3}, \ce{FeOT}, \ce{MnO}, \ce{MgO}, \ce{CaO}, \ce{Na2O}, and \ce{K2O}.
For each of these oxides, the data specifies their respective concentrations in each sample, expressed as a weight percentage (wt. \%) of the total composition.
An excerpt of this dataset is shown in Table~\ref{tab:composotion_data_example}.

\begin{table*}[!b]
\centering
\begin{tabular}{lllllllllllll}
Expand All @@ -117,23 +95,44 @@ \subsection{Composition Data}\label{subsec:composition_data}
TB2 & --- & --- & 60.4 & 0.93 & 20.5 & 11.6536 & 0.047 & 1.86 & 0.2 & 1.29 & 3.86 & 100.7406 \\
\bottomrule
\end{tabular}
\caption{Exert from the composition dataset.}
\caption{Excerpt from the composition dataset.}
\label{tab:composotion_data_example}
\end{table*}

\begin{figure*}
The rows in the location dataset represent which wavelength the intensity measurements were taken at.
There are $6144$ rows and $N$ columns, where $N$ is the number of shots taken for a given sample.
While $N=50$ for each sample in the calibration data, the number of shots taken on Mars for each sample can vary but is typically between $30$ and $50$\cite{maurice_chemcam_2016}.

Table \ref{tab:ccs_data_example} shows an example of the CCS data for the first location for the \texttt{cadillac} sample.
As can be seen in the table, the second final row of the \texttt{cadillac} sample contains negative values, which is not physically possible.
These negative values represent noise and are a result of the initial preprocessing steps applied to the raw LIBS data.

\begin{figure}[t]
\centering
\includegraphics[width=0.5\textwidth]{images/masked_regions.png}
\caption{Spectral plot of the CCS data for the \texttt{cadillac} sample. The blue regions represent the noisy edges of the spectral regions.}
\label{fig:masked_regions}
\end{figure}

Figure \ref{fig:masked_regions} shows a spectral plot of the CCS data for the \texttt{cadillac} sample.
Note how it comprises of three different spectral regions - ultra-violet (UV), violet (VIO), and visible and near infrared (VNIR).
Separate instruments were used for each of these regions.
Consequently, the edges of the spectral regions are noisy because pixels at the edges of the CCD\footnote{A charge-coupled device (CCD) is a light-sensitive electronic detector that converts incoming photons into an electronic signal, commonly used in digital imaging and astronomy\cite{radionuclide_imaging}.} usually exhibit lower sensitivity compared to those at the center, and the optics vary in their reflective and absorptive properties at different wavelengths.
These regions, which also contain no unique major element diagnostic peaks, are masked out to enhance the accuracy and reliability of the quantitative analysis\cite{cleggRecalibrationMarsScience2017}.
Specifically, the masked ranges are defined in \citet{cleggRecalibrationMarsScience2017} as 240.811 --- 246.635, 338.457 --- 340.797, 382.138 --- 387.859, 473.184 --- 492.427, and 849 --- 905.574 nm and are highlighted in blue in Figure~\ref{fig:masked_regions}.

\subsection{Composition Data}\label{subsec:composition_data}
\begin{figure*}[t]
\centering
\includegraphics[width=0.85\textwidth]{images/composition_box_plot.png}
\caption{Box plot of the composition data. The orange line represents the median, the black boxes represent the interquartile range for each oxide, and the whiskers represent the range of the data. The black circles represent outliers.}
\label{fig:composition_box_plot}
\end{figure*}

\begin{figure*}
\centering
\includegraphics[width=0.85\textwidth]{images/oxide_corr.png}
\caption{Correlation matrix of the composition data calculated using the Pearson correlation coefficient, illustrated as a heatmap.}
\label{fig:oxide_corr}
\end{figure*}
In addition to these datasets, there is also a \\ \texttt{ccam\_calibration\_compositions.csv} file that contains ground truth data for each major oxide in each sample.
There are a total of eight major oxides: \ce{SiO2}, \ce{TiO2}, \ce{Al2O3}, \ce{FeOT}, \ce{MnO}, \ce{MgO}, \ce{CaO}, \ce{Na2O}, and \ce{K2O}.
For each of these oxides, the data specifies their respective concentrations in each sample, expressed as a weight percentage (wt. \%) of the total composition.
An excerpt of this dataset is shown in Table~\ref{tab:composotion_data_example}.


Figure \ref{fig:composition_box_plot} shows a box plot of the composition data.
Expand Down

0 comments on commit 91f6700

Please sign in to comment.