DOC: Adds Iñigo's w10 and w11 blog posts #63

itellaetxe · 2024-08-13T11:28:37Z

Adds Iñigo's week 10 and week 11 blog posts.

Also adds necessary images to render properly.

I tried the inline LaTeX capabilities of RST in w11 blog post, should render fine (I believe).

Also adds necessary images

Previously 100dpi, now 600dpi

github-actions · 2024-08-13T11:29:28Z

🪓 PR closed, deleted preview at https://github.com/dipy/preview-html/tree/main/dipy.org/pull/63/

WassCodeur

Hi @itellaetxe!

Good work and nice clear articles. LGTM

jhlegarreta

Thanks for this @itellaetxe.

A couple of inline comments.

posts/2024/2024_08_02_Inigo_week_10.rst

jhlegarreta · 2024-08-13T23:59:52Z

posts/2024/2024_08_09_Inigo_week_11.rst

+
+Let's break down how the AAE works. For those not familiar with how generative adversarial networks (GANs) work, the idea is to have two networks, a generator and a discriminator, that play a game. The generator tries to generate samples that look like the real data (e.g.: pictures of animals), while the discriminator tries to distinguish between real and generated samples. The generator is trained to fool the discriminator, and the discriminator is trained to not be fooled. This way, the generator learns to generate samples that look like the real data. The adversarial loss (:math:`\mathcal{L}_{adv}`) is computed as it is shown in the lowest rectangle.
+
+In our case, the generator is the encoder :math:`\mathcal{G}`, which generates a latent representation of the input data, which the discriminator :math:`\mathcal{D}` tries to distinguish from "real" latent representations, sampled from a given prior distribution. The trick to introduce the information of the kind of animal to which the photo belongs to is to concatenate the latent representation with the one-hot encoded bundle and attribute vectors. This way, the decoder :math:`\mathcal{D}` can generate samples conditioned on a categorical variable. The reconstruction loss (:math:`\mathcal{L}_{MSE}`) is computed as it is shown in the middle rectangle, and it ensures that the samples are reconstructed from the latent representation as close as possible to the original data.


The trick to introduce the information of the kind of animal to which the photo belongs (...) the one-hot encoded bundle (...) : talking about photos, then bundles, previously about pictures of animals. Can we be consistent when choosing the example domain?

Oops, true. Will stick to animals to keep it simple.

jhlegarreta · 2024-08-14T00:00:50Z

posts/2024/2024_08_09_Inigo_week_11.rst

+
+In our case, the generator is the encoder :math:`\mathcal{G}`, which generates a latent representation of the input data, which the discriminator :math:`\mathcal{D}` tries to distinguish from "real" latent representations, sampled from a given prior distribution. The trick to introduce the information of the kind of animal to which the photo belongs to is to concatenate the latent representation with the one-hot encoded bundle and attribute vectors. This way, the decoder :math:`\mathcal{D}` can generate samples conditioned on a categorical variable. The reconstruction loss (:math:`\mathcal{L}_{MSE}`) is computed as it is shown in the middle rectangle, and it ensures that the samples are reconstructed from the latent representation as close as possible to the original data.
+
+As for the AR, we try to tie a continuous attribute of choice found in the data space (fur length, age, size, etc.) to a specific dimension of the latent space. To do this, we compute an attribute-distance matrix in the data space :math:`D_a`, and we compute a distance matrix from the chosen dimension of the latent space (:math:`D_r`). By minimizing the mean absolute error (MAE) between the two matrices, we force the latent space to be organized in such a way that the chosen dimension is related to the chosen attribute. This way, we can generate samples conditioned on the attribute of choice. The AR loss (:math:`\mathcal{L}_{AR}`) is computed as it is shown in the top rectangle.


Related to the above: (fur length, age, size, etc.)

Got it, thanks :)

skoudoro

Thank you for addressing the comments. All good, merging

DOC: Adds Iñigo's w10 and w11 blog posts 95ccd31

itellaetxe added 2 commits August 13, 2024 13:23

DOC: Adds Iñigo's w10 and w11 blog posts

caf7f9d

Also adds necessary images

ENH: Increase quality of added images

3f55301

Previously 100dpi, now 600dpi

itellaetxe requested review from jhlegarreta, skoudoro, pjsjongsung, WassCodeur, robinroy03 and deka27 August 13, 2024 11:29

FIX: Fixed filename in embedded image link

3c3bc32

itellaetxe changed the title ~~Inigo week 10 11 blogpost~~ DOC: Adds Iñigo's w10 and w11 blog posts Aug 13, 2024

WassCodeur reviewed Aug 13, 2024

View reviewed changes

WassCodeur approved these changes Aug 13, 2024

View reviewed changes

robinroy03 approved these changes Aug 13, 2024

View reviewed changes

pjsjongsung approved these changes Aug 13, 2024

View reviewed changes

jhlegarreta reviewed Aug 14, 2024

View reviewed changes

FIX: Applied suggested changes for domain consistency

dd1d891

deka27 approved these changes Aug 14, 2024

View reviewed changes

jhlegarreta approved these changes Aug 14, 2024

View reviewed changes

skoudoro approved these changes Aug 14, 2024

View reviewed changes

skoudoro merged commit 95ccd31 into dipy:master Aug 14, 2024
3 checks passed

github-actions bot added a commit that referenced this pull request Aug 14, 2024

Merge pull request #63 from itellaetxe/inigo_week_10_11_blogpost

51af7e3

DOC: Adds Iñigo's w10 and w11 blog posts 95ccd31

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

DOC: Adds Iñigo's w10 and w11 blog posts #63

DOC: Adds Iñigo's w10 and w11 blog posts #63

itellaetxe commented Aug 13, 2024 •

edited

Loading

github-actions bot commented Aug 13, 2024 •

edited

Loading

WassCodeur left a comment

jhlegarreta left a comment

jhlegarreta Aug 13, 2024

itellaetxe Aug 14, 2024

jhlegarreta Aug 14, 2024

itellaetxe Aug 14, 2024

skoudoro left a comment


		Let's break down how the AAE works. For those not familiar with how generative adversarial networks (GANs) work, the idea is to have two networks, a generator and a discriminator, that play a game. The generator tries to generate samples that look like the real data (e.g.: pictures of animals), while the discriminator tries to distinguish between real and generated samples. The generator is trained to fool the discriminator, and the discriminator is trained to not be fooled. This way, the generator learns to generate samples that look like the real data. The adversarial loss (:math:`\mathcal{L}_{adv}`) is computed as it is shown in the lowest rectangle.

		In our case, the generator is the encoder :math:`\mathcal{G}`, which generates a latent representation of the input data, which the discriminator :math:`\mathcal{D}` tries to distinguish from "real" latent representations, sampled from a given prior distribution. The trick to introduce the information of the kind of animal to which the photo belongs to is to concatenate the latent representation with the one-hot encoded bundle and attribute vectors. This way, the decoder :math:`\mathcal{D}` can generate samples conditioned on a categorical variable. The reconstruction loss (:math:`\mathcal{L}_{MSE}`) is computed as it is shown in the middle rectangle, and it ensures that the samples are reconstructed from the latent representation as close as possible to the original data.


		In our case, the generator is the encoder :math:`\mathcal{G}`, which generates a latent representation of the input data, which the discriminator :math:`\mathcal{D}` tries to distinguish from "real" latent representations, sampled from a given prior distribution. The trick to introduce the information of the kind of animal to which the photo belongs to is to concatenate the latent representation with the one-hot encoded bundle and attribute vectors. This way, the decoder :math:`\mathcal{D}` can generate samples conditioned on a categorical variable. The reconstruction loss (:math:`\mathcal{L}_{MSE}`) is computed as it is shown in the middle rectangle, and it ensures that the samples are reconstructed from the latent representation as close as possible to the original data.

		As for the AR, we try to tie a continuous attribute of choice found in the data space (fur length, age, size, etc.) to a specific dimension of the latent space. To do this, we compute an attribute-distance matrix in the data space :math:`D_a`, and we compute a distance matrix from the chosen dimension of the latent space (:math:`D_r`). By minimizing the mean absolute error (MAE) between the two matrices, we force the latent space to be organized in such a way that the chosen dimension is related to the chosen attribute. This way, we can generate samples conditioned on the attribute of choice. The AR loss (:math:`\mathcal{L}_{AR}`) is computed as it is shown in the top rectangle.

DOC: Adds Iñigo's w10 and w11 blog posts #63

DOC: Adds Iñigo's w10 and w11 blog posts #63

Conversation

itellaetxe commented Aug 13, 2024 • edited Loading

github-actions bot commented Aug 13, 2024 • edited Loading

WassCodeur left a comment

Choose a reason for hiding this comment

jhlegarreta left a comment

Choose a reason for hiding this comment

jhlegarreta Aug 13, 2024

Choose a reason for hiding this comment

itellaetxe Aug 14, 2024

Choose a reason for hiding this comment

jhlegarreta Aug 14, 2024

Choose a reason for hiding this comment

itellaetxe Aug 14, 2024

Choose a reason for hiding this comment

skoudoro left a comment

Choose a reason for hiding this comment

itellaetxe commented Aug 13, 2024 •

edited

Loading

github-actions bot commented Aug 13, 2024 •

edited

Loading