Skip to content
This repository has been archived by the owner on Jul 22, 2024. It is now read-only.

Use ALS instead of SVD? #56

Open
dc-thanh opened this issue Apr 28, 2020 · 1 comment
Open

Use ALS instead of SVD? #56

dc-thanh opened this issue Apr 28, 2020 · 1 comment

Comments

@dc-thanh
Copy link

This repo very great! But I still don't understand why we should use ALS instead of SVD (Step 3 in Jupyter notebook)? Could you explain more for this experiment?

@MLnick
Copy link
Contributor

MLnick commented May 19, 2020

Hi there

Indeed, SVD and ALS are very similar in concept - they are both factorizing the ratings matrix into sub-matrices. It's debatable which is best and it really depends on the situation. In fact for most real-world problems one should try a few algorithms and approaches to see which one best fits the objectives.

In this case, the ALS implementation in Spark is performant and scalable and is the easiest to use and get started with. The SVD implementation available in the older RDD-based Spark APIs is not very scalable by contrast. Hence the reason we use ALS here.

Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants