Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add similarity search of datasets in OpenML database #34

Open
MorrisNein opened this issue Jul 14, 2023 · 0 comments
Open

Add similarity search of datasets in OpenML database #34

MorrisNein opened this issue Jul 14, 2023 · 0 comments
Assignees
Labels
enhancement New feature or request

Comments

@MorrisNein
Copy link
Collaborator

We may add a similarity search that functions the following way:

  1. Extract necessary metafeatures from a custom dataset (implement corresponding MetaFeaturesExtractor in Adopt to OpenML research #3 or in another PR)
  2. Load the full OpenML datasets database
  3. Find the N nearest datasets by DatasetsSimilarityAssessor
  4. Load all evaluations for the closest datasets from OpenML datasets (implement corresponding ModelsLoader in Adopt to OpenML research #3). Filter out the best M models for each dataset
  5. Provide a final report to the user
@MorrisNein MorrisNein self-assigned this Jul 14, 2023
@DRMPN DRMPN added the enhancement New feature or request label Aug 2, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

No branches or pull requests

2 participants