Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Shall we deprecate Scholar.Preprocessing? #228

Open
josevalim opened this issue Dec 31, 2023 · 4 comments
Open

Shall we deprecate Scholar.Preprocessing? #228

josevalim opened this issue Dec 31, 2023 · 4 comments
Labels
suggestion ideas and/or plans to put forward for consideration.

Comments

@josevalim
Copy link
Contributor

Today the module only contain defdelegate to functions in other modules. Should we remove these delegations? Should we keep them?

@krstopro @msluszniak @polvalente

@krstopro
Copy link
Member

krstopro commented Dec 31, 2023

Given that each of these modules contains fit_transform/2, I think it is safe to deprecate/remove Scholar.Preprocessing. Only Binarizer module remains to be implemented.

Just my opinion, curious to see what the others think.

@msluszniak
Copy link
Contributor

Yes, I agree that after moving the binarizer into a separate module we'll be able to delete Scholar.Preprocessing

@josevalim
Copy link
Contributor Author

There is another option here to remove the duplication:

  1. Remove the fit_transform functions from Scholar.Preprocesing.* modules
  2. Keep the current functions in Scholar.Preprocessing as the fit_transform variant

But I don't like much because the documentation for supported options are all in the actual module. So we would either need a way to share the options or redirect users to a separate module, which is not the best user experience. I thought I'd mention it for completeness tho.

@msluszniak
Copy link
Contributor

We might make a more general struct Preprocessing with two fields operation and data. For practical purposes, all functions will have fit and predict parts. Then using predict, based on the operation field, it will call the right function. However, this makes the fields like standard deviation in normalization less explicit.

@josevalim josevalim added the question Further information is requested label Mar 5, 2024
@josevalim josevalim added suggestion ideas and/or plans to put forward for consideration. and removed question Further information is requested labels May 7, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
suggestion ideas and/or plans to put forward for consideration.
Projects
None yet
Development

No branches or pull requests

3 participants