You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Phi_k is a new correlation coefficient between categorical, ordinal, and interval variables with Pearson characteristics. Paper: https://arxiv.org/pdf/1811.11440.pdf
Phi_k seems to be based on the Chi2 contingency test and use Brent's optimization, both of which seem to be already available in this library. Perhaps it wouldn't be too hard to implement it in smile.
Besides working with categorical, ordinal, and interval variables, it also captures non-linear dependency, so it would be a powerful tool for data analysis.
Describe alternatives you've considered
There is an official Python implementation based on NumPy and SciPy available here: https://github.com/KaveIO/PhiK. It is well integrated with pandas and allows computing correlation matrices of pandas data frames.
--
I'm curious to know if this is something you'd like to include in smile.
The text was updated successfully, but these errors were encountered:
Thanks, for the reply. I'm not sure I would be able to make it in the next month or so given my current project priorities, but I may be able to do it in the future. I wanted to first gauge if this was something already on your radar and if you would be interested to add it to SMILE.
Describe the solution you'd like
Phi_k
is a new correlation coefficient between categorical, ordinal, and interval variables with Pearson characteristics. Paper: https://arxiv.org/pdf/1811.11440.pdfPhi_k
seems to be based on the Chi2 contingency test and use Brent's optimization, both of which seem to be already available in this library. Perhaps it wouldn't be too hard to implement it insmile
.Besides working with categorical, ordinal, and interval variables, it also captures non-linear dependency, so it would be a powerful tool for data analysis.
Describe alternatives you've considered
There is an official Python implementation based on NumPy and SciPy available here: https://github.com/KaveIO/PhiK. It is well integrated with pandas and allows computing correlation matrices of pandas data frames.
--
I'm curious to know if this is something you'd like to include in
smile
.The text was updated successfully, but these errors were encountered: