Add argument for alternative indexing of OrdinalEncoder #291

wildcat47 · 2021-02-11T21:50:09Z

Expected Behavior

There are a variety of applications in which zero-indexing would be preferred for the OrdinalEncoder. One example is preparing features for a PyTorch model with categorical embeddings, in which case the ordinal label is used to slice dimensions of an embedding matrix. Note also that the sklearn OrdinalEncoder is zero-indexed.

One could possibly add an argument to init() that specifies the indexing (e.g., self.index_start), so that the ordinal_encoding() method can do something like:

data = pd.Series(index=index, data=range(self.index_start, len(index) + self.index_start))

Actual Behavior

The ordinal_encoding() method imposes one-indexing in this line:
data = pd.Series(index=index, data=range(1, len(index) + 1))

Specifications

Version: 2.2.2

PaulWestenthanner added enhancement good first issue labels Jan 31, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add argument for alternative indexing of OrdinalEncoder #291

Add argument for alternative indexing of OrdinalEncoder #291

wildcat47 commented Feb 11, 2021 •

edited

Loading

Add argument for alternative indexing of OrdinalEncoder #291

Add argument for alternative indexing of OrdinalEncoder #291

Comments

wildcat47 commented Feb 11, 2021 • edited Loading

Expected Behavior

Actual Behavior

Specifications

wildcat47 commented Feb 11, 2021 •

edited

Loading