[FIX] Preprocess: pickle when saving #2289

jerneju · 2017-05-05T13:43:40Z

Issue

Preprocess widget is unable to save Randomize, Normalize Features, and Continuize Discrete Variables due to some pickle error.

Steps to reproduce the behavior

Add Randomize or Normalize Features or Continuize Discrete Variables. Change some parameters. Save the workflow and close the application Orange. Then run the app and open the workflow.

Description of changes

Includes

Code changes
Tests
Documentation

codecov-io · 2017-05-05T14:27:07Z

Codecov Report

Merging #2289 into master will decrease coverage by <.01%.
The diff coverage is 100%.

@@            Coverage Diff             @@
##           master    #2289      +/-   ##
==========================================
- Coverage   73.29%   73.29%   -0.01%     
==========================================
  Files         317      317              
  Lines       55512    55532      +20     
==========================================
+ Hits        40687    40701      +14     
- Misses      14825    14831       +6

kernc · 2017-05-05T17:56:16Z

How can this issue be reproduced?

jerneju · 2017-05-08T08:10:41Z

@kernc : Added the steps above.

kernc · 2017-05-12T13:00:13Z

Orange/preprocess/preprocess.py

-         "Indicators, FirstAsBase, FrequentAsBase,"
-         "Remove, RemoveMultinomial, ReportError, AsOrdinal,"
-         "AsNormalizedOrdinal, Leave")
+    class ContinuizeDV(Enum):


Before, you'd get:

>>> from Orange.preprocess import Continuize >>> preproc = Continuize(multinomial_treatment=Continuize.FrequentAsBase) >>> preproc Continuize(multinomial_treatment=Continuize.FrequentAsBase)

With this change, it's:

>>> preproc = Continuize(multinomial_treatment=Continuize.FrequentAsBase) >>> preproc Continuize(multinomial_treatment=2)

Is there some way to keep the customized Enum and still pickle numbers?

Or, as an alternative, sklearn uses expressive strings. I wouldn't mind that also.

@kernc: I did some changes.

kernc · 2017-06-12T11:53:24Z

Orange/preprocess/preprocess.py

+        ReportError = "ReportError"
+        AsOrdinal = "AsOrdinal"
+        AsNormalizedOrdinal = "AsNormalizedOrdinal"
+        Leave = "Leave"


I don't oppose strings at all. They are simple and robust. Sklearn is all strings in params, and nobody is complaining. But sklearn checks their user input for invalid values. Could we check the passed argument is valid? E.g.:

assert multinomial_treatment in self._MultinomialTreatment

kernc · 2017-06-12T11:54:30Z

Orange/preprocess/preprocess.py

-         "Indicators, FirstAsBase, FrequentAsBase,"
-         "Remove, RemoveMultinomial, ReportError, AsOrdinal,"
-         "AsNormalizedOrdinal, Leave")
+    class ContinuizeDV(Enum):


Can name this MultinomialTreatment is it's an enumeration of such methods?

janezd · 2017-06-16T09:27:49Z

Solved in #2409.

kernc reviewed May 12, 2017

View reviewed changes

[FIX] Preprocess: pickle when saving

b63b16e

jerneju changed the title ~~[WIP][FIX] Preprocess: pickle when saving~~ [FIX] Preprocess: pickle when saving Jun 7, 2017

kernc reviewed Jun 12, 2017

View reviewed changes

janezd assigned kernc and janezd Jun 16, 2017

lanzagar added this to the 3.4.4 milestone Jun 16, 2017

janezd closed this Jun 16, 2017

jerneju deleted the pickle-preprocess branch June 21, 2017 07:20

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[FIX] Preprocess: pickle when saving #2289

[FIX] Preprocess: pickle when saving #2289

jerneju commented May 5, 2017 •

edited

Loading

codecov-io commented May 5, 2017 •

edited

Loading

kernc commented May 5, 2017

jerneju commented May 8, 2017

kernc May 12, 2017 •

edited

Loading

jerneju May 18, 2017

kernc Jun 12, 2017 •

edited

Loading

kernc Jun 12, 2017

janezd commented Jun 16, 2017

[FIX] Preprocess: pickle when saving #2289

[FIX] Preprocess: pickle when saving #2289

Conversation

jerneju commented May 5, 2017 • edited Loading

Issue

Steps to reproduce the behavior

Description of changes

Includes

codecov-io commented May 5, 2017 • edited Loading

Codecov Report

kernc commented May 5, 2017

jerneju commented May 8, 2017

kernc May 12, 2017 • edited Loading

Choose a reason for hiding this comment

jerneju May 18, 2017

Choose a reason for hiding this comment

kernc Jun 12, 2017 • edited Loading

Choose a reason for hiding this comment

kernc Jun 12, 2017

Choose a reason for hiding this comment

janezd commented Jun 16, 2017

jerneju commented May 5, 2017 •

edited

Loading

codecov-io commented May 5, 2017 •

edited

Loading

kernc May 12, 2017 •

edited

Loading

kernc Jun 12, 2017 •

edited

Loading