-
Notifications
You must be signed in to change notification settings - Fork 13
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Added Preprocessing Bundle to ML_Core #25
base: master
Are you sure you want to change the base?
Conversation
@RogerDev Please review |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
@vzeufack Good code and testing! There are a few merge conflicts and minor typos. Please resolve.
Preprocessing/StandardScaler.ecl
Outdated
SHARED numberLayout := Preprocessing.Types.numberLayout; | ||
|
||
/** | ||
* Computes averages and stdevs for each feature in baseData. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Please try to void shorthand: stdevs = standard deviation (stdevs)
Preprocessing/LabelEncoder.ecl
Outdated
RETURN Result; | ||
ENDMACRO; | ||
|
||
<<<<<<< HEAD |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Remove line 59. It looks like a leftover from a merge conflict
$.TestOneHotEncoder.RunOneHotEncoderTests; | ||
$.TestStandardScaler.RunStandardScalerTests; | ||
$.TestMinMaxScaler.RunMinMaxScalerTests; | ||
<<<<<<< HEAD |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Please resolve the merge conflict
$.TestAreEqualRows.TestDifferentRows(); | ||
|
||
$.TestCompare.TestEqualData(); | ||
<<<<<<< HEAD |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Please resolve the merge conflict
END; | ||
|
||
/** | ||
<<<<<<< HEAD |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Please resolve the merge conflict
PROJECT(ROWS(LEFT), XF(LEFT))); | ||
|
||
#UNIQUENAME(comparisonResult) | ||
<<<<<<< HEAD |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Please resolve the merge conflict
e6dff78
to
92ac0e2
Compare
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Some minor typos need correction. A few descriptions need more details.
Great job! @vzeufack
Preprocessing/LabelEncoder.ecl
Outdated
* </pre> | ||
*/ | ||
EXPORT GetMapping(key) := FUNCTIONMACRO | ||
IMPORT Preprocessing.Utils.LabelEncoder; |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
It may cause broken code. please use relative path.
Preprocessing/LabelEncoder.ecl
Outdated
* <p> Data with categorical values replaced by numbers. | ||
*/ | ||
EXPORT Encode(dataToEncode, key) := FUNCTIONMACRO | ||
IMPORT Preprocessing.Utils; |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
It may cause broken code. please use relative path.
Preprocessing/LabelEncoder.ecl
Outdated
* <p> Data with categorical values replaced by their original labels. | ||
*/ | ||
EXPORT Decode(dataToDecode, encoderKey) := FUNCTIONMACRO | ||
IMPORT Preprocessing.Utils; |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
It may cause broken code. please use relative path.
IMPORT STD; | ||
IMPORT $.Files; | ||
|
||
spray := STD.File.SprayDelimited('192.168.56.101', |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Please abstract the IP address
IMPORT STD; | ||
IMPORT $.Files; | ||
|
||
STD.File.SprayDelimited('192.168.56.101', |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Please abstract the IP address
* | ||
* @param partialKey: same record structure as the key (see below). | ||
* <p> Mapping between feature names and categories. | ||
* Some names are mapped to empty categories such that |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Please add a description for the case where the names are mapped to non-empty categories.
Preprocessing/MinMaxScaler.ecl
Outdated
t_FieldReal := MLC.types.t_FieldReal; | ||
|
||
/** | ||
* shifts the values in a range [min, max]. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Minor typo: Shift
Preprocessing/MinMaxScaler.ecl
Outdated
|
||
|
||
/** | ||
* scales the data using the following formula: |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Typo: Scale
Preprocessing/OneHotEncoder.ecl
Outdated
END; | ||
|
||
/** | ||
* Produces a mapping between numbers when encoded to numbers when decoded. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Please rephrase this line for better description of the function.
Preprocessing/StratifiedSplit.ecl
Outdated
END; | ||
|
||
/** | ||
* Determines y stats from full data. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Please explain y stats
Is this branch going to be merged? Analytics would like to use the preprocessing module to prepare data for deep learning training. I have cloned @vzeufack 's repository yet run into this error when trying to import the module: Not sure if merging the branch would resolve this issue or not. Please advise! |
@Zelazny7 If ML_Core bundle is already installed previously, it may conflict with the downloaded ML_Core bundle. Rename the downloaded bundle should solve the issue. Please let me know if anything is unclear. |
The current version of the Preprocessing Bundle includes: