This project provides examples on how to use the Datumbox Machine Learning Framework v0.8.3-SNAPSHOT (Build 20201014).
Copyright (c) 2013-2020 Vasilis Vryniotis.
The code is licensed under the Apache License, Version 2.0.
The code uses Maven Project Structure and contains the following code examples:
- Classification.java: Contains an example on how to perform Classification.
- Clustering.java: It is an example that runs Cluster Analysis.
- Regression.java: Shows how to run Regression Analysis.
- DataModeling.java: Explains how to use the convenience Modeler class.
- TextClassification.java: Uses the convenience TextClassifier class.
All of the above files contain a main() method. To use it just clone the project on your workspace and run any of the above files.
The project contains also 5 configuration files in the resources folder:
- datumbox.configuration.properties: It defines for the default storage engine (required).
- datumbox.concurrencyconfiguration.properties: It controls the concurrency levels (required).
- datumbox.inmemoryconfiguration.properties: It contains the configurations for the InMemory storage engine (required).
- datumbox.mapdbconfiguration.properties: It contains the configurations for the MapDB storage engine (optional).
- logback.xml: It contains the configuration file for the logger (optional).
Finally in the resources folder there are several real world datasets which are used for testing.