Skip to content

Ranler/boostingPL

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

18 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

BoostingPL - Scalable and Parallel Boosting with MapReduce


What is BoostingPL?

  BoostingPL is a scalable and parallel machine learning tools for Boosting
  (What is Boosting? http://en.wikipedia.org/wiki/Boosting).

  This Project is based on this paper:
 
      Indranil Palit and Chandan K. Reddy, "Scalable and Parallel Boosting
      with MapReduce", IEEE Transactions on Knowledge and Data Engineering
      (TKDE), 2012.
   
  If you want to know the theory and demonstration of algorithms in BoostingPL
  , this paper provides references for further reading.

  BoostingPL use the classifiers in WEKA(http://www.cs.waikato.ac.nz/ml/weka/)
  as weak classifiers, so it depends on weka.

  The MapReduce platform we used is Apache Hadoop(http://hadoop.apache.org/)
  , which is an opensource project and is more popular than other 
  implementations of MapReduce. You can deploy it on Amazon EC2 or your own 
  cluster.

  The cgl-mapreduce version for BoostingPL is also vailable from the TKDE paper 
  authors, you can download "CGL-MapReduce-BoostingPL.zip" in "extra/" directory.
  This cgl-mapreduce version depends on these libraries: Weka, twister-0.8, 
  NaradaBrokering-4.2.2. You can download these from source websites.

  BoostingMR( https://github.com/Ranler/boostingMR ): another project which 
  provide an simple web-based interface for hadoop MR jobs.


License:

  BoostingPL is open source software issued under the GNU General Public 
  License (GPLv3). See the LICENSE included in this directory for more 
  information.



Boosting Classifiers:

  At present we have implemented these classifiers:

    * Boosting Classifiers
      - AdaBoost
      - AdaBoostPL
      - LogitBoost
      - LogitBoostPL

    * Weak Classifiers:
      - DecisionStump


A Simple Testing for AdaBoostPL:

https://github.com/Ranler/boostingPL/wiki/A-Simple-Testing-for-AdaBoostPL


Getting Started:

   See http:///



Documentation:

   See http:///



Experimental Results:

   See http:///


About

BoostingPL - Scalable and Parallel Boosting with MapReduce

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published