A comparison of shogun with the popular machine learning toolboxes weka, kernlab, dlib, nieme, orange, java-ml, pyML, mlpy, pybrain, torch3, scikit-learn. A '?' denotes unkown, '-' feature is missing. This table is availabe as a google spreadsheet. If you have additions please share an updated spreadsheet with us and we will integrate the changes.

createdlast updatedmain languagemain focus
shogun199910-2013C++General Purpose ML Package with particular focus on large scale learning; Kernel Methods; Interfaces to various languages
weka199707-2013javaGeneral Purpose ML Package
kernlab04-200411-2013RKernel Based Classification/Dimensionality Reduction
dlib200610-2013C++Portability; Correctness
nieme09-200603-2009C++Linear Regression; Ranking; Classification
orange06-200411-2013pythonVisual Data Analysis
java-ml08-200807-2012javaFeature Selection
pyML08-200409-2013C++; pythonKernel Methods
mlpy02-200803-2012pythonBasic Algorithms
pybrain10-200802-2013pythonReinforcement Learning
torch701-200211-2013C++;luaNeural Networks
scikit-learn200708-2013python; cythonGeneral Purpose with simple API and numpy / scipy idioms

shogun
weka
kernlab
dlib
nieme
orange
java-ml
pyML
mlpy
pybrain
torch3
scikit-learn
General FeaturesGraphical User Interfacecrosstickcrosstickticktickcrosscrosscrossticktickcross
One Class Classificationticktickticktickcrosscrosscrosstickcrosscrosscrosstick
Classificationticktickticktickticktickticktickticktickticktick
Multiclass classificationtickticktickcrosstickcrossticktickticktickticktick
Regressionticktickticktickticktickcrosstickcrosstickticktick
Structured Output Learningtickcrosscrosscrosstickcrosscrosscrosscrosscrosscrosscross
Pre-Processingtickticktickticktickticktickticktickcrossticktick
Built-in Model Selection Strategiesticktickticktickcrosstickticktickcrosscrosscrosstick
Visualizationcrosstickcrosscrossticktickcrosstickticktickticktick
Test Frameworkticktickcrossticktickuntestedtickcrosscrosscrosscrosstick
Large Scale Learningtickcrosscrossticktickcrosscrosscrosstickcrosscrosscross
Semi-supervised Learningcrosscrosscrosscrosscrosscrosscrosscrosscrosscrosscrosspartial
Multitask Learningtickcrosscrosscrosscrosscrosscrosscrosscrosscrosscrosscross
Domain Adaptationtickcrosscrosscrosscrosscrosscrosscrosscrosscrosscrosscross
Serializationticktickticktickticktickticktickticktickcrosstick
Parallelized Codeticktickcrosstickcrosscrosscrosscrosscrosscrosscrosstick
Performance Measures (auROC etc)ticktickcrosstickticktickticktickticktickticktick
Image Processingcrosscrosscrosstickcrosscrosscrosscrosscrosscrosscrosspartial
Supported Operating SystemsLinuxticktickticktickticktickticktickticktickticktick
Windowstickticktickticktickticktickcrossticktickticktick
Mac OSXtickticktickticktickticktickticktickcrossticktick
Other Unixtickticktickticktickticktickcrosstickcrossticktick
Language BindingsPythontickcrosscrosscrossticktickcrosstickticktickcrosstick
Rtickcrosstickcrosscrosscrosscrosscrosscrosscrosscrosscross
Matlabtickcrosscrosscrosscrosscrosscrosscrosscrosscrosscrosscross
Octavetickcrosscrosscrosscrosscrosscrosscrosscrosscrosscrosscross
C/C++tickcrosscrossticktickcrosscrosscrosscrosscrosstickcross
Command Linetickcrosscrosscrosscrosscrosscrosscrosstickticktickcross
Javaticktickcrosscrosstickcrosstickcrosscrosscrosscrosscross
C#tickcrosscrosscrosscrosscrosscrosscrosscrosscrosscrosscross
Luatickcrosscrosscrosscrosscrosscrosscrosscrosscrosscrosscross
Rubytickcrosscrosscrosscrosscrosscrosscrosscrosscrosscrosscross
SVM SolversSVMLightticktickcrosscrosscrosscrosscrosscrosscrosscrosscrosscross
LibSVMticktickticktickticktickticktickcrosstickcrosstick
SVM Ocastickcrosscrosstickcrosscrosscrosscrosscrosscrosscrosscross
LibLinearticktickcrosscrosscrosscrosscrosscrosscrosscrosscrosstick
BMRMtickcrosscrosscrosscrosscrosscrosscrosscrosscrosscrosscross
LaRanktickcrosscrosscrosscrosscrosscrosscrosscrosscrosscrosscross
SVMPegasoscrosstickcrossticktickcrosscrosscrosscrosscrosscrosscross
SVM SGDtickcrosscrosscrosscrosscrosscrosscrosscrosscrosscrosstick
othertickcrosstickcrosscrosscrosscrossticktickcrosstickcross
RegressionKernel Ridge Regressiontickcrosscrosscrosscrosscrosscrosstickcrosscrosscrosstick
Support Vector Regressiontickticktickcrosscrosscrosscrosstickcrosscrossticktick
Gaussian Processestickticktickcrosscrosscrosscrosscrosscrosscrosscrosstick
Relevance Vector Machinecrosstickticktickcrosscrosscrosscrosscrosscrosscrosscross
Multiple Kernel LearningMKLtickcrosscrosscrosscrosscrosscrosscrosscrosscrosscrosscross
q-norm MKLtickcrosscrosscrosscrosscrosscrosscrosscrosscrosscrosscross
multiclass MKLtickcrosscrosscrosscrosscrosscrosscrosscrosscrosscrosscross
ClassifiersNaive Bayesticktickcrosscrosscrosstickcrosscrosscrosstickticktick
Bayesian Networkscrosstickcrosstickcrosscrosscrosscrosscrosstickcrosscross
Multi Layer Perceptroncrosstickcrossticktickcrosscrosscrosscrossticktickcross
RBF Networkscrosstickcrosstickcrosscrosscrosscrosscrosstickcrosscross
Logistic Regressionticktickuntestedcrossticktickcrosscrosscrosscrosscrosstick
LASSOtickcrossuntestedcrosstickcrosscrosscrosscrosscrosscrosstick
Decision Treescrosstickcrosscrosscrossticktickcrosscrosscrosscrosspartial
k-NNticktickticktickcrosstickticktickticktickticktick
Gaussian Process Classificationtickticktickcrosscrosscrosstickcrosscrosscrosscrosstick
Linear ClassifiersLinear Programming Machinetickcrosscrosscrosscrosscrosscrosscrosscrosscrosscrosscross
LDAtickcrosscrosscrosscrosscrosscrosscrosstickcrosscrosstick
DistributionsMarkov Chainstickcrosscrosscrosscrosscrosstickcrosscrosscrosscrosscross
Hidden Markov Modelstickcrosscrosscrosscrosscrosscrosscrosscrosscrossticktick
Dimension ReductionPCAticktickticktickcrosstickcrosscrosstickcrosscrosstick
Kernel PCAtickcrosstickcrosscrosscrosscrosscrosstickcrosscrosstick
Isomaptickcrosscrosscrosscrosscrosscrosscrosscrosscrosscrosstick
Multidimensional scalingtickcrosscrosscrosscrosstickcrosscrosscrosscrosscrosstick
Sammon mappingcrosscrosscrosstickcrosstickcrosscrosscrosscrosscrosscross
Locally Linear Embeddingtickcrosscrosscrosscrosscrosscrosscrosscrosscrosscrosstick
Diffusion Maptickcrosscrosscrosscrosscrosscrosscrosscrosscrosscrosstick
Local Tangent Space Alignmenttickcrosscrosscrosscrosscrosscrosscrosscrosscrosscrosstick
Laplacian Eigenmapstickcrosscrosscrosscrosscrosscrosscrosscrosscrosscrosstick
Barnes-Hut t-SNEtickcrosscrosscrosscrosscrosscrosscrosscrosscrosscrosscross
Independent Component AnalysisFIXMEuntesteduntesteduntesteduntesteduntesteduntesteduntesteduntesteduntesteduntesteduntesteduntested
KernelsLinearticktickticktickticktickticktickticktickticktick
Gaussianticktickticktickcrosstickticktickticktickticktick
Polynomialticktickticktickcrosstickticktickticktickticktick
String Kernelstickticktickcrosscrosscrosscrosstickcrosscrosscrosscross
Sigmoid Kernelticktickcrosstickcrosstickcrosscrosscrosscrosscrosstick
Kernel Normalizertickuntestedtickcrosscrosscrosscrosstickcrosscrosscrossuntested
Feature SelectionForwardcrosstickcrossuntestedcrossticktickticktickcrosscrosstick
Wrapper methodscrosstickcrossuntestedcrossuntestedtickticktickcrosscrosscross
Recursive Feature Selectioncrosstickcrosstickcrossuntestedtickticktickcrosscrosstick
Missing FeaturesMean value imputationcrosstickcrosscrosscrossticktickcrosstickcrosscrosscross
EM-based/model based imputationcrosstickcrosscrosscrosstickcrosscrosscrosscrosscrosscross
ClusteringHierarchical Clusteringticktickcrosscrosscrosstickcrosscrosstickcrosscrosstick
k-meansticktickticktickcrosstickticktickticktickticktick
OptimizationBFGSticktickcrossticktickcrosscrosscrosscrosscrosscrosscross
conjugate gradientcrosscrosscrosstickcrosscrosscrosscrosscrosscrosscrosscross
gradient descenttickticktickcrosstickcrosscrosscrossticktickticktick
bindings to CPLEXtickcrosscrosscrosscrosscrosscrosscrosscrosscrosscrosscross
bindings to Mosektickcrosscrosscrosscrosscrosscrosscrosscrosscrosscrosscross
bindings to other solvertickcrosstickcrosscrosstickcrosstickcrosscrosscrosstick
Structural Output LearningLabel Sequence Learningtickcrossuntestedtickuntesteduntesteduntesteduntesteduntesteduntesteduntestedcross
Factor Graph Learningtickcrossuntestedtickuntesteduntesteduntesteduntesteduntesteduntesteduntestedcross
SO-SGDtickcrossuntestedcrossuntesteduntesteduntesteduntesteduntesteduntesteduntestedcross
Latent SO-SVMtickcrossuntestedcrossuntesteduntesteduntesteduntesteduntesteduntesteduntestedcross
Supported File FormatsBinaryticktickcrosscrosscrosscrosscrosscrosscrosstickcrosstick
Arffcrosstickcrosscrosscrosscrosstickcrosscrosscrosscrosscross
HDF5tickcrosstickcrosscrosscrosscrosscrosscrosscrosscrosscross
CSVtickticktickcrosscrossticktickticktickcrossticktick
libSVM/ SVMLight formatticktickcrossticktickcrosscrosstickcrosstickcrosstick
Excelcrosscrosstickcrosscrosstickcrosscrosscrosscrosscrosscross
Protobuftickcrosscrosscrosscrosscrosscrosscrosscrosscrosscrosscross
Supported Data TypesSparse Data Representationticktickcrosstickticktickticktickticktickcrosstick
Dense Matricesticktickticktickcrosstickticktickticktickticktick
Stringsticktickticktickcrosscrosscrosscrosscrosscrossticktick
Support for native (e.g. C) types (char, signed and unsigned int8, int16, int32, int64, float, double, long double)tickcrosscrosstickcrosscrosscrosscrosstickcrosscrosstick

What's New

Feb. 17, 2014 -> SHOGUN 3.2.0
Jan. 6, 2014 -> SHOGUN 3.1.1
Jan. 5, 2014 -> SHOGUN 3.1.0
Oct. 28, 2013 -> SHOGUN 3.0.0
March 17, 2013 -> SHOGUN 2.1.0
Sept. 1, 2012 -> SHOGUN 2.0.0
Dec. 1, 2011 -> SHOGUN 1.1.0