Top 6 Machine Learning Algorithms for Classification Problems with explanation and code (Python) | by Irem Selin DENIZ

Machine Learning (ML) is the science (and art) of developing algorithms to be executed as computer programs so that the machine can learn from data and use what it learned to provide meaningful outputs to its users. For example, an automated program that distinguishes spam and non-spam e-mails is a ML application — actually one of the first ML applications that became mainstream around the globe in 1990s.

Classification problems refer to the development of machine learning models with supervised learning in such a way that the instances in the dataset are mapped to pre-defined classes based on an algorithm. The ML model learns from the training dataset to be able to predict the class of a newly introduced data in a correct way. Classes are also called labels, targets, and categories in this context.

In order to introduce and implement ML algorithms which are commonly used for classification problems, the MNIST dataset is used in the following code examples. This dataset consists of 70,000 small images of digits handwritten by high school students and employees of the US Census Bureau. The label of each image is the digit it represents.

Let’s start with getting the MNIST dataset and prepare it to be used in ML algorithms.

Scikit-Learn provides many helper functions to download popular datasets including the MNIST. The following code fetches the MNIST dataset from OpenML.org: