Article

Category: Expertise strategy

Non-Linear Machine Learning Algorithms explained:
Decision Trees

In this blog, I give a simple overview of the decision trees, how they work and how they are created.

By Alan Lehane, Developer

The Machine Learning Blog Series so far has discussed Logistic Regression and Linear Regression Machine Learning Algorithms.

These Algorithms are Linear, in this article I am going to introduce an example of a non-linear Algorithm, Decision Trees.

emagine's Technical Machine Learning Series:

The Machine Learning Blog Series so far has discussed Logistic Regression and Linear Regression Machine Learning Algorithms.

These Algorithms are Linear, in this article I am going to introduce an example of a non-linear Algorithm, Decision Trees.

What is a Decision Tree?

A decision tree made like a hierarchic structure.

Above is an example of a simple decision tree. Each root represents an input variable (x) and a split at that variable.

Each Leaf represents an output variable (y) which is used to make a prediction.

The above Tree takes an example of an animal as the input variable and depending on the features of the example animal, a branch is selected, this process is repeated until a leaf is reached.

Creating a Tree from Data

A tree can be derived from training data by using a recursive binary split. Different splits are tried and tested using a cost function. All input variables and all possible splits are evaluated in a greedy manner. (Best split is chosen).

The most common cost function is the mean square of the error. For Example, a tree is created with random splits, a supervised training set is an input into the tree, and the output (y) is returned.

The square of the error is calculated for each member of the training set:

(training set output – y)²

The average square error across the entire tree is calculated. Another random tree is created and the process repeats. After n trees have been created the best tree, i.e. the tree with the smallest average square error, is returned.

Pruning the Tree

Pruning a decision tree refers to removing leaf nodes to improve the performance and readability of the tree.

The quickest and simplest way of pruning a tree is to work through each node and determine the effect on the tree of removing the node. For example, remove a node and then rerun the training set through the tree, if the error has reduced or remained the same remove the node, if the error has increased keep the node.

Conclusion

This blog is a continuation of the Technical Machine Learning Series, it introduced the idea of Non-Linear Machine Learning Algorithms. In this blog, I gave a simple overview of the decision trees, how they work and how they are created.

Alan Lehane, Software Developer

Alan has worked with Aspira/emagine for several years as a Software Developer, specialising in Data Analytics and Machine Learning. He has provided various services to Aspira's clients, including Software Development, Test Automation, Data Analysis and Machine Learning.

Ready to find out more?

Ask us how we can help you succeed.

Blog

Explore our blog

Non-Linear Machine Learning Algorithms explained: Decision Trees

What is a Decision Tree?

Creating a Tree from Data

Pruning the Tree

Conclusion

Ready to find out more?

Explore our blog

Developments of AI in Project Management

From Sci-fi to Reality: The future of the automotive industry

New EU Act on Artificial Intelligence

How to ensure the best collaboration between developer and tester

Exploring AI’s capability for app development

Different approaches to generating front-end code

On-demand webinar: Dodging Agile buzzwords for meaningful change (sign-up)

On-demand webinar: Intelligent Automation & Advanced Analytics

Microsoft Dev Box

On-demand webinar: From Data to Decisions – How Intelligent Automation & Analytics Drive Results

The Future of AI

Voice recognition in vehicles

Effective ways to work with Backend

Ebook: The AI Paradox

Machine Learning Algorithm: Logistic Regression

Invite webinar: Empower Transitions: Cultivating a Knowledge-Sharing Culture in IT

How to leverage Robotic Process Automation (RPA) to drive business growth

The role of artificial intelligence is transforming the banking industry

Example of non-Linear Machine Learning Algorithms: Decision Trees

Machine Learning Algorithm: Linear Regression

The Complete Guide to Migrating Monolithic Applications to Microservices on the Cloud

Creating a full stack application with a Machine Learning Component hosted on Microsoft’s Azure Platform

emagine Group

Denmark

France

Germany

India

Ireland

Netherlands

Norway

Poland

Sweden

United Kingdom

United Arab Emirates

Non-Linear Machine Learning Algorithms explained:
Decision Trees