Back

Develop and implement machine learning algorithms

URN: TECIS805401

Business Sectors (Suites): IT(Data Science)

Developed by: e-skills

Approved on: 2020

Download as PDF Download as Word

Overview

This standard identifies the competences you need to develop machine learning algorithms. It includes the different approaches to machine learning and their implementation in accordance with approved procedures. Machine learning algorithms are used in a wide variety of applications where it is difficult or infeasible to develop a conventional algorithm to perform the task. This will involve the practical use of software tools for machine learning algorithm development. You will understand how to identify and select tasks suitable for machine learning and formulate machine learning problems in order to address them. Your underpinning knowledge will enable you to develop and test machine learning algorithms. You will be required to select and apply machine learning algorithms to build models for prediction, classification or clustering. You will undertake the process of training and validation in order to develop machine learning solutions. You will be able to assess the performance of a developed model and identify the role of training and test datasets in this process. You will understand the process of training and validating machine learning models. The standard will introduce the concepts of error and bias in model development and their importance in evaluating model performance. This activity can be increasingly found in any sector or organisation. This activity is likely to be undertaken by people working as Machine Learning Specialists or Machine Learning Engineers.

Performance criteria

You must be able to:

prepare datasets from multiple databases and other sources to input into machine learning models
capture, organise and prioritise requirements to describe organisational needs
evaluate datasets to identify quality issues to determine and document an approach to addressing them
translate business and technical requirements into machine learning problems to plan and develop solutions
conduct data cleaning of noisy, incomplete or data with established data quality issues using approved tools and techniques
select and develop data sets, algorithms and modelling techniques required to solve organisational data problems
create analytical models to produce machine learning solutions
evaluate and validate machine learning models to ensure no bias is introduced
apply best-practice techniques for output model testing and tuning to assess accuracy, fit, validity and robustness
design and implement dashboard and automated reporting systems to deliver updates on model performance
develop strategies for model improvement as well as improvements to data and retraining
create and disseminate reports, presentations and other documentation that provides storytelling and description of model development to confirm stakeholder approval for handover to implementation

Knowledge and Understanding

You need to know and understand:

the stages of the machine learning lifecycle and how to apply them
the characteristics of different machine learning methods and models including; supervised learning; unsupervised learning; text mining, reinforcement learning, ensemble learning; predictive modelling; classification models; regression models and clustering models
a wide range of statistical methods and best-practice modelling techniques and how to apply them
the required data cleaning techniques used to improve data quality
the dataset preparation activities that are required in the machine learning process including data collection, formatting, reduction, decomposition and rescaling
how to select and apply machine learning algorithms for classification, regression and clustering using existing libraries
the required machine learning procedures for text data
the steps involved in machine learning output model validation and how to apply them
the variables and features that impact model performance to test and validate output model performance
the factors that impact model validation such as the size of the data set and how it is segmented
the differences between structured and unstructured data
the required training and testing steps for data sets to produce accurate models
how to evaluate machine learning model performance
the tools, systems and procedures for developing machine learning models
the techniques for identifying and reducing bias in datasets and how to apply them

Develop and implement machine learning algorithms

Overview

Performance criteria

Knowledge and Understanding

Scope/range

Scope Performance

Scope Knowledge

Values

Behaviours

Skills

Glossary

Links To Other NOS

External Links

Version Number

Indicative Review Date

Validity

Status

Originating Organisation

Original URN

Relevant Occupations

SOC Code

Keywords