Build and implement data processing systems

URN: TECDT80841
Business Sectors (Suites): IT(Data Science)
Developed by: e-skills
Approved on: 30 Mar 2023

Overview

This standard is about building and implementing data processing systems.

Data engineers prepare data for analytical or operational uses. They are typically responsible for designing and building data pipelines to bring together information from different source systems. They integrate, consolidate, cleanse and structure data to make it easily accessible and in usable format. Processed data can then be used by business executives, data analysts and other end users to inform organisational processes and decision making.

Building and implementing data processing systems involves creating data storage to link data systems, and creating and deploying data pipelines and data processing infrastructure. This also includes implementing batch and stream processing solutions, building and running tests to confirm required operation and documenting implementations.

This standard is for those who need to build and implement data processing systems as part of their duties.


Performance criteria

You must be able to:

  1. Review data processing system designs to plan system development
  2. Plan data processing development activities in line with organisational procedures
  3. Build and debug data pipelines to process and deliver data to destination systems
  4. Integrate data processing systems with data storage systems and business processes
  5. Implement data security controls into data processing systems in line with organisational procedures

  6. Implement data flows and transformations to deliver data to users

  7. Configure data pipelines in line with organisational requirements

  8. Automate data processing to provide adaptive operation
  9. Build and run test suites to validate data pipeline functionality in line with organisational requirements
  10. Deploy data solutions into production environments to make them operational
  11. Manage commercial relationships with data engineering partners in line with organisational procedures
  12. Document data pipeline solutions in line with organisational standards

Knowledge and Understanding

You need to know and understand:

1. How data pipelines work 2. The steps involved in planning data pipeline development 3. Industry standard data pipeline architectures and how to apply them 4. How to build and test data pipelines and products to meet organisational needs 5. How to design, write and iterate code from prototype to production-ready for data processing solutions 6. Security requirements of data pipelines and how to address them 7. How to use and link to SQL (Structured Query Language) and NoSQL databases 8. Industry standard data pipeline engineering tools and how to apply them 9. The effective use of managed cloud services for data processing 10. How to expose data from data storage systems 11. How to link data from multiple systems and deliver batch and streaming services 12. The steps involved in data acquisition and import 13. How to automate data pipelines 14. The steps involved in testing data pipelines 15. How to configure data pipelines to extract, manipulate, transform, aggregate and abstract data

Scope/range


Scope Performance


Scope Knowledge


Values


Behaviours


Skills


Glossary


Links To Other NOS


External Links


Version Number

1

Indicative Review Date

30 Mar 2026

Validity

Current

Status

Original

Originating Organisation

ODAG Consultants Ltd.

Original URN

TECDT80841

Relevant Occupations

Information and Communication Technology Professionals

SOC Code

2134

Keywords

data engineering, data design, data processing, data cleansing