Troubleshoot and resolve cloud issues
URN: TECDT90347
Business Sectors (Suites): IT(Networking)
Developed by: ODAG
Approved on:
2024
Overview
This standard is about troubleshooting and resolving cloud issues.
Cloud troubleshooting and issue resolution involves identifying, diagnosing and resolving issues that may arise in cloud-based systems. This includes maintaining the availability, performance, and reliability of cloud services by promptly addressing incidents and delivering effective resolutions.
This standard is for those who need to troubleshoot and resolve cloud issues.
Performance criteria
You must be able to:
- Monitor cloud resources and services to detect potential issues
- Evaluate cloud issues to determine severity and impact in line with organisational procedures
- Investigate cloud issues to diagnose the root cause
- Resolve or escalate cloud issues in line with organisational procedures
- Document incident and resolution details in line with organisational procedures
- Update cloud issue knowledge base in line with organisational procedures
- Communicate effectively with users, stakeholders and external cloud vendors during incident resolution
- Troubleshoot incidents to identify emerging trends and patterns to support continuous improvement
- Collaborate with cloud architects and engineers to evaluate cloud resource modifications to reduce incident occurrences in cloud environments
Knowledge and Understanding
You need to know and understand:
- Cloud platform products and services used to deliver cloud applications and resources
- How to use cloud monitoring and diagnostic tools to identify issues and incidents in cloud environments
- How to analyse logs and related data sources to inform problem diagnostics
- The steps involved in identifying root causes of cloud issues and how to apply them
- How to evaluate cloud issues and determine their severity and impact on organisational systems
- How to investigate could issues to determine whether to escalate or resolve them
- The main issues that can occur in cloud environments and how to resolve them
- The importance of maintaining a knowledge base of issues and resolutions to inform future support activities
- The importance of clear and concise communication during the resolution of cloud incidents
- How to document issues, incidents, troubleshooting steps and resolutions clearly
- The importance of continuous improvement to maintain the quality and availability of cloud resources
- How to troubleshoot incidents, recurring issues and trends as part of continual improvement strategies
- The key principles of cloud and problem resolution
- The importance of collaborating with cloud architects and engineers to resolve underlying issues in cloud infrastructure
Scope/range
Scope Performance
Scope Knowledge
Values
Behaviours
Skills
Glossary
Links To Other NOS
External Links
Version Number
1
Indicative Review Date
2027
Validity
Current
Status
Original
Originating Organisation
ODAG
Original URN
TECDT90347
Relevant Occupations
Information and Communication Technology Professionals
SOC Code
2133
Keywords
Cloud support, cloud engineer, cloud analyst, cloud troubleshooting