NOC Engineer
The NOC Engineer is responsible for monitoring the organization’s network, compute, and storage infrastructure to ensure continuous uptime and optimal performance. This role involves real-time monitoring, troubleshooting, and escalation of issues related to network, compute, and storage systems. The NOC Engineer collaborates with the Network and Infrastructure Team to maintain and optimize infrastructure performance.
Key Responsibilities:
Monitor Network, Compute, and Storage Operations:
Continuously monitor the performance and stability of the organization’s network, compute, and storage infrastructure using monitoring tools, ensuring minimal downtime and optimal performance.Incident Management:
Respond to and troubleshoot incidents related to network, compute, and storage systems, identifying root causes and resolving low- to medium-severity issues as per the incident handling framework. Escalate critical incidents when necessary.Collaborate with the Network and Infrastructure Team:
Work closely with the Network and Infrastructure Team to escalate and resolve ongoing issues, contributing to long-term optimization strategies across all infrastructure components.Documentation and Reporting:
Maintain detailed logs and documentation of incidents, network status, and performance metrics for network, compute, and storage systems. Provide regular updates and reports to senior management.Escalation and Communication:
Submit incident reports and escalate critical incidents related to network, compute, and storage systems to senior engineers or management, ensuring all stakeholders are informed of infrastructure status and issues.
Job Requirements:
- Diploma or degree in network engineering, information technology, computer science, or a related field.
- Experience working in a NOC or similar real-time monitoring and operations role, with more than 2 years preferred.
- Strong understanding of network fundamentals.
- Basic knowledge of compute infrastructure, including virtualization platforms.
- Proficient with network monitoring tools like SolarWinds, Zabbix.
- Ability to troubleshoot network, compute, and storage systems and escalate critical incidents when necessary.
- Strong communication skills for effective incident reporting and collaboration.
- Ability to work in a 24/7 operational environment, including on-call rotations.
- Proficiency in written and verbal communication (English).
- Proficiency in Office 365 suite.