Key Qualifications
Incident Management: Experience in managing incidents, including response strategies and postmortem analysis, is critical for maintaining system reliability.
Troubleshooting: The ability to diagnose and resolve issues quickly is a key trait for any Incident Manager.
Networking Knowledge: A solid grasp of networking concepts helps in diagnosing issues and understanding how systems communicate.
Programming Skills: Proficiency in programming languages (such as .NET or Java) is important for review ideas / solution - automation and developing tools.
Monitoring and Observability: Skills in using monitoring tools (like App D, Azure App insights, EAGLE, Grafana) to track system performance and detect anomalies are essential.
Security Awareness: Understanding security best practices helps ensure that reliability solutions do not compromise system security.
Collaboration and Communication: Strong interpersonal skills are necessary for working effectively with development, Network, Firewall and Release operations teams.
Windows / Linux/Unix Proficiency: Understanding on Windows, Linux or Unix systems is fundamental, as many applications run on these platforms.
Cloud Computing: Familiarity with cloud services (like AWS, Azure, or Google Cloud) is crucial, given the prevalence of cloud-based architectures.
CI/CD Practices: Understanding Continuous Integration and Continuous Deployment (CI/CD) pipelines is vital for managing software releases and ensuring reliability.
Capacity Planning: Skills in forecasting system needs and scaling resources accordingly are important for maintaining performance.
Must have: Incident Manager having experience in handling Production Major Incident calls, RCA problem management.
Responsibilities
Incident Management: Experience in managing incidents, including response strategies and postmortem analysis, is critical for maintaining system reliability.
Troubleshooting: The ability to diagnose and resolve issues quickly is a key trait for any Incident Manager.
Networking Knowledge: A solid grasp of networking concepts helps in diagnosing issues and understanding how systems communicate.
Programming Skills: Proficiency in programming languages (such as .NET or Java) is important for review ideas / solution - automation and developing tools.
Monitoring and Observability: Skills in using monitoring tools (like App D, Azure App insights, EAGLE, Grafana) to track system performance and detect anomalies are essential.
Security Awareness: Understanding security best practices helps ensure that reliability solutions do not compromise system security.
Collaboration and Communication: Strong interpersonal skills are necessary for working effectively with development, Network, Firewall and Release operations teams.
Windows / Linux/Unix Proficiency: Understanding on Windows, Linux or Unix systems is fundamental, as many applications run on these platforms.
Cloud Computing: Familiarity with cloud services (like AWS, Azure, or Google Cloud) is crucial, given the prevalence of cloud-based architectures.
CI/CD Practices: Understanding Continuous Integration and Continuous Deployment (CI/CD) pipelines is vital for managing software releases and ensuring reliability.
Capacity Planning: Skills in forecasting system needs and scaling resources accordingly are important for maintaining performance.
About Cognizant:
Cognizant (Nasdaq: CTSH) engineers' modern businesses. We help our clients modernize technology, reimagine processes and transform experiences so they can stay ahead in our fast-changing world. Together, we're improving everyday life. See how atwww.cognizant.comor @cognizant.#J-18808-Ljbffr