Centene
Senior Site Reliability Engineer (Remote)
Senior Site Reliability Engineer | Centene | UnitedStates
You could be the one who changes everything for our 28 million membersby using technology to improve health outcomes around the world. As adiversified, national organization, Centene’s technologyprofessionals have access to competitive benefits including a freshperspective on workplace flexibility....
Senior Site Reliability Engineer | Centene | United States
You could be the one who changes everything for our 28 million members by using technology to improve health outcomes around the world. As a diversified, national organization, Centene’s technology professionals have access to competitive benefits including a fresh perspective on workplace flexibility.
Position Purpose:
Helps lead projects that are focused on managing and maintaining optimum platform infrastructure performance, reliability, and security using SRE practices, observability tools, manual and automated procedures, documentation, people and processes and continuous delivery(CI/CD) tools, processes, and designs. Develops complex services to automate monitoring activities and provide critical information to facilitate response and resolution of performance and availability issues and incidents. Understands and advocates for standardized and scalable software tools to ensure that systems operate without interruption at optimum performance and leads project teams through out the deployment process. Troubleshoots and analyzes service disruptions to determine the root cause of issues and develop solutions for improved reliability.
- Troubleshoots and resolves more complex problems with systems and services and initiates regular deployment of new versions of the systems and their subcomponents.
- Ensues system design meets functional, quality, and security standards.
- Develop and maintain monitoring and alerting dashboards using our instrumentation tools (Dynatrace & Splunk).
- Review our overall architecture and make/recommend changes in the code and in the infrastructure to improve the reliability and performance of our sites.
- Advocates & collaborates on best practices with other reliability engineers, developers and architects.
- Prepares and performs Disaster Resiliency and Capacity activities.
- Eliminates toil.
- Maintain service level indicators / KPI’s (latency, errors, traffic, saturation).
- Lead major incident management calls.
- Facilitate blameless post-mortems.
- Leads more complex projects focused on building and maintaining observability/monitoring for the application, monitoring key performance indicators, maintaining alerting, and continuously improving visibility.
- Helps make decisions around periodic system validation and testing, service monitoring, and standing up new services/tools.
- Uses knowledge and experience to identify strategies that increase system reliability and performance through on-call rotation and process optimization.
- Identifies and implements necessary manual and automated procedures for improved collaborative response in real-time.
- Leads lower level Engineers in stress, security, and performance testing.
- Resolves issues that come up through support escalation.
- Keeps documentation and runbooks up to date to effectively deal with new incidents that might arise.
- Leads post incident reviews and documents findings for future informed decision making.
- Reviews proposals to optimize Software Development Life Cycle (SDLC) to boost service reliability and makes decisions around which proposals should move forward.
- Communicates complex topics with development teams to investigate and document issues and leads internal team to develop solutions to mitigate them.
- Performs other duties as assigned.
- Complies with all policies and standards.
Education/Experience:
A Bachelor’s degree in a quantitative or business field (e.g., statistics, mathematics, engineering, computer science).
Requires 4 – 6 years of related experience.
Or equivalent experience acquired through accomplishments of applicable knowledge, duties, scope and skill reflective of the level of this position.
Technical Skills:
- One or more of the following skills are desired:
- HPUX
- MicroFocus Cobol
- Oracle
- Snowflake
- Centrify
- Pager Duty
- Apache Kafka
- Docker
- Windows 2016 / 2019
- Gitlab
- Snyk
- Ansible
- .NET Framework
- AXWAY
- Gremlins
- Dynatrace
- Splunk
- AWS
Soft Skills:
- Intermediate – Seeks to acquire knowledge in area of specialty
- Intermediate – Ability to identify basic problems and procedural irregularities, collect data, establish facts, and draw valid conclusions
- Intermediate – Ability to work independently
- Intermediate – Demonstrated analytical skills
- Intermediate – Demonstrated project management skills
- Intermediate – Demonstrates a high level of accuracy, even under pressure
- Intermediate – Demonstrates excellent judgment and decision making skills
Pay Range: $83,600.00 – $155,000.00 per year
Centene offers a comprehensive benefits package including: competitive pay, health insurance, 401K and stock purchase plans, tuition reimbursement, paid time off plus holidays, and a flexible approach to work with remote, hybrid, field or office work schedules. Actual pay will be adjusted based on an individual’s skills, experience, education, and other job-related factors permitted by law. Total compensation may also include additional forms of incentives.
Centene is an equal opportunity employer that is committed to diversity, and values the ways in which we are different. All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, disability, veteran status, or other characteristic protected by applicable law.
Related Jobs
See more All Other Remote Jobs-
NewSave
-
NewSave
-
NewSave
-
NewSave
-
NewSave
-
NewSave
-
NewSave
-
NewSave
-
NewSave