Senior Site Reliability Engineer (Remote)

Salary: Competitive Salary
Job Type: Full time
Experience: Senior Level

EPAM Systems

Senior Site Reliability Engineer (Remote)

Senior Site Reliability Engineer | EPAM Systems |Argentina

We are in search of a committed Senior Site ReliabilityEngineer to improve the dependability and automation processes ofour infrastructure.

...

Senior Site Reliability Engineer | EPAM Systems | Argentina

We are in search of a committed Senior Site Reliability Engineer to improve the dependability and automation processes of our infrastructure.

The perfect candidate excels at resolving issues on platforms, adept in automating development and deployment tasks, and has strong troubleshooting skills. Responsibilities include participation in sprint planning, story grooming, and engaging in technical discussions aimed at enhancing our application and deployment methods.

Responsibilities

  • Investigate and address issues across our platform
  • Develop, analyze, and boost automation of deployments independently
  • Craft scripts that automate various tasks
  • Participate actively in sprint meetings and partake in technical conversations
  • Monitor a production-level APM, like Datadog, and relay critical insights to the team
  • Oversee the collection and analysis of application logs
  • Address application and instances alerts regarding site reliability
  • Engage in infrastructure architecture discussions during technical meetings
  • Maintain essential applications and libraries for the platform
  • Manage servers and methods for application code deployment
  • Guide and support other engineers
  • Conduct code reviews

Requirements

  • 3+ years of managing production applications workload in AWS Cloud
  • Understanding of public Cloud networks and VPC peering
  • Skills in cloud computing, including EC2, SNS/SQS, and RDS
  • Proficiency using container and orchestration technologies such as Docker, Kubernetes, EKS
  • Background in managing technologies at scale like Elasticsearch, PostgreSQL, Redis
  • Proficiency in provisioning and managing configurations using Terraform, Ansible
  • Competency in administration of Linux or Windows server
  • Knowledge of various scripting languages including Python, Groovy, PowerShell, or Ruby
  • Flexibility in integrating monitoring, logging, and alerting into development processes
  • Ability to troubleshoot complex issues in collaboration with peers
  • Ability to adapt swiftly to changing requirements and priorities
  • Fluent English communication skills at a B2+ level

Nice to have

  • Experience with monitoring tools like Datadog
  • Background in maintaining compliance with HIPAA and other regulatory standards

Technologies

  • Node.js/NestJS
  • React Native
  • Python/Django
  • PostgreSQL, Redis
  • CircleCI, Spinnaker, Expo
  • AWS
  • Datadog

We offer

  • International projects with top brands
  • Work with global teams of highly skilled, diverse peers
  • Healthcare benefits
  • Employee financial programs
  • Paid time off and sick leave
  • Upskilling, reskilling and certification courses
  • Unlimited access to the LinkedIn Learning library and 22,000+ courses
  • Global career opportunities
  • Volunteer and community involvement opportunities
  • EPAM Employee Groups
  • Award-winning culture recognized by Glassdoor, Newsweek and LinkedIn

Show more

Show less

Tagged as: remote, remote job, virtual, Virtual Job, virtual position, Work at Home, work from home

Load more listings
When applying state you found this job on Pangian.com Remote Network.