Site Reliability Engineer Immediate Joiner (Remote)

Other
Salary: $From ₹20,00,000 a year INR per Year
Job Type: Full time
Experience: Senior Level

Nanolytics Software Pvt. Ltd.

Site Reliability Engineer Immediate Joiner (Remote)

Site Reliability Engineer Immediate Joiner | Nanolytics SoftwarePvt. Ltd. | Worldwide

Senior Site Reliability Engineer is responsible for meaningfullycontributing and providing continuous feedback on site health, reliability,availability and user experience. This is a matrixed role where the SREwill work closely on a day-to-day basis with the product team whilereporting to the practice lead.

This role is expected to understand the product in depth, collect and...

Site Reliability Engineer Immediate Joiner | Nanolytics Software Pvt. Ltd. | Worldwide

Senior Site Reliability Engineer is responsible for meaningfully contributing and providing continuous feedback on site health, reliability, availability and user experience. This is a matrixed role where the SRE will work closely on a day-to-day basis with the product team while reporting to the practice lead.

This role is expected to understand the product in depth, collect and analyze meaningful measurements and provide feedback to the business, Software Engineering and Product teams. The SRE will work very closely with the key stakeholders to help drive changes to increase customer satisfaction, product availability, reliability, and the completion of strategic technical initiatives.

In addition to monitoring and integration with the observability platform, a heavy focus will be placed on automation opportunities and automating operational processes to maintain high availability of the product.

You will have to work from 7 PM to 4 AM IST and also on needed basis some weekend hours.

Technical

Infrastructure Management:

  • Design, build, and maintain scalable, resilient infrastructure using Azure cloud platforms .
  • Manage and optimize Kubernetes clusters, containers, and microservices.
  • Implement Infrastructure as Code (Iac) using tools like Terraform(Must). Advanced Terraform syntax, Ansible (syntax, tasks, playbook)
  • Monitoring Dynatrace, Azure App Insight, Prometheus, and Grafana: service catalog metrics and recording rules for alerts

Automation & CI/CD:

  • Maintain automated CI/CD pipelines to ensure rapid, safe, and reliable delivery of software.
  • Automate repetitive tasks, processes, and workflows to increase efficiency and reduce human error.
  • Implement and maintain monitoring, logging, and alerting systems to ensure visibility into system performance

*

  • Cost Optimization:
  • Set up monitoring and reporting tools to track cloud spending in real-time.
  • Regularly review the architecture and operations to identify areas where costs can be reduced. This includes evaluating new tools, services, or practices that could lead to further cost savings.
  • Collaborate with development teams to ensure that cost-efficient practices are followed in software design and deployment.
  • Recommend and manage the purchase of reserved instances, savings plans, or other discounts offered by cloud providers to reduce costs for long-term workloads.
  • Incident Response & Troubleshooting:
  • Respond to and resolve incidents in a timely manner, ensuring minimal downtime and impact on customers.
  • Perform root cause analysis and post-mortem reviews to prevent recurrence of issues.
  • Collaborate with development teams to improve system reliability through proactive issue identification and resolution.
  • Performance Optimization:
  • Monitor system performance and capacity and implement improvements to optimize efficiency and scalability.
  • Analyze and improve application performance, ensuring high availability and low latency.
  • Security & Compliance:
  • Ensure security best practices are followed across the infrastructure.
  • Implement security controls and monitoring to protect against vulnerabilities and threats.
  • Work with compliance teams to ensure systems adhere to regulatory requirements.
  • Collaboration & Communication:
  • Work closely with software engineers, product managers, platform team, Global Support and other stakeholders to ensure system reliability aligns with business goals.
  • Provide guidance and mentorship to other team members.
  • Document processes, procedures, and best practices for the broader team.”

Required Technical and Professional Expertise

  • Bachelor’s/Master’s degree in Computer Science, Engineering or another relevant field.
  • Prior experience of 5 to 8 years with Enterprise Backup/Storage solutions.

· Advanced knowledge of Azure cloud services.

  • Understanding of web hosting infrastructure and architecture in highly available environments
  • Working knowledge and experience C#, Javascript, and HTML
  • Familiarity with RESTful API and .Net Applications
  • Experience working with Dynatrace, Azure monitor, AppInsight, log analytics (highly Desirable)
  • Experience with scalable networking technologies, including Linux, software-defined networking, network virtualization, open protocols, App acceleration, Load Balancers, DNS, virtual private networks, and their application in PaaS and IaaS technologies
  • Good understanding of IT infrastructure and ITSS security standards which integrates or are used by backup or storage solution.
  • Highly motivated with a desire to improve processes and procedures that support the backup and storage infrastructure.
  • Develop and maintain a deep understanding of the risks and security vulnerabilities for backup/storage applications.
  • Awareness of penetration testing methods preferred.

Job Type: Full-time

Pay: From ₹2,000,000.00 per year

Benefits:

  • Leave encashment
  • Work from home

Schedule:

  • Evening shift
  • Monday to Friday
  • Night shift
  • Weekend availability

Application Question(s):

  • How many years of exp in How many years of experience do you have in Azure Iaas?
  • How many years of experience do you have in Terraform?
  • How many years of experience do you have in Site Reliability Engineering?
  • How many years of experience do you have in Docker?
  • What is your notice period
  • What is your current CTC in Lakhs per annum?
  • What is your expected CTC in Lakhs per annum?
  • How many years of experience do you have in Azure site backup and Disaster Receover?

Education:

  • Bachelor’s (Required)

Experience:

  • total work: 1 year (Preferred)

Language:

  • English (Required)

License/Certification:

  • Azure Certification (Preferred)

Location:

  • Remote (Preferred)

Shift availability:

  • Night Shift (Preferred)
  • Overnight Shift (Required)

Work Location: Remote

Tagged as: remote, remote job, virtual, Virtual Job, virtual position, Work at Home, work from home

Load more listings
When applying state you found this job on Pangian.com Remote Network.