Portainer.io
Platform Engineer Kubernetes Contractor Role Cst Tz (Remote)
Platform Engineer Kubernetes Contractor Role Cst Tz |Portainer.io | Argentina
We are seeking a highly skilled and experienced Platform Engineer tojoin our remote team. The ideal candidate will have extensive experience inKubernetes/Swarm administration, troubleshooting across all components,infrastructure, observability, and platform engineering. This role willinvolve managing large-scale Kubernetes environments, implementing,maintaining and ensuring the reliability and scalability of the platform....
Platform Engineer Kubernetes Contractor Role Cst Tz | Portainer.io | Argentina
We are seeking a highly skilled and experienced Platform Engineer to join our remote team. The ideal candidate will have extensive experience in Kubernetes/Swarm administration, troubleshooting across all components, infrastructure, observability, and platform engineering. This role will involve managing large-scale Kubernetes environments, implementing, maintaining and ensuring the reliability and scalability of the platform. You will also be part of an on-call rotation to handle critical incidents.
The role includes (but may not be limited to) the following functions:
– Kubernetes Management:
- Manage and optimise large-scale Kubernetes clusters.
- Perform version updates, configuration changes, and troubleshoot issues.
- Assist with and maintain container orchestration using Kubernetes.
- Platform Engineering Services:
- Maintain and expand the platform solution to meet SLA/OLS requirements.
- Perform platform moves/adds/changes and monitor core platform metrics.
- Manage load across components and ensure normal operating parameters.
- Implement component updates for defect resolution and preventive maintenance.
– Operational Onboarding:
- Create and maintain documentation for service levels, roles, and responsibilities.
- Conduct platform reviews and tooling deployments.
– DevOps and SRE:
- Aid in the use of GitOps pipelines and assist in application deployment strategies.
- Provide guidance on namespace, cluster, access control, and isolation best practises.
- Implement blue/green deployment strategies and assist with performance issues.
– Automation and DR Planning:
- Develop automations for preventative maintenance and operational efficiency.
- Create and validate cluster recovery guides to ensure infrastructure recoverability.
– Emergency Support:
- Provide 24/7 emergency engineering support with a 1-hour response SLA.
- Analyse alerts and perform root analysis to prevent recurrence.
This section sets out the previous experience, technical abilities, and professional qualifications required to perform the role.
Experience:
- 6 years of total experience in IT and platform engineering.
- 4 years managing Kubernetes environments.
- Experience with Docker Swarm is an advantage.
- Experience in operation, virtualisation, cloud infrastructure (AWS,Azure,GCP), and DevOps practises.
- Familiarity with ITIL-based practises for incident management and service requests.
Technical skills:
- Expertise in Kubernetes, Docker, and container orchestration tools.
- Experience with monitoring and logging tools (Prometheus, Grafana, Loki etc).
- Proficient in scripting and automation (Python, Bash, Terraform, Ansible).
- Knowledge of CI/CD pipelines and GitOps practises.
- Knowledge of Virtualisation Technologies (VMware).
Soft Skills:
- Excellent problem-solving and trouble shooting skills.
- Strong communication and documentation skills.
- Ability to explain technical concepts to non-technical stakeholders.
- Willingness to learn and adapt to new technologies and methodologies.
- Flexible and adaptable to changing requirements and priorities.
- Ability to work independently and as part of a remote team.
- Ability to work effectively with cross-functional teams, including developers, operations, and security teams.
- Cultural awareness and sensitivity to cultural differences when managing international partnerships.
Additional information:
- This role requires participation in an on-call rotation to respond to critical incidents.
- Candidates must be able to work primarily within the CST time zone with some flexibility for other time zones.
Related Jobs
See more All Other Remote Jobs-
NewSave
-
NewSave
-
NewSave
-
NewSave
-
NewSave
-
NewSave
-
NewSave
- Save
- Save
- Save