Job Description
Job Description
Looking for a professional to design, build and maintain the platform that supports hundreds of development environments. The position involves working directly with internal infrastructure and tools to automate the creation, orchestration and management of these environments, ensuring efficiency, scalability and reliability.
Responsibilities:
Design, build and maintain the control plane that manages development environments.
Develop backend tools and services that automate the creation, orchestration and management of environments.
Produce and deliver infrastructure as a platform, including databases, caches, object storage and queuing systems.
Scale and tune the cluster to handle varying workloads, including Karpenter/VPA/KEDA policies, sizing, pod density, packaging, and scheduling strategies for diverse loads.
Constantly monitor and optimize costs, considering all aspects of computing, storage and data transfer.
Manage network configuration to replicate the production environment, optimizing the large-scale development experience, including traffic routing, interception, ingress tuning, and asset delivery.
Implement and evolve observability within development environments.
Collaborate with the development environment SRE team to ensure daily availability and reliability of the environments.
Work with Deel engineering to improve the development experience and transform daily needs into a self-service platform.
Requirements:
More than 8 years of experience in Software Engineering, Infrastructure or Platform Engineering.
Backend engineering experience: API design, Postgres, Kafka/Nats.
Knowledge of Node.js, Go or Python.
Experience with AWS, GCP or Azure.
Advanced experience with Kubernetes, including creating tools, controllers, or operators that extend its capabilities.
Experience with standard Kubernetes tools:
Networks (ingress controllers, CoreDNS, external-dns, AWS LBC, oauth2-proxy).
Secrets management (Vault, External Secrets).
Cluster autoscaling and resource tuning (Karpenter, VPA, goldilocks, KEDA).
Storage provisioning
Experience with Helm charts and GitOps.
Experience in managing and maintaining large Kubernetes clusters (1000 nodes, 30k+ pods).
Communication and collaboration skills with various teams.
Salary to receive
To agree