Site Reliability Engineer

  • Pune
  • Ncs Group

Are you looking for value-adding and impactful work?

Do you want to make a difference with your expertise?

With us, you’ll be able to make it happen.

NCS is a leading technology services firm, operating across Asia Pacific in over 20 countries, providing services and solutions in consulting, digital services, technology, and more.

We believe in utilizing the power of technology to make extraordinary things happen and to create lasting impact and value for our people, communities, and partners. Our diverse 12,000-strong workforce has delivered a wealth of large-scale, mission-critical, and multi-platform projects for governments and enterprises in Singapore and the APAC region.

What we do

We drive our passion for harnessing technology.

We bring people and technology together.

We advance communities and transform industries.

We’re searching for an Integration Lead DevOps/DevOps Architect to be part of our diverse team of talent here at NCS!

If you believe in going above and beyond, want to exemplify the best, and wish to bring people and technology together like never before, then we would love to talk with you!


What Will You Do?

  • & supporting the next generation Cloud Application Runtime Platform.
  • development squads on Kubernetes, cloud engineering, cloud native best practices including configuration & observability
  • self-service operations, delivery pipelines, primarily GitOps based with kubernetes controllers
  • continuous improvement for the platform covering areas such as: capacity planning, observability, monitoring, reliability, and resiliency

Perform system maintenance, patching and upgrades.

  • repetitive tasks, optimize processes and perform thorough testing to ensure quality.
  • to and troubleshoot incidents , providing post-mortem analysis/ areas of improvement.



You Will Be a Great Fit If You Have:

Bachelor's degree (computer science or related field) with good experience working with contemporary technologies and scripting languages.

  • 3 years of experience in a SRE (Site Reliability Engineer) role , Infrastructure Engineering or Application support with DevOps.
  • 3 years of experience in one or more programming languages – Golang / Java / python and configuration management/ IAC tools – Ansible – Good to Have / Terraform / Puppet
  • experience in a Continuous Integration/Continuous Delivery (CI/CD) with hands-on working knowledge in Jira, Confluence, Gitlab
  • building and deploying cloud native applications on Kubernetes (EKS, AKS, OCP)
  • understanding of Linux, networking & distributed systems – Not mandatory
  • tools, ELK ( Not Admin) , Prometheus, AWS CloudWatch, Grafana, Jaeger, OpenTelemetry, etc. - Knowledge
  • support of systems, for example, on a L3 support rota, etc





Good to have:

Professional certifications in Java, Linux, Networks, AWS or Kubernetes a bonus

  • Argo stack; ArgoCD for GitOps, Events, Workflows, Rollouts, etc
  • Experience analysing Java heap / thread dumps
  • Data Engineering & Analysis, ETL pipelines, etc.
  • Cloud Engineering on Hybrid multi-clouds using Terraform, Crossplane, AWS ACK & CDK
  • Network Mgmt. tools such as, Apigee, Kong, Envoy, NGINX, Istio
  • Application Performance Monitoring tools, such as ELK, Dynatrace, AppDynamics, Elastic APM
  • JBoss, Weblogic, Tomcat, Redis, PostgreSQL, MongoDB
  • Streaming Platform like Apache Kafka
  • Knowledge on OLAP and OLTP database