Site Reliability Engineer

  • Pune
  • Creon Private Limited

Creospan is a growing tech collective of makers, shakers, and problem solvers, offering solutions today that will propel businesses into a better tomorrow. “Tomorrow’s ideas, built today!” In addition to being able to work alongside equally brilliant and motivated developers, our consultants appreciate the opportunity to learn and apply new skills and methodologies to different clients and industries.


Job Title: Senior Site Reliability Engineer (SRE) - Banking Domain

Location: Pune (Hybrid)


Industry Focus: Banking (Priority), Fintech (Product Development background also considered)


Key Responsibilities:

  • Lead troubleshooting, analysis, and resolution of unexpected system behaviors that impact the quality of service, ensuring minimal disruption to operations.
  • Gather and analyze metrics to assist in performance tuning and fault finding, optimizing system performance and reliability.
  • Provide a proactive approach to our clients’ workloads by anticipating failures, automating tasks, ensuring availability, and delivering outstanding customer experiences.
  • Communicate issue/resolution status effectively to project teams and management through written reports and verbal updates.
  • Provide reactive break-fix support to address urgent issues and minimize downtime.
  • Collaborate closely with cross-functional teams to design and implement reliable, scalable, and secure solutions aligned with business objectives.
  • Mentor junior team members, sharing knowledge and best practices to foster continuous learning and development.


Required Skills and Experience:

  • Bachelor’s degree in Computer Science, Engineering, or related field.
  • Proven experience in the banking domain, with a focus on wealth management.
  • Hands-on experience with SRE tools and technologies.
  • Proficiency in CI/CD pipelines and infrastructure-as-code (IaaC) concepts, including Terraform, YAML, and Liquibase.
  • Expertise in Azure infrastructure services such as Service Bus, Databricks, Function Apps, and Logic Apps.
  • Strong grasp of monitoring and observability tools such as Grafana, Prometheus, KQL, and Application Insights for AKS.
  • Excellent problem-solving skills with the ability to analyze complex issues and develop effective solutions.
  • Exceptional communication skills, both written and verbal, with the ability to convey technical concepts to non-technical stakeholders.
  • Proven track record of delivering high-quality results in a fast-paced, dynamic environment.
  • Strong commitment to continuous improvement, with a proactive mindset towards automation and optimization.