As an SRE Engineer, you will be responsible for the Activate and Production Infrastructure. Your essential duties encompass ensuring the seamless operation and optimal performance of large-scale distributed software applications. Your role revolves around maintaining a robust and high-performing environment, contributing to the reliability of our services, and innovating solutions to guarantee 24/7 availability. By leveraging your technical expertise and dedication, you contribute to maintaining a seamless experience for our users while upholding the highest standards of operational excellence. Your specific responsibilities include:

Role and Responsibilities:

1. Monitoring and Alerting

a. Review existing and set up new monitoring tools and systems as needed to track system performance, key metrics.

2. Incident Management

a. monitor the alerts and logs to promptly identify incidents or anomalies.

b. Prioritize incidents based on severity and potential impact on stability and reliability.

c. Engage in effective incident resolution, applying necessary fixes and mitigations to restore normal operations.

3. On-Call Responsibilities

a. Organize on-call schedules to ensure 24/7 coverage for incident response.

b.Respond to alerts, troubleshoot issues, and coordinate with NOC and Engineering teams for incident resolution.

c. Conduct post-incident reviews to identify root causes, learn from incidents, and implement preventive measures.

4. Automation and Tooling

a.Review pre-existing and build new automation scripts and tools as needed to streamline repetitive tasks, enhance efficiency, and reduce manual errors.

b.Regularly update and maintain tools used for monitoring, deployment, and incident management to align with evolving needs.

5. Performance Optimization

a. Analyze application performance using profiling and monitoring tools to identify bottlenecks and areas for improvement.

b. Work on optimizations, infrastructure upgrades, and architectural improvements to enhance system performance and efficiency.

6. Capacity Planning and Scaling

a. Monitor resource utilization and trends to predict capacity needs and plan for scaling.

b. Scale resources, such as servers and databases, are based on usage patterns and anticipated growth to maintain performance and reliability. Also, automate the entire sizing process.

7. Disaster Recovery and Redundancy

a. Develop and maintain disaster recovery plans and procedures to ensure business continuity in case of failures or disasters.

b. Implement redundancy and failover strategies to minimize downtime and maintain service availability during failures.

8. Knowledge Sharing and Documentation

a. Create and maintain comprehensive documentation for configurations, procedures, incidents, and best practices.

b. Foster a culture of knowledge sharing within the team, conducting regular knowledge-sharing sessions and training programs.

9. Feedback Loop and Continuous Improvement

a. Collect feedback from incidents, post-mortems, and NOC/Dev team interactions to identify areas for improvement.

b. Continuously iterate on processes, tools, and systems based on feedback and lessons learned to drive continuous improvement.

10. Collaboration and Communication

a. Collaborate closely with Engineering and DC/NOC teams to align goals and priorities.

b. Ensure open and transparent communication within the team and with stakeholders, providing regular updates on incidents, progress, and initiatives.

Required Skills and Qualifications

Bachelor's degree in computer science or related disciplines
Total 3+ years' experience in software application/product support
Ability to program using programming languages like Go, Scripting languages like Shell or Python
Good to have prior experience in technical engineering
A proactive approach to identify the problems, performance bottlenecks, and areas of improvement
Must know, Networking, Database (MySQL) and Linux System concepts, Debugging and analyzing the core dumps
Hands-on experience with monitoring and observability tools like Grafana, Nagios, Influx, ELK, etc.
Familiarity with orchestration tools like Docker and Grafana and incident management systems like Zenduty
Excellent communication and collaboration skills, with the ability to work effectively across teams.
Self-motivated and positive mindset to examine any incidents

Apply now

Other job offers that may interest you

Cae engineer

PuneExpert Global Group2 days ago

.. appropriateness of test methods. Prepare technical CAE reports of results. Desired Skills Excellent understanding of solid mechanics and structural finite element analysis (FEA) Hands on Experience with Altair Hyperworks/ Hypermesh, Optinstruct & Radioss / Nastran CAE software Ansys Workbench CAE tool .. read more

Lead generation executive

PuneHummingbird Web Solutions Private LimitedToday

.. strategies. Qualifications and Skills: Bachelor's degree in Business Administration, Marketing, or related field. Proven experience in lead generation, within .. based on performance. Target Geographies - USA, Australia, Europe, Middle East (you will focus on one or two zones) Salary Range - 5L - 7 L fixed monthly .. read more

Asst manager - marketing

PuneSapphire Foods India Limited2 days ago

.. Ø MBA with a specialization in Marketing Ø Minimum 3yrs of experience into Retail Marketing Ø Willing to travel extensively (20- 22 days travel). Interested .. and regional sales team. Responsible for the New product launch marketing strategy for the region Manage Corporate Sales for the region. Explore opportunities .. read more

Quality assurance engineer

PuneTruein2 days ago

.. and control to the time & attendance process. We leverage Face recognition and AI technologies. We are backed by Investors and a high-potential team of 30 people and growing. Our Culture: At Truein, we genuinely care about every member we hire. You’ll learn new things regardless of your experience .. read more

Bigdata sre (on contract)

PunePubmaticYesterday

.. organizations provide, such as stock options, paternity/maternity leave, Healthcare insurance, broadband reimbursement. As well, when we’re back in the .. & data warehouses would be preferred. Hands-on software development for infrastructure that will perform at scale. Design and maintain automation tools .. read more

Associate vice president - stpl credit risk

PunePoonawalla FincorpToday

.. up-to-date with the latest regulatory guidelines and ensure the credit policy adheres to all relevant legal and regulatory requirements set forth by relevant authorities. e) Cross-functional Collaboration: Collaborate with various departments, including Finance, Operations, IT, Data Analytics, Business .. read more

Senior technical lead

PuneAvalaraYesterday

.. and support our highly critical Identity and Access Management platform. A successful candidate will be a well-rounded software development professional with a proven track record of delivering SaaS at scale in an Agile environment. As a Senior Technical Lead, you will join a team of seasoned engineers .. read more

Full stack engineer

PuneTech MahindraYesterday

.. net core for PAN India but looking only out of job or serving notice period and going to end in 10-15 days only. PFB the details EXP-5+ years Location- Anywhere in India Required Skills-.Net, C#, .Net core and react js Working days-5 days working Interested candidates please share their resume at **@techmahindra.com .. read more

Successfactors ec consultant

Pune_voisToday

.. Employee Central workshops for collecting business requirements and updating configurations workbooks Provide expert functional inputs to SuccessFactors Integrations team. Experience in BIB mapping and other SAP configuration required for Integrations preferred. Expertise in SuccessFactors API’s and .. read more

Quality engineer

PuneNcs Group2 days ago

.. harnessing technology. We bring people and technology together. We advance communities and transform industries. We’re searching for a Quality Engineering – Automation and Functional Engineer to be part of our diverse team of talent here at NCS! If you believe in going above and beyond, want to exemplify .. read more

Sme mechanical

PuneNxtra By AirtelToday

.. of multiple HVAC projects, Understanding & conceptualizing the HVAC of the projects as per application, Preparing/checking Headload calculation, Ventilation Calculation, Pump head calculation, Preparation of airflow diagram, Duct sizing, ESP Calculation, Zone Pressurization, Preparation of HVAC BOQs .. read more

Opentext

PuneLtimindtreeToday

.. : Immediately to 20 Days Job Description: Installation & Configuration, Administration of OpenText Extended ECM for Engineering 22.x • Experience in OpenText .. • Configuring OT Brava Viewer for Content Markup, Redaction, Brava Administration • SSO configuration using OTDS, SAP Authorization for OTCS business .. read more

Civil engineer

PuneS And J Buildcon Pvt. Ltd.Today

.. with AutoCAD and be able to utilize strong Site Handling skills in order to create visual aids. By utilizing strong organizational and communication skills, this candidate will also have the ability to execute a project based on the criteria outlined. Responsibilities Work closely with project managers .. read more

Sme electrical

PuneNxtra By Airtel2 days ago

.. outs. 6) Perform engineering duties in planning, designing, and overseeing Construction and maintenance of building structures, and facilities. 7) Teamwork, .. requirements by directing or coordinating installation, manufacturing, Construction, maintenance, documentation, support or testing activities. h) Coordination .. read more

Senior associate - mlops

PuneAxtria Ingenious InsightsToday

.. the product commercialization journey to drive sales growth and improve Healthcare outcomes for patients. We are seeking high energy, driven and innovative ML .. an eco-system for continuous learning & development. Write white papers, collaborate with academia and participate in relevant forums to continuously upgrade .. read more

Network risk and compliance analyst

PuneCaci LtdYesterday

.. past activity and reports Centralize compliance responses/data to improve audit response time and create consistent responses across teams Interact with Auditors and Regulators as needed Develop and conduct ongoing risk and compliance training and education Role Requirements: Bachelor’s degree in Computer .. read more

Mep bim modeler

PuneBimit India2 days ago

.. to supporting the AEC industry & delivering cutting-edge solutions in the Construction industry. With a focus on excellence and sustainability, we are seeking a .. conflicts between MEP systems and other building components. Generate Construction drawings, shop drawings, and other documentation from BIM models to support .. read more

Social studies teacher

Pimpri-ChinchwadVibgyor Group Of SchoolsToday

.. so as to best utilize the available time. Time management. B. STUDENT ADMINISTRATION The Teacher should ensure that the student growth and achievement is .. and guidelines and utilize the worksheets, materials, teaching aids and methods that contribute to a climate where students are actively engaged in meaningful .. read more

Interior designer

PuneLivceYesterday

.. Coordinating with internal & external agencies. Holds sales expertise in Sales Closure by way of logical & trusted Sales pitch, Driving and leading the Design meetings with the customers Holds behavioral attributes of Result oriented, Team work, Integrity & Ethics, Crisp & meaningful communication. Holds .. read more

Test engineer

PuneSynechronToday

.. culture, and Synechron is proud to be an equal opportunity workplace and an affirmative-action employer. Our Diversity, Equity, and Inclusion (DEI) initiative ‘Same Difference’ is committed to fostering an inclusive culture – promoting equality, diversity and an environment that is respectful to .. read more

Project executive

PuneLivce2 days ago

.. a weekly schedule and align vendors for resource allocation. Quality monitoring as per standards and specifications. Identify issues and de-bottleneck for smooth project execution. Coordinate and manage stakeholders. 100% adherence to internal Livspace processes. Keeping records for all site work. Project .. read more

Hcm fusion functional lead

PuneOracleToday

.. client facing role Bachelor of Engineering or master’s degree in business Administration (MBA) with 5 to 10 years of total experience in EBS or Cloud Application .. technical experience like data model of HCM Schemas, plsql. Experience in the preparation of Functional documents e.g. Requirement Gathering, Solution .. read more

Assistant manager- commercial property accounting

PuneWnsToday

.. preparation of annual audit requirements including completion of year-end files and CAM Statements. Responsible for the completion of quarterly reforecast and annual financial budgets. Ensure all Internal Control Policies and Procedures are followed, including the documentation and completion of proper .. read more

Dell boomi

PuneJade GlobalToday

.. and deployment experience (atoms, molecules, and atmosphere development). Hands on experience in EDI (X12 and EDIFACT), XSLT, XSD, Flat File (CSV, EXCEL, FIXED, JSON, XML), Web Services (REST or SOAP). Demonstrate expertise in Rosettanet integration standards and best practices. Design, develop, and .. read more

Windows specialist

PuneTeksystems2 days ago

.. 3 - 5 years Hyper-V - Strong knowledge and experience required for Installation , Configuration and end to end management of Hypervisor and VM running on Hyper-V. Troubleshooting skills and problem-solving ability required for Windows OS, Windows Cluster and Hyper-V. Good experience required on SCVMM .. read more

Quality assurance specialist

PuneNumeric TechnologiesToday

.. QA - 8-12 yrs experience Jr QA - 4-7 yrs experience WFO - Pune Location Shift timing - 3:00 Pm -12 AM JD Strong backend candidates having good experience in Backend, Restassured, Strong in Java Good experience in cucumber, docker, Jenkins Immediate joiners only Certifications are added advantage mail .. read more

Senior aws devops engineer

PuneIntelliasToday

.. of just selling boring banking products. We achieve this in a socially responsible way, driven by innovation and backed by a leading financial institution who are in it for the long run. Requirements: 5+ years of experience working as a DevOps Good working knowledge of Cloud platforms, services and principles .. read more

Teamcenter business analyst

PuneLarsen E Toubro2 days ago

.. Teamcenter product, how it is used and how the business operates more widely Having the analytical skills to analyze/synthesize large amounts of data and other business processes to form ideas and participate in solutioning Skilled in research – comfortable diving deeper into subject matter, translating .. read more

Appian lead

PunePersistent SystemsToday

.. personal accident, and Mediclaim hospitalization for self, spouse, two children, and parents Our company fosters a values-driven and people-centric work environment that enables our employees to Accelerate growth, both professionally and personally Impact the world in powerful, positive ways, using the .. read more

Hrbp

PuneZeno HealthYesterday

.. buys medicines. We are building India’s largest chain of generic medicine Retail stores with the aim of providing affordable Healthcare to millions. We .. - Mumbai, Pune, Surat - by March ** Zeno Health is now foraying into deeper Healthcare needs of our customers and building products and channels to serve them .. read more

Site Reliability Engineer (Activate)

Other job offers that may interest you