Linux Operations Engineer

  • Hyderabad
  • Chubb
Linux Operations Engineer Position Summary: Chubb is seeking an experienced Linux Operations Engineer to join our growing Distributed IT Operations team. The Linux Operations Engineer will provide support for on-premises and Azure based Red Hat Enterprise Linux (RHEL) and SUSE Linux Enterprise Server (SLES) systems. The Linux Operations Engineer would require a minimum 5+ years of directly related experience supporting Linux, specifically RHEL and/or SUSE, operations. Using a robust understanding of Linux, systems and management tools, this position will drive system / business service stability, availability, and resilience by working closely with counterparts in Infrastructure, Application Support, Enterprise Engineering, and engage as needed to repair any service-impacting issues.Various activities include but not limited to support and maintenance, monitoring, alerting, availability, and recovery. Primary Job Responsibilities: Investigation and diagnosis of incidents and problems relating to Red Hat (RHEL) and SUSE (SLES) Server Operating System stacks. Participate in a follow the sun operational model supporting and maintaining the Linux Server environment for North and Latin America systems (expanding globally throughout the year). Work closely with network, security, development, application and support teams in the implementation of infrastructure components that support emerging technologies and applications. Provide training and mentorship to junior team members. Train team members in best practices and act as subject matter expert and escalation contact for infrastructure related issues. Automate operational, monitoring, and integrity verification processes (e.g., runbooks) for hardware, server, and system resources and processes. Proactively ensure the highest levels of systems and infrastructure availability. Perform daily system monitoring, verifying the integrity and availability of systems and key processes, reviewing system and application logs. Create and maintain system documentation for Linux infrastructure technologies, including installation, configuration, and appropriate troubleshooting steps. Collaborating with other technology leads and support teams to ensure integrated end-to-end availability, reliability, and performance. Improve existing processes through automation solutions to recurring problems and enhancements to existing solutions or documentation. Provides on-call and after-hours support to address incidents, maintain infrastructure and support operational efforts. Identify and drive resolution on monitoring and alerting gaps. Ability to work across multiple projects and provide best practice advice and contribute to technical tasks. Solve problems relating to mission-critical services and create automation to prevent problem recurrence. Assist in the development and execution of disaster recovery plans. Participate in change, incident, and problem management. Knowledge, Skills and Competencies: Extensive and recent support experience in supporting virtualized Red Hat Enterprise Linux Server operating systems and technologies in a global multi-data center enterprise organization. Advanced knowledge and experience with Linux server operating systems (Red Hat Enterprise Linux 8.x, 7.x, SUSE Linux Enterprise Server 15). Advanced knowledge and experience with configuration management tools (Ansible, etc.). Advanced knowledge and experience developing Bash scripts (other languages commonly used: Python, JavaScript, etc.) Working Knowledge of Red Hat Satellite Server systems management. Working knowledge of networking principles including routing, switching, firewalls, load balancing and VLANs. Working knowledge and experience with Windows server operating systems (Windows 2019, 2022) is a plus. Working knowledge and experience with IBM AIX is a plus. Working knowledge and experience with virtualization hypervisors (VMware vSphere 7.x, 8.x, Azure). Working knowledge and experience with containerization (VMware Tanzu, Docker, Kubernetes, Azure Kubernetes Service) a plus. Excellent problem solving and analytical thinking skills; ability to influence change and drive results. Work within defined change control processes and procedures. Ability to manage multiple projects in a dynamic development environment; demonstrated project delivery required. Strong ability to identify, understand and communicate business needs and application architectures for technical projects. Excellent communication and collaboration skills; ability to effectively communicate across all levels is required. Working knowledge of common information security concepts and practices. Self-driven with the ability to manage workload without direct supervision. Ability to collaborate with different technology towers to achieve common goals. Ability to manage multiple stands of work simultaneously. Excellent verbal and written communication skills.