Principle Data Engineer [T500-**]

  • Hyderabad
  • Bristol Myers Squibb

Job Description:

Working with Us

Challenging. Meaningful. Life-changing. Those aren’t words that are usually associated with a job. But working at Bristol Myers Squibb is anything but usual. Here, uniquely interesting work happens every day, in every department. From optimizing a production line to the latest breakthroughs in cell therapy, this is work that transforms the lives of patients, and the careers of those who do it. You’ll get the chance to grow and thrive through opportunities uncommon in scale and scope, alongside high-achieving teams rich in diversity. Take your career farther than you thought possible.


Bristol Myers Squibb recognizes the importance of balance and flexibility in our work environment. We offer a wide variety of competitive benefits, services and programs that provide our employees with the resources to pursue their goals, both at work and in their personal lives. Read more: careers.bms.com/working-with-us.


Key Responsibilities:

  • The Data Engineer will be responsible for designing, building, and maintaining the data products, evolution of the data products, and utilize the most suitable data architecture required for our organization's data needs.
  • Serves as the Subject Matter Expert on Data & Analytics Solutions.
  • Accountable for delivering high quality, data products and analytic ready data solutions.
  • Develop and maintain ETL/ELT pipelines for ingesting data from various sources into our data warehouse.
  • Develop and maintain data models to support our reporting and analysis needs.
  • Optimize data storage and retrieval to ensure efficient performance and scalability.
  • Collaborate with data architects, data analysts and data scientists to understand their data needs and ensure that the data infrastructure supports their requirements.
  • Ensure data quality and integrity through data validation and testing.
  • Implement and maintain security protocols to protect sensitive data.
  • Stay up-to-date with emerging trends and technologies in data engineering and analytics
  • Closely partner with the Enterprise Data and Analytics Platform team, other functional data teams and Data Community lead to shape and adopt data and technology strategy.
  • Accountable for evaluating Data enhancements and initiatives, assessing capacity and prioritization along with onshore and vendor teams.
  • Knowledgeable in evolving trends in Data platforms and Product based implementation
  • Manage and provide guidance for the data engineers supporting projects, enhancements, and break/fix efforts.
  • Has end-to-end ownership mindset in driving initiatives through completion
  • Comfortable working in a fast-paced environment with minimal oversight
  • Mentors and provide career guidance to other team members effectively to unlock full potential.
  • Prior experience working in an Agile/Product based environment.
  • Provides strategic feedback to vendors on service delivery and balances workload with vendor teams.


Qualifications & Experience

  • 10+ years of hands-on experience working on implementing and operating data capabilities and cutting-edge data solutions, preferably in a cloud environment. Breadth of experience in technology capabilities that span the full life cycle of data management including data lakehouses, master/reference data management, data quality and analytics/AI ML is needed.
  • Ability to craft and architect data solutions, automation pipelines to productionize solutions.
  • Hands-on experience developing and delivering data, ETL solutions with some of the technologies like AWS data services ( Glue, Redshift, Athena, lakeformation, etc. ). Cloudera Data Platform, Tableau labs is a plus.
  • Create and maintain optimal data pipeline architecture, assemble large, complex data sets that meet functional / non-functional business requirements.
  • Identify, design, and implement internal process improvements: automating manual processes, optimizing data delivery, re-designing infrastructure for greater scalability, etc.
  • Strong programming skills in languages such as Python, PySpark, R, PyTorch, Pandas, Scala etc.
  • Experience with SQL and database technologies such as MySQL, PostgreSQL, Presto, etc.
  • Experience with cloud-based data technologies such as AWS, Azure, or GCP (Preferably strong in AWS)
  • Strong analytical and problem-solving skills
  • Excellent communication and collaboration skills Functional knowledge or prior experience in Lifesciences Research and Development domain is a plus
  • Experience and expertise in establishing agile and product-oriented teams that work effectively with teams in US and other global BMS site.
  • Initiates challenging opportunities that build strong capabilities for self and team
  • Demonstrates a focus on improving processes, structures, and knowledge within the team. Leads in analyzing current states, deliver strong recommendations in understanding complexity in the environment, and the ability to execute to bring complex solutions to completion.​
  • AWS Data Engineering/Analytics certification is a plus.