Data Engineer - Big Data GCP

  • Gurugram
  • Treyas Infotech And Consulting Pvt Ltd
JOB DESCRIPTION : As Manager, Data Engineering, you will be responsible for translating client requirements into design, architecting, and implementing GCP Cloud-based big data solutions for clients. Your role will be focused on delivering high-quality solutions by independently driving design discussions related to Data Ingestion, Transformation & Consumption, Data Storage and Computation Frameworks, Performance Optimizations, Infrastructure, Automation & Cloud Computing, and Data Governance & Security. The role requires a hands-on technologist with expertise in Big Data solution architecture and with a strong programming background in Java / Scala / Python. Your Impact: Provide technical leadership and hands-on implementation role in the areas of data engineering including data ingestion, data access, modeling, data processing, visualization, design, and implementation. Lead a team to deliver high quality big data technologies-based solutions on GCP Cloud. Manage functional & nonfunctional scope and quality. Help establish standard data practices like governance and address other non-functional issues like data security, privacy, and quality. Manage and provide technical leadership to a data program implementation based on the requirement using agile technologies. Participate in workshops with clients and align client stakeholders to optimal solutions. Consulting, Soft Skills, Thought Leadership, Mentorship etc. People management, contributing to hiring and capability building. QUALIFICATIONS Overall 8+ years of IT experience with 3+ years in Data related technologies, and expertise of 1+ years in data-related GCP Cloud services and delivered at least 1 project as an architect. Mandatory to have knowledge of Big Data Architecture Patterns and experience in the delivery of end-to-end Big Data solutions on GCP Cloud. Expert in programming languages like Java/ Scala and good to have Python Expert in at least one distributed data processing framework: Spark (Core, Streaming, SQL), Storm or Flank. Expert in Hadoop eco-system with GCP cloud distribution and worked at least on one or more big data ingestion tools (Sqoop, Flume, NiFI, etc), distributed messaging and ingestion frameworks (Kafka, Pulsar, Pub/Sub, etc.) and good to know traditional tools like Informatica, Talend, etc. Should have worked on any NoSQL solutions like Mongo DB, Cassandra, HBase, etc, or Cloud-based NoSQL offerings like DynamoDB, Big Table, etc. Good Exposure in development with CI / CD pipelines. Knowledge of containerization, orchestration, and Kubernetes engine would be an added advantage. Set Yourself Apart With: Certification on GCP cloud platform or big data technologies. Strong analytical and problem-solving skills. Excellent understanding of data technologies landscape/ecosystem.