Data Engineer II
Design and build resilient, scalable data pipelines using Spark (PySpark), Hive/Impala, and SQL to move and transform data at scale. Orchestrate data workflows with Airflow or NiFi, ensuring reliable scheduling, SLAs, retries, and alerting. Ingest and integrate data from batch and streaming sources (files, APIs, RDBMS, NoSQL) and support bronze/silver/gold data modeling. Troubleshoot data quality issues, performance bottlenecks, and lineage problems, and modernize legacy pipelines. Collaborate with data scientists, analysts, and platform teams to enable self-service analytics and downstream consumption in SQL/NoSQL and BI tools. Operate in a cloud-enabled environment (Databricks) to deliver large-scale analytical capabilities across global business use cases.
Similar offers · 5
Save your favorite offers
Sign in to add this offer to your favorites.
