Senior Data Engineer
Zeta Global
full-remoteseniorpermanentbackenddata Full remote Yesterday via WTTJ
See how well this job matches your profile
Sign up to get an AI match score and generate a tailored application in seconds.
Get your match scoreTags
SQLData ModelingPythonSparkStreamingKafkaAirflowAWSData QualityAdTech
About the role
Role Overview
Join Zeta as a Senior Data Engineer to design, build, and operate high-scale data pipelines and aggregates for its AdTech platform. You’ll partner with backend, ML, and product teams to deliver reliable, well-modeled data with strong performance, quality, and observability.
Responsibilities
- Design, build, and operate batch and streaming data pipelines and aggregates
- Create data aggregates and marts to support platform use cases
- Define data modeling and data contracts
- Ensure data quality and reliability (tests, monitoring, lineage, SLAs/SLOs)
- Implement validation, anomaly detection, backfills, and reconciliation for completeness/correctness/timeliness
- Optimize performance and cost for large-scale processing
- Build orchestration and automation for data workloads
- Support measurement workflows (e.g., attribution, incrementality, lift, experimentation analytics)
Requirements
- Strong SQL and data modeling (dimensional modeling; star/snowflake; event modeling)
- Strong data quality/observability practices (monitoring, lineage, SLAs/SLOs)
- Proficiency in one or more data engineering languages: Python, Java, Scala, or Go
- Familiarity with lakehouse/table formats (e.g., Parquet, Iceberg/Hudi/Delta)
- Experience with ML feature stores or offline/online feature generation and/or training datasets
- Experience with real-time analytics stores (e.g., Druid, ClickHouse, Pinot) and high-cardinality aggregation
- Experience with data orchestration/workflow tools (e.g., Airflow, Dagster, Step Functions) and CI/CD for data workloads
- Experience with distributed processing at scale (Spark, Flink, or equivalent) and performance tuning
- Experience with SQL + NoSQL data stores and selecting the right store for the use case
- Hands-on streaming systems experience (e.g., Kafka preferred) and/or AWS Kinesis
- AdTech/programmatic advertising domain knowledge (DSP/SSP/exchange/RTB concepts)
- 5+ years building production data pipelines/data products (batch and/or streaming) in high-scale environments
- Clear communicator; able to translate needs into reliable data interfaces
- Experience with AWS data services and cloud-native patterns (S3, Glue/EMR, Athena, Redshift, etc.)
- Deep knowledge of data governance/privacy, including PII handling and consent-aware processing
Nice to Have
- Open-source contributions, publications, or conference speaking
- BS/MS in CS/Engineering or equivalent practical experience
About Zeta Global
Zeta Global is an AdTech platform that supports programmatic advertising through data, targeting, and analytics. The company builds high-scale data pipelines and products to power trusted, well-modeled data across advertising and downstream teams including ML and product.
Scraped 5/14/2026