Data Engineer
Cohere
hybridseniorpermanentbackenddata San Francisco, CA 7 days ago via LinkedIn
See how well this job matches your profile
Sign up to get an AI match score and generate a tailored application in seconds.
Get your match scoreTags
Data EngineeringPythonSQLApache BeamApache SparkApache FlinkDistributed Data ProcessingKubernetesBigQuerydbt
About the role
Role Overview
Join Cohere’s Analytics & Data Insights team as a Data Engineer. You’ll build foundational data and storage infrastructure that supports product launches and enterprise experiences around AI capabilities.
Responsibilities
- Work on storage infrastructure and data pipelines powering AI-driven product launches and customer experiences
- Run end-to-end implementations and drive initiatives to measurable outcomes
- Collaborate daily with researchers and engineers to deliver production-grade data processing
- Partner across research, marketing, sales, and finance to inform growth strategy with data recommendations
Requirements
- 5+ years building production-grade data processing systems
- Strong command of Python and SQL
- Experience with distributed data processing frameworks such as Apache Beam, Spark, or Flink
- Ability to transform unstructured data into performant datasets across diverse storage backends including S3, GCS, and POSIX
Nice to Have
- Experience with modern orchestration platforms, especially Kubernetes
- Familiarity with analytics tooling such as BigQuery, Airflow, or dbt
- Knowledge of Java or Golang
Additional Signals
- Strong curiosity and excitement about AI research, with willingness to operate at the edge of what’s known
- Enjoys hands-on problem solving and building new systems rather than only optimizing existing ones
Perks (Highlights)
- Full-time benefits including health/dental and mental health budget
- 100% parental leave top-up (up to 6 months)
- Remote-flexible work with offices in multiple cities and coworking stipend
- 6 weeks vacation and weekly lunch stipend
About Cohere
Cohere builds and deploys frontier AI models for developers and enterprises, enabling applications like content generation, semantic search, RAG, and agents. The company brings together researchers, engineers, and designers to scale intelligence and drive real customer value through advanced AI infrastructure.
Scraped 4/22/2026