Data Engineer
Clearsense
full-remotemidpermanentbackenddata United States 47 days ago via LinkedIn
See how well this job matches your profile
Sign up to get an AI match score and generate a tailored application in seconds.
Get your match scoreTags
PythonSQLAWSETL/ELTData EngineeringHealthcare DataFHIRHL7C-CDAData Quality
About the role
Role Overview
Data Engineer (mid-level, 100% remote, full-time) supporting large-scale one-time data ingestions and migrations into the Clearsense platform. The focus is on finite, high-impact projects—building extraction, transformation, validation, and delivery for clean, validated, production-ready healthcare datasets.
Responsibilities
- Data ingestion & transformation
- Perform one-time extractions from healthcare sources (files, APIs, databases, EHR replicas)
- Build and optimize ETL/ELT workflows using Python and SQL
- Transform complex clinical/operational data into structured datasets for reporting/analytics
- Healthcare data mapping (critical)
- Work with healthcare modalities such as HL7, FHIR, and C-CDA
- Normalize and map data across systems while preserving clinical meaning
- Reconcile local code sets/structures into standardized formats
- Data quality & validation
- Implement validation checks (e.g., row counts, reconciliation, integrity, completeness)
- Ensure strict acceptance criteria and reporting accuracy
- Produce clear data quality and reconciliation reports
- Performance & delivery
- Support large-scale batch processing and historical backfills
- Optimize performance (partitioning, parallelism, memory usage)
- Build reliable, restartable ingestion workflows
- Collaboration
- Partner with Integration Engineers, Product, and QA on mapping logic, validation rules, and output requirements
- Troubleshoot issues across ingestion workflows
Requirements
- 3–5 years of experience in Data Engineering / ETL / Data Integration
- Strong SQL and Python
- Hands-on experience building data pipelines or batch workflows
- Experience with AWS (e.g., S3, Glue, Athena) or similar cloud environment
- Required healthcare data experience with at least one modality: HL7 v2/v3, FHIR, C-CDA, EHR/EMR structures
- Note: Not a fit if you’ve never worked with HL7/FHIR/healthcare data, or if you only build dashboards.
Preferred Skills
- PySpark / Spark experience
- Experience with EHR systems or healthcare data warehouses
- Familiarity with healthcare code sets (ICD, CPT, LOINC, RxNorm)
- Data quality tooling familiarity (Great Expectations, Deequ, etc.)
Extras / Incentives
- Competitive compensation package
- Strong onboarding and ramp-up support
- Opportunity to grow into senior-level ownership
About Clearsense
Clearsense is a Healthcare IT Data & Analytics company focused on delivering advanced data capabilities for healthcare. The role is within a creative, collaborative, fast-paced environment, working on high-impact data projects for the Clearsense platform.
Scraped 4/2/2026