Job Description
❤️ Want to make a tangible difference in the future of healthcare? ❤️ Love applying the latest technologies to solve tough problems? ❤️ Looking for a fulfilling career with plenty of learning and growth? About 1Phi 🤩 1Phi makes healthcare more accessible by lowering the cost and shrinking the time to preventive care and treatment. Through 1Access, we enable digital health apps and devices to accept insurance and integrate into the traditional clinical system. Through 1Sage, our healthcare navigator, we help patients find the treatment options with the best clinical outcomes for their unique situation. 1Phi (1st Principles Healthcare Institute) reflects our commitment to rethinking healthcare from the ground up. As a Public Benefit Corporation, we combine deep expertise in payer models, government policy, provider incentives, and patient advocacy. By unlocking data historically accessible only to industry insiders, 1Phi gives patients and healthcare companies the tools to make better decisions, and brings high-quality care within reach for more people. You would be joining an early stage startup team of engineers, healthcare experts and dreamers. You’ll help define the culture, the tech stack, and the future of our industry. Tight-knit team. Zero bureaucracy. Project ownership. Room to experiment. Accelerated growth. Requirements 🤓 What You’ll Do: Build and maintain data pipelines that ingest, transform, and validate large-scale Medicare claims data using SQL, Python, and Databricks (Spark). You'll work with patient-level records across billions of claim lines. Write and optimize complex SQL — multi-step transformations, window functions, joins across large datasets, aggregations with suppression rules. SQL is the primary language of the work. Automate and operationalize recurring data workflows — building reliable, repeatable pipelines that process CMS data extracts, dimension tables, and derived provider metrics. Ensure data quality by designing validation checks, reconciling source data against expected schemas, and investigating anomalies when numbers don't add up. Collaborate with data scientists and product engineers to define output schemas, deliver clean datasets, and support downstream analytics and application features. Work in cloud infrastructure — primarily Databricks on AWS, with exposure to S3, Unity Catalog, and related services. Learn the healthcare data domain — you'll develop working knowledge of claims data structures, medical coding systems (ICD-10, HCPCS, DRG), and CMS data programs. You're a fit if: You have strong SQL skills. Coursework, internships, or projects where you wrote non-trivial queries — joins, CTEs, window functions, aggregations. You can reason about query performance. You're comfortable with Python. You've used it for data manipulation (pandas, PySpark, or similar). You don't need to be a software engineer, but you can write clean, functional code. You understand data pipeline concepts — ETL/ELT, idempotency, schema management, data validation. Exposure through coursework, capstone projects, or internships counts. You're detail-oriented and methodical. Healthcare data has strict rules around suppression, privacy, and accuracy. You care about getting the numbers right. You're a fast learner who's comfortable ramping up on unfamiliar domains. You'll be learning Medicare claims data, CMS programs, and healthcare coding systems on the job. You have a BS or MS in Computer Science, Data Science, Information Systems, Statistics, or a related field. Even better if: You've worked with Spark, Databricks, or other distributed compute environments (even in a class or personal project). You have exposure to cloud platforms (AWS, GCP, or Azure) — S3, IAM, or managed database services. You've touched healthcare data in any capacity — claims, EHR, public health datasets, MIMIC, CMS public use files. You're familiar with version control (Git) and collaborative development workflows. You've built a data project end-to-end — ingestion through delivery — even if it was small. Salary: $60k- $95k depending on experience Location: Remote (United States Only) or Hybrid in the Chicago office is available. Hiring Process 😎 At 1Phi, we believe hiring should be fair, transparent, and collaborative. We work hard to find the best mix of respecting the candidates time, while also giving our team an efficient way to learn if a candidate is a good fit. Since we also strive to simulate the work environment throughout the process, we encourage you to use all the AI/LLM tools and outside resources a