top of page

Careers

Office

About PreOncology  

PreOncology is building the world’s first multi-omic, AI-powered platform dedicated to ultra-early cancer detection and prevention. We combine whole-body MRI, liquid biopsy, genomics, lifestyle data, and social determinants of health (SDOH) to define a new specialty in oncology: To prevent, detect and treat Stage Alpha. Our mission is to stop cancer before it becomes clinically detectable.   

Current Openings

Senior Data Engineer 

The Senior Data Engineer is primarily responsible for building and maintaining all of the data resources driving this state-of-the-art platform, including very significant amounts of research data (terabytes) driving the deep learning algorithms identifying the core cancer risk elements;  as well as, the transactional data infrastructure supporting the PreOncology software platform and it’s interactions with the patients and supporting medical personnel. 

 

Additionally, this role will delve into some reporting and analytic functions to support the corporate management and evolution of the product during a significant growth profile. It will also actively collaborate and integrate with the other Java, Machine Learning, and DevOps engineers working on other aspects of the platform infrastructure. 

Role Overview

The Senior Data Engineer is primarily responsible for building and maintaining all of the data resources driving this state-of-the-art platform, including very significant amounts of research data (terabytes) driving the deep learning algorithms identifying the core cancer risk elements; as well as, the transactional data infrastructure supporting the PreOncology software platform and its interactions with patients and supporting medical personnel.

Additionally, this role will delve into some reporting and analytic functions to support the corporate management and evolution of the product during a significant growth profile. This role will actively collaborate and integrate with the other Java, Machine Learning, and DevOps engineers working on other aspects of the platform infrastructure.


Key Responsibilities

  • Use your passion to produce a very high-quality data architecture for both the patient-facing application and the core machine-learning infrastructure.
  • Design operational disciplines to manage data resources at scale, including terabyte-scale volumes.
  • Collaborate with architecture, cloud operations, and product management to deliver high-quality, supportable data infrastructure.
  • Work with the Java (Spring)-based engineering team in a scrum environment to refine the database schema.
  • Communicate effectively and promote best-practice IT disciplines within a fast-growth environment.
  • Develop data infrastructure from concept (greenfield) to working product for PreOncology’s platform.
  • Collaborate with Product Management, Cloud Operations, and Engineering Leads to achieve business goals and solve customer problems.
  • Understand PreOncology’s current software requirements, particularly data assets supporting the business model.
  • Continuously evolve data architecture within a cloud-native and microservices framework.
  • Reinforce best-practice development guidelines, processes, tools, design patterns, and practices.
  • Mentor peers and help deliver high-quality software to the market.

Qualifications


Required Experience

  • Bachelor’s degree in Computer Science or equivalent knowledge of computer science principles.
  • 5–7 years of experience as a Data Engineer building and evolving large-scale data architectures.
  • Hands-on experience with developmental DBA disciplines supporting software teams.
  • Experience developing within high-performance, Agile software teams.
  • Experience managing data operations within a high-growth software platform.

Knowledge

  • Deep understanding of computer science design concepts, patterns, algorithmic thinking, and coding standards.
  • Familiarity with Agile software development and how to apply it effectively within a startup environment.

Skills & Abilities

  • Strong interpersonal and communication skills across all organizational levels.
  • Passionate about well-crafted, scalable, high-quality code.
  • Adaptable and motivated by organizational growth and change.
  • Team-oriented with a desire to succeed collaboratively.
  • Creative problem-solver who enjoys finding innovative solutions.
  • Curious and eager to learn while applying critical thinking.

Why Join Us?

  • Be part of the founding science team shaping a new specialty within oncology.
  • Work at the intersection of AI, epidemiology, and clinical translation.
  • Collaborate with leaders in machine learning, imaging, and oncology.
  • Receive equity and a competitive compensation package.
  • Join a mission-driven company helping prevent cancer before it starts.

Machine Learning Pipeline Engineer (Nextflow + Omics)

We are looking for an engineer who can bridge machine learning and workflow engineering. You should be comfortable training, tuning, and validating ML and deep-learning models, and also building the robust pipelines needed to deploy them. You will build and optimize Nextflow-based workflows for large-scale cancer genomics, integrating model training, calibration, and deployment into production environments. 
 
This role blends pipeline development with hands-on ML implementation. It is ideal for someone who enjoys building real systems and seeing models move from development into clinical impact. This position is fully remote within the United States. 

What You’ll Do

  • Design, build, and maintain Nextflow pipelines for large-scale genomics and ML workflows.
  • Integrate Python-based ML and deep-learning models into reproducible, production-ready pipelines.
  • Train, tune (Bayesian optimization, hyperparameter sweeps), and validate survival and risk-prediction models (e.g., Cox, DeepSurv, RSF, gradient boosting, CNNs).
  • Engineer genomic and longitudinal features (e.g., PRS, rare variant burden, temporal trajectories) and integrate them into model pipelines.
  • Run pipelines on cloud computing systems (AWS preferred).
  • Implement observability (logging, metrics, alerts) and enforce data quality and reproducibility.
  • Package and deploy reproducible inference endpoints and artifacts with Docker or Singularity.

Must-Have Qualifications

  • 2+ years building production pipelines in Nextflow.
  • Strong Python skills for data processing, ML training, and pipeline integration.
  • Proven experience training, tuning, and validating ML and deep-learning models, ideally for risk prediction or survival analysis.
  • Hands-on experience with omics or clinical genomics data, ideally cancer-related.
  • Experience running pipelines on cloud computing systems (AWS preferred).
  • Experience with containerization (Docker, Singularity) and artifact/version tracking (e.g., MLflow, DVC).
  • Understanding of data security, reproducibility, and workflow quality control.

Work Authorization

Candidates must already be authorized to work in the United States. We are unable to provide visa sponsorship now or in the future.


Why Join PreOncology

  • Impact: Your work won’t sit in a journal—it will shape clinical protocols for cancer prevention.
  • Mentorship: Learn directly from oncologists, epidemiologists, and startup leaders.
  • Visibility: Opportunities to co-author papers, present at conferences, and help set industry standards.
  • Growth: Early role in a venture-backed company with rapid career advancement.
  • Equity & Upside: Competitive comp plus meaningful equity in a company defining a new medical specialty.

How to Apply

Email your resume to Luke.Stetson@preoncology.com and include brief answers (1–2 sentences each) to the following:

  • The largest Nextflow pipeline you have built or maintained (steps, scale, infrastructure).
  • Your omics experience (data types, tools, and any cancer-specific work).
  • The machine learning or deep learning models you have trained, tuned, or validated, and how they were applied.

Join Our Talent Community

At PreOncology, we’re building breakthrough technologies and redefining cancer care through ultra-early detection and personalized prevention . We’re a startup on a mission — and we know that the strength of our team will define our success. Even if you don’t see the perfect role posted, we want to meet talented people who share our drive to make a difference.

Why Join PreOncology?

  • Impact First: Your work will directly accelerate progress in cancer research and patient outcomes.

  • Startup Energy: We move fast, experiment boldly, and value action over bureaucracy.

  • Innovation Culture: You’ll collaborate with scientists, engineers, and entrepreneurs solving complex challenges every day.

  • Growth Opportunity: As an early team member, you’ll shape not only your role but the future of the company.


Who We’re Looking For

  • Mission-driven and inspired to tackle cancer through technology.

  • Adaptable, resourceful, and excited by startup pace.

  • Creative problem-solvers who thrive on collaboration and curiosity.

  • Passionate about applying their skills to real-world breakthroughs in science and technology.

 

Areas of Opportunity

  • Cancer Modeling & Research

  • Computational Biology & Data Science

  • Software & Platform Engineering

  • Operations 

  • Business Development & Partnerships

​

How to Apply

If you’re ready to bring your talent to the fight against cancer, we’d love to hear from you. Share your resume and a quick note about what excites you — whether it’s science, engineering, operations, or something we haven’t thought of yet.

PreOncology-Logo-onColor-1920w-transparent.png

© 2025 PreOncology

bottom of page