logo

Glyphic Biotechnologies

Director, Data Science

Department
Research
Job Type / Location
Berkeley
Experience Required
10+ years
Posted On

About Glyphic Biotechnologies

At Glyphic Biotechnologies, we are developing a massively parallel, single-molecule proteome sequencing platform that will transform life science discovery and usher in a new era of insights into human biology and disease. We have raised >$80M from venture partners and non-dilutive grant funding to achieve our vision of next generation proteome sequencing.

About the Role

We are looking for a Director-level technical leader to build and lead Glyphic’s Data Science function. This is a “player-coach” role, requiring you to be technically deep enough to guide signal-processing and ML strategy for a novel nanopore-based protein sequencing platform, while also building the team culture, processes, and infrastructure needed to scale a larger data organization. You will report to the VP of R&D and work closely with team leads in assay development, chemistry, and automation. This is a hybrid role with expectations to spend as much as ~20% of your time on-site with the team in Berkeley, CA (on average) in service of a more complete understanding of Glyphic’s technology and calibration with the on-site research team. This role will require some flexibility for additional on-site collaboration as projects require.

What You'll Do

Technical Leadership

  • Set the technical direction for ML model development: amino acid classification from nanopore current signals, signal segmentation, stall detection, temporal modeling, and multi-cycle analysis.
  • Drive improvements to classification accuracy through better architectures (transformers, deep learning), training strategies, and feature engineering.
  • Own the roadmap for data infrastructure: pipeline automation, data lake architecture, metadata standards, and self-serve analytics for the broader scientific team.
  • Make strategic build-vs-buy decisions for tooling, compute, and third-party platforms.

People Management

  • Provide technical and professional management to a team of data scientists and engineers to enable end-to-end analysis pipeline.
  • Create an environment where high-autonomy individual contributors thrive: clear goals, minimal process overhead, rapid feedback loops.
  • Foster a culture of rigorous, reproducible analysis and clear communication of results to non-computational audiences.

Cross-Functional Partnership

  • Translate wet-lab experimental goals into computational strategies and vice versa — surface data-driven insights that reshape assay design and instrument operation.
  • Work with assay development to design experiments that generate high-quality training data and enable systematic evaluation of new chemistries (expanders, linkers, barcodes).
  • Collaborate with the Head of Automation and hardware teams on instrument data integration and real-time analysis capabilities.
  • Represent Data Science in management discussions, communicating progress, risks, and resource needs clearly.

AI Strategy

  • Champion the adoption of AI coding and analysis tools (Claude, Claude Code, etc.) across the data team and the broader organization.
  • Evaluate how generative AI and LLMs can accelerate internal workflows: automated reporting, data exploration, code generation, and literature review.

What You Need

Required

  • MS or PhD in a quantitative field (Computer Science, Electrical Engineering, Computational Biology, Bioinformatics, Statistics, or related).
  • 10+ years of post-academic experience in the omics space (genomics, proteomics, or related fields).
  • 4+ years of experience managing technical teams (data scientists, ML engineers, or bioinformaticians), including hiring responsibility.
  • Ability and willingness to operate as a player-coach: setting strategy while remaining hands-on with data, code, and models.
  • Exceptional ability to identify, hire, and develop talent while establishing and enforcing standards of excellence in data science.
  • Capacity to develop both individual contributors and future managers within the team.
  • Deep expertise in one of the following:
    • Primary sequencing data analysis
    • Machine learning applied to biological data
    • Pipeline infrastructure and bioinformatics tooling
  • Solid understanding of signal processing, classification, and machine learning techniques (transformers, CNNs, RNNs) and comfort applying them to sequencing or time-series data.
  • Practical familiarity with AWS, Nextflow, and modern bioinformatics tooling.
  • Demonstrated ability to work at the bench-to-computation interface in collaborative research environments.
  • Ability to present complex technical results to non-technical stakeholders and to translate biological questions into computational approaches.

Nice to have

  • Direct experience with sequencing data, basecalling, read-level QC or nanopore signal-level analysis (strongly preferred).
  • Experience building data infrastructure and analytics platforms in early-stage biotech.

What you can expect from this role

Work environment

  • Collaborative culture where your ideas and expertise are valued.
  • Direct impact on product development and company direction.

Professional growth

  • Build Glyphic's first dedicated data science management function, defining team structure, standards, and culture.
  • Help define the technical standards and best practices for omics data analysis while mentoring the next generation of data scientists who will adopt and advance these approaches.
  • Work with proprietary, information-rich data at scale that few organizations possess—the opportunity to develop novel approaches and methodologies that set benchmarks for the field.

View Assessment Process

Think you'll be a good fit?