Skip to content

Instantly share code, notes, and snippets.

@inc0
Last active July 1, 2024 23:45
Show Gist options
  • Save inc0/fcef6dfc3cbdc0eaf1b5465357104acd to your computer and use it in GitHub Desktop.
Save inc0/fcef6dfc3cbdc0eaf1b5465357104acd to your computer and use it in GitHub Desktop.
resume

Michal Jastrzebski

Summary

I am a seasoned professional with a strong background in machine learning, bioinformatics, and cloud computing. My diverse experience includes designing MLOps platforms, contributing to leading open-source projects such as OpenStack and Kubeflow, and developing advanced bioinformatics tools. I’m an expert at MLOps, Python, Rust, and Kubernetes.

Professional Experience

Ginkgo Bioworks (12.2023 - current)

As a Principal Machine Learning Engineer at Ginkgo Bioworks, I am responsible for training, managing, and deploying ML models for both internal and customer projects. This includes overseeing a cluster of over 100 GPUs to build applied ML models in organism, protein, and DNA design. Our team leverages vast internal metagenomic datasets to train large language models (LLMs) for proteins and DNA, fine-tuning them for scientific purposes and optimizing the Design-Build-Test-Learn (DBTL) cycle for customer projects at Ginkgo. Additionally, our team deploys and operates tools like RFDiffusion and ESMFold to support the day-to-day tasks of protein design.

VantAI (11.2021 - 03.2023)

At VantAI, I held the position of Senior Staff Software Engineer, where I played a pivotal role in advancing the field of bioinformatics within the pharmaceutical industry, particularly in the development of protein degraders known as PROTACs (PROteolysis TAgeting Chimeras). My work at VantAI involved the creation of internal infrastructure and research tools, which were specifically designed to address the unique challenges of drug design and induced proximity in the context of protein-protein interactions. I played a crucial role in implementing machine learning and data-driven techniques to identify potential drugs for protein targets and optimize the design of PROTAC and glue molecules. This involved processing and analyzing large-scale biological datasets, as well as leveraging state-of-the-art techniques in molecular dynamics simulations and docking studies.

Bytewax (11.2020 - 10.2021)

As a co-founder and CTO of Bytewax, an early-stage startup, I played a critical role in shaping the company’s strategic direction and technical vision. Bytewax is a cutting-edge stream processing platform built on Kubernetes, specializing in data pipeline management and machine learning inference. In my capacity as CTO, I was responsible for designing the platform’s architecture, ensuring seamless integration with various data sources, and optimizing the performance and scalability of the system. Additionally, I managed a talented team of engineers and coordinated closely with other co-founders to set product roadmaps, evaluate potential partnerships, and pitch our innovative solutions to investors and clients. My experience in this role allowed me to develop a deep understanding of the challenges and opportunities inherent in building a successful startup from the ground up, while driving the development of a robust and scalable platform that empowers businesses to harness the power of real-time data analytics and machine learning effectively.

GitHub (10.2018 - 10.2020)

As a Staff Data Engineer at GitHub, I spearheaded the development of an internal MLOps platform on Kubernetes, streamlining the machine learning lifecycle for our data team. I designed the platform’s architecture, integrated various data sources and tools, and facilitated rapid model iteration and deployment. I was involved in projects like internal spam filtering and Copilot. Within GitHub I also helped the Institute for Disease Modeling which resulted in the publication of the Covasim paper.

Intel (08.2014 - 9.2018)

During my tenure at Intel, I held various roles, including Machine Learning Architect and OpenStack Kolla Project Technical Leader, contributing significantly to both the OpenStack and Kubeflow open-source communities. I led efforts to implement machine learning on Kubernetes, worked on upstream OpenStack development, and participated in core reviews, driving advancements in cloud computing and machine learning infrastructure for a wide range of projects and organizations.

Multiple roles, Poland (08.2008 - 07.2014)

Throughout my career in Poland, I held various roles across diverse companies, beginning as a Web Developer at Goniec.com and eventually becoming a Senior Python Developer at Allegro Group. I gained valuable experience in web development, cloud computing, and software engineering, working with technologies such as Django, OpenStack, and Python. My accomplishments during this time included developing payment and authorization systems, maintaining web-based systems, and creating an open-source infrastructure management system and internal cloud infrastructure.

Achievements

  • Publication in Nature Communication: Covasim: an agent-based model of COVID-19 dynamics and interventions (Link)
  • Speaker: CloudNativeCon Copenhagen 2018: Intro to Kubeflow
  • Keynote Speaker: OpenStack Summit 2016.11 in Barcelona
  • Multiple sessions: OpenStack Summits
  • Elected Project Technical Leader: OpenStack-Kolla (3 releases)
  • Speaker: PyKonik 2011, PyCon UA 2012, and PyCon PL 2014
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment