zenml
  • Home
  • Tags
  • ZenML

ZenML | blog

The latest news, opinions and technical guides from ZenML.

Deploy your ML models with KServe and ZenML

August 04, 2022 - Safoine El khabich - 14 mins read


How to use ZenML and KServe to deploy serverless ML models in just a few steps.

What's New in v0.12.0: Serverless Inferencing on Kubernetes with KServe

August 02, 2022 - Dickson Neoh Tze How - 4 mins read


This release brings KServe integration to ZenML.

What's New in v0.11.0: Label All The Things!

July 19, 2022 - Alex Strick van Linschoten - 6 mins read


This release brings the first iteration of the ZenML annotation stack component and an integration with Label Studio, the popular open-source tool that supports many annotation types. We've also made significant updates to our documentation.

ZenML sets up Great Expectations for continuous data validation in your ML pipelines

July 07, 2022 - Stefan Nica - 18 mins read


ZenML combines forces with Great Expectations to add data validation to the list of continuous processes automated with MLOps. Discover why data validation is an important part of MLOps and try the new integration with a hands-on tutorial.

Podcast: ML Monitoring with Emeli Dral

July 07, 2022 - Alex Strick van Linschoten - 1 min read


This week I spoke with Emeli Dral, co-founder and CTO of Evidently, an open-source tool tackling the problem of monitoring of models and data for machine learning. We discussed the challenges around building a tool that is both straightforward to use while also customizable and powerful.

Podcast: Edge Computer Vision with Karthik Kannan

June 30, 2022 - Alex Strick van Linschoten - 1 min read


I spoke with Karthik Kannan, cofounder and CTO of Envision, a company that builds on top of the Google Glass and using Augmented Reality features of phones to allow visually impaired people to better sense the environment or objects around them.

How to run production ML workflows natively on Kubernetes

June 29, 2022 - Felix Altenberger - 12 mins read


Getting started with distributed ML in the cloud: How to orchestrate ML workflows natively on Amazon Elastic Kubernetes Service (EKS).

What's New in v0.10.0: A Kubernetes Native Orchestrator!

June 28, 2022 - Jayesh Sharma - 3 mins read


This release brings the highly-requested Kubernetes Orchestrator and a Vault secret manager to ZenML! We have also added Data Validators as a new stack component and an implementation for Great Expectations to kick things off.

Serverless MLOps with Vertex AI

June 27, 2022 - Alexej Penner - 12 mins read


How ZenML lets you have the best of both worlds, serverless managed infrastructure without the vendor lock in.

Podcast: Humans in the Loop with Iva Gumnishka

June 23, 2022 - Alex Strick van Linschoten - 1 min read


This week I spoke with Iva Gumnishka, the founder of Humans in the Loop. They are an organization that provides data annotation and collection services. Their teams are primarily made up of those who have been affected by conflict and now are asylum seekers or refugees.

Move over Kubeflow, there's a new sheriff in town: Github Actions 🤠

June 20, 2022 - Michael Schuster - 17 mins read


This tutorial presents an easy and quick way to use GitHub Actions to run ML pipelines in the cloud. We showcase this functionality using Microsoft's Azure Cloud but you can use any cloud provider you like.

What's New in v0.9.0: Everyone Gets an Orchestrator!

June 13, 2022 - Alex Strick van Linschoten - 5 mins read


We added two new orchestrators (Github Actions and Vertex AI), an Azure Secrets Manager integration, a Slack integration and a bunch of other smaller changes in the latest release.

Need an open-source data annotation tool? We've got you covered!

June 10, 2022 - Alex Strick van Linschoten - 3 mins read


We put together a list of 48 open-source annotation and labeling tools to support different kinds of machine-learning projects.

Podcast: ML Engineering with Ben Wilson

June 8, 2022 - Alex Strick van Linschoten - 1 min read


This week I spoke with Ben Wilson, author of 'Machine Learning Engineering in Action', a jam-backed guide to all the lessons that Ben has learned over his years working to help companies get models out into the world and run them in production.

How to get the most out of data annotation

June 2, 2022 - Alex Strick van Linschoten - 5 mins read


I explain why data labeling and annotation should be seen as a key part of any machine learning workflow, and how you probably don't want to label data only at the beginning of your process.

Will they stay or will they go? Building a Customer Loyalty Predictor

May 27, 2022 - Ayush Singh - 13 mins read


We built an end-to-end production-grade pipeline using ZenML for a customer churn model that can predict whether a customer will remain engaged with the company or not.

The Framework Way is the Best Way: the pitfalls of MLOps and how to avoid them

May 24, 2022 - Stefan Nica - 9 mins read


As our AI/ML projects evolve and mature, our processes and tooling also need to keep up with the growing demand for automation, quality and performance. But how can we possibly reconcile our need for flexibility with the overwhelming complexity of a continuously evolving ecosystem of tools and technologies? MLOps frameworks promise to deliver the ideal balance between flexibility, usability and maintainability, but not all MLOps frameworks are created equal. In this post, I take a critical look at what makes an MLOps framework worth using and what you should expect from one.

What's New in v0.8.0: Extend ZenML Any Way You Like!

May 18, 2022 - Alex Strick van Linschoten - 5 mins read


Release notes for the new version of ZenML. We've added a ton of extensibility improvements, documentation and guides that take away most of the hard work of figuring out how to add custom components. Our CLI also has been beautified and it should even run a bit faster too!

All Continuous, All The Time: Pipeline Deployment Patterns with ZenML

May 11, 2022 - Safoine El Khabich - 12 mins read


Connecting model training pipelines to deploying models in production is seen as a difficult milestone on the way to achieving MLOps maturity for an organization. ZenML rises to the challenge and introduces a novel approach to continuous model deployment that renders a smooth transition from experimentation to production.

What's New in v0.7.2 and v0.7.3: HuggingFace, Weights & Biases, LightGBM, XGBoost, and more!

April 28, 2022 - Hamza Tahir - 5 mins read


The release adds experiment tracking stack components, integrations with HuggingFace, Weights & Biases and also improvements to how the Seldon Core deployer handles secrets from the secrets manager component.

Predicting how a customer will feel about a product before they even ordered it

April 20, 2022 - Ayush Singh - 8 mins read


We built an end to end continuous deployment pipeline using ZenML for a customer satisfaction model that uses historical data of the customer predict the review score for the next order or purchase.

Podcast: Trustworthy ML with Kush Varshney

April 14, 2022 - Alex Strick van Linschoten - 1 min read


This week I spoke with Kush Varshney, author of 'Trustworthy Machine Learning', a fantastic guide and overview of all of the different ways machine learning can go wrong and an optimistic take on how to think about addressing those issues.

What's New in v0.7.1: Fetch data from your feature store and deploy models on Kubernetes

April 11, 2022 - Hamza Tahir - 3 mins read


The release introduces the Seldon Core ZenML integration, featuring the Seldon Core Model Deployer and a Seldon Core standard model deployer step. It also includes two new integrations with Feast as ZenML's first feature store addition and NeuralProphet adding to the growing list of training libraries supported.

'It's the data, silly!' How data-centric AI is driving MLOps

April 07, 2022 - Hamza Tahir - 9 mins read


ML practitioners today are embracing data-centric machine learning, because of its substantive effect on MLOps practices. In this article, we take a brief excursion into how data-centric machine learning is fuelling MLOps best practices, and why you should care about this change.

Podcast: Open-Source MLOps with Matt Squire

March 31, 2022 - Alex Strick van Linschoten - 2 mins read


This week I spoke with Matt Squire, the CTO and co-founder of Fuzzy Labs, where they help partner organizations think through how best to productionise their machine learning workflows.

What's New in v0.7.0: 🔡 User Profiles and Secret Storage 🤫

March 28, 2022 - James W. Browne - 4 mins read


Release notes for the new version of ZenML. We've added the ability to store your stacks in profiles accessible system-wide instead of just in individual project folders. We also added a way to store and retrieve passwords and secrets, including the possibility to integrate the AWS Secrets Manager. Also: run individual steps on Vertex AI.

Run your steps on the cloud with Sagemaker, Vertex AI, and AzureML

March 25, 2022 - Hamza Tahir - 6 mins read


With ZenML 0.6.3, you can now run your ZenML steps on Sagemaker, Vertex AI, and AzureML! It’s normal to have certain steps that require specific infrastructure (e.g. a GPU-enabled environment) on which to run model training, and Step Operators give you the power to switch out infrastructure for individual steps to support this.

Podcast: Practical Production ML with Emmanuel Ameisen

March 18, 2022 - Alex Strick van Linschoten - 1 min read


This week I spoke with Emmanuel Ameisen, a data scientist and ML engineer currently based at Stripe. Emmanuel also wrote an excellent O'Reilly book called 'Building Machine Learning Powered Applications', a book I find myself often returning to for inspiration and that I was pleased to get the chance to reread in preparation for our discussion.

Solving Atari Games with Reinforcement Learning (AI)

March 17, 2022 - Ayush Singh - 10 mins read


We trained a model to solve Atari games using reinforcement learning. We used the Deep Q algorithm as the basis of our implementation. It allowed us to get a working solution fairly quickly.

What's New in v0.6.3: Run Steps on Sagemaker and AzureML ☁️

March 14, 2022 - Alex Strick van Linschoten - 3 mins read


Release notes for the new version of ZenML. We've added the ability to run steps on AWS Sagemaker and AzureML. We added a new Tensorboard visualization that runs when using the Kubeflow orchestrator. You'll also find a lot of smaller improvements, documentation additions and bug fixes in this release.

How we made our integration tests delightful by optimizing the way our GitHub Actions run our test suite

March 9, 2022 - Alexej Penner - 16 mins read


As we outgrew our initial template Github Action workflow, here's the five things we added to our Github Action arsenal to fit our growing needs: Caching, Reusable Workflows, Composite Actions, Comment Triggers and Concurrency Management.

Everything you ever wanted to know about MLOps maturity models

March 7, 2022 - Alex Strick van Linschoten - 6 mins read


An exploration of some frameworks created by Google and Microsoft that can help think through improvements to how machine learning models get developed and deployed in production.

Podcast: From Academia to Industry with Johnny Greco

March 3, 2022 - Alex Strick van Linschoten - 1 min read


This week I spoke with Johnny Greco, a data scientist working at Radiology Partners. Johnny transitioned into his current work from a career as an academic — working in astronomy — where also worked in the open-source space to build a really interesting synthetic image data project.

How to painlessly deploy your ML models with ZenML

March 2, 2022 - Stefan Nica - 11 mins read


Connecting model training pipelines to deploying models in production is regarded as a difficult milestone on the way to achieving Machine Learning operations maturity for an organization. ZenML rises to the challenge and introduces a novel approach to continuous model deployment that renders a smooth transition from experimentation to production.

Richify that CLI!

February 28, 2022 - Alex Strick van Linschoten - 8 mins read


We recently reworked a number of parts of our CLI interface. Here are some quick wins we implemented along the way that can help you improve how users interact with your CLI via the popular open-source library, rich.

What's New in v0.6.2: ♻️ Continuous Deployment and a fresh CLI 👩‍💻

February 23, 2022 - Alex Strick van Linschoten - 4 mins read


Release notes for the new version of ZenML. We've added the ability to serve models using MLflow deployments, and we've refreshed how our CLI looks using the popular 'rich' library. You'll also find a lot of smaller improvements, documentation additions and bug fixes in this release.

Aggregating and Reporting ZenML Company Metrics on a Schedule

February 15, 2022 - James W. Browne - 6 mins read


We built a low barrier of entry reporting pipeline tool that collects, stores, and displays key performance indicators without a data lake.

Podcast: The Modern Data Stack with Tristan Zajonc

February 10, 2022 - Alex Strick van Linschoten - 1 min read


Tristan and Alex discuss where machine learning and AI are headed in terms of the tooling landscape. Tristan outlined a vision of a higher abstraction level, something he's working on making a reality as CEO at Continual.

How to improve your experimentation workflows with MLflow Tracking and ZenML

February 9, 2022 - Alex Strick van Linschoten - 7 mins read


Use MLflow Tracking to automatically ensure that you're capturing data, metadata and hyperparameters that contribute to how you are training your models. Use the UI interface to compare experiments, and let ZenML handle the boring setup details.

What's New in v0.6.1: Reach for the AWS and Azure Cloud! ☁️

February 7, 2022 - Alex Strick van Linschoten - 2 mins read


Release notes for the new version of ZenML. We now support AWS S3 and Azure Blob Storage as artifact stores. You'll also find a lot of smaller improvements and bug fixes in this release.

How to build a three-pointer prediction pipeline

February 2, 2022 - Alexej Penner - 10 mins read


We challenged ourselves to put our own tool to the test and set up a few pipelines to answer two questions: Did Steph Curry change the game of basketball? And how many three-pointers will be in the next NBA game?

Type hints are good for the soul, or how we use mypy at ZenML

January 31, 2022 - Alex Strick van Linschoten - 7 mins read


A dive into Python type hinting, how implementing them makes your codebase more robust, and some suggestions on how you might approach adding them into a large legacy codebase.

Podcast: Neurosymbolic AI with Mohan Mahadevan

January 27, 2022 - Alex Strick van Linschoten - 1 min read


Mohan and Alex discuss neurosymbolic AI and the implications of a shift towards that as a core paradigm for production AI systems. In particular, we discuss the practical consequences of such a shift, both in terms of team composition as well as infrastructure requirements.

What's New in v0.6.0: whylogs integration and some big core architecture changes

January 26, 2022 - Alex Strick van Linschoten - 3 mins read


Release notes for the new version of ZenML. ZenML now supports whylogs (from whylabs) that logs data from your ML pipelines in production. We also made some sizeable (breaking 😢) core architecture changes

10 Reasons ZenML ❤️ Evidently AI's Monitoring Tool

January 21, 2022 - Alex Strick van Linschoten - 7 mins read


ZenML recently added an integration with Evidently, an open-source tool that allows you to monitor your data for drift (among other things). This post showcases the integration alongside some of the other parts of Evidently that we like.

What's New in v0.5.7

January 17, 2022 - Alex Strick van Linschoten - 3 mins read


Release notes for the new version of ZenML. ZenML now supports MLFlow for tracking pipelines as experiments and Evidently for detecting drift in your ML pipelines in production.

Get to know ZenML through examples

January 6, 2022 - Alex Strick van Linschoten - 2 mins read


ZenML has a treasure-trove of examples available for users to get to know specific features. Using these examples, running them and pulling refreshed versions is easy with our CLI that takes on the heavy work for you.

What's New in v0.5.6

December 23, 2021 - Hamza Tahir - 1 min read


Release notes for the new version of ZenML.

Podcast: Monitoring Your Way to ML Production Nirvana with Danny Leybzon

December 16, 2021 - Alex Strick van Linschoten - 1 min read


We discuss how to monitor models in production, and how it helps you in the long-run.

ZenML - Why we built it

December 14, 2021 - Hamza Tahir & Adam Probst - 5 mins read


All the advantages that ZenML will bring you if you choose to use it to productionize your model development workflows.

What's New in v0.5.5

December 13, 2021 - Alexej Penner - 1 min read


Release notes for the new version of ZenML.

Why you should be using caching in your machine learning pipelines

December 7, 2021 - Alex Strick van Linschoten - 4 mins read


Use caches to save time in your training cycles, and potentially to save some money as well!

What's New in v0.5.4

December 6, 2021 - Alex Strick van Linschoten - 1 min read


Release notes for the new version of ZenML.

Podcast: Practical MLOps with Noah Gift

December 2, 2021 - Alex Strick van Linschoten - 1 min read


We discuss the role of MLOps in an organization, some deployment war stories from his career as well as what he considers to be 'best practices' in production machine learning.

How we track our todo comments using GitHub Actions

December 01, 2021 - Michael Schuster - 3 mins read


A programmatic means of ensuring #TODO comments made in code also end up in our Jira ticketing system.

Lazy Loading Integrations in ZenML

November 26, 2021 - Baris Can Durak - 5 mins read


How integrations work under the hood to connect you to the tools you know and love.

What's New in v0.5.3

November 24, 2021 - Alex Strick van Linschoten - 1 min read


Release notes for the new version of ZenML.

Pipeline Conversations: Our New Podcast

November 19, 2021 - Adam Probst - 1 min read


We launched a podcast to have conversations with people working to productionize their machine learning models and to learn from their experience.

Introducing the revamped ZenML 0.5.x

November 16, 2021 - Michael Schuster - 5 mins read


We rewrote much of our code base to make it more robust and also to enable new features.

10 Ways To Level Up Your Testing with Python

November 4, 2021 - Alex Strick van Linschoten - 9 mins read


A mix of mental and technical skills you can develop to get better at testing your Python code.

Taking on the ML pipeline challenge

October 27, 2021 - Hamza Tahir - 7 mins read


Why data scientists need to own their ML workflows in production.

Why ML should be written as pipelines from the get-go

March 31, 2021 - Hamza Tahir - 7 mins read


Eliminate technical debt with iterative, reproducible pipelines.

Spot the difference in ML costs

January 28th, 2021 - Hamza Tahir - 5 mins read


Spot instances are a great option for anyone training machine learning models; they're much cheaper than other on-demand options, albeit with some drawbacks.

Is your Machine Learning Reproducible?

January 19th, 2021 - Hamza Tahir - 5 mins read


Short answer: not really, but it can become better!

A most unusual year

December 26th, 2020 - ZenML Team - 5 mins read


A retrospective of some of the decisions that caused us to pivot from a focused machine learning consultancy to deciding to build an open-source MLOps tool.

ZenML will be open source

November 11th, 2020 - ZenML Team - 4 mins read


An overview of some of the capabilities that ZenML will unlock for our users.

MLOps: Learning from history

November 9th, 2020 - ZenML Team - 6 mins read


MLOps isn't just about new technologies and coding practices. Getting better at productionizing your models also likely requires some institutional and/or organisational shifts.

12 Factors of Reproducible Machine Learning in Production

September 28th, 2020 - ZenML Team - 10 mins read


A set of guiding principles to help you better productionize your machine learning models.

Why ML in production is (still) broken - [#MLOps2020]

June 26th, 2020 - Hamza Tahir - 5 mins read


The MLOps movement and associated new tooling is starting to help tackle the very real technical debt problems associated with machine learning in production.

Can you do the splits?

June 11th, 2020 - Hamza Tahir - 6 mins read


Splitting up datasets is part of the daily work of a data scientist, but there's more complexity and art to it than first meets the eye.

Avoiding technical debt with ML pipelines

June 6th, 2020 - Hamza Tahir - 10 mins read


Pipelines help you think and act better when it comes to how you execute your machine learning training workflows.

A case for declarative configurations for ML training

May 17th, 2020 - ZenML Team - 5 mins read


Using config files to specify infrastructure for training isn't widely practiced in the machine learning community, but it helps a lot with reproducibility.

Predicting the winner of a DotA 2 match using distributed deep learning pipelines

May 7th, 2020 - Baris Can Durak - 12 mins read


ZenML makes it easy to setup training pipelines that give you all the benefits of cached steps.

Deep Learning on 33,000,000 data points using a few lines of YAML

May 4th, 2020 - Hamza Tahir - 11 mins read


Use YAML files to help configure pipelines that can run complex deep learning training.

Why deep learning development in production is (still) broken

May 1st, 2020 - Hamza Tahir - 3 mins read


Software engineering best practices have not been brought into the machine learning space, with the side-effect that there is a great deal of technical debt in these code bases.

Distributed PCA using TFX

February 27, 2020 - Hamza Tahir - Crossposted on Tensorflow Blog - 5 mins read


We use PCA to reduce the dimension of input vectors while retaining maximal variance.

About ZenML

  • Imprint & Data privacy

Links

  • ZenML Home
  • ZenML Docs
  • ZenML Terms of service

Contact Us

  • info@zenml.io