Tecton
timezone
+00:00 GMT
SIGN IN
  • Home
  • Events
  • Content
  • Help
Sign In
Sign in or Join the community to continue

Hamilton: a Micro Framework for Creating Dataframes

Posted Mar 28, 2021 | Views 312
# Data engineering
# Open Source
Share
SPEAKER
Stefan Krawczyk
Stefan Krawczyk
Stefan Krawczyk
Manager Data Platform @ Stitch Fix

Stefan loves the stimulus of working at the intersection of design, engineering, and data. He grew up in New Zealand, speaks Polish, and spent formative years at Stanford, LinkedIn, Nextdoor & Idibon. He currently leads the Model Lifecycle Team at Stitch Fix. Outside of work in a pre-covid world, Stefan liked to swim, eat tacos, drink beer, and travel; for the past year, he has instead _biked, ate tacos and baked sourdough.

+ Read More

Stefan loves the stimulus of working at the intersection of design, engineering, and data. He grew up in New Zealand, speaks Polish, and spent formative years at Stanford, LinkedIn, Nextdoor & Idibon. He currently leads the Model Lifecycle Team at Stitch Fix. Outside of work in a pre-covid world, Stefan liked to swim, eat tacos, drink beer, and travel; for the past year, he has instead _biked, ate tacos and baked sourdough.

+ Read More
SUMMARY

At Stitch Fix we have 130+ ÒFull Stack Data ScientistsÓ who in addition to doing data science work, are also expected to engineer and own data pipelines for their production models. One data science team, the Forecasting, Estimation, and Demand team was in a bind. Their data generation process was causing them iteration & operational frustrations in delivering time-series forecasts for the business. In this talk IÕll present Hamilton, a novel python micro framework, that solved their pain points by changing their working paradigm.

Specifically, Hamilton enables a simpler paradigm for a Data Science team to create, maintain, and execute code for generating wide dataframes, especially when there are lots of intercolumn dependencies. Hamilton does this by building a DAG of dependencies directly from python functions defined in a special manner, which also makes unit testing and documentation easy; tune into the talk to find out how. IÕll also cover our experience migrating to it and using it in production for over a year, along with possible future directions.

+ Read More

Watch More

30
Posted Jan 26, 2022 | Views 311
# Data engineering
# Open Source
# Systems and Architecture
10
Posted May 12, 2022 | Views 1.1K
# Feature Stores
# Production Use Case
# Systems and Architecture