Building Your Data Governance Toolbox

Photo by Cesar Carlevarino Aragon on Unsplash When you first start learning about data governance, it often seems like a hairball of tightly knit ideas where you can’t understand any one piece until you’ve studied and learned the whole thing. I’m not an expert by any stretch, but I’ve wrestled with learning about data governanceContinue reading “Building Your Data Governance Toolbox”

Layers of Data Infrastructure 3: Storage

Photo by Cobro on Unsplash In my last two posts I’ve explored the high-level design decisions related to two of the three layers that define each pipeline stage of each category of data use cases: Control and Compute. The Control layer defines how the user interacts with the system, while the Compute layer defines howContinue reading “Layers of Data Infrastructure 3: Storage”

Data Infrastructure Layers 2: Compute

Photo by Noah Negishi on Unsplash In my last post I described how you can think of your organization’s data infrastructure as a grid of blocks defined by category of use case and stage of the pipeline. Each block can be further broken down into three layers: Control, Compute and Storage. Last time I brieflyContinue reading “Data Infrastructure Layers 2: Compute”

Data Infrastructure Layers 1: Control

Photo by CHUTTERSNAP on Unsplash In my last two posts, I started to break down the types of areas where an organization might need to deploy data tools/infrastructure along two axes: the categories of common use cases and the stages that you’ll encounter in most of these use cases. You can think of these asContinue reading “Data Infrastructure Layers 1: Control”

Common Stages of Data Workflows

Photo by tian kuan on Unsplash I want to start going into more details of the categories of data use cases that I introduced in my last post. When you think of each use case, it’s easy to focus on a fairly narrow piece of it – typically the most interesting parts. But within eachContinue reading “Common Stages of Data Workflows”

Categories of Data Use Cases

Photo by Martin Woortman on Unsplash As the head of software engineering at a small startup with ambitions to grow much larger, I think a lot about how to design data infrastructure that will both address our immediate needs and adapt to future needs. I’ve seen what happens at large companies when each team hasContinue reading “Categories of Data Use Cases”