Dataset productization systems
Raw data is turned into standardized, consumable products with defined schemas, documentation, and delivery mechanisms.
Data Engineering
Data Engineering is the design and construction of systems for collecting, storing, and processing data.
Data Engineering is the design and construction of systems for collecting, storing, and processing data. On this site, it matters because it transfers across technical, operational, and venture work instead of staying trapped in one narrow context.
Learn more: https://en.wikipedia.org/wiki/Data_engineering
Raw data is turned into standardized, consumable products with defined schemas, documentation, and delivery mechanisms.
Pipelines extract, transform, and load data across systems while enforcing schema, quality, and timing constraints.
Using SafeGraph location observation data and Databricks/Spark, built a geospatial model to quantify human risk around utility infrastructure — helping...
Google BigQuery is a serverless data warehouse for large-scale analytics.
A data brokerage and value-add business focused on transforming large raw datasets into usable commercial products.
Founded and operated a data company that processed hundreds of terabytes of raw, real-time data streams into clean, compliant, valuable datasets for a variety...
MongoDB is a noSQL database for flexible document storage.
MySQL is a relational database management system.
A utility-infrastructure company where geospatial risk modeling and data engineering were applied to physical-world asset prioritization.
Applying data engineering and systems thinking to personal life — automated financial infrastructure, multi-entity accounting, and the quantified self.