Data Centre expansion, Databricks copying Clickhouse (?) and Human-in-the-loop workflows #119 w/e 23 Jan 2026
Join the 6,500-strong data herd getting all you need to know about Data for your Friday roundup
If we needed any reminding, this week showcased AI = data centre expansion. Big money, big builds, and governments trying to keep up with the power + planning reality.
Infra constraints become product constraints: where your warehouse runs, how fast you can scale, and what “always-on” reliability actually looks like. APAC and EMEA are trying to join the race (Google investing $15bn in a 100MW data centre India, Vantage expanding), so expect more regional options and more variance in latency, cost, and quotas.
Separately Databricks employees were very happy and excited about their 2027 Sales Kick-off. We’ll be monitoring that announcement surprise closely, but after Clickhouse announced postgres to Clickhouse replication in Clickhouse cloud, you have to feel there is a Neon angle at play.
Orchestra Product Updates
Want to find out the latest live? Join us on January 28th at 4pm UKT! LINK HERE
Separately..
Alteryx is often one of the more common tools in the analytics stack.
With Orchestra’s new Alteryx integration, you can now trigger and monitor Alteryx jobs directly from your pipelines. This makes it easy to coordinate Alteryx with upstream data ingestion, downstream transformations, and notifications, while keeping everything observable in one place. Alteryx becomes a composable building block in your data platform, not a silo.
January Feature Log
Pipeline Context Propagation — Triggered pipelines can now inherit the parent run context as an input.
Persistent pipeline table sorting — Your pipeline table sort order is now saved per user between sessions. Read more.
Alteryx integration — You can trigger and monitor Alteryx jobs directly from Orchestra pipelines. Docs here.
Richer Slack notifications — Slack Tasks now support Slack Blocks for structured, richer alerts. Check out Slack’s Block Kit Builder.
State-aware savings + programmatic pipeline runs — New scripts help quantify state-aware savings and run pipelines in bulk programmatically. This is huge! Read more.
Control upstream → downstream triggers — Choose which upstream run statuses should trigger downstream pipelines. Docs here.
dbt → Slack alerts: user groups + multi-channel routing — dbt Slack alerts now support @user groups and multi-channel routing. Read more.
Sensor Alerts — Add alerts on failing sensors to catch freshness/dependency issues early. Docs here.
NEW
Easier Git-Connections. You can now just “click a button” and if you’re signed in to your github, it will “just-work”
Usage Page. Head over to Settings —> Usage to see an overview of usage and limits
Approvals / Human-in-the-Loop - Pipelines can now be paused until a human approves the next step. Docs here.
Medium 🧠
Editors Pick
🧠 Best Practices for Adopting the Snowflake Internal Marketplace (link)
🧠 If You Want to Become a Data Scientist in 2026, Do This (link)
🧠 The Trillion-Dollar Curly Brace: Why JSON is Eating Your CPU (link)
🧠 Get started with Python on Azure | Orchestra and Azure Container Apps (link)
🧠 7 Things Nobody Tells You About Owning Data Pipelines (link)
Editors Pick
🧠 The Ghost in the Shell | How Nebius is taking on Databricks, Snowflake, and Everyone else (link)
🧠 Google Trends is Misleading You: How to Do Machine Learning with Google Trends Data (link)
🧠 Just the Gist: Data ShareBack from Snowflake Consumer to Snowflake Provider with Native Apps (link)
🧠 Read External Data Sources Directly in Snowpark Python with the New JDBC API (Public Preview) (link)
LinkedIn🕴
🕴 Is the AI Hype Bubble Finally About to Burst? (link)
🕴 When Looking Smart Gets in the Way of Selling (link)
🕴 LEIT DATA Joins Snowflake Intelligence as a Launch Partner (link)
Editors Pick
🕴 Why ClickHouse May Become the Default Cloud for Developers (link)
🕴 Who Actually Has the Freedom to Speak Up at Work? (link)
🕴 Why Does This Databricks AUTO CDC Example Fail with a Syntax Error? (link)
🕴 Upcoming Data & AI Events You Won’t Want to Miss (link)
News 📰
📰 Snowflake Drops $1B on Observe—Datadog’s Nightmare (link)
📰 Databricks raises a Series K investment (link)
📰 Datarails Raises $70M Series C Led by One Peak to Make AI the Foundation of the CFO’s Office (link)
📰 Databricks 2027 Sales kick-off’s surprise announcement (link)
YouTube and Podcast 🎥
Editor’s Pick
🎥 OneLake Explained in less than 10 Minutes (link)
🎥 Power BI | Build Dynamic Matrix Visuals with Month Selection Filtering (link)
🎥 Getting started with the Orchestra CLI (link)
🎥 Streamlining Code Reviews with DBT and ChatGPT 🚀 (link)
🎥 No More Writing SQL for Quick Analysis (link)
🎥 Building the Agentic Lakehouse Experience (link)
Special 💫
💫 Data Engineering Weekly #253 (link)
💫 Data Contracts: A Missed Opportunity (link)
Jobs
💼 dbt consultant(s) at Uncommon Schools (link)
💼 Sr. Analytics Engineer at Kard (link)
💼 Senior Data Engineer (IC4) at Roo Vet (link)
💼 Lead Analytics Engineer (Nuuly) at URBN (link)
💼 Senior Data Platform Engineer at Inspiren (link)
Run dbt models cheaply and easily?
If you’re looking for an easy way to run your dbt core models, look no further than Orchestra.
dbt, dbt core and dbt labs are all trademarks of dbt labs inc




