UNCOVERED: data vendor price increases, Le Chat, Orchestra deployment envs #71 w/e 14 Feb 2025
Join the 5,000-strong data herd getting all you need to know about Data for your Friday roundup
There are going to be some big big things over the coming weeks. Please share and subscribe if the below are of interest
Streaming Wars V2 - the streaming wars are heating up. is this 2015 all over again? We’ll be bringing you an exclusive write-up with inside info from across the data space
PRICING INCREASES - we have uncovered that there is a concerted effort across multiple vendors to increase costs, and we think we know who is next. That’s why we’ll be putting out a GUIDE To saving csots and protecting your P&L
Orchestra deployment environments- if you’ve ever had to manage multiple airflow instances or kubernetes clusters you know how painful it is. Fortunately, Orchestra now offers a way to get around this issue; powerful, elegant.
Want to get more news like this delivered into your inbox? Subscribe now
Save literally hundreds of hours with environments
This is a low-key feature with a big impact.
We’ve spoken to countless Data Engineering teams that are trying to become “Hub and Spoke” instead of “Centralised”. Both legacy frameworks like Airflow and more modern frameworks suffer from one crucial design flaw - size.
The services required to orchestrate are ENORMOUS. Think of Airflow - you have a UI, a scheduler, a webserver, a database. Spinning this up in Dev, UAT, and PROD means THREE TIMES the infrastructure.
And sure, you can go for a managed version. But all these managed providers are doing is spinning up multiple instances and charging you a premium for it.
We’re excited to announce Orchestra now supports multiple environments. This is a super powerful and lightweight way to re-use environment variables that are pointers to things like values, IDs and connections.
By leveraging Orchestra’s serverless infrastructure, there’s no need to worry about ensuring staging is a direct replica of prod or that the settings in dev marry up to prod as well - Orchestra takes care of all this for you.
This means data engineers can now get the benefits of software best practices and build more robust data pipelines by testing orchestration logic in a staging area first. We’re super excited to see what you build.
Winter Data Conference
Excited to share that anyone using our special code HUGO50 can get a 50% discount to the Winter Data Conference in Zell Am See - check it out here.
Meme Drop
we will have more memes this week so follow Hugo on Linkedin for the latest memes
Medium 🧠
🧠 Building Multi-Tool Agents on Snowflake: A Hands-On Guide (link)
🧠 Learnings from a Machine Learning Engineer — Part 1: The Data (link)
🧠 7 Reasons Why Orchestra is an ideal choice for lean Data Teams (link)
🧠 The Architecture of Apache Iceberg: Unlocking the Modern Data Lakes (link)
🧠 Should Data Scientists Care About Quantum Computing? (link)
LinkedIn🕴
🕴 Airbyte Unveils Capacity-Based Pricing: Predictable Costs, Maximum Value! (link)
🕴 Accenture Wins $1.2B in GenAI Deals—But Do They Really Get AI? (link)
🕴 Mistral’s ‘Le Chat’ Takes on ChatGPT—A Game-Changer for European AI! (link)
🕴 Elon Musk’s $97.4B Bid for OpenAI—Genius Move or Power Play? (link)
🕴 Winter Data Conference 🇦🇹⛷️ Learn, Network & Ski in Zell Am See! (link)
🕴 Snowflake’s Cortex.Complete()—Smarter LLM Calls, Lower Costs! 🚀❄️ (link)
News 📰
Editor’s Pick
📰 Snowflake Pipe Syntax comes to spark (link)
📰 Orion Deepens Integrations With Snowflake, Pontera
Announced separately this week, the updated technology will provide firms with more choice in cloud data management and access to workplace retirement plans from within Orion Trading… Read More
📰 Melbourne startup Factor House raises $5 million to expand real-time data tools
Melbourne-based enterprise technology startup Factor House has raised $5 million in seed funding to accelerate the launch of its flagship product… Read More
📰 AI platform secures six-figure funding to boost hospitality data
Distil.ai has secured £350,000 from the South West Investment Fund, part of a £1m funding round that also includes investment from Waterspring Ventures… Read More
YouTube and Podcast 🎥
Editor’s Pick
🎥 Orchestra vs. Airflow | End-to-end Data/AI pipeline example (link)
🎥 Introduction to UV - fastest package manager built in Rust (link)
🎥 Your Connections are a MESS! Let's clean it up... (link)
🎥 Where Data Science Meets Shrek: How BuzzFeed uses AI (link)
Special 💫
💫 Cutter Associate’s 2025 trends for Data Platforms (link)
💫 The Cloud Judgement newsletter (link)
Jobs 💼
For the UK contingent - Accurx are looking for a Data Architect. They are a great team, highly recommend to reach out if you’re interested in the Healthcare / Healthtech space.
💼 Manager, Data Analytics & Insights at The Philadelphia Inquirer (link)
💼 Software Engineer at Blue Onion (link)
💼 Lead Analytics Engineer at Rewind Software (link)
💼 Senior Data Engineer at Spond (link)
💼 Senior Business Intelligence Analyst at Mood Media (link)
Want to save on your ingestion bills? You’ll love this
You can leverage Python for lightweight ELT integrations. Here you’re only paying for compute and not being penalised by row-based pricing models. Pretty neat right? Check it out below / head to Orchestra and start today.
The best place to run dbt?
Don’t believe us? Watch the video below.