Databricks acquire Neon for $1bn #84 w/e 16 May 2025
Join the 5,300-strong data herd getting all you need to know about Data for your Friday roundup
Yet. More. Acquisitions. This time with Databricks dominating the headlines once again.
Connect to AWS using IAM Roles
You can now connect to AWS using IAM Roles instead of IAM Users! Orchestra can connect to AWS services using IAM User authentication or IAM Role authentication.
To connect using an AWS IAM Role, you will need to configure the correct 'Trust relationships' on a given IAM Role, in order for Orchestra to assume it correctly.
When using this authentication method, Orchestra creates one IAM role per account. This role is used to assume any roles you have configured in your Orchestra connections. For security reasons, Orchestra sends an 'External ID' when assuming roles to ensure it cannot be assumed by other Orchestra accounts. This is a best practice for cross-account access in AWS.
The following trust policy is mandatory for any IAM Role in your AWS account that you wish to assume from Orchestra. Your Orchestra Account ID is available in account settings. This policy is shown in the setup guides for each AWS integration in Orchestra, and automatically contains the correct account ID to copy directly into AWS.
You can read the docs here
Gorgeous!
Medium 🧠
🧠 Why I’m Betting on Snowflake for AI Startups (link)
🧠 How To Build a Benchmark for Your Models (link)
🧠 How to Learn the Math Needed for Machine Learning (link)
🧠 KNN Algorithm in Data Mining with Example (link)
🧠 100 Days of Data Engineering Day 81: Creating Customer 360 View Using SAP Master Data (link)
LinkedIn🕴
🕴 Meet Me at Current 2025 in London – Get 50% Off with My Code! (link)
🕴 Stop Listening, Start Stagnating Feed Your Brain Like You’d Feed Your Model (link)
🕴 Optimizing Snowflake Python Just Got Easier (link)
🕴 Ephemeral Infra Is the Single-Use Plastic of Data Engineering (link)
🕴 Some incredible benchmarking on total cost of ownership for Databricks Photon vs. Snowflake vs. AWS EMR (link)
News 📰
Editor’s Pick
📰 KPMG acquires Metaphor (link)
📰 Fintech IPOs Give Hope To a Comeback Despite Market Volatility (link)
📰 Databricks announces deal to acquire database startup Neon for roughly $1B (link)
📰 A really interesting take on why Databricks acquired Neon (link) that I think wasn’t written with AI but let me know if you disagree??
📰 OroraTech secures €37m Series B to expand global wildfire satellite data system - OroraTech has confirmed it has extended its Series B funding round to €37 million, with participation from BNP Paribas Solar Impulse Venture Fund, Rabo Ventures, Bayern Kapital, Edaphon and the European Circular Bioeconomy Fund… Read More
YouTube and Podcast 🎥
Editor’s Pick
🎥 From Query to Conversation: Microsoft Fabric’s Data Agents Explained (link)
🎥 Faster Data Pipelines development with MCP and DuckDB (link)
🎥 Getting Started with Dremio's MCP Server (link)
Special 💫
💫 Data Engineering Weekly #220 (link)
Jobs 💼
💼 Director of Data & Analytics at Trade Coffee (link)
💼 Senior Data Analyst at Super (link)
💼 Senior Analytics Engineer at Thrive Global (link)
💼 BI Engineer at Talkiatry (link)
💼 Analytics Manager at Paytient (link)
Want to save on your ingestion bills? You’ll love this
You can leverage Python for lightweight ELT integrations. Here you’re only paying for compute and not being penalised by row-based pricing models. Pretty neat right? Check it out below / head to Orchestra and start today.
The best place to run dbt?
Don’t believe us? Watch the video below.