4 Trends Shaping Data Engineering in 2023

4 Trends Shaping Data Engineering in 2023

Hi There,

 If 2022 was a watershed year for data organizations, 2023 is bound to be even more transformative. 

 As companies ingest greater volumes of data and possible use cases grow (I’m looking at you, ChatGPT), the pressure on data organizations to deliver faster, more reliable insights will increase, too. 

 So, how can teams stay ahead of the curve and as close to business impact as possible? In my opinion, it starts by working directly with data consumers.

 Here are a few of my predictions for how we get there: 

  • Treating Data Like a Product Goes from Concept to Reality. In 2021 and 2022, treating data like a product was little more than a buzzword. In 2023, more and more companies will seek to integrate ways to track and monetize data generated by these data-driven products as part of their core offerings to drive competitive advantage. As a result, teams must start treating data platforms, analytics, and other “data products” like iterative and revenue-generating software systems.
  • Business Intelligence Will Integrate with More of the Stack. Data teams will look for solutions that narrow the gap between data producers and consumers, for instance, collaborative data notebooks that merge analytics with data science modeling, and spreadsheets generated directly in the warehouse. This will help bring data pipeline maintenance (and troubleshooting) closer to the insights these pipelines generate, leading to faster time to value for critical data assets.
  • Data Teams Will Spend More Time On FinOps and Cost Optimization. It’s no secret that the macro-economic environment has caused many organizations to start focusing on optimizing operations and profitability. This makes it even more important for data teams to add value to the business by acting as a force multiplier on the efficiency of other teams as well as generating new revenue through data products. Newly introduced, cost optimization will become an increasingly important third avenue.
  • Data Trust Becomes a First-Class Citizen. You’re investing in Snowflake, Redshift, Databricks, and a menagerie of other tools to expedite analysis and make better decisions and products with data. But - can you actually trust it? As budgets tighten and teams seek to justify the cost of their infrastructure, it’ll be no-brainer to invest in solutions and processes that help realize the potential of the cloud-based data stack, like data observability.

Any I missed?  

Here’s wishing you no data downtime in 2023 - and beyond.

Happy New Year,

Barr Moses 

CEO, Monte Carlo

Recommended Reading

10 Data Trends & Predictions for 2023

If you’re still not satisfied, check out this post I wrote featuring commentary from Tomasz Tunguz , an early backer of Looker and other leading data companies, about our predictions in data engineering and analytics for the New Year, including the maturation of data contracts, the rise of cloud-prem, and the promise of a metrics layer.

Types of Data Products

Speaking of data products... data mesh—and nearly every other modern approach to data management—dictates that teams treat data like a product. But what does that even mean? In his latest, Luke Lin , Director of Product Management, Data, at SoFi distills this beloved and oft-misunderstood concept, diving into the three main types of data products: data platforms, insights, and activation. 

Data Quality Camp

Launched in November, Chad Sanderson , former data leader at Convoy and Data LinkedIn influencer, launched a new Slack group, Data Quality Camp, to help data engineering teams grok data quality best practices and other topics related to scaling data trust. dbt Cloud’s new pricing model? The warehouse / lakehouse wars? CI/CD for your data pipelines? You name it, there’s probably a thread for it.

Testing & Monitoring the Data Platform at Scale

The data team at Checkout.com is behind one of the most advanced data observability strategies today, with a multi-layered approach to testing, monitoring, incident detection, and resolution. Leveraging Datadog , dbt Labs , Airflow, and other tooling, they’ve built and scaled an incident management workflow to impress even the best DevOps teams. Alexandre Carvalho Serge Bouschet Martynas Matimaitis Jacob Holland

The Build vs. Buy Guide for Your Modern Data Stack

To build your data platform or buy your data platform - that is the question. And for good reason. In this guest contribution to the Data Downtime Blog, Nishith Agarwal , Head of Data & ML at Lyra Health and creator of Apache Hudi, shares some of his top-line cost, resource, and competitive considerations when deciding which tools to build or buy for your modern data stack. 

Upcoming Webinars & Events

Cameron Price

Founder | Senior Data Executive | 30 Years of Leadership in Data Strategy & Innovation | Executive Director | Mentor | Strategy | Analytics | AI | Transformation | ESG

2w

Thanks for sharing, Barr! How do you see data teams adapting to cost optimization and data partnerships in 2023? Would love your thoughts on integrating FinOps practices and collaborating with external data sources.

Like
Reply
Todd Scofield

Catalyst for Creativity in Big Data & High Performance Computing

1y

Hi Barr, Full Fidelity Metadata (f2md) is going to change all things DATA. Knowing before first move ALL details of ALL the DATA, simply and FAST, will change how AI/ML/Analytics is delivered well. -Todd

Like
Reply
Andrei Svirida

Delivering outcomes with Data & AI | Senior Director at Slalom

1y

Thanks a lot for these useful insights Barr Moses - very helpful!

Like
Reply

Interesting to see the trendlines on the chart, in my opinion I suspect there would be some overlaps/blurred lines between roles (e.g. Data Analyst / Data Scientist) as it is something I have experienced in the past. In terms of predictions, I think that data partnerships will be important as companies overlay their data with second and third party data to enhance and evolve their insights. The concept and demand of data marketplaces are becoming popular so there needs to be consideration around data sharing and collaboration (ties into the data trust point).

Like
Reply

To view or add a comment, sign in

Insights from the community

Others also viewed

Explore topics