Why Data Gravity Will Grow Stronger

Matthew Wallace

Co-Founder/CTO Building @Kamiwaza.ai for an #AI Powered World

Published Jan 14, 2019

The term “data gravity” refers to the desire to have applications and data attract more applications and data on a network. The idea is based on Newtonian gravity: The larger a mass, the more the attraction. The term was coined by Dave McCrory in 2010.

One of the first things you discover with large data sets is that they are hard to move. The need for low latency and high throughput drives data gravity. AWS famously rolled out its “snowmobile” service to help customers with moving up to 100 petabytes of data per truck. It is literally a storage data center in a box delivered by a semitrailer truck. If you had a full 10 Gbps connection straight to the cloud that you could maximize throughput on, it would take nearly three years to transfer that much data. That’s the throughput problem in a nutshell.

Additionally, applications that want access to data want it to be fast. If you have an application that runs in a data center in Chicago and it needs to access data in a data center in Ashburn, Virginia, you have to deal with waiting for data to flow back and forth. If your application and data are both in Ashburn, you can access the data in 5% of the time. That can mean your application can work over 20 times faster for some cases. That’s the latency component of data gravity.

Big data is a rapidly growing market. The enormous market for big data solutions is driven by the value they can bring to enterprises. For example, McKinsey estimated a potential $100 billion in annual value for the health care system in the U.S. However, if you look at big data success stories, you will notice that a lot of big data projects seem to only require data from an organization. But there are efforts to unearth unstructured data that is not electronically readable, and there are a wide variety of efforts to safely share data between organizations.

Finish reading this article at Forbes

Debra Artigliere

Security & Law Enforcement

You got this down!

To view or add a comment, sign in

Why Data Gravity Will Grow Stronger

Matthew Wallace

Co-Founder/CTO Building @Kamiwaza.ai for an #AI Powered World

More articles by Matthew Wallace

Insights from the community

Others also viewed

Tescra and Big Data: Redefining Information Management

MDS Newsletter #44

8 Data Trends: The Next Frontier

Revolutionizing Data Pipelines with Snowflake and dbt

High Throughput and Low Latency in ADLS Gen 2

High Throughput and Low Latency in ADLS Gen 2

FAIR Business Data and Confluent’s Data Streaming Platform

Predict & Influence Muses Series 001: The Chase of the Next Data Platform/Technologies

Revolutionizing Data Management with Delta Tables in Delta Lake

A Simple Guide to Considering Data Gravity in a Hybrid & Edge Computing World

Explore topics

More articles by Matthew Wallace

The Greatest Disruption

Climbing to AI Victory with Diffusion of Excellence

Is there a Generative AI bubble?

Public Cloud, Private Cloud, and Repatriation

Generational Inflection Points

The Cambrian Explosion of Technology

Multi-Cloud: Not a Good DIY Project

Faster and Faster

Cloud Strategy in the Age of Multi-Cloud

How IT Leaders Can Plan For The Imminent Multi-Cloud Wave

Insights from the community

Others also viewed

Tescra and Big Data: Redefining Information Management

MDS Newsletter #44

8 Data Trends: The Next Frontier

Revolutionizing Data Pipelines with Snowflake and dbt

High Throughput and Low Latency in ADLS Gen 2

High Throughput and Low Latency in ADLS Gen 2

FAIR Business Data and Confluent’s Data Streaming Platform

Predict & Influence Muses Series 001: The Chase of the Next Data Platform/Technologies

Revolutionizing Data Management with Delta Tables in Delta Lake

A Simple Guide to Considering Data Gravity in a Hybrid & Edge Computing World

Explore topics