Why Multidimensional Scaling Fails?

Rahul Kumar

Lead Data Scientist @ZF | Ex PayPal | IITH

Published Feb 15, 2022

+ Follow

Why MDS Fails in giving us Meaningful Embeddings:

MDS basically arranges points in 2D or lower dimension based on high dimensional pairwise distances.

The objective loss which it follows is:

where dij is distances at original higher dimension and yi -yj is the distance pair at low dimension preferably 2D.

Preserving high dimensional distances is usually a bad idea because it's not possible to preserve them (curse of dimensionality).

what's Curse of dimensionality Problem:

The common theme of these problems is that when the dimensionality increases, the volume of the space increases so fast that the available data become sparse. In order to obtain a reliable result, the amount of data needed often grows exponentially with the dimensionality. Also, organizing and searching data often relies on detecting areas where objects form groups with similar properties; in high dimensional data, however, all objects appear to be sparse and dissimilar in many ways, which prevents common data organization strategies from being efficient.

Now coming back to our topic

Let us see example of random generated data points from gaussian with unit variance and look at the distribution of their pairwise distances

As we are increasing the dimensions the distribution of pairwise distances are being shifting more away from the centre meaning the distances en up getting on higher ends.

Recommended by LinkedIn

The Power of Probabilistic Scenarios in Constantly…

International Standard for Lean Six Sigma (ISLSS) 10 months ago

Fixed-Latency Models

Stefan Schlamp 8 months ago

What should we do if we have a case where we have data…

Neil Pradhan 2 years ago

So suppose if you want to have two points separated with large distances in 2D but in doing so there would be some points which are close together which have pairwise distances as zero which is not there at all in original dimension. So point here is you are trying to fit the green distribution with the blue one due to which MDS fails to produce meaningful embedding.

Embedding results on MNIST data seems pretty unseperable.

Another issue with MDS is it's quadratic memory and time complexity.

So how it have been dealt is Instead to preserving distances preserve nearest neighbour.

To check more on the idea please take a look on below paper

https://www.cs.toronto.edu/~hinton/absps/sne.pdf

Hope you like it!

Thanks

Why Multidimensional Scaling Fails?

Rahul Kumar

Lead Data Scientist @ZF | Ex PayPal | IITH

Recommended by LinkedIn

More articles by this author

Insights from the community

Others also viewed

Floating Point Types in C++: Understanding Precision and Usage

Occam’s Razor

Look-ahead bias

The magic of features: Transforming datasets

Getting the best out of your LayTec data: Learn how to analyze your in-situ data most efficiently

Transforming Models with Shape Functions: From E-R to Dimensional

Extract by Mask and Clip Raster: Different Cuts of the Same Data

Space Complexity

Regularization - L1(Lasso) & L2(Ridge)

Adaptable Blotter Calculated Columns

Explore topics

Recommended by LinkedIn

Ensemble Learning

Jan 5, 2019

Radial basis function network

Dec 8, 2018

Linear Discriminant Analysis

Aug 14, 2018

Resolving MERGE Performance in Azure SQL Database

Oct 5, 2017

ORC vs RC file format

Sep 7, 2017

Partitioning clustered columnstore tables in Azure Sql Data-warehouse

Sep 3, 2017

Best Practices for Azure Sql data warehouse Data Load using polybase or single-client gated load methods

Sep 2, 2017

NoSql Database Modelling Challenges

Sep 2, 2017

Insights from the community

Others also viewed

Floating Point Types in C++: Understanding Precision and Usage

Occam’s Razor

Look-ahead bias

The magic of features: Transforming datasets

Getting the best out of your LayTec data: Learn how to analyze your in-situ data most efficiently

Transforming Models with Shape Functions: From E-R to Dimensional

Extract by Mask and Clip Raster: Different Cuts of the Same Data

Space Complexity

Regularization - L1(Lasso) & L2(Ridge)

Adaptable Blotter Calculated Columns

Explore topics