Building with Patterns in MongoDB

Venkat Bobbili

Cloud-Agnostic Data Architect|Solution Architect|Data Architecture Director|Cloud Solutions Technology Director|Data Governance & Observability Architect |CDO | CTO | Global Talent Visa Holder | No Sponsorship Required

Published Feb 16, 2024

Building with Patterns in MongoDB refers to the strategic and thoughtful application of design patterns to enhance the efficiency, scalability, and maintainability of database structures. MongoDB, being a flexible and schema-less NoSQL database, encourages the adoption of patterns tailored to specific use cases. The Building with Patterns approach involves understanding the data access patterns of an application and crafting MongoDB schemas that align with these patterns. Common patterns include the use of Embedded Data Models for one-to-many relationships, the Bucket Pattern for time-series data, and the Outlier Pattern for handling outliers in datasets. By leveraging these patterns, developers can optimize query performance, simplify data retrieval, and ensure the scalability of MongoDB databases. This approach facilitates the creation of robust, adaptable, and high-performance database solutions tailored to the unique requirements of each application.

Approximation

The Approximation Pattern is useful when expensive calculations are frequently done and when the precision of those calculations is not the highest priority.

Pros

Fewer writes to the database.
Maintain statistically valid numbers.

Cons

Exact numbers aren’t being represented.
Implementation must be done in the application.

Attribute

The Attribute Pattern is useful for problems that are based around having big documents with many similar fields but there is a subset of fields that share common characteristics and we want to sort or query on that subset of fields. When the fields we need to sort on are only found in a small subset of documents. Or when both of those conditions are met within the documents.

Pros

Fewer indexes are needed.
Queries become simpler to write and are generally faster.

Bucket

The Bucket Pattern is a great solution for when needing to manage streaming data, such as time-series, real-time analytics, or Internet of Things (IoT) applications.

Pros

Reduces the overall number of documents in a collection.
Improves index performance.
Can simplify data access by leveraging pre-aggregation.

Computed

When there are very read intensive data access patterns and that data needs to be repeatedly computed by the application, the Computed Pattern is a great option to explore.

Pros

Reduction in CPU workload for frequent computations.
Queries become simpler to write and are generally faster.

Cons

It may be difficult to identify the need for this pattern.
Applying or overusing the pattern should be avoided unless needed.

Document Versioning

When you are faced with the need to maintain previous versions of documents in MongoDB, the Document Versioning pattern is a possible solution.

Pros

Easy to implement, even on existing systems.
No performance impact on queries on the latest revision.

Cons

Doubles the number of writes.
Queries need to target the correct collection.

Extended Reference

You will find the Extended Reference pattern most useful when your application is experiencing lots of JOIN operations to bring together frequently accessed data.

Pros

Improves performance when there are a lot of JOIN operations.
Faster reads and a reduction in the overall number of JOINs.

Cons

Data duplication.

Outlier

Do you find that there are a few queries or documents that don’t fit into the rest of your typical data patterns? Are these exceptions driving your application solution? If so, the Outlier Pattern is a wonderful solution to this situation.

Recommended by LinkedIn

MongoDB

ARNAB MUKHERJEE 🇮🇳 1 year ago

Starting with YugabyteDB or MongoDB?

Franck Pachot 3 months ago

Billion Dollar Unicorns: MongoDB Rises High on NoSQL…

Sramana Mitra 9 years ago

Pros

Prevents a few documents or queries from determining an application’s solution.
Queries are tailored for “typical” use cases, but outliers are still addressed.

Cons

Often tailored for specific queries, therefore ad hoc queries may not perform well.
Much of this pattern is done with application code.

Pre-allocation

When you know your document structure and your application simply needs to fill it with data, the Pre-Allocation Pattern is the right choice.

Pros

Design simplification when the document structure is known in advance.

Cons

Simplicity versus performance.

Polymorphic

The Polymorphic Pattern is the solution when there are a variety of documents that have more similarities than differences and the documents need to be kept in a single collection.

Pros

Easy to implement.
Queries can run across a single collection.

Schema Versioning

Just about every application can benefit from the Schema Versioning Pattern as changes to the data schema frequently occur in an application’s lifetime. This pattern allows for previous and current versions of documents to exist side by side in a collection.

Pros

No downtime needed.
Control of schema migration.
Reduced future technical debt.

Cons

Might need two indexes for the same field during migration.

Subset

The Subset Pattern solves the problem of having the working set exceed the capacity of RAM due to large documents that have much of the data in the document not being used by the application.

Pros

Reduction in the overall size of the working set.
Shorter disk access time for the most frequently used data.

Cons

We must manage the subset.
Pulling in additional data requires additional trips to the database.

Tree

When data is of a hierarchical structure and is frequently queried, the Tree Pattern is the design pattern to implement.

Pros

Increased performance by avoiding multiple JOIN operations.

Cons

Updates to the graph need to be managed in the application.

Conclusion

The MongoDB document model offers significant flexibility in data modeling, providing a powerful tool for developers. However, this flexibility must be carefully channeled to align with the specific data access patterns of your application. It's crucial to recognize that the schema design in MongoDB significantly impacts the performance of your application, and performance issues often stem from inadequate schema design. To maximize the potential of the document model, consider utilizing schema design patterns in conjunction with each other, as appropriate. For instance, Schema Versioning can complement other patterns as your application evolves. This article covers twelve schema design patterns, equipping you with the tools and knowledge necessary to effectively leverage the flexibility inherent in the MongoDB document model.

Building with Patterns in MongoDB

Venkat Bobbili

Cloud-Agnostic Data Architect|Solution Architect|Data Architecture Director|Cloud Solutions Technology Director|Data Governance & Observability Architect |CDO | CTO | Global Talent Visa Holder | No Sponsorship Required

Pros

Cons

Pros

Bucket

Pros

Pros

Cons

Pros

Cons

Pros

Cons

Recommended by LinkedIn

Pros

Cons

Pros

Cons

Pros

Pros

Cons

Pros

Cons

Pros

Cons

More articles by this author

Insights from the community

Others also viewed

MongoDB: A Robust Solution for Transactional Use Cases

MongoDB In 500 Words

Mastering MongoDB CRUD Operations: A Step-by-Step Guide

Case Study on How Industries are using MongoDB.

How Companies Using MongoDB?

Database Design 2.0: It's Not Your Grandpa's Data Storage Anymore

MongoDB Real World Use Cases: Advantages & Top Companies [2022]

What is MongoDB and how it Work?

What is MongoDB? What are the different data types in MongoDB?

Exploring MongoDB: A Comprehensive Guide for Modern Data Management

Explore topics

Pros

Cons

Pros

Bucket

Pros

Pros

Cons

Pros

Cons

Pros

Cons

Recommended by LinkedIn

Pros

Cons

Pros

Cons

Pros

Pros

Cons

Pros

Cons

Pros

Cons

Zachman Framework

Sep 1, 2024

Power of Data Observability

Mar 17, 2024

Delving into Collective IntelligenceHarnessing the Strength of Collaborative Wisdom

Mar 16, 2024

From Data to AI to Collective Intelligence

Mar 16, 2024

Lambda Architecture: Unifying Data Processing Potential on AWS

Mar 16, 2024

Digital Transformation: The Power of Data Centralisation

Mar 16, 2024

Bridging the Gap: Unleashing the Power of Data Architecture to Unify IT and Data Teams

Mar 10, 2024

Navigating the World of Embeddings and Embedding Models

Mar 7, 2024

Unlocking Intelligent Transformation: AI and Gen AI in Public and Private Sectors

Feb 29, 2024

Unlocking Global Scalability with Amazon RDS Global Database

Feb 27, 2024

Insights from the community

Others also viewed

MongoDB: A Robust Solution for Transactional Use Cases

MongoDB In 500 Words

Mastering MongoDB CRUD Operations: A Step-by-Step Guide

Case Study on How Industries are using MongoDB.

How Companies Using MongoDB?

Database Design 2.0: It's Not Your Grandpa's Data Storage Anymore

MongoDB Real World Use Cases: Advantages & Top Companies [2022]

What is MongoDB and how it Work?

What is MongoDB? What are the different data types in MongoDB?

Exploring MongoDB: A Comprehensive Guide for Modern Data Management

Explore topics