Modern, Cloud Native Application Design

Modern, Cloud Native Application Design

Understanding the value of microservices over monoliths, containers and Kubernetes, and SQL vs NoSQL can be complex. Here, I'll simplify it for you.

What Is a Distributed Application?

A distributed application is a software system that runs on multiple interconnected devices or servers. These applications are designed to handle large-scale data processing, high availability, and fault tolerance.

There’s a lot to unpack here, but first let's talk about why you’d want to consider a distributed design. In today’s modern era of wifi, cellular, mobile apps, social media and an expectation of instant gratification, clients or prospects expect a certain interaction and response. Thirty years ago developers didn’t have to build for this type of always online, always available, global solution. Client/Server environments consisted of hundreds, or maybe thousands of concurrent users. In those days we could have a single database running on a single host able to handle this level of concurrency. But when we consider the new world, everyone running their own mobile devices, and able to hit any application at any time of the day, we need to design and scale for many more. The most demanding apps such as amazon.com or instagram, uber or waze are designed for millions and tens of millions of concurrent users. You just can’t achieve this type of scale with a traditional database or a single cluster of ten or 20 web servers and application servers like we used to.

Designing a modern application to scale requires stepping out of our legacy comfort zone and learning to try new ideas. The focus of this paper will be to introduce you to these ideas and why they’ve been so critical in changing the world.

Microservices

Traditional monolithic applications architecture:

In the traditional web application architecture, the public cloud was not a prevalent option. Applications were developed and deployed within private data centers, typically leveraging VMware for virtualizing compute resources. Companies constructed web portals to manage the presentation layer for consumers, with an application layer handling business logic and a database layer for persistent storage and transactional consistency.

Traditional Monolithic Application Architecture

Components weren’t usually separated into granular areas, and everything was packaged and pre-deployed as a single monolithic unit. The web cluster was pre-provisioned to handle the worst case anticipated load, as was the app server cluster. In some cases each separate datacenter would be pre-provisioned to handle the worst case scenario, allowing the company resilience in a full datacenter outage.

SQL databases were scaled vertically, ensuring sufficient CPU, memory, and storage to handle potential loads, with additional resources allocated for running ad hoc queries and reports. Replication was utilized to maintain a backup copy in secondary data centers for failover during major outages.

While there are numerous variations of this architecture, many organizations continue to operate within it today, particularly those leveraging on-premises services and VMware as their runtime virtualization platform.

Legacy environments lacked autoscaling capabilities, and even if developers attempted to implement it themselves, limitations in dynamic capacity would have impeded progress. Traditional environments did not offer dynamic chargeback mechanisms, and every line of business (LOB) and application was closely scrutinized and sized based on budget constraints. If an organization purchased 20 CPUs, they aimed to utilize all 20 CPUs rather than leaving any idle, as there were no on-demand pricing or consumption models available.

Public Cloud

The advent of the public cloud brought about a paradigm shift, offering pre-built, fully managed services accessible with a single API call, alongside on-demand pricing models that enabled companies to pay only for what they utilized, without upfront costs or extensive commitments. This revolutionary concept laid the groundwork for the emergence of numerous new capabilities and concepts.

While the notion of a service-oriented architecture, containers, agile development, CI/CD, and DevOps were not novel, the introduction of the public cloud transformed their implementation. The availability of on-demand pricing, coupled with the ability to swiftly provision and deprovision resources through a single API call, along with a plethora of readily available managed services, empowered developers to embrace a novel approach to designing, building, and managing complex applications tailored to meet the demands of an ever-evolving global landscape.

Microservices Architecture

The concept behind microservices is to decompose a monolithic application into numerous smaller components, each operating independently of the others. This approach facilitates separate coding, testing, and debugging of each component, allowing different developers to focus on specific sections. Each microservice defines a clear interface for communication with other services, typically through network APIs, specifying the required data format for communication.

Modern Microservices Architecture

Although the concept of microservices wasn't novel, its practical implementation presented significant challenges. Breaking down an application into discrete parts offered numerous benefits but also posed several challenges. Managing the proliferation of virtual machines, each hosting a single microservice, exponentially increased the infrastructure requirements. Additionally, network complexities and routing issues emerged, areas that developers previously didn't delve into deeply. Furthermore, the introduction of numerous microservices necessitated the use of databases to store their respective data, leading to potential overload on traditional databases, which were accustomed to handling only hundreds or thousands of connections but now faced tens of thousands or even millions of connections.

Therefore, the adoption of microservices alone wasn't sufficient; it required a new platform capable of addressing these complexities comprehensively.

Kubernetes (k8s) and Containers

Even before Kubernetes (often abbreviated as k8s) gained widespread popularity, containers presented a compelling alternative to traditional virtualization. While virtualization allowed for the partitioning of hardware into smaller components, each virtual machine (VM) still represented a complete computer with its own operating system and underlying infrastructure. In contrast, containers offered additional partitioning within a shared VM and operating system, enabling the execution of multiple isolated workloads. This lightweight approach facilitated the deployment of hundreds or even thousands of microservices on a single VM, significantly improving the cost-effectiveness of microservice deployment.

While containers alone offer considerable advantages, managing a large number of them efficiently requires an orchestration platform. Several contenders emerged in this space, but Kubernetes emerged as the leading solution, offering advanced features such as networking and routing, health checking, auto-scaling, auto-recovery, canary testing, and version management. The seamless integration of containers with the Kubernetes orchestration platform empowered organizations to deploy and manage microservices at scale with unparalleled ease and flexibility.

Yet, it was the combination of containers and Kubernetes with the capabilities of the public cloud that truly unleashed their potential for scalability. On-premises deployments were hampered by hardware limitations, requiring the procurement, installation, and configuration of hardware tied closely to specific budgets and allocations. By leveraging Kubernetes on the public cloud, organizations gained access to virtually limitless scalability and flexibility. Cloud service providers (CSPs) responded by offering managed Kubernetes services, liberating developers from the complexities of managing large clusters while enabling operations teams to focus on governance and risk management without the constraints of physical infrastructure.

Horizontal Scale

Once the monolithic application was broken down into smaller components and paired with an efficient platform for running each part, organizations gained the ability to scale each component independently. Instead of having to over-provision capacity for the entire monolith, individual tasks could be provisioned with precisely the resources they required, precisely when they needed them. This ability lies at the core of why, when implemented effectively, the public cloud can be more cost-effective than on-premises solutions. Moreover, enhanced scalability brings about improved resilience. Horizontal scaling not only involves scaling across multiple hosts but also distributing them across different data centers. Cloud Service Providers (CSPs) offer availability zones (sometimes referred to as availability domains), which comprise distinct data centers or even separate data center campuses with latency typically under 10ms. Deploying a platform like Kubernetes (K8s) across multiple AZs significantly enhances resilience against host or facility failures.

However, independent scaling of each microservice introduces additional strain on traditional databases, which were not designed to handle potentially hundreds of thousands or even millions of concurrent users. Consequently, as the microservices pattern has evolved, so has the concept of each microservice managing its own data. This evolution has led to an increasing number of microservices adopting NoSQL solutions.

Databases

The industry has undergone significant evolution. Thirty years ago, SQL databases running on single hosts dominated the landscape, spanning platforms from IBM Mainframes with DB2 and Oracle to Open Systems with DB2, Oracle, Informix, Sybase, and Ingres, to Microsoft Windows platforms hosting Microsoft SQL Server and various alternatives. In the early days of SQL, storage was among the costliest infrastructure components, and SQL databases offering data normalization, such as 3rd normal form, served as a method for deduplicating data, thereby enhancing efficiency and accessibility.

It's only been in the past 10-15 years that we've witnessed a shift towards numerous new technologies, including solutions like NoSQL, Key/Value stores, Graph Databases, Caching and Distributed Caches, Clustering, Sharding, Columnar, Document, In-Memory, Distributed SQL, and many more.

This departure from traditional SQL and ACID-compliant databases has also precipitated a paradigm shift from Strong Consistency to Eventual Consistency.

Strong Consistency

A strong consistency model such as ACID (Atomicity, Consistency, Isolation, Durability) ensures that within a transaction spanning many tables, it can keep track of what changed and properly revert the change if a rollback is required. It also ensures a lock across all table rows for the duration of the transaction, so no one else can modify the affected records throughout the duration of the transaction.

This approach ideally requires everything to happen within a very short duration, due to the lock being held, and provides immediate gratification to the user knowing their request is either completed or failed.

In a traditional single host SQL database, if the database fails for any reason, upon restart everything is frozen until it completes crash recovery which means it performs a rollback of any open transactions prior to the crash, before making the database available again.

When operating a traditional SQL database, all applications have access to all tables, and the business logic is contained within that one app. Different applications might operate on the same database, and perhaps even the same tables, thereby relying on the database to maintain consistency.

Eventual Consistency

There are different types of eventual consistency. In some cases, it might refer to a single database, with storage devices each persisting the same data. In another case, it could be a single distributed database with each host needing to persist the data. In our case, we're talking about eventual consistency across different microservices performing complex transactions, with each microservice communicating with a different database. The database(s) in question could be SQL, NoSQL, Distributed, or any variation of the previously mentioned database types.

Eventual consistency allows data to be inconsistent across microservices (or tables) temporarily. Updates propagate asynchronously, and eventually, all systems converge to a consistent state. Don't confuse eventual consistency with data corruption. Databases and architectures leveraging an eventually consistent model still ensure data correctness and consistent request order processing, thereby ensuring data consistency.

Eventual consistency is ideal for microservices transactions, where each service owns its own data and is assured that no other services would have direct access to the underlying data. If any external applications want or need access to the data, they must do so via an API request to the microservice that owns the data. This approach ensures that no data can be manipulated out of order and relies on the microservice itself to ensure consistency across a complex transaction, not the database.

Several patterns exist, from different forms of locking to hashing requests, all to ensure the proper order of requests is maintained to eliminate the risk of transaction collisions.

Eventual Consistency solutions are ideal for highly distributed environments with potentially tens of thousands or millions of concurrent requests. To prevent overloading any single database, all requests are typically queued or streamed through a service and processed in the appropriate order, both asynchronously and in parallel. This pattern not only ensures optimal performance and resilience but also supports distributed databases, enabling each microservice to scale effectively.

SQL Databases for Scalability and Concurrency

Traditional databases aren’t ideal for high concurrency workloads such as modern web and mobile applications. Most SQL solutions offer optimized reading via distributed nodes, but still require all writes go back to a single host. This approach improves performance, but doesn’t deliver the linear scale necessary for some of the most demanding workloads.

There are some SQL databases that have been specially engineered to handle high concurrency and global distribution. For example Google’s Spanner or Cockroachdb are among some more popular SQL solutions that have improved on SQL concurrency with a distributed write model.

While it’s compelling to want to continue to use SQL databases for modern, distributed, cloud native workloads, there’s a few things to keep in mind. With each microservice owning its own data, the complexity of the schema for the microservice is greatly reduced from that of a traditional database schema supporting tens or hundreds of applications, sometimes even more. Therefore the need for many of the advanced features of a SQL database just aren’t needed, so other than comfort, there’s very little need to continue to use a SQL database. The idea of ACID consistency must change to an eventually consistent model to properly handle the decoupling that microservices offer, along with the scale they’re capable of.

NoSQL Databases for Scalability and Concurrency

For organizations wanting to develop the next Instagram, SnapChat or Uber, or any cloud based SaaS solution that needs to scale, learning how to properly leverage a NoSQL solution can be ideal. With a NoSQL database the schema is often flattened reducing the amount of IO requests in exchange for denormalization which results in larger storage requirements. But since storage has been greatly commoditized in the cloud, it’s inexpensive. And, with a flattened model, most microservices are a single IO for reads/writes so the performance is highly consistent and dependable. When we add the distributed nature of NoSQL solutions, their ability to scale near linear is ideal when combined with a microservices architecture that also scales horizontally.

With NoSQL databases such as MongoDB, DynamoDB, Cassandra, Redis we gain improved flexibility, scalability, and better performance for many cloud native workloads.

Conclusion

When designing distributed applications, consider the trade-offs between eventual consistency and strong consistency. NoSQL databases provide powerful tools for scalability and concurrency, but choose the right database based on your application’s requirements.

A distributed SQL solution can still be a good option for many workloads, including smaller environments that don’t have the demand for scale, smaller applications without the complexity, or smaller development teams can benefit from fewer microservices in place of larger services. Consider a banking app that needs to move money between accounts. Instead of separate services for taking from one account and putting into another account, you could conceivably wrap that into a single service. That means the one service needs to own all tables involved in the transaction, but it can still be a perfectly acceptable way to go. It can also be ok to mix SQL and NoSQL for different requirements.

It can be very hard for organizations to break away from decades of SQL experience and the confidence that Strong Consistency delivered. And for many organizations, they don’t have applications that need the kind of scale and resilience provided by NoSQL. When SQL has served an organization fine for years, there may not be a compelling reason to change, and that’s fine.

Remember, there’s no one-size-fits-all solution. Each system design depends on factors like workload, data model, and user expectations. Happy coding!


It's great to see discussions around cloud-native application design gaining traction. Your insights on the balance between modern NoSQL solutions and traditional SQL databases are essential as organizations navigate their technology choices. The evolution of microservices and containerization truly reshapes our approach to application architecture. What are some key factors you think should influence the decision between these technologies in a security-sensitive environment?

Like
Reply

To view or add a comment, sign in

Insights from the community

Others also viewed

Explore topics