Architecture

Scalability is the ability of a software system to handle increasing workloads — more users, data, transactions, or requests — by adding resources without requiring fundamental redesign of the architecture.

Scalability is measured in two dimensions: vertical scaling (scaling up) adds more CPU, RAM, or storage to an existing server, while horizontal scaling (scaling out) adds more server instances behind a load balancer and is the approach used by internet-scale systems. A scalable architecture distributes state carefully — shared databases and in-memory session state are common bottlenecks — and typically moves session data to distributed caches (Redis), uses async message queues (Kafka, RabbitMQ) to decouple components, and applies database read replicas or sharding to spread query load. Scalability must be designed into the architecture from an early stage; retrofitting scalability into a tightly coupled monolith with shared mutable state is one of the most expensive engineering efforts a growing company can face. Load testing under realistic traffic patterns — using tools like k6, Locust, or JMeter — is essential for validating that scaling assumptions hold before production traffic exposes bottlenecks.

Example

A food delivery startup designs its order service to be stateless from day one, storing session data in Redis, so that the team can simply add more application server instances during peak dinner-hour traffic without changing any code.

Related terms

Load Balancing

Load balancing is the process of distributing incoming network traffic across multiple servers or instances to prevent any single server from becoming a bottleneck, improving application availability, throughput, and fault tolerance.

Microservices

Microservices is an architectural style in which a large application is built as a collection of small, independently deployable services, each responsible for a specific business capability and communicating via APIs.

Cloud Computing

Cloud computing is the delivery of computing services — including servers, storage, databases, networking, software, and analytics — over the internet, enabling on-demand access, rapid scaling, and pay-per-use pricing.

Database

A database is an organised collection of structured or semi-structured data stored electronically and managed by a database management system (DBMS) that enables efficient querying, insertion, update, and deletion of records.

Ready to grow your business?

Tell us what you're building. We'll reply within one business day with a clear next step.

Talk to us

What is Scalability?