System Design Interview Fundamentals

Approach

Following Hello Interview approach we have

graph LR
    %% Define nodes with labels
    1[Requirements]
    2[Core<br/>Entities]
    3[API or<br/>Interface]
    4[Data Flow]
    5[High-level<br/>Design]
    6[Deep Dives]
    
    %% Define connections
    1 --> 2
    2 --> 3
    3 --> 4
    4 -.-> 5
    5 --> 6
    
    %% Add primary goal annotations
    5 -.-> |Primary Goal: Satisfy<br/>Non-functional Requirements| 1
    6 -.-> |Primary Goal: Satisfy<br/>Non-functional Requirements| 1
    
    %% Style the nodes
    style 1 fill:#d4e6e6,stroke:#5a8a8a,stroke-width:2px,color:#000000
    style 2 fill:#d4e6e6,stroke:#5a8a8a,stroke-width:2px,color:#000000
    style 3 fill:#d4e6e6,stroke:#5a8a8a,stroke-width:2px,color:#000000
    style 4 fill:#ffffff,stroke:#5a8a8a,stroke-width:2px,stroke-dasharray: 5 5,color:#000000
    style 5 fill:#d4e6e6,stroke:#5a8a8a,stroke-width:2px,color:#000000
    style 6 fill:#d4e6e6,stroke:#5a8a8a,stroke-width:2px,color:#000000

1. Requirements:

1.1 Functional Requirements:

What is the application suppose to do? For example, user can post a tweet
user can see all tweets of another user
Keep it concise (2, 3 things). Ask interviewer if that look reasonable

1.2 Non-Functional Requirements:

When defining functional requirements add why is it relevant to the application. Remember the CAP theorem, there is always a tradeoff between Consistency and Availability.

Availability: how much time is the app up a year. Measured in the percentage of up time
- 99% (two nines): 3.65 days (Moderate reliability)
- 99.9% (three nines): 8.77 hours (Good uptime)
- 99.99% (four nines): 52.6 minutes (Very high reliability)
- 99.999% (five nines): 5.26 minutes (Highly available (standard)) 99.9999% (six nines): 31.56 seconds (Exceptional, very rare)
Scalability: normally measured in the amount of DAU the system handles. For example, the system should be able to handle 100M DAU
Consistency: how distributed systems guarantee that multiple copies (replicas) of data remain in sync
- strong consistency: guarantees that every read returns the most recent write across all nodes. Common in financial systems
- eventual consistency: if no new updates are made, all data replicas will eventually agree on the same value. There is some delay before every part of the system displays the updated state. Useful in non-critical apps like social media
Performance: how efficiently and effectively a system consumes resources to achieve its goals. Measured in:
- latency:The time it takes for a request to travel from the client to the server and back. Normal REST request are ~100 ms. gRPC ~ 10ms. In order to achieve real time request must be >= 500 ms
- Throughput: Throughput: The number of operations, transactions, or requests the system completes within a certain timeframe. For example, Request Per Second (RPS), Queries Per Second (QPS)

The majority of system design interviews focus on this one, however, the following are also valid.

Security: How secure does the system need to be? Think about authentication vs authorization. Encryption.
Compliance: Are there legal or regulatory requirements the system needs to meet? Consider industry standards, data protection laws, and other regulations.

2. Capacity Estimation

Following approach: Let's simplify the process:

Determine What to Estimate: Figure out which quantities are load-bearing for your design.
Break It Down: Start with a big problem. Slice it into smaller pieces.
Use What You Know: Apply basic principles and facts you're confident about.
Keep It Simple: Stick to round numbers. Precision isn't the goal; ballpark is.
Check Yourself: Does your answer make sense in the real world?

I will focus on Use What You Know and then provide an example

Storage

Power of 1000 (1000^x)	Number	Prefix
0	Unit
1	Thousand	Kilo
2	Million	Mega
3	Billion	Giga
4	Trillion	Tera
5	Quadrillion	Peta

Latency

Action	Time	Comparison
Reading 1mb sequentially from memory	0.25ms
Reading 1mb sequentially from SSD	1ms	4x memory
Reading 1mb sequentially from spinning disk	20ms	20x SSD
Round trip network latency CA to Netherlands	150 ms

Item	Size
A two-hour movie	1 GB
A small book of plain text	1 MB
A high-resolution photo	1 MB
A medium-resolution image (or a site layout graphic)	100 KB

Business

Metric	Order of Magnitude
Daily active users of major social networks	O(1b)
Hours of video streamed on Netflix per day	O(100m)
Google searches per second	O(100k)
Size of Wikipedia	O(100gb)

3. Core Entities ( 2 min):

Define the core entities of your system

Who are the actors in the system? Are they overlapping?
What are the nouns or resources necessary to satisfy the functional requirements?

API or System Interface (5 minutes)

Maps to satisfy functional requirements. You should define which API protocol to follow

REST: Uses HTTP verbs (GET, POST, PUT, DELETE) to perform CRUD operations on resources. This should be your default choice for most interviews.
GraphQL: Allows clients to specify exactly what data they want to receive, avoiding over-fetching and under-fetching. Choose this when you have diverse clients with different data needs.
RPC (Remote Procedure Call): Action-oriented protocol (like gRPC) that's faster than REST for service-to-service communication. Use for internal APIs when performance is critical.

[Optional] Data Flow (~5 minutes)

For some backend systems, especially data-processing systems, it can be helpful to describe the high level sequence of actions or processes that the system performs on the inputs to produce the desired outputs. If your system doesn't involve a long sequence of actions, skip this! We usually define the data flow via a simple list. You'll use this flow to inform your high-level design in the next section. For a web crawler, this might look like:

Fetch seed URLs
Parse HTML
Extract URLs
Store data
Repeat

High Level Design (~10-15 minutes)

Diagram to fullfill your functional requirements. One good way to solve this is to follow the API endpoints you have defined.

Deep Dive (~10 Mins)

Improve the High Level Design by:

Making sure all non-functional requirements are met
Addressing edge cases
Identifying bottlenecks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

System Design Interview Fundamentals

Approach

1. Requirements:

1.1 Functional Requirements:

1.2 Non-Functional Requirements:

2. Capacity Estimation

Storage

Latency

Business

3. Core Entities ( 2 min):

API or System Interface (5 minutes)

[Optional] Data Flow (~5 minutes)

High Level Design (~10-15 minutes)

Deep Dive (~10 Mins)

About

Uh oh!

Releases

Packages

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
Availability.md		Availability.md
Availability.pdf		Availability.pdf
Design of Google Maps.md		Design of Google Maps.md
Design of Google Maps.pdf		Design of Google Maps.pdf
Design of a Key-value Store.md		Design of a Key-value Store.md
Design of a Key-value Store.pdf		Design of a Key-value Store.pdf
Detailed Design of WhatsApp.md		Detailed Design of WhatsApp.md
Detailed Design of WhatsApp.pdf		Detailed Design of WhatsApp.pdf
Examples of Resource Estimation.md		Examples of Resource Estimation.md
Examples of Resource Estimation.pdf		Examples of Resource Estimation.pdf
High-level Design of WhatsApp.md		High-level Design of WhatsApp.md
High-level Design of WhatsApp.pdf		High-level Design of WhatsApp.pdf
How the Domain Name System Works.md		How the Domain Name System Works.md
How the Domain Name System Works.pdf		How the Domain Name System Works.pdf
Introduction to Domain Name System (DNS).md		Introduction to Domain Name System (DNS).md
Introduction to Domain Name System (DNS).pdf		Introduction to Domain Name System (DNS).pdf
Key Concepts to Prepare for the System Design Interview.md		Key Concepts to Prepare for the System Design Interview.md
Key Concepts to Prepare for the System Design Interview.pdf		Key Concepts to Prepare for the System Design Interview.pdf
Put Back-of-the-envelope Numbers in Perspective.md		Put Back-of-the-envelope Numbers in Perspective.md
Put Back-of-the-envelope Numbers in Perspective.pdf		Put Back-of-the-envelope Numbers in Perspective.pdf
README.md		README.md
Resources to Prepare for a System Design Interview.md		Resources to Prepare for a System Design Interview.md
Resources to Prepare for a System Design Interview.pdf		Resources to Prepare for a System Design Interview.pdf
Retries Have an Evil Twin_ Duplicates - by Raul Junco.md		Retries Have an Evil Twin_ Duplicates - by Raul Junco.md
Retries Have an Evil Twin_ Duplicates - by Raul Junco.pdf		Retries Have an Evil Twin_ Duplicates - by Raul Junco.pdf
The Architecture That Gets You Here Won’t Take You There.pdf		The Architecture That Gets You Here Won’t Take You There.pdf

marianar97/system-design

Folders and files

Latest commit

History

Repository files navigation

System Design Interview Fundamentals

Approach

1. Requirements:

1.1 Functional Requirements:

1.2 Non-Functional Requirements:

2. Capacity Estimation

Storage

Latency

Business

3. Core Entities ( 2 min):

API or System Interface (5 minutes)

[Optional] Data Flow (~5 minutes)

High Level Design (~10-15 minutes)

Deep Dive (~10 Mins)

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Packages