Design a News Feed System

A news feed system displays a personalized, constantly updating list of posts from friends, pages, and groups. Examples include Facebook News Feed, Twitter Timeline, and Instagram Feed.

Requirements
Back of the Envelope Estimation
System APIs
High-Level Design
Feed Generation Strategies
Database Design
Deep Dive
Ranking Algorithm
Key Takeaways
Interview Tips

Requirements

Functional Requirements

Post creation: Users can create posts with text, images, videos
Feed generation: Generate personalized feed from friends/followees
Feed viewing: Users can view their news feed with infinite scroll
Interactions: Like, comment, share posts

Non-Functional Requirements

Requirement	Description
Low latency	Feed should load in < 500ms
High availability	99.99% uptime
Scalability	Support billions of users
Consistency	Eventual consistency acceptable

Extended Requirements

Push notifications for new posts
Support for different media types
Content moderation
Ad integration

Back of the Envelope Estimation

Traffic Estimates

Daily Active Users (DAU): 500 million
Avg friends per user: 500
Avg posts per user per day: 2

Feed refreshes per user per day: 10
Total feed requests: 500M × 10 = 5 billion/day
                   = ~58,000 requests/second

New posts: 500M × 2 = 1 billion posts/day
         = ~12,000 posts/second

Storage Estimates

Post metadata: ~1 KB per post
Posts per day: 1 billion
Posts per year: 365 billion

Storage per year: 365B × 1KB = 365 TB (metadata only)
Media storage: 10× metadata = 3.65 PB/year

Memory Estimates (Cache)

Hot posts to cache: 20% of daily posts
Cache size: 200M posts × 1KB = 200 GB
Feed cache per user: 500 posts × 100 bytes = 50 KB
Active user feed cache: 100M × 50KB = 5 TB

Summary

Metric	Value
Feed requests	58,000/second
New posts	12,000/second
Storage (metadata)	365 TB/year
Feed cache	5 TB

System APIs

Create Post

POST /v1/posts

Request:

{
  "user_id": "user123",
  "content": "Hello world!",
  "media_ids": ["img1", "img2"],
  "privacy": "friends",
  "location": "San Francisco"
}

Response:

{
  "post_id": "post456",
  "created_at": "2024-01-15T10:30:00Z",
  "status": "published"
}

Get News Feed

GET /v1/feed?user_id={user_id}&page_token={token}&limit={limit}

Response:

{
  "posts": [
    {
      "post_id": "post789",
      "author": {
        "user_id": "user456",
        "name": "John Doe",
        "avatar_url": "..."
      },
      "content": "Great day!",
      "media": [...],
      "likes_count": 150,
      "comments_count": 23,
      "created_at": "2024-01-15T09:00:00Z"
    }
  ],
  "next_page_token": "abc123"
}

High-Level Design

flowchart TB
    LB["Load Balancer"]

    LB --> PostSvc["Post Service"]
    LB --> FeedSvc["Feed Service"]
    LB --> UserSvc["User Service"]

    PostSvc --> PostCache["Post Cache (Redis)"]
    FeedSvc --> FeedCache["Feed Cache (Redis)"]
    UserSvc --> UserCache["User Cache (Redis)"]

    PostCache --> PostDB["Post DB (Cassandra)"]
    FeedCache --> FeedDB["Feed DB (Redis)"]
    UserCache --> UserDB["User DB (PostgreSQL)"]

    PostDB --> Media["Media Storage (S3)"]

Components

Component	Responsibility
Post Service	Create, update, delete posts
Feed Service	Generate and serve news feeds
User Service	Manage user profiles and relationships
Notification Service	Push notifications for new posts
Ranking Service	Score and rank posts

Feed Generation Strategies

Interview context: This is the CORE question in news feed design: “How do you generate the feed?” The answer reveals your understanding of the fundamental trade-off.

The Challenge

When User A opens their feed, we need to show recent posts from hundreds of friends. Two extremes:

Compute on demand: Slow for users with many friends (high read latency)
Pre-compute everything: Expensive when someone with 10M followers posts (high write cost)

Option 1: Pull Model (Fan-out on Read)

Generate feed when user requests it.

User Request → Get Friends List → Fetch Recent Posts → Rank → Return

Pros	Cons
No precomputation needed	High latency (many DB queries)
Always fresh data	Heavy load on read path
No wasted work for inactive users	Doesn’t scale for users with many friends

Option 2: Push Model (Fan-out on Write)

Pre-generate feeds when posts are created.

New Post → Get Followers List → Write to Each Follower's Feed

Pros	Cons
Fast read (O(1) from cache)	High write amplification
Pre-computed, ready to serve	Wasteful for inactive users
Consistent read latency	Celebrity problem

Interviewer might ask: “What’s the celebrity problem?”

When a celebrity with 10 million followers posts, you need 10 million writes. This takes time and resources, delaying feed updates for everyone.

Option 3: Hybrid Model (Recommended)

This is the answer interviewers want to hear.

Combine both approaches based on user type:

flowchart TB
    Post["New Post"]
    Post --> Check{"Is user famous?<br/>(>10K followers)"}
    Check -->|YES| Pull["Pull Model"]
    Check -->|NO| Push["Push Model"]

How it works:

Regular users (< 10K followers): Push to followers’ feeds
Celebrities (> 10K followers): Don’t push; pull at read time
On read: Merge pre-computed feed + fresh celebrity posts + rank

Why this works: Most users have few followers (push is cheap). Celebrities are few but have many followers (pull is cheaper than 10M writes).

Database Design

User Table (PostgreSQL)

CREATE TABLE users (
    user_id         BIGINT PRIMARY KEY,
    username        VARCHAR(50) UNIQUE,
    email           VARCHAR(100) UNIQUE,
    created_at      TIMESTAMP,
    follower_count  INT DEFAULT 0,
    following_count INT DEFAULT 0
);

Post Table (Cassandra)

CREATE TABLE posts (
    post_id     UUID,
    user_id     BIGINT,
    content     TEXT,
    media_ids   LIST<UUID>,
    created_at  TIMESTAMP,
    likes_count INT,
    PRIMARY KEY (user_id, created_at)
) WITH CLUSTERING ORDER BY (created_at DESC);

Feed Table (Redis)

Key: feed:{user_id}
Value: Sorted Set of (post_id, timestamp)

ZADD feed:user123 1705312200 post456
ZADD feed:user123 1705312100 post789

ZREVRANGE feed:user123 0 49  # Get top 50 posts

Friendship Table (Cassandra)

CREATE TABLE followers (
    user_id     BIGINT,
    follower_id BIGINT,
    created_at  TIMESTAMP,
    PRIMARY KEY (user_id, follower_id)
);

CREATE TABLE following (
    user_id      BIGINT,
    following_id BIGINT,
    created_at   TIMESTAMP,
    PRIMARY KEY (user_id, following_id)
);

Deep Dive

Interview context: “Let’s dive deeper into some specific challenges…”

1. Feed Publishing Flow

Interviewer might ask: “Walk me through what happens when a user creates a post.”

flowchart TB
    User["User creates post"]
    User --> PostSvc["Post Service saves post"]

    PostSvc --> MQ["Message Queue (Kafka)"]
    PostSvc --> Media["Media Service upload to S3"]

    MQ --> Fanout["Fanout Workers"]

    Fanout --> FeedA["User A's Feed"]
    Fanout --> FeedB["User B's Feed"]
    Fanout --> FeedC["User C's Feed"]

2. Feed Reading Flow

flowchart TB
    User["User requests feed"]
    User --> Cache{"Feed Cache (Redis)"}

    Cache -->|Cache Hit| Return["Return Feed"]
    Cache -->|Cache Miss| FeedSvc["Feed Service"]

    subgraph FeedSvc["Feed Service"]
        F1["1. Get pre-computed feed"]
        F2["2. Fetch celebrity posts"]
        F3["3. Merge & rank"]
        F1 --> F2 --> F3
    end

    FeedSvc --> Return

3. Handling the Celebrity Problem

Interviewer might ask: “What specific strategies can you use for celebrities?”

When a celebrity with millions of followers posts, you have several options:

Strategy	How It Works	Trade-off
Selective Push	Only push to active followers (last 7 days)	Inactive users see stale feed on return
Tiered Fanout	Push to top 10K engaged first, rest in background	Some followers see post late
Pure Pull	Never push celebrity posts, always fetch on read	Slightly higher read latency

Recommended: Pure Pull for celebrities (>10K followers) + aggressive caching. Celebrity posts are hot and cached anyway.

4. Cache Strategy

flowchart TB
    subgraph CacheLayers["Cache Layers"]
        subgraph L1["L1: CDN (Edge Cache)"]
            L1A["Static media"]
            L1B["TTL: 24 hours"]
        end
        subgraph L2["L2: Application Cache (Redis Cluster)"]
            L2A["User feeds"]
            L2B["Hot posts"]
            L2C["TTL: 1 hour"]
        end
        subgraph L3["L3: Database Query Cache"]
            L3A["Frequently accessed data"]
            L3B["TTL: 5 minutes"]
        end
    end
    L1 --> L2 --> L3

Ranking Algorithm

Interview context: After discussing feed generation, interviewers often ask: “How do you decide which posts to show first?”

The Challenge

A user might have 1000+ posts from friends. Which 50 do you show first? Chronological order is simple but doesn’t maximize engagement.

Ranking Factors

Factor	Weight	Why
Recency	High	Users expect recent content
Engagement	High	Popular posts are likely interesting
Relationship	Medium	Close friends’ posts matter more
Content type	Medium	User preferences (photos vs text)
Past interactions	Low	Posts from people you engage with

Simple Ranking Formula

Interviewer might ask: “How would you implement a basic ranking algorithm?”

score = (time_decay × 0.4) + (engagement_score × 0.3) + (relationship × 0.3)

where:
  time_decay = 1 / (1 + hours_old / 6)     # Half-life of 6 hours
  engagement = log(1 + likes + comments×2 + shares×3)
  relationship = interaction_frequency with author

Production: ML-Based Ranking

For real systems, use machine learning:

Features → Model (XGBoost/Neural Net) → Ranked Feed

Feature categories:
- User: age, location, interests, activity patterns
- Post: type, length, media, author popularity
- Context: time of day, device, session depth
- Interaction: past clicks, dwell time, hides

Key insight: The ranking model optimizes for a business metric (engagement, time spent, or a combination).

Key Takeaways

Design Decisions Summary

Decision	Choice	Why
Feed generation	Hybrid push/pull	Balance read latency and write cost
Post storage	Cassandra	High write throughput, time-series optimized
Feed storage	Redis Sorted Set	O(log n) insert, O(1) range query
Celebrity handling	Pure pull + cache	Avoids 10M write amplification
Media storage	S3 + CDN	Cost-effective, edge caching

Trade-offs to Discuss

Decision	Option A	Option B
Feed generation	Push (fast read)	Pull (fresh data)
Ranking	Chronological (simple)	ML-based (engaging)
Celebrity posts	Push (consistent)	Pull (efficient)
Feed size	Large (comprehensive)	Small (focused)

Scalability Phases

flowchart TB
    subgraph P1["Phase 1: Small scale (~1M users)"]
        S1["Single server, simple push model, PostgreSQL"]
    end
    subgraph P2["Phase 2: Medium scale (~100M users)"]
        S2["Hybrid push/pull, Redis for feeds, Cassandra for posts"]
    end
    subgraph P3["Phase 3: Large scale (~1B users)"]
        S3["Sharded everything, ML ranking, CDN for media"]
    end
    P1 --> P2 --> P3

Interview Tips

How to Approach (45 minutes)

1. CLARIFY (3-5 min)
   "What's the scale? Just friends or also pages/groups?
    Do we need ranking or just chronological?"

2. HIGH-LEVEL DESIGN (5-7 min)
   Draw: Post Service → Fanout → Feed Cache → Feed Service

3. DEEP DIVE (25-30 min)
   - Push vs Pull vs Hybrid (THE key discussion)
   - Celebrity problem and solutions
   - Feed storage (Redis sorted set)
   - Ranking algorithm basics

4. WRAP UP (5 min)
   - Caching strategy (CDN, Redis, DB)
   - Real-time updates (WebSocket for new posts)

Key Phrases That Show Depth

Instead of…	Say…
“We push to followers”	“For users with <10K followers, we push to their followers’ feeds. For celebrities, we pull at read time to avoid write amplification.”
“Store feed in Redis”	“Redis Sorted Set is perfect for feeds - we store (post_id, timestamp) with O(log n) insert and O(1) top-N retrieval.”
“Rank by time”	“Chronological is simple but engagement-based ranking keeps users scrolling longer. We use a score combining recency, engagement, and relationship strength.”

Common Follow-up Questions

Question	Key Points
“How handle celebrity problem?”	Hybrid model - don’t push for >10K followers, pull + cache
“How ensure freshness?”	Short TTL on cache, invalidate on new post
“How rank posts?”	Time decay + engagement + relationship (or ML model)
“How handle real-time?”	WebSocket push for new posts from close friends
“How scale fanout?”	Async via Kafka, batch writes, shard by user_id

Consistent Hashing - Data distribution
Design Key-Value Store - Feed storage
Design Chat System - Similar fanout patterns

Design a News Feed System

Table of Contents

Requirements

Functional Requirements

Non-Functional Requirements

Extended Requirements

Back of the Envelope Estimation

Traffic Estimates

Storage Estimates

Memory Estimates (Cache)

Summary

System APIs

Create Post

Get News Feed

High-Level Design

Components

Feed Generation Strategies

The Challenge

Option 1: Pull Model (Fan-out on Read)

Option 2: Push Model (Fan-out on Write)

Option 3: Hybrid Model (Recommended)

Database Design

User Table (PostgreSQL)

Post Table (Cassandra)

Feed Table (Redis)

Friendship Table (Cassandra)

Deep Dive

1. Feed Publishing Flow

2. Feed Reading Flow

3. Handling the Celebrity Problem

4. Cache Strategy

Ranking Algorithm

The Challenge

Ranking Factors

Simple Ranking Formula

Production: ML-Based Ranking

Key Takeaways

Design Decisions Summary

Trade-offs to Discuss

Scalability Phases

Interview Tips

How to Approach (45 minutes)

Key Phrases That Show Depth

Common Follow-up Questions

Related Topics