Cache

A cache is a specialized memory that sits between a processor and its actual memory.

Types

Direct
Fully Associative
Set Associative

Pre-requisites

Physical Address

Lets assume our physical address in N bits.

Block

Lets assume our block address is B bits (block address, not word address, also address, not size).
A block of data is a number of consequitive words from memory. For example if our physical address is 16 bits and we seperate the high 14 bits, we will have a block of 4 bytes (if our memory is byte addressable) with the block address of 14 bits.

Word ID

Lets assume our word address is W bits.
This is the local address to find the word in a data block (in the example above, the word ID address is 2 bits long).

Cache Line

Each entry in a cache is called a line.

The Idea

We want our data blocks to be stored in a temporary location closer to our processor that is called cache.
Now, every data block has a block address.

Direct Cache

Cache Line Count

Lets assume we have 2^L cache lines. So we need L bits to address them.
The number of cache lines in a direct mapped cache. For example 8 cache lines need 3 line bits.

How it works

TAG: (N-L-W) bits

Flexibility

A physical address can only be mapped to a cache entry where the line address bits match. Not very flexible. Causes a lot of cache misses (conflict-misses).

Fully Associative Cache

We don’t use line address to select the entry anymore. So we need to keep those bits inside the TAG.

How it works

TAG: (N-W) bits

Flexibility

Super flexible. Any physical address can be mapped into any cache entry. There are no hardwired line address bits to force the mapping, so, no more cache misses (conflict-misses).

Set Associative Cache

An n-way set associative cache means that we have sets that contain n entries. We have to calculate the number of sets. Lets assume we need S bits to address the sets.

How it works

TAG: (N-S-W) bits

Flexibility

Better than direct cache, worse than fully associative cache. The hardwired bits are only the set address bits (S).

Cache Misses

Size Miss

The cache size is too small and leads to a cache miss.

Compulsory Miss (cold miss)

The cache does not have the data and it needs to be fetched from the memory.

Conflict Miss

Two physical memory addresses are being mapped into a single cache entry because of the hardwired address bits. Fully associative caches have none, so they will not have any conflict misses. The direct caches are the worst.

Complexity

Direct Cache: Super fast and cheap; Too much conflict misses
Fully Associative Cache: No conflict misses; Too slow and complex, it needs a lot of comparators implemented in a combinatory fashion.
Set Associative Cache: Something in between. Less conflict misses than direct cache; A bit faster than fully associative cache.

Another way to see things

Direct caches are 1-way set associative.
Fully associative caches are L-way set associative. Remember that we called L as the number of cache lines.

Eviction Algorithms

FIFO

The oldest entry gets evicted. This is cheap to implement, but it’s bad because it the entry might be old but used frequently.

LRU

The least recently used eviction algorithm.

LFU

The least frequently used eviction algorithm.

Random

Random eviction!

Cache Policies

Read Miss

Write Miss

Write Through: Write straight to the memory as well.
Write Back: Write to the memory on eviction.

پپ
پ

References

Direct Caches
Fully Associative Caches
Set Associative Caches

Cache

Types

Pre-requisites

Physical Address

Block

Tags

Word ID

Cache Line

The Idea

Direct Cache

Cache Line Count

How it works

Flexibility

Fully Associative Cache

How it works

Flexibility

Set Associative Cache

How it works

Flexibility

Cache Misses

Size Miss

Compulsory Miss (cold miss)

Conflict Miss

Complexity

Another way to see things

Eviction Algorithms

FIFO

LRU

LFU

Random

Cache Policies

Read Miss

Write Miss

References