Memory Paging and Page Tables

Right now, you have a deep understanding of how memory works within a process. You understand stack memory, heap memory, and you are working to build a memory allocator yourself in MP3! However, how does the memory work within the operating system?

Pages and Page Size

In a modern system, we have many types of storage (processor caches, RAM, hard drive, etc). To help us organize moving data between different locations, ALL of the storage inside of our system will be organized into pages:

Just like a page in a book, a page in a system is going to contain a small amount of data.
- On a Linux system, you can run getconf PAGESIZE to find the size (in bytes) of your page, or
- On a Windows system, you can run write a program that calls the GetSystemInfo system call and check the dwPageSize value
On most systems, the page size will be 4096. This means that every page will contain exactly 4,096 bytes of data – or 2¹² bytes. However, a page can be any power of two – 256, 512, 1024, 2048, 4096, 8192, and so on are all reasonable values for a page size.

Memory Addresses and Pages

Once we know the size of a page, we can look at every memory address and divide the address into two sections:

The page table index or page number (ex: where in the book do I find this specific page), and
The page offset (ex: where on the page do I find the specific letter I’m looking for)

For a page size of 4,096 B (2¹²), the least significant 12 bits will describe where on the page the data is located. Therefore, the last 12 bits is the page offset and all of the other bits describe the page number.

Using the page size of 4,096 B (2¹²), consider the memory address 0x 2c3af == 0b 0010 1100 0011 1010 1111:

`0x`	`2`	`c`	`3`	`a`	`f`
`0b`	`0010`	`1100`	`0011`	`1010`	`1111`
	Page Number		Page Offset (w/ pagesize=4096)

Using a smaller page size of just 256 B (2⁸), consider the same memory address:

`0x`	`2`	`c`	`3`	`a`	`f`
`0b`	`0010`	`1100`	`0011`	`1010`	`1111`
	Page Number			Page Offset (w/ pagesize=256)

Page Table

You can now identify the page number and page offset of any memory address. However, the critical thing that an operating system does is help us manage the pages within our system.

The operating system maintains a page table for every process on the system. The page table will serve many purposes:

The page table will include a pointer to where the data is located in storage (ex: in RAM, on disk, etc)
The page table will provide the translation form a virtual memory address to physical memory address
The page table will maintain bits associated with the use of memory on the page (ex: has the page had any write operations done to the memory?)

To help us understand this, consider a simple system with a small amount of RAM (4 pages), a process P1 (with 16 virtual pages), and a hard drive. The hard drive has the executable that we will run in (programCode) and a hiddenImage.png that contains a hidden GIF inside of the PNG:

RAM:

[0]	(empty)
[1]	(empty)
[2]	(empty)
[3]	(empty)

P1 Page Table:

[0]	(empty)
[1]	(empty)
[2]	(empty)
[3]	(empty)
[4]	(empty)
[5]	(empty)
[6]	(empty)
[7]	(empty)
[8]	(empty)
[9]	(empty)
[10]	(empty)
[11]	(empty)
[12]	(empty)
[13]	(empty)
[14]	(empty)
[15]	(empty)

Hard Drive:

...	(empty)
[41]	(empty)
[42]	programCode
[43]	programCode
[44]	programCode
[45]	programCode
[46]	programCode
[47]	(empty)
...	(empty)
[81]	(empty)
[82]	hiddenImage.png
[83]	hiddenImage.png
[84]	hiddenImage.png
[85]	(empty)
...	(empty)

Operations:

File Sizes and Pages

In the memory layout above, each page is 4 KiB (4,096 bytes). We can learn a lot about the files on disk:

We know that programCode requires five pages to store the data. Therefore, we know that:
- Four pages was not enough to store the file. Therefore, the file must be at least (16 KiB + 1 byte) in size.
- Six pages was not required. Therefore, the file must be no larger than 20 KiB in size (a full five pages).
Similarly, the same logic can be applied to hiddenImage.png. The file requires 3 pages of storage and must have the size in the range: (8 KiB, 12 KiB].

Step 1: Starting Our Program

The first step in starting a process is to map the program’s executable code into memory. It is not necessary to load the entire program into RAM (that would take forever for a large program).

When a process is created, we simply map the executable code to the process’ virtual page table and point it to disk:

RAM:

[0]	(empty)
[1]	(empty)
[2]	(empty)
[3]	(empty)

P1 Page Table:

[0]	(empty)
[1]	==> disk[42]
[2]	==> disk[43]
[3]	==> disk[44]
[4]	==> disk[45]
[5]	==> disk[46]
[6]	(empty)
[7]	(empty)
[8]	(empty)
[9]	(empty)
[10]	(empty)
[11]	(empty)
[12]	(empty)
[13]	(empty)
[14]	(empty)
[15]	(empty)

Hard Drive:

...	(empty)
[41]	(empty)
[42]	programCode
[43]	programCode
[44]	programCode
[45]	programCode
[46]	programCode
[47]	(empty)
...	(empty)
[81]	(empty)
[82]	hiddenImage.png
[83]	hiddenImage.png
[84]	hiddenImage.png
[85]	(empty)
...	(empty)

Operations:
1. Load Program

Step 2: Running Our Program

Our program begins to execute. We can only work with data that is in RAM, so a request to the process’s code (located in the process’s first page) will need to load the data into RAM.

Anytime we need to load new data into RAM we call this a page fault.
A page fault results in the operating system fetching data from external storage, loading into RAM, and continuing only once the load is completed.

As we begin to run our program, three things take place:

We check if the data for programCode (1/5) is already in RAM. It is not, so a page fault occurs.
As part of resolving the page fault, programCode (1/5) is read form disk and written to RAM,
Additionally, the page table is updated to note the mapping into RAM

RAM:

[0]	programCode (1/5)
[1]	(empty)
[2]	(empty)
[3]	(empty)

P1 Page Table:

[0]	(empty)
[1]	==> ram[0], disk[42]
[2]	==> disk[43]
[3]	==> disk[44]
[4]	==> disk[45]
[5]	==> disk[46]
[6]	(empty)
[7]	(empty)
[8]	(empty)
[9]	(empty)
[10]	(empty)
[11]	(empty)
[12]	(empty)
[13]	(empty)
[14]	(empty)
[15]	(empty)

Hard Drive:

...	(empty)
[41]	(empty)
[42]	programCode (1/5)
[43]	programCode (2/5)
[44]	programCode (3/5)
[45]	programCode (4/5)
[46]	programCode (5/5)
[47]	(empty)
...	(empty)
[81]	(empty)
[82]	hiddenImage.png
[83]	hiddenImage.png
[84]	hiddenImage.png
[85]	(empty)
...	(empty)

Operations:
1. Load Program
2. Run programCode (1/5)
- malloc(4000) + use
3. Run programCode (2/5)
- malloc(10000) + use
- open `hiddenImage`
- read `hiddenImage`
4. Run programCode (3/5)
- Access 4KB from #2
5. Program Exits

Step 2(b): Running malloc

As part of running the first part of our program, the program requests a memory allocation via malloc(4000)!

We can satisfy this by allocating this memory to the process’s heap
Since the program also uses this memory, a page fault will occur and that data is loaded into RAM

RAM:

[0]	programCode (1/5)
[1]	(heap memory data)
[2]	(empty)
[3]	(empty)

P1 Page Table:

[0]	(empty)
[1]	==> ram[0], disk[42]
[2]	==> disk[43]
[3]	==> disk[44]
[4]	==> disk[45]
[5]	==> disk[46]
[6]	(empty)
[7]	(empty)
[8]	(heap memory) ==> ram[1]
[9]	(empty)
[10]	(empty)
[11]	(empty)
[12]	(empty)
[13]	(empty)
[14]	(empty)
[15]	(empty)

Hard Drive:

...	(empty)
[41]	(empty)
[42]	programCode (1/5)
[43]	programCode (2/5)
[44]	programCode (3/5)
[45]	programCode (4/5)
[46]	programCode (5/5)
[47]	(empty)
...	(empty)
[81]	(empty)
[82]	hiddenImage.png
[83]	hiddenImage.png
[84]	hiddenImage.png
[85]	(empty)
...	(empty)

Operations:
1. Load Program
2. Run programCode (1/5)
- malloc(4000) + use
3. Run programCode (2/5)
- malloc(10000) + use
- open `hiddenImage`
- read `hiddenImage`
4. Run programCode (3/5)
- Access 4KB from #2
5. Program Exits

Step 3: Running Our Program (but running out of RAM)

The next step is to continue to run our program, by running the second page of our program’s data.

programCode (2/5) is not in RAM, so a page fault occurs to load the data into RAM
Just like before, we will update our page table to ensure the page table maps where the data is at during all times

RAM:

[0]	programCode (1/5)
[1]	(heap memory data)
[2]	programCode (2/5)
[3]	(empty)

P1 Page Table:

[0]	(empty)
[1]	==> ram[0], disk[42]
[2]	==> ram[2], disk[43]
[3]	==> disk[44]
[4]	==> disk[45]
[5]	==> disk[46]
[6]	(empty)
[7]	(empty)
[8]	(heap memory) ==> ram[1]
[9]	(empty)
[10]	(empty)
[11]	(empty)
[12]	(empty)
[13]	(empty)
[14]	(empty)
[15]	(empty)

Hard Drive:

...	(empty)
[41]	(empty)
[42]	programCode (1/5)
[43]	programCode (2/5)
[44]	programCode (3/5)
[45]	programCode (4/5)
[46]	programCode (5/5)
[47]	(empty)
...	(empty)
[81]	(empty)
[82]	hiddenImage.png
[83]	hiddenImage.png
[84]	hiddenImage.png
[85]	(empty)
...	(empty)

Operations:
1. Load Program
2. Run programCode (1/5)
- malloc(4000) + use
3. Run programCode (2/5)
- malloc(10000) + use
- open `hiddenImage`
- read `hiddenImage`
4. Run programCode (3/5)
- Access 4KB from #2
5. Program Exits

Step 3(b): A large allocation

The next command to be run is scary – malloc(10000). A request for 10,000 bytes requires the use of 3 pages as each page can only contain 4 KiB (4,096 bytes).

Initially, as malloc expands the heap, the heap grows on the virtual page table. The heap now includes virtual page numbers 8, 9, 10, and 11:

RAM:

[0]	programCode (1/5)
[1]	(heap memory data)
[2]	programCode (2/5)
[3]	(empty)

P1 Page Table:

[0]	(empty)
[1]	==> ram[0], disk[42]
[2]	==> ram[2], disk[43]
[3]	==> disk[44]
[4]	==> disk[45]
[5]	==> disk[46]
[6]	(empty)
[7]	(empty)
[8]	(heap memory) ==> ram[1]
[9]	(heap memory)
[10]	(heap memory)
[11]	(heap memory)
[12]	(empty)
[13]	(empty)
[14]	(empty)
[15]	(empty)

Hard Drive:

...	(empty)
[41]	(empty)
[42]	programCode (1/5)
[43]	programCode (2/5)
[44]	programCode (3/5)
[45]	programCode (4/5)
[46]	programCode (5/5)
[47]	(empty)
...	(empty)
[81]	(empty)
[82]	hiddenImage.png
[83]	hiddenImage.png
[84]	hiddenImage.png
[85]	(empty)
...	(empty)

Operations:
1. Load Program
2. Run programCode (1/5)
- malloc(4000) + use
3. Run programCode (2/5)
- malloc(10000) + use
- open `hiddenImage`
- read `hiddenImage`
4. Run programCode (3/5)
- Access 4KB from #2
5. Program Exits

As we make use of the first page of the heap data, the first page fault occurs loading virtual page [9] into RAM:

RAM:

[0]	programCode (1/5)
[1]	(heap memory data)
[2]	programCode (2/5)
[3]	(heap memory data)

P1 Page Table:

[0]	(empty)
[1]	==> ram[0], disk[42]
[2]	==> ram[2], disk[43]
[3]	==> disk[44]
[4]	==> disk[45]
[5]	==> disk[46]
[6]	(empty)
[7]	(empty)
[8]	(heap memory) ==> ram[1]
[9]	(heap memory) ==> ram[3]
[10]	(heap memory)
[11]	(heap memory)
[12]	(empty)
[13]	(empty)
[14]	(empty)
[15]	(empty)

Hard Drive:

...	(empty)
[41]	(empty)
[42]	programCode (1/5)
[43]	programCode (2/5)
[44]	programCode (3/5)
[45]	programCode (4/5)
[46]	programCode (5/5)
[47]	(empty)
...	(empty)
[81]	(empty)
[82]	hiddenImage.png
[83]	hiddenImage.png
[84]	hiddenImage.png
[85]	(empty)
...	(empty)

Now, we need to load our second page of heap memory in RAM. However – we are out of empty RAM to use! :(

Page Eviction

When a page fault occurs (data that is not in RAM needs to be loaded into RAM) and our RAM is full, we must decide on a page to evict from RAM to make room for the new page. Let’s dive into page eviction policies and algorithms before we continue our example…

Next: Page Eviction Algorithms >>

Archived Content

Memory Paging and Page Tables

Pages and Page Size

Memory Addresses and Pages

Page Table

File Sizes and Pages

Step 1: Starting Our Program

Step 2: Running Our Program

Step 2(b): Running malloc

Step 3: Running Our Program (but running out of RAM)

Step 3(b): A large allocation

Page Eviction