Skip to content
UoL CS Notes

Searching & Inserting in B+ Trees

COMP207 Lectures

There is a visualisation tool for B+ trees available at https://www.cs.usfca.edu/~galles/visualization/BPlusTree.html.

Single vs Multi-Level Indexes

  • Single Level Index - Stores in a single list.
  • Multi-Level Index - Distributes across different layers.

B+ Tree Leaves (Idea)

a1 a2 an  
Points to tuples with value a1. Points to tuples with value a2   Points to tuples with value an. Next Leaf

n is chosen such that a node fits into a single disk block:

  • Disk Block Size - 512 byte
  • Values - 4 byte integers
  • Pointers - 8 bytes

With the above example n=42 as:

51242(8+2)+8512<43(8+4)+8

B+ Tree Inner Nodes (Idea)

a1 a2 an  
Points to a nodes for values <a1. Points to nodes for values a1 and <a2   Points to nodes for values an1 and <an Points to node for values an

Pointers point to B+ tree nodes at the level below. n is chosen as before.

B+ Tree Leaves (Actually)

  • Not all the fields have to be used.
  • Fields are filled from left to right.
a1 a2 ai Unused Unused
Points to tuples with value a1. Points to tuples with value a2   Points to tuples with value au. Points Nowhere Next Leaf

Ensure that at least n+12 pointers are used (unless this is the only leaf).

To follow with the online tool, we count the next leaf pointer as a pointer, even if there is none.

B+ Tree Nodes (Actually)

  • Not all the fields have to be used.
  • Fields are filled from left to right.
a1 a2 ai Unused Unused
Points to a nodes for values <a1. Points to nodes for values a1 and <a2   Points to nodes for values ai1 and <ai Points to node for values ai Points Nowhere

Where:

  • ai - Number is smallest number in child-sub-tree slightly to the right.

Ensure that at least n+12 pointers are used (the root must use 2 pointers).

B+ Tree Index

A B+ tree index of the prime numbers through 47 looks like the following:

B+ tree with max degree of 4.

Searching Values

To find the pointer to the rows with value v:

  1. Start at the root of the B+ tree.
  2. While the current node is a non-leaf node:
    • f v<a1, proceed to the first child of the node.
    • Otherwise find the largest i with aiv and proceed to the associated child node.
  3. If the current node is a leaf:
    • If v occurs in the leaf, follow the associated pointer.
    • If v does not occur in the leaf, return “v doesn’t exist in the index”.

The running time is:

O(h×log2n)

where h is the height of the B+ tree.

The real running time is:

O(h×D)

where D is the time for a disk operation.

Inserting Values

To insert a new value/pointer pair:

  1. Find the leaf that should contain the value.
  2. If the leaf is not full, insert the key value pair at a suitable location.
  3. If the leaf is full:
    1. Split the leaf to make space for the new value/pointer pair and move half of the pointers to the new node.
    2. Insert the value/pointer pair
    3. Connect the leaf to a suitable parent node (which might incur the creation of a new node).

The running time is:

O(h×log2n)

where h is the height of the B+ tree.

The real running time is:

O(h×D)

where D is the time for a disk operation.