View
227
Download
0
Category
Preview:
Citation preview
Binary Trees
Tree Example
Tree Structures
A tree is a hierarchical structure that places elements in nodes along branches that originate from a root.
Nodes in a tree are subdivided into levels in which the topmost level holds the root node.
Any node in a tree can have multiple successors at the next level Therefore a tree is a nonlinear structure.
Tree Structures (continued) Operating systems use a general tree to
maintain file structures.
Tree Terminology Tree structure
Collection of nodes that originate from a unique starting node called the root. Each node consists of a value and a set of zero or more
links to successor nodes. The terms parent and child describe the relationship
between a node and any of its successor nodes.
Tree Terminology (continued)
A
JI
HGFE
DCB
ro o t
parent
ch ild
subtree
leaf no de
sib ling
in ter io r (in ternal) no de
s
Tree Terminology (continued) Consists of nodes connected by edges
A tree is an instance of a more general category called a graph (a later slideshow discusses graphs)
Nodes
Edges
Nodes connected by edges(a graph – but not a tree)
Tree Terminology Tree – recursively defined as empty or a root node with
zero or more sub-trees Node – a holder for data plus edges to children Edge – connects a parent node to a child node Root – a pointer to the first node, if it exists, or NULL Leaf node – a node with no children Internal node – a node with one or two children Path – sequence of edges between two nodes Height – longest path in a tree from root to any other node Depth – number of edges from root to a node
Tree Terminology (continued) A path between a parent node P and any node N in its subtree is a sequence of nodes P=X0, X1, . . ., Xk = N
k is the length of the path Each node Xi in the sequence is the parent of Xi+1 for 0 i k-1.
Depth (also called Level) of a node Length of the path from root to the node. Equivalent - number of edges from root to the node.
Viewing a node as a root of its subtree, the height of a node is the length of the longest path from the node to a leaf in the subtree.
The height of a tree is the maximum level in the tree.
Tree Terminology (continued)Height = 3
Sorted Binary Trees In a binary tree, each parent has no more than two children
each node (item in the tree) has a value a total order (linear order) is defined on these values left subtree of a node contains only values less than the node's value right subtree contains only values greater than or equal to the node's value.
21
10 34
5
72
25 39
33
Binary Trees A compiler builds unsorted binary trees while
parsing expressions in a program's source code.
Binary Trees (continued) Each node of a binary tree defines a left and
a right subtree. Each subtree is itself a tree.
Left child of T
Right child of T
Binary Trees (continued) A recursive definition of a binary tree:
T is a binary tree if T has no node (T is an empty tree)
or has at most two subtrees.
Height of a Binary Tree The height of a binary tree is the length of the longest path
from the root to a leaf node Let TN be the subtree with root N and TL and TR be the roots
of the left and right subtrees of N. Then
-1 if TN is empty1+max( height(TL), height(TR)) if TN not empty
height(N) = height(TN) = {
leaf node will always have a height of 0
Height of a Binary Tree (concluded)
Degenerate binary tree (least dense)
Density of a Binary Tree In a binary trees, the number of nodes at each level falls within a range of values. At level 0, there is 1 node, the root; at level 1 there
can be 1 or 2 nodes. At any level k, the number of nodes is in the range
from 1 to 2k. The number of nodes per level contributes to the
density of the tree. Intuitively, density is a measure of the size of a tree
(number of nodes) relative to the height of the tree.
Density of a Binary Tree (continued)
Density of Binary Tree
If degenerate trees allowed: Problem – search in basic binary tree is O(N)
Value could be anywhere in tree No better than a list
Density of a Binary Tree (continued) A complete binary tree of height h has all
possible nodes through level h-1, and the nodes on depth h exist left to right with no gaps
Complete binary trees are excellent storage structures due to packing a large number of nodes near the root
Density of a Binary Tree (continued)
Density of a Binary Tree (continued) Determine the minimum height of a complete
tree that holds n elements. Through first h - 1 levels, total number of nodes is
1 + 2 + 4 + ... + 2h-1 = 2h - 1 At depth h, the number of additional nodes ranges
from a minimum of 1 to a maximum of 2h. Hence the number of nodes n in a complete binary
tree of height h ranges between2h - 1 + 1 = 2h n 2h - 1 + 2h = 2h+1 - 1 < 2h+1
Density of a Complete Binary Tree After applying the logarithm base 2 to all terms
in the inequality, we have
h log2 n < h+1
and conclude that a completebinary tree with n nodes must have height
h = int(log2n)search in complete binary tree is O(log2(n))
Binary Tree Nodes Define a binary tree a node as an instance of the generic TNode class. A node contains three fields.
The data value, called nodeValue. The reference variables, left and right that identify the
left child and the right child of the node respectively.
Binary Tree Nodes (continued) The TNode class allows us to construct
a binary tree as a collection of TNode objects.
TNode Class// declared as an inner class within the class building the tree
Building a Binary Tree
Using previous slide’sTNode class
Scanning a Binary Tree Next issue:
How will you retrieve the data stored in the tree?
Iterative Level-Order Scan A level-order scan visits the root, then nodes on
level 1, then nodes on level 2, etc.
Iterative Level-Order Scan A level-order scan is an iterative process that uses
a queue as an intermediate storage collection. Initially, the root enters the queue. Start a loop ending when the queue is empty
Remove a node from the queue Perform some action with the node Add its children onto the queue
Because siblings enter the queue during a visit of their parent, the siblings (on the same level) will exit the queue in successive iterations.
Notice in the next example how I'm using Java's Queue interface
Iterative Level-Order Scan (continued)
Iterative Level-Order Scan (continued)
Step 1: remove A then add B and C into queue
Step 3: remove C then add E into the queue
Step 4: remove D
Step 5: remove E
Step 2: remove B then add D into the queue
Iterative Level-Order Scan (continued) A
B C
C D
D E F
E F G H
F G H
G H I
H I
I J
J
remove A
remove B
remove C
remove D
remove E
remove F
remove G
remove H
remove I
remove J
Notice the order removed from queue and appended to the output s
Recursive Binary Tree-Scan Algorithms If current node == null is stopping condition To scan a tree recursively
Visit and display the node (D) scan the left subtree (L) and scan the right subtree (R) The order in which you perform the D, L, R tasks determines the
order in which nodes are retrieved
In the following code, t is initially the reference to the root node:
Inorder Scan L D R Scan is in order of visits to the left subtree,
the node's own value, and visits to the right subtree
Inorder Scan: G D J H B A E C F I
inorderDisplay method call stack: inorderDisplay(a) inorderDisplay(b) inorderDisplay(d) inorderDisplay(g) inorderDisplay(null) append g to s inorderDisplay(null) append d to s inorderDisplay(h) inorderDisplay(j) inorderDisplay(null) append j to s inorderDisplay(null) append h to s inorderDisplay(null) append b to s inorderDisplay(null) append a to s inorderDisplay(c) inorderDisplay(e) inorderDisplay(null) append e to s inorderDisplay(null) append c to s inorderDisplay(f) inorderDisplay(null) append f to s inorderDisplay(i) inorderDisplay(null) append i to s inorderDisplay(null)
Postorder Scan L R D
Scan order: G J H D B E I F C A
Scan is in order of visits to the left subtree, visits to the right subtree, and the node's own value
Can you write the method call stack for this?
Method Call Stack for Postorder
inorderDisplay method call stack: inorderDisplay(a) inorderDisplay(b) inorderDisplay(d) inorderDisplay(g) inorderDisplay(null) inorderDisplay(null) append g to s inorderDisplay(h); inorderDisplay(j); inorderDisplay(null); inorderDisplay(null); append j to s inorderDisplay(null); append h to s// rest left to you…
Scan order: G J H D B E I F C A
Preorder Scan D L R
Scan order: A B D G H J C E F I
Scan is in order of the node's own value, visits to the left subtree, and visits to the right subtree
More Recursive Scanning ExamplesPreorder (NLR): A B D G C E H I FInorder (LNR): D G B A H E I C FPostorder (LRN): G D B H I E F C A
Visitor Design Pattern When an action is needed on each element of a collection
Don't know in advance what the type of each element will be
Defines the visit() method which denotes what a visitor does For a specific visitor pattern
1. Create a Visitor interface
2. Create a class that implements the Visitor interface
3. In class scanning a tree Create an object of the class implementing the Visitor interface During traversal, call Visitor object's visit() and pass the current value as an
argument
Visitor Design Pattern (continued)
1.
2.
2.
Another possibility (requires T to be "Comparable"):
3. scanInorder()
This recursive method scanInorder()provides a generalized inorder traversal of a tree that performs an action specified by a visitor pattern.
Uses Visitor object's visitinstead of System.out.println
scanInOrder()
If Visitor parameter is VisitorOutput Prints output to the console
If Visitor parameter is VisitMax VisitMax parameter stores the tree element with
the max value after scanInOrder finishes
Program B_Tree
Illustrates use of all the scanning methods
Computing Tree Height Recall that the height of a binary tree can be
computed recursively.
height(T) =
-1 if T is empty
1 + max(height(TL), height(TR)) if T is nonempty{
F
Computing Tree Height (continued)
Copying a Binary Tree Simple case – exact duplicate Duplicate with additional information
Contain nodes with additional field Possibility - references the parent - this allows a scan up the
tree along the path of parents
Copying a Binary Tree (continued) Copy a tree using a postorder scan
Build the duplicate tree from the bottom up.
Clearing a Binary Tree Clear a tree with a postorder scan
Remove the left and right subtrees before removing the node.
Binary Search (Sorted) Trees
Binary Search Trees
Assume each data element has some key value For every node
Key is greater than all keys found in the left subtree Key less than all keys found in the right subtree
All nodes can be considered ordered (i.e., sorted)
BSTs - More Average depth of a balanced tree is log2N Function Definitions
Make_Empty O(N) Find O(log2N)
Find_Min / Find_Max O(log2N)
Insert O(log2N)
Remove O(log2N)
Most operations on a binary search tree take time directly proportional to the tree's height, so it is desirable to keep the height small. Ordinary binary search (unbalanced) trees have the primary disadvantage that they can attain very large heights in rather ordinary situations, such as when the keys are inserted in order. The result is a data structure similar to a linked list, making all operations on the tree expensive.
Binary Search (sorted) Trees
21
10 34
5
72
25 39
33
a total order (linear order) is defined on node or key values left subtree of a node contains only values less than node or key value right subtree contains only values greater than or equal to node or key value.
Inserting into a Sorted Binary Tree Create the new node (set children to null) If first node in tree
Make new node be the root
Else – determine where to insert the new node Set current node to root Loop until done
If new node greater than current node If current.right node is null – end of a branch
Set current.right to the new node Set done to true
Else Set current to current.right
Similar for less than on the left side…
Deleting from a Sorted Binary Tree Several cases to consider
If delete leaf node Remove node from tree
If delete node with one child Delete node and replace with its child
If delete node with two children Find the successor of the node Copy the successor value into the deletion position Delete the successor node (cases 1 and 2 above)
Successor will not have a left branch See following slides
Deleting from a Sorted Binary Tree
50
25
3515
205 30 40
Delete 25
32
31 33
Delete node with two children example
50
30
3515
205 30 40
Copy successor
32
31 33
Delete this node
Deleting from a Sorted Binary Tree How to find successor (node replacing deleted node)
50
25
3515
205 30 40
Delete 25
32
31 33
Start with deleted node's right child Then follow path of left children to
the end – this is the successor
30 has no left child – this is the successorThe successor will never have a left child
Searching a Sorted Binary Tree for a Value Set current to root
Loop while current's value isn't the item's value If item sought value is less than current's value
Set current to current.right Else
Set current to current.left If current is null didn't find the item
Heaps
Chapter 22 Ford and Top Array based binary trees
Array-Based Binary Trees A complete binary tree of depth d Contains all possible nodes through level d-1 Nodes at level d in the leftmost positions in the tree. An array a can be viewed as a complete binary tree
root is a[0] first-level children are a[1] and a[2] second-level children are a[3], a[4], a[5], a[6] and so forth.
Array-Based Binary Trees (continued)
Integer[] arr = {5, 1, 3, 9, 6, 2, 4, 7, 0, 8};
Array-Based Binary Trees (concluded) For element a[i] in an n-element array‑based
binary tree:
Left child of a[i] is a[2*i + 1]undefined if 2*i + 1 n
Right child of a[i] is a[2*i + 2]undefined if 2*i + 2 n
Parent of a[i] is a[(i-1)/2]undefined if i = 0
Heaps
Array based tree structure Max heap:
If B is a child node of A, then key(A) >= key(B) Min heap:
If relationship is reversed: key(A) <= key(B)
Root is max if max heap, min if min heap
Heaps A maximum heap is an array‑based
tree in which the value of a parent is ≥ the value of its children. A minimum heap uses the relation ≤.
Inserting into a Heap Assume that an array with elements inthe index range 0 i < last < n forms a heap. The new element will enter the array at index last with
the heap expanding by one element.
Inserting into a Heap (continued) Move nodes on the path of parents down one
level until the item is assigned as a parent that has heap ordering.
Path of parents for insertion of item = 50
Inserting into a Heap (continued) The static method pushHeap() of the class Heaps inserts
a new value in the heap. The parameter list includes the array arr, the index last, the new value item, and a Comparator object of type Greater or Less indicating whether the heap is a maximum or minimum heap.
Inserting into a Heap (continued) The algorithm uses an iterative scan with variable
currPos initially set to last. At each step, compare the value item with the value of the parent and if item is larger, copy the parent value to the element at index currPos and assign the parent index as the new value for currPos. The effect is to move the parent down one level. Stop when the parent is larger and assign item to the position currPos.
Inserting into a Heap (continued)
Deleting from a Heap
Deletion from a heap is normally restricted to the root only. Hence, the operation removes the maximum (or minimum) element.
To erase the root of an n‑element heap, exchange the element at index n‑1 and the root and filter the root down into its correct position in the tree.
Deleting from a Heap (continued)
Deleting from a Heap (continued)
1 8
35
3 882 51 0
4 03 0
a r r [ 0 ]
a r r [ 1 ] a r r [ 2 ]
a r r [ 4 ]
a r r [ 8 ]a r r [ 7 ]
a r r [ 5 ]a r r [ 3 ] a r r [ 6 ]
F ilte r d o w n 1 8
adjustHeap() The implementation of adjustHeap()
uses the integer variables currPos and childPos to scan the path of children. Let currPos = first and target = arr[first]. The iterative scan
proceeds until we reach a leaf node or target is ≥ to the values of the children at the current position.
Move currPos and childPos down the path of children in tandem. Set childPos = index of the largest (smallest) of arr[2*currPos + 1] and arr[2*currPos + 2].
Implementing popHeap() The implementation first captures the root
and then exchanges it with the last value in the heap (arr[last-1]). A call to adjustHeap() reestablishes heap order in a heap which now has index range[0, last-1). Method popHeap() concludes by returning the original root value.
Complexity of Heap Operations A heap stores elements in anarray-based tree that is a complete tree. The pushHeap() and adjustHeap() operations reorder elements in the tree by move up the path of parents for push() and down the path of largest (smallest) children for pop(). Assuming the heap has n elements, the maximum length for a path between a leaf node and the root is log2n, so the runtime efficiency of the algorithms is O(log2 n)
Sorting with a Heap If the original array is a maximum heap, an
efficient sorting algorithm can be devised. For each iteration i, the largest element is arr[0].
Exchange arr[0] with arr[i] and then reorder the array so that elements in the index range [0, i) are a heap. This is precisely the action of popHeap(), which is an O(log2n) algorithm.
By transforming an arbitrary array into a heap, this algorithm will sort the array.
Building a Heap Transforming an arbitrary array into a heap is
called "heapifying" the array. The method makeHeap() in the Heaps class
performs this transformation. Turn an n‑element array into a heap by filtering
down each parent in the tree beginning with the last parent at index(n-2)/2 and ending with the root node at index 0
Building a Heap (continued)Integer[] arr = {9, 12, 17, 30, 50, 20, 60, 65, 4, 19};
Building a Heap (continued)
4
9
1 96 5
6 02 05 03 0
1 71 2
a d ju s tH e a p () a t 4N o c h a n g e s
(a )
4
9
1 93 0
6 02 05 065
171 2
a d ju s tH e a p () a t 3M o v e 30 d o w n 1 le v e l
(b )
5 0
4
9
1 93 0
1 72 06 5
6 01 2
a d ju s tH e a p () a t 2M o v e 17 d o w n 1 le v e l
(c )
Heapsort
The heap sort is a modified version of the selection sort for an array arr that is a heap. For each i = n, n-1, ..., 2, call popHeap() which pops
arr[0] from the heap and assigns it at index i-1. With a maximum heap, the array is assorted in
ascending order. A minimum heap sorts the array in descending order.
Heapsort (continued)public static <T> void
heapSort(T[] arr, Comparator<? super T> comp){ // "heapify" the array arr Heaps.makeHeap(arr, comp);
int i, n = arr.length;
// iteration that determines elements // arr[n-1] ... arr[1] for (i = n; i > 1; i--) { // call popHeap() to move next // largest to arr[n-1] Heaps.popHeap(arr, i, comp); }}
Heapsort (concluded) A mathematical analysis shows that the worst
case running time of makeHeap() is O(n). During the second phase of the heap sort,
popHeap() executes n - 1 times. Each operation has efficiency O(log2 n).
The worst-case complexity of the heap sort is O(n) + O(n log2 n) = O(n log2 n).
Implementing a Priority Queue Recall that the HeapPQueue class implements
the PQueue interface. The class uses a heap as the underlying storage
structure. The user is free to specify either a Less or Greater
comparator which dictates whether a deletion removes the minimum or the maximum element from the collection.
Implementing a Priority Queue (continued)
The HeapPQueue Classpublic class HeapPQueue<T> implements PQueue<T>
{ // heapElt holds the priority queue elements private T[] heapElt;
// number of elements in the priority queue private int numElts;
// Comparator used for comparisons private Comparator<T> comp;
// create an empty maximum priority queue public HeapPQueue() { comp = new Less<T>(); numElts = 0; heapElt = (T[]) new Object[10]; } . . .}
HeapPQueue Class peek()
// return the highest priority item// Precondition: the priority queue is not empty;// if it is empty, throws NoSuchElementExceptionpublic T peek(){ // check for an empty heap if (numElts == 0) throw new NoSuchElementException( "HeapPQueue peek(): empty queue"); // return the root of the heap return heapElt[0];}
HeapPQueue Class pop()// erase the highest priority item and return it
// Precondition: the priority queue is not empty;// if it is empty, throws NoSuchElementExceptionpublic T pop(){ // check for an empty priority queue if (numElts == 0) throw new NoSuchElementException( "HeapPQueue pop(): empty queue"); // pop the heap and save the return value in top T top = Heaps.popHeap(heapElt, numElts, comp);
// heap has one less element numElts--;
return top;}
HeapPQueue Class push()
// insert item into the priority queuepublic void push(T item){ // if the current capacity is used up, reallocate // with double the capacity if (numElts == heapElt.length) enlargeCapacity();
// insert item into the heap Heaps.pushHeap(heapElt, numElts, item, comp); numElts++;}
Recommended