DS&A - Data Structures & Algorithms

Exercises: Algorithm Correctness

May 142018

Before attempting these exercises, you should read the posts on algorithm correctness, invariants and variants.

Tags:
exercises, correctness, assertions, invariants, variants, algorithms

Variants

May 142018

An algorithm is correct if it always returns the correct result.

Assertions are our main tool for proving algorithms are correct; they state that some condition will be true whenever a particular line of code is reached. The goal is to assert that, when the algorithm finishes running, the result is correct. If this assertion never fails, then the algorithm will never return a wrong result.

Tags:
variants, correctness, algorithms

Invariants

May 142018

An assertion is a claim about the state of a running program — it says that some logical condition should be true whenever a particular line of code is reached. In the previous post, we only considered assertions in code where each line is executed once, in sequence.

Of course, real algorithms are more complicated, because loops and recursion cause some code to be executed multiple times, not in a simple sequence. In this post we’ll explore invariants, which are needed to prove correctness of non-trivial algorithms.

Tags:
invariants, correctness, algorithms

Is Your Algorithm Correct?

May 142018

When we write an algorithm to solve a problem, the algorithm is only correct if it does actually solve that problem. That should be obvious! But how do we know whether an algorithm is correct?

This is a genuine concern because algorithms often aren’t correct, and we need to debug them. We need a way to analyse an algorithm to decide if it really does solve the problem or not.

Tags:
assertions, correctness, algorithms

Exercises: Algorithm Efficiency

May 072018

Before attempting these exercises, you should read the posts on dynamic analysis and static analysis.

Tags:
exercises, efficiency, dynamic analysis, algorithms

Static Analysis

May 072018

In the previous post we analysed how efficient the insertion sort algorithm is. We did dynamic analysis — we analysed the algorithm by actually running it.

Technically, we analysed an implementation of the algorithm; perhaps a cleverer implementation could be more efficient. From one perspective, the implementation is what we really care about:

The implementation is what will actually be run in the real world.
A more efficient implementation will make a real piece of software faster.

On the other hand, it’s hard to do an accurate dynamic analysis:

There will always be some measurement error.
It might take a long time to do the analysis.
Running time depends on external factors like how fast the computer is, or what other programs are running. Results from different computers, or even the same computer on different days, may be incomparable.

In this post, we’ll get around these problems by making a series of simplifications. We’ll do a kind of static analysis called asymptotic analysis — this means we’ll analyse algorithms without running them, and we’ll assume the inputs are large.

Tags:
efficiency, complexity, static analysis, algorithms

‘Big O’ Notation

May 072018

By making a series of assumptions and considering only “large” inputs, we can analyse how efficient an algorithm is without actually running it. The result of this analysis is a mathematical formula called the complexity (or time complexity) of the algorithm. For example, we derived the formula n² for the selection sort algorithm.

Using standard notation, we would say that selection sort’s complexity is O(n²), or that selection sort is an O(n²) algorithm.

This formula says, very roughly, how much “work” the algorithm has to do as a function of n, which represents the “input size”. In this post, we’ll see how to make sense of this formula, and how to derive it with as little algebra as possible.

Tags:
big O notation, efficiency, complexity, static analysis, algorithms

How Fast is Your Algorithm?

May 072018

Efficiency is important; people don’t like waiting for computers to slowly do things that ought to be fast.

(This interactive feature requires Javascript to be enabled in your browser.)

So far we’ve been thinking about this in vague terms; we know, for example:

Getting from an array-list is faster than from a linked list,
Binary search is faster than linear search,
Merge sort is faster than selection sort.

Now it’s time to go into detail: how do we know how fast an algorithm is?

Tags:
efficiency, dynamic analysis, algorithms

Exercises: Algorithm Design

Apr 302018

Before attempting these exercises, you should read the posts on algorithm strategies and choosing data types.

Tags:
algorithms, algorithm strategies, exercises

Choose the Right Data Structures

May 072018

In an earlier post we saw the importance of choosing the right data types when designing an algorithm. But we don’t just choose a type — whether we’re aware of it or not, we are also choosing a data structure which implements that type.

Both choices can have a big impact on how efficient an algorithm is; let’s consider some examples.

Tags:
efficiency, data structures, algorithms

Algorithm Strategies

Apr 302018

Perhaps the most difficult part of our process for writing algorithms is splitting a problem into subproblems. This is more an art than a science — there’s no systematic way to identify subproblems, and each problem might be split into subproblems in many different ways.(1)

Tags:
brute-force, greedy algorithms, divide and conquer, algorithm strategies, algorithms

Polygons

Apr 232018

A polygon is any shape made of straight-line segments. Polygons are especially important in computer graphics and games, since they’re used in vector image formats and 3D models.

Tags:
geometry, data structures, algorithms

Simple Shapes

Apr 232018

The most primitive geometric objects are points and vectors. But real geometric problems involve shapes like rectangles, triangles, circles, and polygons — or more complex collections of shapes, such as 3D models made of many polygons. In this post we’ll consider how computers can represent some simple kinds of shapes, and solve problems using these representations.

Tags:
geometry, data structures, algorithms

Exercises: Sorting

Apr 162018

Before attempting these exercises, you should read the posts on comparisons and swaps, iterative sorting and recursive sorting.

Tags:
exercises, sorting, algorithms

Choose the Right Data Types

Apr 302018

One of the most important and overlooked parts of designing algorithms is choosing the right data types. We often assume that the types of data an algorithm will use are determined by the inputs and the output — but it might help to use a temporary collection of some data, if that collection has useful operations. Those operations are determined by the collection’s type.

In this post we’ll look at a couple of examples where using the right data types makes the problem simpler.

Tags:
algorithms, collections, plans, abstract data types

Recursive Sorting

Apr 162018

In the previous post we saw two iterative sorting algorithms: selection sort and insertion sort both work using a loop to put the right values in the right places, one at a time. They are two of the simplest sorting algorithms. But they’re not very efficient — for long lists, they’re too slow.(1)

In this post we’ll see one of the simplest efficient sorting algorithms, called merge sort. Rather than sorting items one-by-one, it works by recursively sorting shorter lists.

Tags:
recursion, divide and conquer, sorting, algorithms

Iterative Sorting

Apr 162018

Before writing an algorithm, it’s always worth thinking about how you would solve the same problem “by hand”. After all, you can’t write instructions for how to do something if you don’t know how to do it yourself.

The problem we’re currently interested in is sorting — given a list, put its values in order. There are hundreds of algorithms which solve this problem, but ultimately every sorting algorithm does one of two things:

Tags:
sorting, lists, algorithms

Exercises: Searching Trees

Apr 092018

Before attempting these exercises, you should read the posts on depth-first search (DFS) and breadth-first search (BFS).

Tags:
exercises, BFS, DFS, trees, search, algorithms

Exercises: Searching Lists

Apr 092018

Before attempting these exercises, you should read the post on linear and binary search.

Tags:
exercises, linear search, binary search, lists, search, algorithms

Comparisons and Swaps

Apr 162018

We have seen two search algorithms on lists — linear search and binary search. Both algorithms find the index of a given target value in a list. But they make different assumptions about the data in the list: linear search works on any list, whereas binary search only works if the list is in order.

Tags:
sorting, algorithms, problems

Breadth-First Search

Apr 092018

Depth-first search (DFS) is a fundamental algorithm for working with trees, because it gives us a way to “visit” every node in the tree, once each.(1) That’s important, because visiting every node is a subproblem of many other problems on trees.

DFS explores the full depth of one subtree before moving onto the next one. If that’s the order we want to visit the nodes in, or the order doesn’t matter, then DFS is fine; otherwise, we need a different algorithm. We’ll see one in this post.

Tags:
BFS, queues, trees, search, algorithms

Depth-First Search

Apr 092018

Search problems vary in two ways: what do we want to search for, and what type of collection are we searching in?(1)

When we need a search algorithm, the data structure we want to search is more important than what we want to find. We could even say that linear search and binary search work on different data structures: linear search uses a list, and binary search uses a list which must be kept in order.(2)

Tags:
DFS, stacks, trees, recursion, search, algorithms

Search Algorithms

Apr 092018

Search problems are problems which ask us to find something, given a description of it. Typical search problems are like these:

Given a list, find the index of a target value.
Given a list of numbers, find the first negative number.
Find the first non-empty line in a text file.
Find all rows in an SQL database table matching a given WHERE clause.
Given a tree, find a node which holds a target value.
Given an HTML document, find the first <h1> tag.
Find all tags in an HTML document matching a given CSS selector.
Starting at a given node in a graph, find all nodes reachable via edges.

A search algorithm is one which solves a search problem. In this post, we’ll see two algorithms which solve search problems on lists.(1)

Tags:
linear search, binary search, search, algorithms

Exercises: Recognising “Plans”

Mar 192018

Before attempting these exercises, you should read the posts on specifying problems, problem-solving and algorithmic “plans”.

Tags:
exercises, problems, plans, algorithms

Exercises: Problem-Solving with “Plans”

Mar 192018

Before attempting these exercises, you should read the posts on specifying problems, problem-solving and algorithmic “plans”.

Tags:
exercises, problems, plans, algorithms

Algorithmic “Plans”

Mar 192018

“Plans” are the basic “blueprints” or “building blocks” for algorithms — they are canned solutions to common programming problems which are simple but appear in many variations. Thinking about plans makes it easier to understand code, because we can see the intentions rather than thinking about one line at a time. Each plan usually only solves part of a problem, so a given piece of code may use many plans, and some plans always use other plans.

Tags:
plans, algorithms

Specifying Problems

Mar 192018

We want to write algorithms, because algorithms solve computational problems. Before writing an algorithm, we need to make the problem specific enough — we need to understand exactly what our algorithm is required to do.

Tags:
plans, problems, algorithms

A Problem-Solving Process

Mar 192018

A computer programmer is somebody who converts computational problems into computational solutions. This is a “meta-problem”:

Given a problem, write a computer program which solves it.
- Input: a problem statement.
- Output: a computer program.

A computer can’t do this — writing programs requires insight and ingenuity.(1) But there are some systematic processes we can follow when writing programs, so most of the time we don’t have to hope for a “eureka!” moment.

Tags:
plans, problems, algorithms

Recursion on Linked Lists

Mar 122018

A linked list is a recursive structure — a linked list node has a value, and a pointer to the next linked list node. This means we can solve problems on linked lists using structural recursion. In this post we’ll see three examples, to demonstrate the process of writing a structurally recursive algorithm.

Tags:
recursion, structural recursion, linked lists, algorithms

Recursion on Trees

Mar 122018

Recursion is one of the most difficult ideas in programming, but sometimes it can be simple. A recursive function is a function which can call itself. Some recursive functions are very puzzling, but others are so natural that you can understand them even without knowing what recursion is.

Tags:
recursion, structural recursion, trees, algorithms

Graph Problems

Mar 052018

Graphs and networks are useful because they’re very general — graphs show up almost everywhere. Given that there are so many different examples of graph structures, then, it’s perhaps surprising that there aren’t actually very many different kinds of problems involving graphs.

That is, there are many computing problems which graphs can help with, but not many kinds of problem. Usually, after translating from the problem domain into the language of graphs, the problem turns out to be one of a relatively small number of “classical graph problems”.

Tags:
graphs, problems, algorithms