Back to all chapters
# Artificial Neural Networks

A quick dive into a cutting-edge computational method for learning.

Dogs and cats are both furry animals with four legs and many other shared traits. Why, then, is it easy to distinguish between them?

As young people, we’re told which animals we observe are dogs and which are cats. Fairly quickly, we stop needing new examples. Our learning is powerful enough to classify a new animal as a dog or a cat, even when it doesn’t look particularly similar to one we’ve seen before. It turns out that computers can learn similarly.

A **supervised learning** algorithm attempts to model a function to relate inputs to outputs. It uses known examples to learn this relationship.

When building a supervised learning model to distinguish whether an image is of a dog or a cat, what should the inputs for the examples be?

After using images as examples to **train** (or teach) the model, you’ll want to use images to **test** the model; that is, to determine if the model is actually successful at identifying if the image is of a dog or a cat.

What would be a reasonable way to select your images for training and testing?

A good supervised learning model predicts the outputs of unobserved inputs using knowledge of the outputs of observed inputs. The ability to make successful predictions on unobserved inputs from observed data is called **generalization**.

For any observed data, there are an infinite number of functions that pass through all input-output pairs. The “best” function is *not* necessarily one which fits all of the observed data, but instead is one that generalizes well.

In the task of distinguishing between dogs and cats, we wanted to classify an image into discrete categories with no numerical relationship; i.e., we cannot say \(\text {dog}\) is 2 times \(\text {cat}.\) This type of problem is called a **classification** problem.

On the other hand, in the previous question, we found a function to relate an input to a numerical output (height). These outputs have a clear numerical relationship; e.g., the output of 6 feet is twice the output of 3 feet. This type of problem is known as a **regression** problem.

Artificial neural networks (ANNs) are flexible enough to be used in both classification and regression problems.

Of the following three learning problems, how many should be treated as **regression** problems?

- Identifying which zip-code digits have been written on an envelope
- Predicting the total number of points scored by two teams in a basketball game
- Determining the adult height of a child given his/her age and current height

This quiz has focused on supervised learning problems, as many of the basic applications of ANNs involve supervised learning. However, there is another type of learning: **unsupervised learning**.

As the name might imply, unsupervised learning attempts to determine relationships between inputs without using any example outputs (such as “dog” or “cat” in our earlier example). Which of the following problems would **not** be a good fit for unsupervised learning?

We now have a big-picture sense of what learning problems are all about. There are two main types of learning: unsupervised and supervised, and problems within supervised learning can be categorized as classification or regression problems. This Exploration illustrates how to construct ANNs, a computational model that performs well in a wide variety of learning problems such as the ones discussed in this quiz.

In the next quiz, we’ll explore the basics of how the human brain works and how these ideas inspire the mechanisms within ANNs.

×

Problem Loading...

Note Loading...

Set Loading...