Machine Learning Garden

Neural Network

References

Tags: concept

Sources:

Related notes:

Updates:

April 16th, 2021: created note.

Notes {{word-count}}

Summary:

Key points:

What is a Neural Network?

A Neural Network is a multi-layer parametric functions with learned parameters.

This is a definition by Professor Sergey Levine at 25:50 in the second part of lecture 1.

In a Neural Network, the parameters of every layer are usually (not always) trained with respect to the overall task objective (e.g., accuracy, loss, cumulative rewards, etc).

It is also called the end-to-end learning.

Exceptions of this includes the technique of freezing parameters in certain layers and training a partial network.

Neural Networks can acquire representations by using high-capacity models and a lot of data without requiring manual engineering of features or representations.

Model capacity here means how many different functions a particular model class can represent.

This means that we do not need to know what good features are, and we expect the model to figure it out from data.

When representations are learned in an end-to-end fashion, they are better tailored to the current task.

Pros and cons of Neural Networks

They need to be huge, require a large amount of data, and a lot of compute.

As we add more layers, data, and compute, they become more and more powerful.

But they do plateau.

Learning (nature) and Inductive Bias (nurture)

Models that get most of their performance from their data rather than a designer's insight.

Inductive Bias is the knowledge we built into the model to make it learn effectively.

All such knowledge is bias in the sense that it makes some solutions more likely and some less likely.

Note that we can never fully get rid of this because otherwise there is no model.

Learning and Inductive Bias trade-off

Should we build in more knowledge, or better machinery for learning and scale?

Scalability

Scalability is the ability that the performance gets better and better as we add more data, representational capacity, and compute.

Why do we call them Neural Networks?

Neural Networks were proposed as a rudimentary model of neurons in the brain.

In our brain, dendrites receive signals from other neurons.

Neuron decide whether to fire based on incoming signals.

Axon further transmits signal to downstream neurons.

Artificial neuron sums up signals from upstream neurons (units).

$\displaystyle z = \sum_i a_i$

Neuron decide how much to fire based on incoming signals.

$a=\sigma(z)$

Activation transmitted to downstream units.

Machine Learning Garden

Neural Network

Referenced in

Natural Language Processing

Transformer

Deep Learning

Neural Network

Neural Network

Default Mode Network

Deep Learning

Neural Network

Neural Network

Machine Learning Concepts

Neural Network

Inductive Bias