Machine Learning: BITS Pilani

Machine Learning
(BITS F464)
Dr.N.L.Bhanu Murthy
BITS Pilani
Hyderabad Campus
What is Learning?
“Gain knowledge or understanding of or skill in by study,

instruction or experience” - Webster
BITS Pilani, Hyderabad Campus

What is Learning?
“Learning is any process by which a system improves

performance from experience.” - Herbert Simon
Researcher in Professor @
 Artificial Intelligence  Carnegie Mellon University
 Cognitive psychology  University of California,
 Computer science Berkeley
 Economics  Illinois Institute of Technology
 Political science
Awards:
 Turing Award, 1975
 Nobel Prize in Economics1978
 National Medal of Science1986 1916 - 2001
 von Neumann Theory Prize1988

What is Machine Learning?
Machine Learning is study of

algorithms that
 improve their performance P
 at some task T
 with experience E
Tom Mitchell (1990)
Well-defined learning task: <P,T,E>

Example - Machine Learning
Handwritten Digit Recognition
T: Recognizing hand-written words
P: Percentage of words correctly classified
E: Database of human-labeled images of handwritten words

T: Driving on four-lane highways using vision sensors
P: Average distance traveled before a human-judged error
E: A sequence of images and steering commands recorded while
observing a human driver.



Learning to drive an autonomous

vehicle (Pomerleau, 1989).



Examples of Successful Applications of Machine Learning
 Learning to recognize spoken words

(Lee, 1989; Waibel, 1989).
 Learning to classify new astronomical structures

(Fayyad et al., 1995).
 Learning to play world-class backgammon

(Tesauro 1992, 1995).
 Categorize email messages as spam or legitimate.

Gary Kasparov on loss to Deep Blue
• Human uses 1% calculation, 99% understanding

– based on patterns, drawing information from experience
• Machine opposite: 99% calculation 1% understanding
– though this understanding is growing

Machine Learning, a Magic?
No, more like gardening

 Seeds = Algorithms
 Nutrients = Data
 Gardener = You
 Plants = Programs

Machine Learning in Computer Science
Speech/Au Robotics
dio
Processing Planning
Natural
Language
Processing
Machine Vision/Image
Processing
Biomedical/Chemed
ical Learning
Informatics
Human Financial Modeling

Computer Analytics
Interaction

They said it!!
 “A breakthrough in machine learning would be worth ten Microsofts”
- Bill Gates, Chairman, Microsoft
 Machine learning is the hot new thing”

- John Hennessy, President, Stanford
 “Web rankings today are mostly a matter of machine learning”

- Prabhakar Raghavan, Dir. Research, Yahoo
 “Machine learning is going to result in a real revolution”

- Greg Papadopoulos, CTO, Sun
 “Machine learning is today’s discontinuity” - Jerry Yang, CEO, Yahoo
 “Machine learning is the next Internet”

- Tony Tether, Director, DARPA
Future prospects..

History of Technology

12 IT skills that employers can't say no to
1) Machine learning
2) Mobilizing applications
3) Wireless networking
4) Human-computer interface
5) Project management
6) General networking skills
7) Network convergence technicians
8) Open-source programming
9) Business intelligence systems
10) Embedded security
11) Digital home technology integration
12) .Net, C #, C ++, Java -- with an edge

History of Machine Learning
• 1950s
– Samuel’s checker player
– Selfridge’s Pandemonium
• 1960s:
– Neural networks: Perceptron
– Pattern recognition
– Learning in the limit theory
– Minsky and Papert prove limitations of Perceptron
• 1970s:
– Symbolic concept induction
– Winston’s arch learner
– Expert systems and the knowledge acquisition bottleneck
– Quinlan’s ID3
– Michalski’s AQ and soybean diagnosis
– Scientific discovery with BACON
– Mathematical discovery with AM

History of Machine Learning (cont.)
• 1980s:
– Advanced decision tree and rule learning
– Explanation-based Learning (EBL)
– Learning and planning and problem solving
– Utility problem
– Analogy
– Cognitive architectures
– Resurgence of neural networks (connectionism,
backpropagation)
– Valiant’s PAC Learning Theory
– Focus on experimental methodology
• 1990s
– Data mining
– Adaptive software agents and web applications
– Text learning
– Reinforcement learning (RL)
– Inductive Logic Programming (ILP)
– Ensembles: Bagging, Boosting, and Stacking
– Bayes Net learning
History of Machine Learning (cont.)
• 2000s
– Support vector machines
– Kernel methods
– Graphical models
– Statistical relational learning
– Transfer learning
– Sequence labeling
– Collective classification and structured outputs
– Computer Systems Applications
• Compilers
• Debugging
• Graphics
• Security (intrusion, virus, and worm detection)
– Email management
– Personalized assistants that learn
– Learning in robotics and vision

Why does Machine Learning need math?
 Calculus
– We need to identify the maximum likelihood, or minimum risk.
Optimization
– Integration allows the marginalization of continuous probability
density functions
 Linear Algebra
– Many features leads to high dimensional spaces
– Vectors and matrices allow us to compactly describe and manipulate
high dimensional feature spaces.
 Vector Calculus
– All of the optimization needs to be performed in high dimensional
spaces
– Optimization of multiple variables simultaneously – Gradient Descent
– Want to take a marginal over high dimensional distributions like
Gaussians.
Teaching & Evaluation (BITS C464 – L P U – 3 0 3)
Evaluation Components & Criteria

Component Weightage Duration Date Mode
(out of 200)
Mid Test 50 90 minutes As per Closed Book
Timetable
Quiz(2) 30 (each quiz 30 minutes Closed Book
TBD
for 15)
Assignments 40 Open Book
Comprehensive 80 3 hours As per Closed Book

Timetable
Make-up Policy: Make-up for other tests will be granted on prior permission and on
justifiable grounds only.
Course Notices: All notices pertaining to this course will be displayed on the LTC
Notice Board as well as the CS & IS Notice Board.
Chamber Consultation: Friday 1600 Hrs – 1700 Hrs

Text Book
T1. Christopher Bishop: Pattern Recognition and Machine Learning,

Springer International Edition.

Reference Book
R1. Tom M. Mitchell: Machine Learning, The McGraw-Hill Companies, Inc.

Reference Book

Reference Book

Reference Book

Supervised Learning

Supervised Learning

Supervised Learning

Supervised Learning

Supervised Learning

Supervised Learning

Supervised Learning

Supervised Learning

Supervised Learning

Supervised Learning
Decision Tree Learning

 Target Concept
“Days on which my friend, yar, enjoys his favorite water sport”
(you may find it more intuitive to think of
“Days on which the beach will be crowded” concept)
 Task
Learn to predict the value of EnjoySport/Crowded for an arbitrary day
 Training Examples for the Target Concept
Example Sky Air Humidity Wind Water Forecast Enjoy

Temp Sport
0 Sunny Warm Normal Strong Warm Same Yes
1 Sunny Warm High Strong Warm Same Yes
2 Rainy Cold High Strong Warm Change No
3 Sunny Warm High Strong Cool Change Yes

Supervised Learning
Decision Tree Learning
 Hypothesis space search
 Occam’s razor
 Overfitting
 Measure for Selecting attributes –
Entropy, Gini Index etc.
 Issues in DT learning
Outlook
Sunny Overcast Rain
Humidity Yes Wind
High Normal Strong Weak
No Yes No Yes
Generative and Discriminative Models: An analogy
 Generative approach is to learn each language and determine

as to which language the speech belongs to
 Discriminative approach is determine the linguistic differences

without learning any language– a much easier task!

Taxonomy of ML Models

Supervised Learning
denotes +1
Support Vector Machine (SVM)
denotes -1
x2
V. Vapnik
x1

Supervised Learning
denotes +1
denotes -1
x2
V. Vapnik
x1

Supervised Learning denotes +1
denotes -1
x2
V. Vapnik
x1

denotes -1
x2
V. Vapnik
x1

denotes -1
x2
Support Vector Machine (SVM) Margin
“safe zone”
V. Vapnik
x1

Supervised Learning
Artificial Neural Networks (ANN)
 Networks of processing units (neurons) with connections

(synapses) between them
 Large number of neurons: 1014
 Large connectitivity: 104
 Parallel processing
 Distributed computation/memory
 Robust to noise, failures

Supervised Learning
Regression
 Polynomial Curve Fitting

 Model Selection
 Overfitting & Regularization
 Probabilistic interpretation
 Bayesian curve fitting
 Linear Basis Function Models
 Bias – Variance Decomposition
 Bayesian Linear Regression

Unsupervised Learning
 Clustering Algorithms

Model / Hypothesis Evaluation
t Tests
Precision
Recall
F-Measure
AuC for ROC curves
R2
Spearman / Pearson Correlation

Thank You!!

Machine Learning: BITS Pilani

Hochgeladen von

Dokumentinformationen

Originaltitel

Copyright

Verfügbare Formate

Dieses Dokument teilen

Dokument teilen oder einbetten

Freigabeoptionen

Stufen Sie dieses Dokument als nützlich ein?

Sind diese Inhalte unangemessen?

Copyright:

Verfügbare Formate

Machine Learning: BITS Pilani

Hochgeladen von

Copyright:

Verfügbare Formate

Machine Learning

“Gain knowledge or understanding of or skill in by study,

BITS Pilani, Hyderabad Campus

“Learning is any process by which a system improves

BITS Pilani, Hyderabad Campus

Machine Learning is study of

Well-defined learning task: <P,T,E>

BITS Pilani, Hyderabad Campus

BITS Pilani, Hyderabad Campus

BITS Pilani, Hyderabad Campus

BITS Pilani, Hyderabad Campus

BITS Pilani, Hyderabad Campus

Learning to drive an autonomous

BITS Pilani, Hyderabad Campus

BITS Pilani, Hyderabad Campus

BITS Pilani, Hyderabad Campus

 Learning to recognize spoken words

 Learning to classify new astronomical structures

 Learning to play world-class backgammon

 Categorize email messages as spam or legitimate.

BITS Pilani, Hyderabad Campus

• Human uses 1% calculation, 99% understanding

BITS Pilani, Hyderabad Campus

No, more like gardening

BITS Pilani, Hyderabad Campus

Human Financial Modeling

BITS Pilani, Hyderabad Campus

 Machine learning is the hot new thing”

 “Web rankings today are mostly a matter of machine learning”

 “Machine learning is going to result in a real revolution”

 “Machine learning is today’s discontinuity” - Jerry Yang, CEO, Yahoo

 “Machine learning is the next Internet”

BITS Pilani, Hyderabad Campus

BITS Pilani, Hyderabad Campus

BITS Pilani, Hyderabad Campus

BITS Pilani, Hyderabad Campus

BITS Pilani, Hyderabad Campus

Evaluation Components & Criteria

Comprehensive 80 3 hours As per Closed Book

Chamber Consultation: Friday 1600 Hrs – 1700 Hrs

T1. Christopher Bishop: Pattern Recognition and Machine Learning,

BITS Pilani, Hyderabad Campus

R1. Tom M. Mitchell: Machine Learning, The McGraw-Hill Companies, Inc.

BITS Pilani, Hyderabad Campus

BITS Pilani, Hyderabad Campus

BITS Pilani, Hyderabad Campus

BITS Pilani, Hyderabad Campus

BITS Pilani, Hyderabad Campus

BITS Pilani, Hyderabad Campus

BITS Pilani, Hyderabad Campus

BITS Pilani, Hyderabad Campus

BITS Pilani, Hyderabad Campus

BITS Pilani, Hyderabad Campus

BITS Pilani, Hyderabad Campus

BITS Pilani, Hyderabad Campus

BITS Pilani, Hyderabad Campus

Decision Tree Learning

 Training Examples for the Target Concept

Example Sky Air Humidity Wind Water Forecast Enjoy

BITS Pilani, Hyderabad Campus

Sunny Overcast Rain

Humidity Yes Wind

High Normal Strong Weak