Machine Learning Workshop for Colleges - Purpose Technology

Topics covered in the Workshop

Introduction : Definition of learning systems. Goals and applications of machine learning. Aspects of developing a learning system: training data, concept representation, function approximation.

Inductive Classification : The concept learning task. Concept learning as search through a hypothesis space. General-to-specific ordering of hypotheses.

Decision Tree Learning: Representing concepts as decision trees. Recursive induction of decision trees. Picking the best splitting attribute: entropy and information gain. Searching for simple trees and computational complexity. Occam’s razor. Over fitting, noisy data, and pruning.

Ensemble Learning :Using committees of multiple hypotheses. Bagging, boosting, and DECORATE. Active learning with ensembles.

Experimental Evaluation of Learning Algorithms: Measuring the accuracy of learned hypotheses. Comparing learning algorithms: cross-validation, learning curves, and statistical hypothesis testing.

Computational Learning Theory: Models of learnability: learning in the limit; probably approximately correct (PAC) learning. Sample complexity: quantifying the number of examples needed to PAC learn.

Rule Learning: Propositional and First-Order : Translating decision trees into rules. Heuristic rule induction using separate and conquer and information gain.

Artificial Neural Networks: Neurons and biological motivation. Linear threshold units. Perceptrons: representational limitation and gradient descent training.

Support Vector Machines: Maximum margin linear separators. Quadractic programming solution to finding maximum margin separators. Kernels for learning non-linear functions.

Bayesian Learning: Probability theory and Bayes rule. Naive Bayes learning algorithm.

Instance-Based Learning: Constructing explicit generalizations versus comparing to past specific examples. k-Nearest-neighbor algorithm. Case-based learning.

Text Classification: Bag of words representation. Vector space model and cosine similarity. Relevance feedback and Rocchio algorithm.

Clustering and Unsupervised Learning: Learning from unclassified data. Clustering. Hierarchical Agglomerative Clustering. k-means partitional clustering.

Language Learning: Classification problems in language: word-sense disambiguation, sequence labeling. Hidden Markov models (HMM’s).

LAB

We will use Rstudio and Rpackage for the Practice
We will cover one real time project using Machine learning.

Organise This Workshop at Your Institute

×

Open chat