Machine Learning

Machine Learning

Machine Learning: Lecture 1 Overview of Machine Learning (Based on Chapter 1 of Mitchell T.., Machine Learning, 1997) 1 Machine Learning: A Definition Definition: A computer program is said to learn from experience E with respect to some class of tasks T and performance measure P, if its performance at tasks in T, as measured by P, improves with experience E.

2 Examples of Successful Applications of Machine Learning Learning to recognize spoken words (Lee, 1989; Waibel, 1989). Learning to drive an autonomous vehicle (Pomerleau, 1989). Learning to classify new astronomical structures (Fayyad et al., 1995). Learning to play world-class backgammon (Tesauro 1992, 1995). 3 Why is Machine Learning

Important? Some tasks cannot be defined well, except by examples (e.g., recognizing people). Relationships and correlations can be hidden within large amounts of data. Machine Learning/Data Mining may be able to find these relationships. Human designers often produce machines that do not work as well as desired in the environments in which they are used. 4 Why is Machine Learning Important (Contd)? The amount of knowledge available about

certain tasks might be too large for explicit encoding by humans (e.g., medical diagnostic). Environments change over time. New knowledge about tasks is constantly being discovered by humans. It may be difficult to continuously re-design systems by hand. 5 Areas of Influence for Machine Learning Statistics: How best to use samples drawn from unknown probability distributions to help decide from which distribution some new sample is drawn? Brain Models: Non-linear elements with weighted inputs (Artificial Neural Networks) have been suggested as simple

models of biological neurons. Adaptive Control Theory: How to deal with controlling a process having unknown parameters that must be estimated during operation? 6 Areas of Influence for Machine Learning (Contd) Psychology: How to model human performance on various learning tasks? Artificial Intelligence: How to write algorithms to acquire the knowledge humans are able to acquire, at least, as well as humans? Evolutionary Models: How to model certain aspects of biological evolution to improve the performance

of computer programs? 7 Designing a Learning System: An Example 1. Problem Description 2. Choosing the Training Experience 3. Choosing the Target Function 4. Choosing a Representation for the Target Function 5. Choosing a Function Approximation Algorithm 6. Final Design 8

1. Problem Description: A Checker Learning Problem Task T: Playing Checkers Performance Measure P: Percent of games won against opponents Training Experience E: To be selected ==> Games Played against itself 9 2. Choosing the Training Experience Direct versus Indirect Experience [Indirect Experience gives rise to the credit assignment problem and is thus more difficult] Teacher versus Learner Controlled Experience [the teacher might provide training examples; the learner might suggest interesting

examples and ask the teacher for their outcome; or the learner can be completely on its own with no access to correct outcomes] How Representative is the Experience? [Is the training experience representative of the task the system will actually have to solve? It is best if it is, but such a situation cannot systematically be achieved] 10 3. Choosing the Target Function Given a set of legal moves, we want to learn how to choose the best move [since the best move is not necessarily known, this is an optimization problem] ChooseMove: B --> M is called a Target Function [ChooseMove,

however, is difficult to learn. An easier and related target function to learn is V: B --> R, which assigns a numerical score to each board. The better the board, the higher the score.] Operational versus Non-Operational Description of a Target Function [An operational description must be given] Function Approximation [The actual function can often not be learned and must be approximated] 11 4. Choosing a Representation for the Target Function Expressiveness versus Training set size [The more

expressive the representation of the target function, the closer to the truth we can get. However, the more expressive the representation, the more training examples are necessary to choose among the large number of representable possibilities.] Example of a representation: x1/x2 = # of black/red pieces on the board wis are adjustable x3/x4 = # of black/red king on the board or learnable

x5/x6 = # of black/red pieces threatened by red/black coefficients V(b) = w0+w1.x1+w2.x2+w3.x3+w4.x4+w5.x5+w6.x6 ^ 12 5. Choosing a Function Approximation Algorithm Generating Training Examples of the form [e.g.

Training the System ^ Defining a criterion for success [What is the error that needs to be minimized?] Choose an algorithm capable of finding weights of a linear function that minimize that error [e.g. the Least Mean Square (LMS) training rule]. 13

6. Final Design for Checkers Learning The Performance Module: Takes as input a new board and outputs a trace of the game it played against itself. The Critic: Takes as input the trace of a game and outputs a set of training examples of the target function The Generalizer: Takes as input training examples and outputs a hypothesis which estimates the target function. Good generalization to new cases is crucial. The Experiment Generator: Takes as input the current hypothesis (currently learned function) and outputs a new problem (an initial board state) for the performance system to explore In this course, we are mostly concerned

with the generalizer 14 Issues in Machine Learning (i.e., Generalization) What algorithms are available for learning a concept? How well do they perform? How much training data is sufficient to learn a concept with high confidence? When is it useful to use prior knowledge? Are some training examples more useful than others? What are best tasks for a system to learn? What is the best way for a system to represent its knowledge? 15

Recently Viewed Presentations

  • Job satisfaction, Employee attitudes, Motivation/job design

    Job satisfaction, Employee attitudes, Motivation/job design

    Employees who report having friends at work have higher levels of productivity, retention, and job satisfaction, and are seven times more likely to be engaged in their work than their "friendless" counterparts . More friends at work, although increases exhaustion...
  • Republic of the Philippines PHILIPPINE STATISTICS AUTHORITY Philippines

    Republic of the Philippines PHILIPPINE STATISTICS AUTHORITY Philippines

    Opportunities Encountered To assess the civil registration system in the country at the sub-national level, the Technical Working Group (TWG) composed of representatives from the NSO, Department of Health, National Statistical Coordination Board and University of the Philippines-Population Institute selected...
  • Higher Modern Studies - St Andrew's and St Brides's high school

    Higher Modern Studies - St Andrew's and St Brides's high school

    evaluative language. You must synthesise between sources. You must also make an overall conclusion about the issue in the question (2 marks) All sources must be used for full marks - 8 marks. Examples of Evaluative . Language. Significantly. more....
  • Diapositiva 1

    Diapositiva 1

    Diapositiva 1 ... Vaccinazioni
  • 2nd Period Review Projects 2015 LO 3.13: The

    2nd Period Review Projects 2015 LO 3.13: The

    The correct answer is C. In the data provided, indirectly defines each of the following modes of natural selection . The student just need to analyze the figure given and then apply it to the question. In this case, the...
  • Response To Intervention and Early Intervening Services ...

    Response To Intervention and Early Intervening Services ...

    Implications for Implementing an RTI Model (cont.) This data-based documentation, reflecting formal assessment of student progress during instruction, is a component of RTI models; however, this regulatory requirement is mandated, whether or not a state chooses to implement an RTI...
  • What is classification? - Seneca Valley School District

    What is classification? - Seneca Valley School District

    You live, from broad to specific… The Cosmos/ Universe. The Milkyway Galaxy. The Sol System (solar system) Planet Earth. North American Continent. United States. Pennsylvania. Our County. Our Town. Address. Number on road. Apartment at this number.
  • Ig Polypeptides Are Encoded by LIGHT CHAIN Multiple

    Ig Polypeptides Are Encoded by LIGHT CHAIN Multiple

    Junctional diversity at CDR3 Deletion of bases at junctions N region additions at junctions P region additions at junctions 3. Activates transcription of the rearranged gene Juxtaposition of intronic enhancers with V region promoters. 4.