The Commons

Back to Results

Patent Title: Method and apparatus for finding the best splits in a decision tree for a language model for a speech recognizer

Assignee: IBM
Patent Number: US5263117
Issue Date: 11-16-1993
Application Number:
File Date:10-26-1989


Abstract: A method and apparatus for finding the best or near best binary classification of a set of observed events, according to a predictor feature X so as to minimize the uncertainty in the value of a category feature Y. Each feature has three or more possible values. First, the predictor feature value and the category feature value of each event is measured. The events are then split, arbitrarily, into two sets of predictor feature values. From the two sets of predictor feature values, an optimum pair of sets of category feature values is found having the lowest uncertainty in the value of the predictor feature. From the two optimum sets of category feature values, an optimum pair of sets is found having the lowest uncertainty in the value of the category feature. An event is then classified according to whether its predictor feature value is a member of a set of optimal predictor feature values.

Notes:

Link to USPTO

IBM Pledge dated 1/11/2005