Efficient Algorithms for Finding Multi-way Splits for Decision Trees

Truxton Fulton, Simon Kasif, Steven Salzberg

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

A number of recent papers have pointed out that binary decision trees are not always the best model for some domains. In particular, for some distributions the best way to partition a set of examples might be to find a set of intervals for a given feature, and split the examples up into several groups based on those intervals. Binary decision tree induction methods pick a single split point, i.e., they consider only bi-partitions at a node in the tree. We have developed an efficient new algorithm that computes an optimal multi-split of an interval into k sub-intervals, for any fixed k less than the number of examples. The algorithm employs a penalty function for increasing values of k to prevent it from splitting the examples into trivial partitions. Our implementation demonstrates both the efficiency of this method and the kinds of distributions for which it can produce better decision trees.

Original languageEnglish (US)
Title of host publicationProceedings of the 12th International Conference on Machine Learning, ICML 1995
EditorsArmand Prieditis, Stuart Russell
PublisherMorgan Kaufmann Publishers, Inc.
Pages244-251
Number of pages8
ISBN (Electronic)1558603778, 9781558603776
StatePublished - 1995
Externally publishedYes
Event12th International Conference on Machine Learning, ICML 1995 - Tahoe City, United States
Duration: Jul 9 1995Jul 12 1995

Publication series

NameProceedings of the 12th International Conference on Machine Learning, ICML 1995

Conference

Conference12th International Conference on Machine Learning, ICML 1995
Country/TerritoryUnited States
CityTahoe City
Period7/9/957/12/95

ASJC Scopus subject areas

  • Artificial Intelligence
  • Human-Computer Interaction
  • Software
  • Theoretical Computer Science

Fingerprint

Dive into the research topics of 'Efficient Algorithms for Finding Multi-way Splits for Decision Trees'. Together they form a unique fingerprint.

Cite this