Togaware DATA MINING
Desktop Survival Guide
by Graham Williams
Google

Summary

Complexity: Generally C4.5 is quite efficient as the number of training instances increases, and for specific datasets has been found empirically to be between and . With rule generation the algorithm is somewhat more expensive at .
Availability: The Borgelt collection (See Chapter 25) contains dtree, a generic implementation of the decision tree divide and conqueror algorithm. A Gnome-based graphical front-end is available as part of the Gnome Data Mining Toolkit (See Chapter [*]). Weka (See Chapter 28) also provides a freely available implementation of a decision tree induction algorithm (J48) within its Java-based framework. Decision tree induction is a fundamental data mining tool and implementations of C4.5 or its variations are available in most commercial data mining toolkits, including Clementine (See Chapter 30) and STATISTICA (See Chapter 36).



Copyright © 2004-2005
Brought to you by Togaware.