COMPARISON


 

Figure 9 Comparison of Machine Learning Methods

Machine Learning
Method
Resistance
against
data outliers
Explains
output
Suitable for
small datasets
Reasoning
Capability
Adaptive
Neural Nets No No No No No
Fuzzy Logic Partially Yes Yes Yes No
Case-Based Reasoning Yes Yes Partially Yes Yes
Analogy Yes Yes Partially Yes Yes
Rule-Based Yes Yes Yes Partially No
Regression Trees Yes Yes Partially Partially No
Hybrid System Partially No Partially Partially No

 

Figure 9 is a comparison of the machine learning methods described in this web site. The headings in the table show advantageous modelling attributes.

Resistance against data outliers - A method's robustness at eliminating data outliers in dataset which are highly atypical projects.

Software datasets (metric data) is frequently contaminated with data outliers from a number of causes, including measurement errors and unusual projects. The existence of such data points can lead to inaccurate predictions. Methods that can prevent data outliers from exerting influences in the prediction.

Explains Output - The capability for a user to see how a model arrives at its predicted output.
Outputs usually fall into three categories:

White Box - The process is very transparent and well understood by the user.

Grey Box - The process is partially visible and reasonably understood by the user.

Black Box - The process is hidden from the user.

Suitable for small datasets - Small datasets are very problematic when predicting estimation. This heading assess whether the method is viable when using small datasets.

Reasoning Capability - This assess whether a model has viable reasoning capabilities. For example, in the analogy method you can look back at past projects and question whether a project is really a true analogous project.

Adaptive - This addresses whether the method can accept new data without regenerating the whole estimation process.

 


This page was last updated 12/3/97 Dan. Snell, Bournemouth University, Copyright. 1997