Empirical Modeling:Optimized Set Reduction |
|
Approach |
Our modeling approach, Optimized Set Reduction (OSR), is based on both statistical and machine learning principles.Given a historical data set, OSR automatically generates (through a search algorithm) a collection of logical expressions referred as patterns which characterize the trends observable in the data set. Patterns provide interpretable models where the impact of each predicate can be easily evaluated. For each pattern generated by OSR, a reliability of prediction and a statistical significance are calculated based on the learning set. |
Validation Strategy |
We demonstrate the effectiveness of the approach by applying OSR to
different problems, e.g., cost prediction, fault-prone components identification,
change management. Different modeling approaches have been compared and evaluated based on data from the Software Engineering Laboratory (SEL). |
Project Status |
Active - Contact Lionel Briand or Khaled El-Emam for current activities. |
Results |
ModelingAlgorithms to generate patterns and merge similar patterns according
to a user defined degree of similarity have been designed. ValidationFirst, we demonstrated the effectiveness of the approach by applying
OSR to the problem of cost estimation. The OSR predictions were compared
to predictions from two other effort estimation techniques to provide a
basis for evaluation. The data set for the study came from the COCOMO database
and the fifteen projects used by kemere for cost modeling evaluation. |