NASA MDP 데이터 집합의 결함도 모호성 분석
Ambiguity Analysis of Defectiveness in NASA MDP Data Sets
- 한국IT서비스학회
- 한국IT서비스학회지
- 한국IT서비스학회지 제12권 제2호
-
2013.06361 - 371 (10 pages)
- 21
Public domain defect data sets, such as NASA data sets which are available from the NASA MDP and PROMISE repositories, make it possible to compare the results of different defect prediction models by using the same data sets. This means that repeatable and general prediction models can be built. However, some recent studies have raised questions about the quality of two versions of NASA data set. and made new cleaned data sets by applying their data cleaning processes. We find that there are two ways in the NASA MDP versions to determine the defectiveness of a module, 0 or 1, and the two results are different in some cases. This serious problem. to our knowledge, has not been addressed in previous studies. To handle this ambiguity problem, we define two kinds of module defectiveness and two conditions that can be used to determine the ambiguous cases. We meticulously analyze 5 projects among the 13 NASA projects by using our ambiguity analysis method. The results show that JMl and PC4 are the best projects with few ambiguous cases.
Abstract
1. 서론
2. 관련 연구
3. 모호성 분석 방법
4. 모호성 분석 실험
5. 결론
참고문헌
저자소개
(0)
(0)