An approach to improving parametric estimation models in the case of violation of assumptions based upon risk analysis

IRIS

In this work, we show the mathematical reasons why parametric models fall short of providing correct estimates and define an approach that overcomes the causes of these shortfalls. The approach aims at improving parametric estimation models when any regression model assumption is violated for the data being analyzed. Violations can be that, the errors are x-correlated, the model is not linear, the sample is heteroscedastic, or the error probability distribution is not Gaussian. If data violates the regression assumptions and we do not deal with the consequences of these violations, we cannot improve the model and estimates will be incorrect forever. The novelty of this work is that we define and use a feed-forward multi-layer neural network for discrimination problems to calculate prediction intervals (i.e. evaluate uncertainty), make estimates, and detect improvement needs. The primary difference from traditional methodologies is that the proposed approach can deal with scope error, model error, and assumption error at the same time. The approach can be applied for prediction, inference, and model improvement over any situation and context without making specific assumptions. An important benefit of the approach is that, it can be completely automated as a stand-alone estimation methodology or used for supporting experts and organizations together with other estimation techniques (e.g., human judgment, parametric models). Unlike other methodologies, the proposed approach focuses on the model improvement by integrating the estimation activity into a wider process that we call the Estimation Improvement Process as an instantiation of the Quality Improvement Paradigm. This approach aids mature organizations in learning from their experience and improving their processes over time with respect to managing their estimation activities. To provide an exposition of the approach, we use an old NASA COCOMO data set to (1) build an evolvable neural network model and (2) show how a parametric model, e.g., a regression model, can be improved and evolved with the new project data.

Sarcia', S.a. (2009). An approach to improving parametric estimation models in the case of violation of assumptions based upon risk analysis [10.58015/sarcia-salvatore-alessandro_phd2009-08-27].

An approach to improving parametric estimation models in the case of violation of assumptions based upon risk analysis

SARCIA', SALVATORE ALESSANDRO

2009-08-27

Abstract

In this work, we show the mathematical reasons why parametric models fall short of providing correct estimates and define an approach that overcomes the causes of these shortfalls. The approach aims at improving parametric estimation models when any regression model assumption is violated for the data being analyzed. Violations can be that, the errors are x-correlated, the model is not linear, the sample is heteroscedastic, or the error probability distribution is not Gaussian. If data violates the regression assumptions and we do not deal with the consequences of these violations, we cannot improve the model and estimates will be incorrect forever. The novelty of this work is that we define and use a feed-forward multi-layer neural network for discrimination problems to calculate prediction intervals (i.e. evaluate uncertainty), make estimates, and detect improvement needs. The primary difference from traditional methodologies is that the proposed approach can deal with scope error, model error, and assumption error at the same time. The approach can be applied for prediction, inference, and model improvement over any situation and context without making specific assumptions. An important benefit of the approach is that, it can be completely automated as a stand-alone estimation methodology or used for supporting experts and organizations together with other estimation techniques (e.g., human judgment, parametric models). Unlike other methodologies, the proposed approach focuses on the model improvement by integrating the estimation activity into a wider process that we call the Estimation Improvement Process as an instantiation of the Quality Improvement Paradigm. This approach aids mature organizations in learning from their experience and improving their processes over time with respect to managing their estimation activities. To provide an exposition of the approach, we use an old NASA COCOMO data set to (1) build an evolvable neural network model and (2) show how a parametric model, e.g., a regression model, can be improved and evolved with the new project data.

Scheda breve

Scheda completa

Scheda completa (DC)

	Data di discussione
	
				27-ago-2009
			
	Anno accademico
	
				2008/2009
			
	Corso di dottorato
	
				Informatica e ingegneria dell'automazione
			
	Ciclo del dottorato
	
				21.
			
	Parole chiave
	
				multi-layer feed-forward neural networks; integrated software engineering environment; non-linear regression; curvilinear component analysis; Bayesian learning; prediction intervals for neural networks; risk analysis and management; learning organizations; software cost prediction; TAME system; Bayesian discrimination function; estimation improvement paradigm; quality improvement paradigm
			
	Settore disciplinare della tesi (valido fino a 24/06/2024)
	
				Settore ING-INF/05 - SISTEMI DI ELABORAZIONE DELLE INFORMAZIONI
			
	Settore disciplinare della tesi (valido dal 09/05/2024)
	
				Settore IINF-05/A - Sistemi di elaborazione delle informazioni
			
	Lingua del contenuto
	
				English
			
	Sponsor
	
				University of Maryland, Fraunhofer Center for Experimental Software Engineering
			
	Tipologia
	
				Tesi di dottorato
			
	Citazione
	
				Sarcia', S.a. (2009). An approach to improving parametric estimation models in the case of violation of assumptions based upon risk analysis [10.58015/sarcia-salvatore-alessandro_phd2009-08-27].
			
	Appare nelle tipologie:
	
				07 - Tesi di dottorato

File in questo prodotto:

File	Dimensione	Formato
SarciaPHDThesis.pdf accesso aperto Licenza: Copyright degli autori Dimensione 1.48 MB Formato Adobe PDF Visualizza/Apri	1.48 MB	Adobe PDF	Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/2108/1048

Citazioni

ND

ND

ND

social impact