Michal Černý, Jaromír Antoch, and Milan Hladík. On the possibilistic approach to linear regression models involving uncertain, indeterminate or interval data. Inf. Sci., 244:26–47, 2013.
[PDF] [gzipped postscript] [postscript] [HTML]
We consider linear regression models where both input data (the observations of independent variables) and output data (the observations of the dependent variable) are affected by loss of information caused by uncertainty, indeterminacy, rounding or censoring. Instead of crisp data, only intervals are available. We study a possibilistic generalization of the least squares estimator, so called OLS-set for the interval model. Investigation of the OLS-set allows us to quantify whether the replacement of crisp values by interval values can have a significant impact on our knowledge of the value of the OLS estimator. We show that in the general case, very elementary questions about properties of the OLS-set are computationally intractable (assuming $P \neq NP$). We also focus on restricted versions of the general interval linear regression model to the crisp input case. Taking the advantage of the fact that in the crisp input -- interval output model the OLS-set is a zonotope, we design both exact and approximate methods for its description. We also discuss special cases of the regression model, e.g.~a model with repeated observations.
@article{CerAnt2013a, author = "Michal {\v{C}}ern\'{y} and Jarom\'{\i}r Antoch and Milan Hlad\'{\i}k", title = "On the possibilistic approach to linear regression models involving uncertain, indeterminate or interval data", journal = "Inf. Sci.", fjournal = "Information Sciences", volume = "244", pages = "26-47", year = "2013", doi = "10.1016/j.ins.2013.04.035", issn = "0020-0255", url = "https://doi.org/10.1016/j.ins.2013.04.035", bib2html_dl_html = "https://dx.doi.org/10.1016/j.ins.2013.04.035", abstract = "We consider linear regression models where both input data (the observations of independent variables) and output data (the observations of the dependent variable) are affected by loss of information caused by uncertainty, indeterminacy, rounding or censoring. Instead of crisp data, only intervals are available. We study a possibilistic generalization of the least squares estimator, so called OLS-set for the interval model. Investigation of the OLS-set allows us to quantify whether the replacement of crisp values by interval values can have a significant impact on our knowledge of the value of the OLS estimator. We show that in the general case, very elementary questions about properties of the OLS-set are computationally intractable (assuming $P \neq NP$). We also focus on restricted versions of the general interval linear regression model to the crisp input case. Taking the advantage of the fact that in the crisp input -- interval output model the OLS-set is a zonotope, we design both exact and approximate methods for its description. We also discuss special cases of the regression model, e.g.~a model with repeated observations.", keywords = "Interval data, Uncertain data, Possibilistic regression, Computational complexity", }
Generated by bib2html.pl (written by Patrick Riley ) on Wed Oct 23, 2024 08:16:44