Multivariate exploratory data analysis for large databases: An application to modelling firms’ innovation using CIS data

dc.contributor.authorBou, Juan Carlos
dc.contributor.authorSatorra, Albert
dc.date.accessioned2020-05-25T08:45:55Z
dc.date.available2020-05-25T08:45:55Z
dc.date.issued2019
dc.description.abstractThis paper argues that, when using a large database, organizational researchers would benefit from the use of specific multivariate exploratory data analysis (MEDA) before performing statistical modelling. Issues such as the representativeness of the database across domains (countries or sectors), assessment of confounding among categorical covariates, missing data, dimension reduction to produce performance indicators and/or remedy multicollinearity problems are addressed by specific MEDA. The proposed MEDA is applied to data from the Community Innovation Survey (CIS), a large database commonly used to analyse firms’ innovation activities, prior to fitting ordered logit and Tobit regression models. A set of recommended practices involving MEDA are proposed throughout the paper.en
dc.description.sponsorshipThis work was supported by the Spanish MEC Grants [Grant Number ECO2015-66671-P (MINECO/FEDER), and ECO2014-59885-P] and Generalitat Valenciana [Grant Number BEST/2018/209].
dc.format.mimetypeapplication/pdf
dc.identifier.citationBou JC, Satorra A. Multivariate exploratory data analysis for large databases: An application to modelling firms’ innovation using CIS data. Business Research Quarterly. 2019 Oct;22(4):275-93. DOI: 10.1016/j.brq.2018.10.001
dc.identifier.doihttp://dx.doi.org/10.1016/j.brq.2018.10.001
dc.identifier.issn2340-9436
dc.identifier.urihttp://hdl.handle.net/10230/44671
dc.language.isoeng
dc.publisherElsevier
dc.relation.ispartofBusiness Research Quarterly. 2019 Oct;22(4):275-93
dc.relation.projectIDinfo:eu-repo/grantAgreement/ES/1PE/ECO2015-66671-P
dc.relation.projectIDinfo:eu-repo/grantAgreement/ES/1PE/ECO2014-59885-P
dc.rightsThis is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/).
dc.rights.accessRightsinfo:eu-repo/semantics/openAccess
dc.rights.urihttp://creativecommons.org/licenses/by-nc-nd/4.0/
dc.subject.keywordCommunity Innovation Survey (CIS)en
dc.subject.keywordMEDAen
dc.subject.keywordInnovationen
dc.subject.keywordMissing dataen
dc.subject.keywordMAR and MCARen
dc.subject.keywordDimension reductionen
dc.subject.keywordMultivariate analysisen
dc.subject.keywordOLSen
dc.subject.keywordOrdered logistic and Tobit regressionen
dc.titleMultivariate exploratory data analysis for large databases: An application to modelling firms’ innovation using CIS dataen
dc.typeinfo:eu-repo/semantics/article
dc.type.versioninfo:eu-repo/semantics/publishedVersion

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
bou_brq_multi.pdf
Size:
2.48 MB
Format:
Adobe Portable Document Format

License

Rights