dc.contributor.author | Dzemski, Andreas | |
dc.contributor.author | Okui, Ryo | |
dc.date.accessioned | 2018-03-06T15:16:38Z | |
dc.date.available | 2018-03-06T15:16:38Z | |
dc.date.issued | 2018-03 | |
dc.identifier.issn | 1403-2465 | |
dc.identifier.uri | http://hdl.handle.net/2077/55922 | |
dc.description | JEL: C23, C33, C38 | sv |
dc.description.abstract | We develop new procedures to quantify the statistical uncertainty from sorting units in panel data into groups using data-driven clustering algorithms. In our setting, each unit belongs to one of a finite number of latent groups and its regression curve is determined by which group it belongs to. Our main contribution is a new joint confidence set for group membership. Each element of the joint confidence set is a vector of possible group assignments for all units. The vector of true group memberships is contained in the confidence set with a pre-specified probability. The confidence set inverts a test for group membership. This test exploits a characterization of the true group memberships by a system of moment inequalities. Our procedure solves a high-dimensional one-sided testing problem and tests group membership simultaneously for all units. We also propose a procedure for identifying units for which group membership is obviously determined. These units can be ignored when computing critical values. We justify the joint confidence set under N, T → ∞ asymptotics where we allow T to be much smaller than N. Our arguments rely on the theory of self-normalized sums and high-dimensional central limit theorems. We contribute new theoretical results for testing problems with a large number of moment inequalities, including an anti-concentration inequality for the quasi-likelihood ratio (QLR) statistic. Monte Carlo results indicate that our confidence set has adequate coverage and is informative. We illustrate the practical relevance of our confidence set in two applications. | sv |
dc.format.extent | 85 | sv |
dc.language.iso | eng | sv |
dc.relation.ispartofseries | Working Papers in Economics | sv |
dc.relation.ispartofseries | 727 | sv |
dc.subject | Panel data | sv |
dc.subject | grouped heterogeneity | sv |
dc.subject | clustering | sv |
dc.subject | confidence set | sv |
dc.subject | machine learning | sv |
dc.subject | moment inequalities | sv |
dc.subject | joint one-sided tests | sv |
dc.subject | self-normalized sums | sv |
dc.subject | high-dimensional CLT | sv |
dc.subject | anti-concentration for QLR | sv |
dc.title | Confidence Set for Group Membership | sv |
dc.type | Text | sv |
dc.type.svep | report | sv |
dc.contributor.organization | Dept. of Economics, University of Gothenburg | sv |