JORDAN MILEV

Home Address:
   268 Humphrey Street, #3
   New Haven, CT 06511

Phone: (203) 809-2832
Office Address:
   Department of Economics
   Yale University
   P.O. Box 208268
   New Haven, CT 06520-8268
   Fax: (203) 432-5779

Citizenship: Bulgarian
Fields of Concentration

Econometrics
Financial Economics

Comprehensive Examinations Completed:

Econometrics and Financial Economics (oral) 2000
Microeconomics and Macroeconomics (written) 1999

Dissertation Title:

Genetic Programming Techniques for Identifying Structural Models Applied to the Earnings-Returns Relation

Expected Completion Date:

May 2004

Degrees:

Ph.D. (in progress), Economics, Yale University
B.A., Economics and Mathematics, Amherst College, 1998

Fellowships, Honors and Awards:

Economics Department Fellowship, Yale University, summer 2002
Yale University Dissertation Fellowship, fall 2002
Yale University Graduate Student Fellowship, 1998-2002
Dwight Hitchcock Memorial Fellowship, Amherst College, 1998/99, 2000/01, 2003/04
Bernstein Prize for Best Senior Thesis in Economics, Amherst College, 1998
Phi Beta Kappa, Amherst College, 1998
Nelson Summer Research Fellowship in Economics, Amherst College, summer 1997

Teaching Experience:

Teaching Assistant: Microeconomics (2000, 2001), Econometrics (2002)
Tutored a blind undergraduate student in Economics and Mathematics courses (1998-2002)

Professional Experience:

Summer Associate, National Economics Research Associates, New York, summers 2001, 2002

Papers:

Search for an Empirical Specification of the Returns-Earnings Relation, 2003 [job market paper]

Short-Term Employment Dynamics, Adjustment Costs, and Firm Exit, 2001

Institutional Parameters and Workforce Dynamics: A Simulation and Empirical Approach, 2000 (with Walter Nicholson, Amherst College)

Capital Creation and Destruction in an Embodied Growth Model with Heterogeneous Firm Size, 1999

Conference Presentations:

"Nonlinear Model Estimation using Genetic Programming" 11th Symposium of the Society for Nonlinear Dynamics and Econometrics, Florence, Italy, March 2003

References:

Professor Peter C.B. Phillips
Yale University
Department of Economics
P.O. Box 208281
New Haven, CT 06520-8281
Tel: (203) 432-3695
Fax: (203) 432-6167
Email: peter.phillips@yale.edu

Professor Ray C. Fair
Department of Economics
Yale University
Box 208281
New Haven, CT 06520-8281
Phone: (203) 432-3715
Fax: (203) 432-6167
Email: ray.fair@yale.edu

Professor Penny Goldberg
Yale University
Department of Economics
P.O. Box 208264
New Haven, CT 06520-8264
Tel: (203) 432-3569
Fax: (203) 432-6323
Email: penny.goldberg@yale.edu
Dissertation Abstract:

The task of model identification is present in most applied econometric exercises. In the simplest case, researchers are faced with empirical data for two economic variables and would like to formulate a structural model characterizing their relationship. When the true data generating process is nonlinear, the inadequacy of linear regression models is apparent as they suffer from the usual symptoms of low explanatory power and biased coefficients. Yet economic theory provides few directions for using a particular functional form when linear regression fails. The selection of appropriate functional form rests with the researcher and has profound implications for the consistency and significance of estimated model parameters and the predictions obtained from the model.

I advocate the use of a flexible parametric estimation approach based on genetic programming (GP). Genetic programming is a stochastically driven optimization algorithm. In contrast to traditional hill climbing algorithms that proceed from one point in the search space to another, GP operates simultaneously on a set of points in the parameter space. Each point in this set is equivalent to a function-a possible data generating process for the data. The algorithm calculates the degree of fit of each function to the data and uses it to rank this function with respect to other functions already explored by it. It then constructs a new set of points in the search space using the expressions contained in the current set of functions. Theory shows that the GP algorithm is intrinsically parallel, i.e., by calculating the fit of every function it implicitly accumulates information about the fit of every element comprising this function. The algorithm explores the search space in an optimal way. More precisely, as the GP algorithm constructs new sets of functions, the elements with high degree of fit occur at an exponentially increasing rate in the newly generated functions while the occurrence of elements with low fit to the data decays exponentially. This allows the algorithm to expend optimal computational effort by spending its calculation time increasingly on functions containing valuable elements.

In the first part of the dissertation I develop and code a genetic programming algorithm specially suited to analyzing time series data. It is specifically designed to incorporate lagged variables and errors in variables explicitly. This is an advance over GP work since errors in variables have been treated as an obstacle in past research, causing a reduction in its performance. This is the first attempt to model errors explicitly within the GP framework.

In the second part of the dissertation the performance of the GP algorithm is evaluated over several test cases and its out-of-sample forecasting ability is evaluated with respect to other nonlinear modeling frameworks. I show that the increased flexibility of GP over traditional parametric methods comes at a relatively small cost. The rate of convergence of the GP estimator is better than that of several nonparametric estimators. A simulation exercise is conducted in order to derive the distributions of model parameters for several particular classes of data generating processes under the null hypothesis that genetic programming uncovers the true functional form up to nuisance parameters.

The third part of the dissertation uses the algorithm to model how stock prices react to unanticipated accounting earnings. I suggest several nonlinear parametric specifications of the reaction of excess current-period stock price returns to the unanticipated component of quarterly earnings. My motivation is the well-known misspecification problem in studies that ignore nonlinearity, as it presents a source of systematic error in earnings response regressions (see Freeman and Tse 1992 and Beneish and Harvey 1998, for a discussion of nonlinearity in this empirical context). My major findings are that: 1) using the described model identification procedure I am able to confirm the existence of a nonlinear model that has superior in-sample and out-of-sample predictive power over the linear model; 2) even if transitory items from earnings are excluded, the unexpected component still has a nonlinear effect on price, but the dominance over the linear model diminishes out-of-sample; 3) the approach presents an advantage over alternative nonparametric techniques as it can be used to derive a parametric model suitable for further empirical work on the earnings-returns relation.