Factors Determining Callbacks to Job Applications by the Unemployed: An Audit Study

Henry S. Farber; Dan Silverman; Till M. Von Wachter

RSF: The Russell Sage Foundation Journal of the Social Sciences

Factors Determining Callbacks to Job Applications by the Unemployed:An Audit Study
Henry S. Farber (bio), Dan Silverman (bio), and Till M. Von Wachter (bio)

We use an audit study approach to investigate how unemployment duration, age, and holding a low- level interim job while applying for a better job affect the likelihood that experienced college-educated females applying for an administrative support job receive a callback from potential employers. First, the results show no relationship between callback rates and unemployment duration. Second, workers age fifty and older are significantly less likely to receive a callback. Third, taking an interim job significantly reduces the likelihood of receiving a callback. Finally, employers who have higher callback rates respond less to observable differences across workers in determining whom to call back.

Keywords

unemployment duration, job finding, interim jobs, age discrimination, audit study

In this project we use an audit study approach (Bertrand and Mullainathan 2004), where we send carefully constructed fictitious job applications to posted job openings, in order to investigate how several characteristics of workers affect the likelihood that they receive a callback after applying for a job. We focus on the recent employment history and age of applicants, paying special attention to the effects of unemployment duration and of taking a low-level interim job. The study is motivated in part by the persistently long duration of unemployment spells experienced by workers in the Great Recession and its aftermath. This pattern is illustrated in figure 1, which plots the mean and median duration of unemployment spells in progress by quarter from 1976 through 2014. Mean unemployment duration peaked in 2011 at almost thirty-seven weeks and has exceeded thirty weeks in all quarters between 2010 Q1 and 2014 Q2. Both mean and median duration remain well above their levels at any point prior to 2008.

This shift toward longer unemployment spells underscores the importance of understanding whether workers who have been unemployed [End Page 168] for a long period face more difficulty in finding a job. The labor force transition data suggest that this is the case. Figure 2 contains a plot of the monthly job finding rate (the probability of an U–E, unemployment to employment, transition) by unemployment duration in months based on matched Current Population Survey (CPS) data from 2008 to 2014. This figure shows a sharp decline in the monthly job finding rate from about 25 percent early in unemployment spells to about 10 percent after one year. In order to study the effect of unemployment duration on the likelihood of call-back, we randomly varied the duration of the current unemployment spell across applications in our audit study.

Click for larger view
View full resolution

Figure 1.

Mean and Median Duration of Unemployment Spells in Progress, by Quarter

Source: Authors’ calculations based on data from the Matched Monthly Files of Current Population Survey.

Click for larger view
View full resolution

Figure 2.

Monthly Job Finding Rate, by Duration of Unemployment, 2008–2014

Source: Authors’ calculations based on data from the Matched Monthly Files of Current Population Survey.

The study is also motivated by an interest in the obstacles that older unemployed workers face in job seeking. Figure 3 highlights the fact that the average duration of unemployment [End Page 169] spells in progress have historically been substantially longer for older workers. For example, from 2014Q1 to 2015Q2, the average duration of an in-progress unemployment spell was twenty-eight weeks for those age twenty-five to thirty-four, thirty-one weeks for those age thirty-five to forty-four, and thirty-six weeks for those age forty-five to sixty-four.

Click for larger view
View full resolution

Figure 3.

Average Number of Weeks Spent in Unemployment, by Age

Source: Authors’ calculations based on data from the Current Population Survey.

Click for larger view
View full resolution

Figure 4.

Unemployment Rate of Displaced Workers, by Age

Source: Authors’ calculations based on data from the Displaced Workers Supplement of the Current Population Survey.

The difficulty that older workers have in finding jobs is further illustrated using data from the Displaced Workers Survey (DWS) from 1984 to 2014. Figure 4 illustrates that older job losers have historically had higher post-displacement unemployment rates (measured at the DWS survey date). Since the Great Recession period (job loss from 2007 to 2013), job losers twenty-five to forty-four years old had a26.3 percent unemployment rate, whereas the unemployment rate was 29.9 percent for job losers forty-five to fifty-four years old and 35.1 percent for job losers fifty-five to sixty-four years of age. The difficulties faced by older unemployed individuals lead some to spend long stretches of time out of work, and some never return to employment (Song and von Wachter 2014). Given these patterns, it is important to [End Page 170] understand the role of age in hiring and its interaction with work history such as unemployment duration and interim jobs.

Our interest in age affected our study design in two ways. In contrast to several recent audit studies of the effect of employment history on callback rates, our sample consists of mature and older workers, for whom job loss and long-term unemployment may be particularly costly. In addition, to examine the question of how age itself affects the likelihood of callback, we randomly varied applicant’s age on a subset of applications, and measured differences in call-back rates.

Finally, we were interested in whether ending a recent spell of unemployment with a short-term, lower-level “interim” job, such as in retail sales, is an effective strategy for improving callback rates. It is well documented that in the aftermath of a job loss the degree of mismatch and nonstandard work histories increases, in particular during recessions (Farber 1999; Elsby, Hobijn, and Şahin 2010). How interim jobs can affect callback rates has direct practical relevance for unemployed workers seeking to obtain a good job while making ends meet. Additionally, it is important to understand the extent to which a rise in the incidence in interim employment during recessions affects callback and job finding and, hence, unemployment duration. Yet relatively little is known about the consequences of taking a low-level interim job. Simple theories suggest it could have countervailing effects on callbacks. It might be that holding a low-level interim job signals that the applicant is ambitious and hardworking, increasing the likelihood of call-back. Alternatively, it might be that holding a low-level interim job suggests to the employer than the applicant is not suitable for the job for which the application was submitted. This could be a conscious choice of employers or a mechanical reading of the résumé that rules out applicants whose most recent job was not related to the job for which the application was submitted. To investigate the role of a low-level interim job on the likelihood of a callback, we included such an interim job on a random subset of some applications, and then measured differences in callback rates.

In order to focus efficiently on the three variables of interest, we limit the range of variation in other dimensions as is common in studies of this type. Specifically, we limit our applications to administrative support jobs and we restrict the characteristics of applicants: all are female and have a four-year college education. Although this does limit any claims we might make regarding the workforce as a whole, the facts that motivated our analysis regarding the incidence of long-term unemployment and the relationship of age with long-term employment do hold for this subgroup of the labor force. Figure 5 shows average duration of unemployment spells in progress from the CPS since 2003 for college-educated females in administrative support occupations. While the samples are considerably smaller than for those for the entire unemployed sample from the CPS (figure 3), one can clearly see a sharp increase in the average duration of unemployment for these women since the Great Recession, and the average duration of unemployment is significantly longer for older women.¹ Thus, the facts we presented in the introduction to motivate our analysis are important for the particular jobs we study.

Our findings are clear with regard to the three variables of interest. First, we find no relationship between unemployment duration and the callback rate. This is different from the results obtained by Kory Kroft, Fabian Lange, and Matthew J. Notowidigdo (2013)—henceforth this study is referred to as KLN—and Rand Ghayad (2014). Those papers find a negative relationship between callback rates and duration of unemployment that is concentrated in the first six or seven months of an unemployment spell.

For longer spells, those papers estimate that the relationship between unemployment duration and the callback rate is flat. Our findings are closest to those by John M. Nunley and his colleagues (forthcoming), who find no effect [End Page 171] of unemployment duration, either past or present, on callbacks for relatively recent college graduates in the United States.² Stefan Eriksson and Dan-Olof Rooth (2), whose study of the Swedish market also found no effect of unemployment duration on callback rates for jobs that require a university degree, additionally found no effects before six months for lower-skill jobs. As we discuss in detail later in this article, there are many potential reasons for the differences across studies in results with regard to unemployment duration and call-backs. We can explore some of them with existing data, but more data collection is necessary to understand fully what drives the differences.

Click for larger view
View full resolution

Figure 5.

Average Weeks Unemployed, by Age, of College-Educated Females in Administrative Support Occupations

Source: Authors’ calculations based on data from the Current Population Survey.

Second, we find that older workers, those in their fifties, are significantly less likely to receive a callback than workers in their thirties and forties. This is consistent with the results in Joanna Lahey (2008), who finds large negative effects of age on callbacks for women seeking entry-level positions in the United States.

Third, we find that taking a low-level interim job significantly reduces the likelihood of receiving a callback. This last result is similar to that in Nunley et al. (forthcoming). That paper found that relatively recent college graduates in the United States had substantially fewer callbacks if they were currently employed in jobs that did not require a college education and were not suited to the job for which they were applying.

Our results have some important implications. First, our findings help to underscore that the effect of unemployment duration on callback rates found for younger workers in KLN do not hold universally in the labor market. For the more seasoned female clerical workers we focus on, long-term unemployment has no causal effect on callback rates. Together with the other mixed findings in the literature, our finding calls into question whether the well-known decline in the probability of job finding with unemployment duration is primarily driven by a causal effect of unemployment duration due to employer behavior rather than arising from some other source, such as negative selection or changes in workers’ search behavior. Future work should seek to understand better the heterogeneity in treatment effects between studies and demographic groups.

Second, our results strengthen Lahey’s (2008) finding and underscore that age discrimination may be a relevant phenomenon in the U.S. labor market. Since we focus on workers with longer labor force histories, our findings suggest that even substantial relevant labor market experience on the résumés we use does not diminish the negative effect of age on call-backs. [End Page 172]

Third, at a practical level, the fact that interim jobs negatively affect the incidence of callback implies that unemployed workers may be better advised remaining unemployed rather than compromising on job quality—or at least they should not list an interim job on their résumés. Finally, our findings on interim jobs imply that employers do use information on the résumés to make inferences even about mature and older workers. Standard employer learning theory would suggest that the availability of many signals for these workers reduces the effect of any given signal (Farber and Gibbons 1996). This could rationalize our zero result on the effect of unemployment duration, but not the significant effects of interim jobs that we find. It is an open question whether this latter finding implies presence of employer learning in the sense of the theory even for older workers, or whether it is due to mechanical screening of CVs by human resource departments that may, for example, eliminate “bad matches” on the basis of the last entry on the CV.

An additional finding is that, for job listings to which we sent four applications, the negative effect of age and interim job on the incidence of callback is substantially weaker (the effect of unemployment duration remains zero) for those employers with high callback rates (for example, three of four applications received a callback as opposed to one of four applications received a callback). This finding can be interpreted as an indication that employers with a high demand for workers become less selective in deciding whether or not to call back. This is consistent with the idea that particular signals on the résumé may matter less for the incidence of callback in a tighter labor market than in a weaker labor market.

In the remainder of this paper we describe and motivate many details of the experimental design; develop a model of employer learning to guide interpretation of results; present the results of simple, univariate analyses of the experimental treatments on duration of unemployment, age, and interim job; present a multivariate analysis to gain additional precision of the estimates; offer some analysis of the disparate findings in the literature; and present our conclusions.

research design

The design of our audit study reflects several considerations and constraints that have implications for interpreting the results. Since, as with any experiment in the social sciences, our design choices affect the internal and external validity of our results, we describe the design and setting of our study in detail.

An audit study consists in sending fake résumés to actual job postings and measuring the incidence of callback rates. The main estimates consists in differences in callback rates based on randomly assigned differences in résumé characteristics, such as age, characteristics of previously held jobs, or employment dates. It is therefore paramount that the fake résumés and the variation in the informational content be constructed to be as realistic as possible.

To facilitate the tailoring of résumés and reduce idiosyncratic variation in callback rates by job type, we restricted both the type of jobs to which we sent our résumés and the demographic characteristics of the applicants. Applications were limited to white-collar office jobs such as administrative or executive assistants, receptionists, secretaries, office associates, and the like. Because these jobs are disproportionately held by women, and gender differences are not our focus, all applicants had female names. Each applicant had a four-year bachelor’s degree from a non-elite public university or college with a current admission rate higher than 65 percent. In contrast to previous studies, our fictitious applicants also had substantial work histories. The work histories consisted of three to six white-collar office jobs, depending on age. Prior to the current spell, these work histories had no spells of unemployment longer than a month in the previous five years. Age or birth year were not listed in the résumés but could be inferred from year of college completion and work experience. No information was included on the résumés regarding race, marital status, or number of children.

The context of our audit study is nationwide in that we submitted job applications to openings in selected cities across the United States. To further be able to tailor our fictitious résumés to jobs and the local labor market, we selected eight cities. Because we also wanted to [End Page 173] allow for differences in treatment effects by local unemployment rates, four of the cities we chose had relatively low unemployment rates at the start of our study (Dallas, Omaha, Pittsburgh, and Portland, Maine) and four of which had relatively high unemployment rates in 2012 (Chicago, Sacramento, Tampa, and Charlotte, North Carolina). Table 1 contains city-level unemployment rates for the eight cities in 2012 (early in our study period) and 2014 (late in our study period). The table illustrates the general improvement in the labor market during the extended recovery from the Great Recession. Unemployment rates fell in both the low-and the high-unemployment cities, and the relative ordering of cities by unemployment rate was preserved across groups.

Click for larger view
View full resolution

Table 1.

Unemployment Rates by City and Year

Source: Bureau of Labor Statistics data.

To further enhance the external validity of the experiment, the résumés were crafted to be plausible and tailored to prospective employers in each of the eight cities we studied. Plausibility was created, as in Marianne Bertrand and Sendhil Mullainanthan (2004), by crafting the fictitious résumés from actual résumés posted on a site we did not use for submissions. These actual source résumés were posted for job openings in the occupations we study, but in a city that was not in the experiment. Each element of each source résumé was migrated to each of the eight target cities in which the experiment was conducted. This migration was performed by finding residential addresses, employers, and institutions of postsecondary education in the target city that are similar to those listed on the source résumé.³ Names were not migrated but instead were selected to be common, according to the Social Security Administration, among people of the relevant age cohort, but not Hispanic in origin. The names selected are neutral with regard to race and ethnicity—not obviously Asian, African American, or Hispanic. The appendix presents a sample of four résumés that vary with regard to the characteristics of interest: unemployment duration, age, and interim job.

The basic structure of the actual experiment follows now standard methods for “correspondence studies” (see, for example, Bertrand and Mullainathan 2004; Lahey 2008; and KLN). Specifically, we sent our crafted fictitious résumés in matched pairs or quadruples to openings posted on two online job boards. The experiment proceeded in four rounds. Round 1 only randomly assigns unemployment duration to one of two résumés sent to the same job posting. Round 2 differs from round 1 in that both résumés sent to the same job posting receive a random unemployment duration. Round 3 differed from round 2 in that also the presence of an interim job is randomly assigned (independently of unemployment duration). Round 4 differs from round 3 in that also the implied age of the résumé is randomly assigned. Details of each round are as follows: [End Page 174]

Round 1: 2,054 applications for 1,027 jobs. Conducted between March and May 2012, the first round involved submitting two applications (treatment and control) to each of 1,027 job openings spread across the eight cities. In this and all other rounds, the number of applications was roughly proportional to city size. The control applicant to each job had always just entered unemployment, whereas the treatment applicant had been unemployed for a number of weeks drawn at random from the set {4, 12, 24, 52}. The beginning of the unemployment spell was indicated on the résumé by the end date of the applicant’s most recent job. Thus the control applicant’s résumé indicated that her most recent job had ended in the month just prior to the month the application was made. The applicant’s age varied (thirty-five, forty, fifty-five, or fifty-six) across applications, but age did not vary within the applicant pair for specific job postings. Age was identified by year of graduation from college and reinforced by the employment history. Formatting of résumés was randomly varied to avoid detection of the experiment.

Round 2: 2,430 applications for 1,215 jobs. In the second round, conducted between July and September 2012, the experimental design was identical to the first round with one exception: each applicant had been unemployed for a number of weeks drawn at random, without replacement, from the set{0, 4, 12, 24, 52}. This change in design allowed us to account for the possibility that the two applicants in a pair were being directly compared by an employer and the control applicant, newly unemployed, was being mistaken for someone currently employed.

Round 3: 1,668 applications for 834 jobs. The third round of the experiment, conducted between November 2013 and April 2014 used the same methods as in round 2 to submit applications in matched pairs.⁴ In this round, however, we introduced the possibility that the applicant held an interim job. Applicants holding an interim job had just started work the month prior to the month of the application, in a relatively low-skilled position at a chain restaurant, a big-box retail store, or a grocery store. These interim jobs involved serving food, stocking shelves, or assisting customers at a register or on a retail floor, and were thus quite different from the career work on the rest of the résumé. The randomization with respect to interim job was conducted at the application level, within matched pair. Thus, both the control and the treatment could be employed in an interim job with some unemployment spell or unemployed with some other unemployment duration. We did not update the start dates of the résumés in this round, with the result that the applicants “aged.” Applicant’s age varied across job postings from the set {36, 37, 41, 42, 56, 57, 58}.

Round 4: 6,072 applications for 1,518 jobs. In the fourth and final round, conducted between April and August 2014, we submitted four (rather than two) applications to each of 1,581 openings spread across the eight cities. This increase in the number of applications per job was motivated by two interests. First, we wanted to speed data collection, which experience indicated could be done without risking detection of the experiment by doubling the number of applications per job. Second, we wanted to produce experimental variation in age, within job. Thus, the four applications per job consisted of two each from two different groups. One pair consisted of younger applicants (thirty-seven or forty-two), and the other consisted of older applicants (fifty-seven or fifty-eight). Randomization with respect to holding an interim job and variation in unemployment duration was as in round 3. [End Page 175]

The fact that the experiment occurred in four stages provided additional sources of variation while not affecting our results. In the empirical work, we begin by analyzing the four rounds separately. We then show that the results that are comparable between the four rounds are sufficiently similar that we can analyze them together.

a model of learning about applicant quality

When employers evaluate an applicant for a job, they have incomplete information about the quality of the worker. Employers use observable information available in the worker’s application to form an expectation about the worker’s quality. This information includes, among other things, worker demographics, education, work history, including characteristics on prior jobs, and unemployment experience. In this section we develop a very simple model of employer learning about applicant quality in order to motivate the analysis and to provide clear predictions and a clear framework for interpreting the results of the audit study.

We assume a profit-maximizing, risk-neutral firm with a single worker. The output (Y) of the firm is equal to the quality of the worker (µ). We assume all potential workers will be paid the same wage so that the firm is interested in hiring the most able worker among applicants for its job opening.⁵ Our model captures the employer’s process of integrating available information to form an expectation of applicant quality.⁶

Consider applicant i. The firm has incomplete information about µ_i and makes an inference based on a set of k noisy signals. For the purposes of our study, these signals include, among other background information, the applicant’s unemployment experience, age, and whether the applicant holds an interim job. Let s_ij represent the noisy j^th signal of µ_i. We assume this j^th signal satisfies

where γ_ij is a normally distributed random variable with zero mean and variance inline graphic The parameters α_j are normalizations that account for the fact that some signals are positive and some are negative as well as for differential scaling of the signals. For example, unemployment duration would have α_j < 0, but interim job might have α_j > 0. The employer’s inference problem is to combine the available information on s_ij, j = 1, … , k optimally in order to derive an expected value for applicant quality (E(μ_i| s_i₁, … , s_ik)).

Think of s_ij as prior information on applicant quality so that the posterior beliefs about applicant quality can be derived using a standard Bayesian procedure. Given the distributional assumption regarding the γ_ij, each signal s_ij about applicant quality is normally distributed with mean μ _i|α _j and variance inline graphic . In describing how information about s_ij is combined to form the employer’s posterior distribution on applicant quality, it is convenient to use the precisions of the random variables rather than the variances. The precision (h) of a random variable is the inverse of the variance, so that s_ij with variance inline graphic has precision . In this normal Bayesian updating model, the posterior distribution of the employer’s beliefs about µ_i is normal with a mean that is a precision-weighted average of the k signals. The posterior expectation is

Consider the implication of the model for the effect of signal m on the likelihood of call-back. The marginal effect of a change in s_im is

[End Page 176] which takes the sign of α_m. If signal m is unemployment duration, then, presumably, α_m ≤ 0, and the marginal effect of unemployment duration is negative. Thus, workers with longer unemployment duration have lower posterior mean worker quality. This makes their posterior expected quality less likely to exceed the necessary threshold and reduces the likelihood of callback. Analogously, if signal m is age and age is a negative signal of worker quality, then α_m ≤ 0 and older workers have lower posterior mean worker quality. Again, this makes their posterior expected quality less likely to exceed the necessary threshold and reduces the likelihood of callback. Given the opposing predictions regarding the value of holding a low-level interim job, the sign of α_m in this case is unknown, and we have no clear prediction on how the likelihood of callback varies with the holding of a low-level interim job.

There are at least two second-order predictions of the model. First, related to unemployment duration, it is likely that there is more information about applicant quality in the duration of unemployment when the labor market is tighter (lower unemployment rate). In terms of the model, the precision associated with the unemployment duration signal is higher where the local unemployment rate is lower so that there is relatively more updating based on unemployment duration. Formally,

which has the sign of α_m. Because α_m ≤ 0 where s_m represents unemployment duration, the negative marginal effect of unemployment duration on the likelihood of callback (equation 3) is larger in absolute value in tighter labor markets (equation 4). In other words, the negative marginal effect of unemployment duration on the callback rate will be more substantial in stronger labor markets.⁷

The other second-order prediction of the model is that where there are more signals of worker quality, the marginal effect of any one signal will be smaller in absolute value. This is relevant when thinking about the role of applicant age. An older worker has more prior work experience. This comes in the form of more and perhaps longer prior jobs. In the context of the model, longer experience and more information increase the number of signals (k). The marginal effect of a particular signal is given in equation 3. On inspection of this relationship, an increase in k simply increases the denominator in the term in brackets. The result is a reduction in the absolute value of the marginal effect any particular existing signal. This predicts, for example, that the marginal effect of unemployment duration will be smaller for older workers. Intuitively, older workers have a longer employment history that will dilute the effect of recent unemployment on the likelihood of callback.

A final prediction is not based strictly on the updating model. If an employer has a great need for workers as indicated by a higher call-back rate for applicants to the particular job, then the employer may not be as selective. The result will be that the threshold posterior mean worker quality necessary for a callback will be lower where demand is high. A clear implication of this is that the marginal effect of particular worker attributes (unemployment duration, age, and the holding of a low-level interim job in case) on the likelihood of callback will be lower for less selective employers.

The foregoing model presents only one way in which employers may use résumé information to draw inferences about applicant suitability for the job. Other approaches may include mechanical screening of résumés to filter out workers that are an obvious mismatch. Another approach would be screening based on tastes for particular worker attributes, such as age. We will not be able to test between alternative approaches, but keep those in mind when interpreting our findings.

descriptive analysis

We begin by separately analyzing the effect of our three main factors—duration of unemployment, worker age, and presence of interim job—separately. In the next section we analyze the effect of these characteristics jointly. To set [End Page 177] the stage, note that our mean callback rate across all rounds is 10.4 percent. One plausibility check that our résumés work as intended is that the callback rate was significantly higher(12.2 percent) in our low-unemployment cities than in our high-unemployment cities (8.9 percent), with a p-value of the difference smaller than 0.0005.

Click for larger view
View full resolution

Table 2.

Average Callback Rate, by Base Unemployment and Round

Source: Authors’ calculations

Note: Numbers in parentheses are standard errors clustered at the job level.

Duration of Unemployment

A primary focus of this study is to examine the effect of unemployment duration on the likelihood of an employer callback to a job application. All four rounds incorporated variation in weeks of unemployment including base values of zero weeks, four weeks, twelve weeks, twenty-four weeks, and fifty-two weeks.⁸ Table 2 contains mean callback rates overall and by round for each of the five baseline values for unemployment duration. There is no systematic relationship, positive or negative, between the probability of callback and the duration of unemployment. The hypothesis that the callback rates are equal across unemployment duration treatments cannot be rejected (p-value = 0.53 overall).⁹

The variation in unemployment duration treatment within job posting in each round offers the opportunity to examine within-posting variation in callback rates by unemployment treatment. The fixed-effect conditional logit analysis due to Gary Chamberlain (1980) is a natural way to estimate this within-posting effect. Intuitively, the fixed-effect conditional logit conditions on the number of successes (callbacks) within each job posting and asks whether the applicants with longer unemployment durations were less likely to be among those who received the fixed number of call-backs. This approach ignores the job postings for which there was no variation in the outcome. In the 3,076 job postings in rounds 1 to 3, for which there were 2 applications per job posting, 2,591 postings had no callbacks and [End Page 178] 229 postings had 2 callbacks. This leaves 256 postings with 1 callback. In the 1,518 job postings in round 4, where there were 4 applications per job posting, 1,215 postings had no callbacks and 30 postings had 4 callbacks. This leaves 150 postings with 1 callback, 85 postings with 2 callbacks, and 38 postings with 3 callbacks.

Click for larger view
View full resolution

Table 3.

Average Callback Rate, by Unemployment and Number of Callbacks

Source: Authors’ calculations.

Notes: By construction, the average callback rate is 0.5 for postings with one callback in rounds 1 to 3. In round 4, the callback rate is 0.25 for postings with one callback, 0.5 for postings with two callbacks, and 0.75 for postings with three callbacks. Numbers in parentheses are standard errors clustered at the job level.

We now present estimates of the average callback rates by unemployment treatment conditional on the number of callbacks received for the job posting (we discuss estimation of the full Chamberlain fixed effects logit model in a later section, “Multivarite Analysis”). Table 3 contains these callback rates conditional on the number of callbacks received. Column 1 of the table contains average callback rates by unemployment treatment for job postings in rounds 1 to 3 with a single callback. There is no obvious relationship between the callback rate and the unemployment treatment, and the hypothesis that callback rates are equal across treatments cannot be rejected (p-value = 0.85). Column 2 shows average callback rates in round 4 for job postings with one to three callbacks for each treatment. These appear to show, counter to expectations, that callback rates are higher where a longer unemployment spell is indicated on the application. However, once again the hypothesis that callback rates are equal across treatments cannot be rejected (p-value = 0.46).

The last three columns of table 3 shows average callback rates in round 4 for job postings with one, two, and three callbacks, respectively, for each treatment. In no case can the hypothesis that callback rates are equal across treatments be rejected (p-values = 0.78, 0.32, and 0.91 respectively).

The theory outlined in “A Model of Learning about Applicant Quality” implied that the marginal effect of unemployment duration will be larger in tighter labor markets. This suggests that there might be a relationship between unemployment duration and the probability of callback in the low-unemployment cities but not in the high-unemployment cities. We do not show the results here, but we repeated our analysis separately in the low-and high-unemployment cities. No perceptible relationship between unemployment duration and the callback rate was found in either group of cities. [End Page 179]

Click for larger view
View full resolution

Table 4.

Average Callback Rate, by Age and Round

Source: Authors’ calculations.

Note: Numbers in parentheses are standard errors clustered at the job level.

Age

Figure 4 showed that older job losers are more likely to be unemployed at a fixed date subsequent to a job loss. It has been a long-standing question in labor economics whether the stark differences by age shown in the figure may partly reflect a reluctance by employers to hire older job applicants. More generally, age may be an important factor for employers when selecting new employees. This motivated the random variation of age of applicant in the résumés we submitted as part of our audit study, and, in this section we present our estimates of callback rates as a function of applicant age.

Two applications were submitted to each of 3,076 job postings in rounds 1 to 3, and each job posting was randomly assigned to an age category. Both applications to each job posting listed the same birth date as implied by the year of graduation from college.¹⁰ Approximately one-third of the job postings were randomly assigned in each age category (32.5 percent aged thirty-five to thirty-seven, 33.5 percent aged forty to forty-two, and 34.0 percent aged fifty-five to fifty-eight). Four applications were submitted to each of 1,518 job postings in round4. Two applications per posting were randomly assigned to be in the oldest age category (fifty-five to fifty-eight) and the remaining two applications were assigned to be in a younger category. The result is that in round 4, roughly one-quarter of the applicants are thirty-five to thirty-seven years of age, one-quarter of the applicants are forty to forty-two years of age, and half of the applicants are fifty-five to fifty-eight years old.

The first column of table 4 contains the call-back rates for all four rounds, both overall (last row) and by age group. The overall callback rate is 10.4 percent. There is not a significant difference between the callback rates for applicants aged thirty-five to thirty-seven and applicants aged forty to forty-two (p-value of difference = 0.97). However, the callback rate for applicants aged fifty-five to fifty-eight is substantially and significantly lower (by about two percentage points) than the callback rate for younger workers (p-values of differences < 0.01).

The remaining columns of table 4 contain the callback rates separately by round. While mean callback rates for workers age fifty-five to fifty-eight are lower than the average callback rates for those thirty-five to forty-two, these differences are not statistically significant from zero in the first three rounds. However, there is a substantial difference by age in round 4. In round 4, applicants aged fifty-five to fifty-eight have a 7.6 percent callback rate compared with callback rates in the 10 to 11 percent range [End Page 180] for younger applicants (p-values of differences < 0.005).

Click for larger view
View full resolution

Table 5.

Average Callback Rate, Round 4, by Age and Number of Callbacks

Source: Authors’ calculations.

Notes: By construction, the average callback rate is 0.25 for postings with one callback, 0.5 for postings with two callbacks, and 0.75 for postings with three callbacks. Numbers in parentheses are standard errors clustered at the job level.

The variation in age of applicant within job posting in round 4 offers the opportunity to examine within-posting variation in call-back rates by age. As we did earlier with respect to the unemployment treatment, we focus on the job postings for which there was variation in the outcome. We ignore the job postings for which there was no variation in the outcome (the 1,215 of 1,518 postings with no callbacks and the 30 of 1,518 postings with 4 callbacks). This leaves 150 postings with 1 callback, 85 postings with 2 callbacks, and 38 postings with 3 callbacks). We do not estimate Chamberlain fixed effects logit model directly at this point; we do present estimates of the average callback rates by age group conditional on the number of callbacks received for the job posting

Table 5 contains mean callback rates in round 4 for postings that received one to three callbacks. The evidence is clear. Applicants in the oldest age groups received callbacks at a significantly lower rate than applicants in either of the two younger groups. For the 150 postings in which one of four applications received callbacks (for an aggregate callback rate of 25 percent), applicants in their fifties received callbacks at a rate sixteen percentage points less than applicants in their thirties or forties (about a 50 percent lower callback rate). For the 85 postings, postings in which two of four applications received callbacks (for an aggregate callback rate of 50 percent), applicants in their fifties received callbacks at a rate that is sixteen percentage points less than applicants in their thirties (about a 30 percent lower callback rate) and 30.3 percentage points less than applicants in their forties (about a 47 percent lower callback rate). There is no difference in callback rates by age for the 38 postings in which three of the four applications received callbacks. Applicants in each of the three age groups had callback rates very close to the 75 percent overall rate.

Overall, table 5 confirms the negative effect of age on callback, even holding the job-specific callback rate constant. In addition, the finding of no difference in callback rates by age category for job postings with three callbacks is consistent with our hypothesis that worker characteristics are less important when employers are less selective, as indicated in this case by callbacks to three of four applicants. The high callback rate may reflect a need by the employers to fill a large number of jobs quickly. In this case the employer would accept most of the applicants and be less sensitive to individual characteristics. This implies that these employers should be less sensitive to other worker characteristics as well, and we examine this directly below. However, the overall pattern is clear. Employers are generally substantially less likely to call back older job applicants. [End Page 181]

Interim Jobs

An important decision facing an unemployed worker is whether to take an interim job at a lower level than, and not directly relevant to, the job the worker is seeking. The obvious positive aspect of taking such a job is that it provides income to the unemployed worker, particularly if the worker is not receiving unemployment compensation. Another possible advantage is that potential employers may infer from the fact that the worker has taken such a job that he or she is hardworking and strongly motivated to stay employed. However, it is possible that potential employers will infer that the worker is not of appropriate quality precisely because he or she has been working in a lower-level job. In some cases this may be the result of the employer’s using some kind of automated or cursory screening of job applications that rejects applications if their most recent job is not relevant to the job for which the applicant is applying.

Which of these potential mechanisms is at work or which dominates is an empirical question that we address. Beginning in round 3, we introduced a treatment to interrupt a spell of unemployment with work at a low-level interim job. We defined an interim job as one with low wages and for which the candidate appeared ill matched in terms of education and previous experience. For example, the interim jobs included sales associate or cashier at a big box or grocery store, and restaurant server. The résumés with such jobs indicate that the job was currently held by the new applicants and started in the month just prior to the application. These jobs interrupted an unemployment spell of varying duration identical to those unemployment spells we investigate directly (zero, four, twelve, twenty-four, or fifty-two weeks). The randomization with respect to interim job was conducted at the application level, within job posting. Interim jobs appeared on an application with probability 0.5. In round 3, with two applications per job posting, there could be zero, one, or two applications with an interim job. In round 4, with four applications per job posting, there could be zero, one, two, three, or four applications with an interim job.

Of the 834 job postings analyzed in round 3, for 219 (26.3 percent) neither of the applications indicated an interim job, for 391 (46.9 percent) one of the two indicated an interim job, and for 224 (26.9 percent) both applications indicated an interim job. Of the 1,518 job postings analyzed in round 4, for 77 (5.1 percent) none of the applications included an interim job, for 438 (28.9 percent) one of the applications included an interim job, for 516 (34.0 percent) two of the applications included an interim job, for 419 (27.6 percent) three of the applications included an interim job, and for 68 (4.5 percent) all four applications included an interim job.

The applications in rounds 3 and 4 varied randomly in unemployment duration and age, and this variation is independent of the variation in interim job. We account for these other dimensions of variation in the multivariate analysis in a later section.

Table 6 contains mean callback rates for rounds 3 and 4 by whether or not an interim job was indicated on the application. The overall callback rate in rounds 3 and 4 was 9.2 percent. The callback rate was 9.8 percent where there was no interim job versus 8.5 percent where there was an interim job. This difference of 1.3 percentage points (15 percent) is statistically significant (p-value = 0.038). When analyzed separately by round, there is no difference in round 3 and a larger statistically significant difference in round 4 (9.9 percent with no interim job versus 8.4 percent with an interim job).

Given the within-job randomization of the existence of an interim job, we once again examine how callbacks vary with an interim job within job posting. Again, this analysis is restricted to applications to job postings for which there was variation in callback. Table 7 contains mean callback rates for postings in round 3 that received one callback and in round 4 for postings that received one to three call-backs. Although the point estimate of the difference in callback rates for single-callback postings in round 3 is negative and substantial in magnitude, this difference is not statistically significant, given the small number of postings (fifty-nine) that meet the sample criteria. The difference in callback rates for postings with one to three callbacks in round 4 is a statistically significant 7.1 percentage points [End Page 182] (p-value = 0.015). This difference is driven by a large negative difference in callbacks by interim job status (13.0 percentage points) for the 150 postings that received a one call-back (p-value < 0.0005). The differences in callback rates by interim job status for postings with two or three callbacks are not statistically significant.

Click for larger view
View full resolution

Table 6.

Average Callback Rate, by Interim Job and Round

Source: Authors’ calculations.

Note: Numbers in parentheses are standard errors clustered at the job level.

Click for larger view
View full resolution

Table 7.

Average Callback Rate, Rounds 3 and 4, by Interim Job and Number of Callbacks

Source: Authors’ calculations.

Notes: By construction, the average callback rate in round 3 is 0.5 for postings with one callback. Similarly, the average callback rate in round 4 is 0.25 for postings with one callback, 0.5 for postings with two callbacks, and 0.75 for postings with three callbacks. Numbers in parentheses are standard errors clustered at the job level

The overall pattern of results suggests that holding a job that is lower skill and irrelevant to the job for which the individual is applying reduces the likelihood of a callback, at least for selective employers. It appears that an unemployed worker is better off remaining unemployed and searching for work rather than being employed in a low-level job while searching. Alternatively, if an applicant has taken a low-level interim job, she may be better off not listing this job on her résumé.

In addition, again the finding of a significant difference in callback rates by interim job status in round 4 only for jobs with one callback and not for jobs with more callbacks is (as with age) consistent with our hypothesis that worker characteristics are more important when employers are more selective, as indicated in this case by callbacks to a single applicant.

multivariate analysis

We now turn to a multivariate analysis that models the probability of callback as a function [End Page 183] of unemployment duration, age, and interim job. This analysis first uses both within-and between-posting variation in application characteristics. We choose the logit model for several reasons. In principle it should provide a better approximation of the functional form for binary choice probabilities with a relative low incidence.¹¹ Given the canonical sample design of recent audit studies that provide random variation within, a particular advantage of the logit model is that it provides a consistent approach that allows us to obtain estimates that rely on within-posting variation via the Chamberlain fixed effects logit model. Finally, the logit model allow us to contrast the fixed effects estimate with a random effects logit, our preferred specification.

The random effects model accounts for the fact that job postings are randomly drawn from the underlying population and may differ in their mean callback rate. This model is appropriate (yields consistent estimates) where the baseline variation across job postings in their callback rates is uncorrelated with the observed applicant characteristics of interest. Given our approach in sending résumés to job listings with key characteristics varying randomly, we would not expect the job-specific callback rate to be correlated with résumé characteristics so that estimates derived using the random effects model should be consistent. More generally, since the three treatments were assigned independently to résumés, there is no reason to expect that the multivariate analysis in general, and the conditional logit in particular, will affect our main results.

Table 8 presents the main results of our multivariate analysis. We report our findings in terms of odds ratios, which for small probabilities are approximately the ratio of probabilities of callback given a treatment versus no treatment.¹² Age enters as a dummy variable for whether a worker is fifty-five to fifty-eight years of age (rather than thirty-five to forty-two). The first three columns present results for the logit, random effects logit, and fixed effects logit, respectively, pooling four rounds.

Recall that in all four rounds unemployment durations differ among applications sent to the same job posting; in addition, in rounds 3 and 4 there is also variation in incidence of interim jobs among applications sent to the same job posting; in addition, in round 4 age differs among applications sent to the same job posting as well. The simple logit and random effects logit models (columns 1 and 2) use all available variation for all factors, even if they were not randomly assigned within jobs. The between-job variation yields valid estimates, since the pairing of résumés with jobs was effectively random with respect to job and résumé characteristics. To make sure our results are not affected by the inclusion of variation between jobs, we then implement the fixed effects logit model, which relies only on within-posting variation. The within variation for unemployment duration is coming from all four rounds; it is coming from round 4 for age; it is coming from rounds 3 and 4 for interim job. To examine a specification where all three factors are treated symmetrically, we then restrict the analysis to round 4, where there are four applications per job posting and within-job posting variation in all three factors. The logit, random effects logit, and fixed effects logit for data from round 4 only is shown in columns 4 to 6 of table 8.

Given that we have purposefully chosen to work with a homogeneous groups of workers, the only control variable, other than dummies [End Page 184] for rounds in columns 1 and 2, is a dummy for whether the city was initially classified as one of our low-unemployment cities (Dallas, Omaha, Pittsburgh, and Portland, Maine) or as one of our high-unemployment cities (Chicago, Sacramento, Tampa, and Charlotte, North Carolina). This effect is identified only from between-job-opening variation.

Click for larger view
View full resolution

Table 8.

Logit, Random Effects Logit, and Conditional Logit Estimates: Odds Ratios

Source: Authors’ calculations.

Notes: The χ2 and p-value refer to the test statistic for the null hypothesis that the four coefficients on unemployment duration are jointly zero. Columns 1 and 2 include indicators for the round of the experiment. Numbers in parentheses are standard errors. Standard errors in columns 1 and 4 are clustered at the job level.

Overall and as expected, the results in table 8 confirm our three main findings from the previous section. There is no detectable effect of unemployment duration on callback rates. The χ² test statistic and corresponding p-value we present are for the null hypothesis that the four coefficients on the unemployment duration dummies are jointly equal to zero. In none of our models can we reject this null hypothesis. Again, we find there is a precisely estimated negative effect (an estimated odds ratio less than 1) of age on the callback rate. Finally, there is a substantial negative effect of reporting holding an interim job on the callback rate.

The first column of table 8 shows basic logit estimates pooling all four rounds, clustering standard errors at the job level. The second column adds random effects. As expected, controlling for random variation in the callback rates across openings improves [End Page 185] the fit of the model substantially, as in dicated by the improvement in the log-likelihood value, and reduces standard errors slightly. The odds ratio on age drops from 0.8 to 0.57. We have no economic explanation for this, since the random effects are not correlated with the independent variable. However, their presence changes the interpretation of the coefficient. Whereas the coefficients of the logit model can be interpreted as the average effect in the population, coefficients of the random effects model are the effects holding constant the within-opening call-back propensity of a given job posting. The third column contains estimates of the Chamberlain fixed effects logit model, which uses only those job postings for which there was variation in callback rates (one callback in rounds 1 to 3 and one to three callbacks in round 4). As expected, given the random assignment of characteristics to résumés, the fixed effects estimates are virtually identical to the random effects estimates in column 2. In order to formally compare the random and fixed effects models, we performed a standard Hausman test comparing the random and fixed effect specifications. The value of the χ² -test statistic (six degrees of freedom) is 2.69 with p-value of 0.85, implying we cannot reject the hypothesis that the fixed effects are uncorrelated with the factors included in the model.

Columns 4 to 6 show the results of repeating the analysis using only data from round 4, where there are four applications per opening and within-opening random variation in all three factors. The results are very similar compared to the model pooling all rounds. The only notable difference is that the coefficient on the dummy for a low local unemployment rate in columns 4 and 5 is no longer statistically significant (odds ratio not significantly different from 1). Note, however, that round 4 was fielded substantially later than the earlier rounds, and, while differences in unemployment rates across labor markets persisted, they were smaller in 2014 (when round 4 was fielded) than earlier.

Column 6 then presents findings for the fixed effects logit model for round 4. As we noted, the model is identified only from quadruplets of job applications in which callback varies (one to three callbacks to four applications). Dropping the 1,215 job postings for which we received no callbacks and the 30 job postings for which all four applications received callbacks leaves 1,092 observations for 273 job postings, a reduction of over 80 percent with respect to the full round 4 model in columns 4 and 5 (6,072 observations for 1,518 job postings). Nevertheless, the results in column 6 are very similar to those from the random effects logit in column 5, particularly with regard to the effect of age and interim job. Once again we performed a Hausman test of the hypothesis that the fixed effects are uncorrelated with the factors included in the model. The value of the χ²-test statistic is 1.63 with p-value of 0.95, implying, as with the estimates for all four rounds, that we cannot reject the hypothesis that the fixed effects are uncorrelated with the factors included in the model.

Overall, the results in table 8 confirm our main findings using the full power of the pooled sample. We tried various alternative specifications, none of which yielded additional statistically meaningful findings. In particular, we tried to assess whether the effects of unemployment duration, age, and interim jobs vary with the local unemployment rate. This is particularly interesting, because a key result of KLN’s analysis was that the effect of unemployment duration on callback rates is lower in markets with higher unemployment rates. Not surprisingly, our finding, that unemployment duration on the résumé does not affect the call-back rate, does not vary with the local unemployment rate. We also do not find that the effect of age or interim jobs varies by the state of the local labor market.

Again mirroring our univariate analysis, in table 9 we replicate the main logit model using observations only from round 4 separately for jobs with different numbers of callbacks. Column 1 from the table simply replicates column 4 from table 8. Column 2 then shows the results when we drop jobs for which either all or none of the résumés we sent received a callback. Our results on age and interim jobs are unchanged, with older applicants and applicants who report [End Page 186] holding an interim job substantially less likely to receive a callback.¹³ Columns 3 to 5 in table 9 then show the results for different number of callbacks per application. Consistent with the findings in our univariate analysis, the effect of age is present for only applications to jobs with one or two callbacks. There is no significant difference in callback rates by age for jobs with three callbacks. The effect of reporting the holding of an interim job is present only for applications to jobs with one callback. There is no significant difference in callback rates by interim jobs for jobs with two or three callbacks. Consistent with the earlier results, there is no relationship between the likelihood of callback and unemployment duration for any group we study.

Click for larger view
View full resolution

Table 9.

Logit Estimates for Round 4 by Number of Callbacks: Odds Ratios

Source: Authors’ calculations.

Notes: The χ2 and p-value refer to the test statistic for the null hypothesis that the four coefficients on unemployment duration are jointly zero. Numbers in parentheses are standard errors clustered at the job level.

The pattern of results in table 9 confirms our finding from the descriptive analysis presented earlier that employers who are eager to hire—and hence have a higher callback rate for their job posting—are less choosy—in other words, résumé characteristics appear to matter less in determining callback than for employers that have a lower callback rate for their job posting. When employers are “hungry” for workers, they are less selective than when they are not so needy. This supports the view that a strong labor market can play an important role in reducing the disadvantage of particular types of applicants, such as older applicants, who otherwise would be at a disadvantage when searching for jobs. [End Page 187]

reconciliation with earlier work

Our finding of no relationship between the duration of unemployment and the likelihood of a callback for mature and older workers is consistent with some prior audit studies and at odds with others. The closest parallel studies that find important effects of unemployment duration is that of Kroft, Lange, and Notowidigdo (2013) and Ghayad (2014). Those studies finds that in the United States in the period 2011–2012 longer unemployment spells reduced callback significantly for younger workers. In contrast, Nunley and his colleagues (forthcoming) find that for relatively recent U.S. college graduates, unemployment duration has no effect on callbacks. The results of a Swedish audit study by Stefan Eriksson and Dan-Olof Rooth (2014) also pertain to younger workers, and imply no effect of shorter ongoing unemployment spells or past unemployment spells on the callback rate, but a negative effect of long current unemployment spells on callback for less-educated workers.

These studies all follow a comparable basic blueprint, but it is important to recognize that there are subtle and not-so-subtle differences in the implementation that could affect the results. In particular, our study is narrowly targeted at one type of worker in one type of job. By focusing on female administrative support workers with a four-year college education, we have a relatively clean design without having to control for confounding variables. But this is at the cost of potentially limited external validity. Additionally, we cover a fairly wide age range and do not include the very young workers who are the focus of some of the earlier studies.¹⁴

In this section we explore differences among the studies that could account for the difference in results. We focus particularly on the KLN (Kroft, Lange, and Notowidigdo 2013) analysis because (1) like the study we report here (which we refer to as FSvW), it is U.S. based in the post–Great Recession period and encompasses most of our cities; (2) many of the jobs in their analysis are of the same type as ours, allowing for a direct comparison in callback rates; (3) the data are publicly available, allowing us to comparable models on their data and our data; (4) the paper has already been highly influential. All of this provides strong motivation to carefully assess the extent to which their approach is comparable to ours.¹⁵

In the following we focus on five key differences in the design and implementation of the KLN and FSvW studies that could account for the difference in results: (1) outcome measure,(2) type of job for which applications are submitted, (3) time period, (4) choice of cities,(5) education level, and (6) age range of the applicants. We consider each of these in turn.

The Outcome Measure

The KLN analysis focuses on callbacks that include a request for an interview while our study and those of Ghayad (2014) and Eriksson and Rooth (2013) focus on all callbacks, regardless of whether or not there was an interview request. This is reflected in a difference in reported callback rates. Our callback rate was 10.4 percent whereas the KLN callback-with-interview rate was 4.7 percent. Using data supplied by KLN, we calculate that the overall call-back rate in KLN was 12.1 percent, comparable in magnitude to the callback rate we found.

The key question here is whether the KLN overall callback rate is negatively related to the length of unemployment spell. In order to address this question, we obtained a copy of the data KLN used. Using both these data and the data from our study, we estimate a simple [End Page 188] model of the effect of unemployment duration on the probability of callback. The model we use is a simple logit model with only a constant and the duration of unemployment in months. Table 10 contains the results of this analysis. The first row of this table contains the estimate of the marginal effect of unemployment duration on the callback rate for the overall FSvW sample, and it confirms the finding of no significant relationship in our sample. The second and third rows of this table contain estimates of the marginal effect of unemployment duration on the callback rate for the overall KLN sample for the two definitions of callback. The estimate in row 2 uses KLN’s preferred callback-with-interview measure and confirms their finding of a significant negative effect of unemployment duration. Our reanalysis of the KLN overall callback measure in row 3 shows an even stronger negative relationship between the duration of unemployment and the probability of callback. Thus the difference in outcome measure is not a factor that can explain the difference in findings. The point estimate in row 3 of the table implies a reduction in the probability of callback of about 0.8 percentage points per month of unemployment. This is a reduction of about 7 percent at the mean of12.05 percent, a substantial effect.

Click for larger view
View full resolution

Table 10.

Reanalysis of the Data on Applications from Kroft, Notowidigdo, and Lange (2013) [KLN] Restricted to the Cities Included in Farber, Silverman, and von Wachter (2017) [FSvW]

Source: Authors’ calculations.

Notes: Marginal effects on the probability of callback calculated from logit model of callback. Robust standard errors clustered by job id in parentheses.

In order to maintain comparability with our analysis, our reanalysis of the KLN data continues using the measure of the overall callback rate rather than the callback-with-interview measure. [End Page 189]

Variation in Job Type

All jobs applied for in the FSvW analysis were white-collar office support jobs and all applicants were female. The KLN analysis included applications for three types of jobs: administrative support and clerical, customer service, and sales. The first KLN occupational group, administrative support and clerical, is comparable to the office support jobs in the FSvW analysis, and 96.4 percent of the 2,690 applicants for these jobs in the KLN sample were female.

Row 4 of table 10 contains results of the analysis for the KLN administrative support and clerical jobs. The first thing to note is that the overall callback rate for these jobs in the KLN data is extremely low at 3.61 percent. There were only ninety-seven callbacks to 2,690 applications for this type of job. Still, there is a statistically significant negative relationship between unemployment duration and the probability of a callback. However, it is only about 56 percent of the estimated effect in the overall KLN sample. The point estimate in row 4 of the table implies a reduction in the probability of callback of about 0.5 percentage points. This is a reduction of about 13 percent at the mean callback rate of 3.61 percent, comparable to the implied effect for the full sample in row 3. We conclude that variation in the type of job does not account for the qualitative difference between our results and those of KLN.

Education Level

Related to job type is the skill level of the applicants. All applicants in the FSvW analysis were graduates of four-year colleges. In contrast, the KLN analysis included applicants who had completed high school (20 percent), community college (42 percent), and four-year college (38 percent). There is no difference in call-back rates by education level in the KLN analysis, but it is worth investigating whether the relationship of the likelihood of callback with unemployment duration holds up for the KLN four-year college graduates.¹⁶

Row 5 of table 10 contains results of the analysis for the KLN applicants who have a degree from a four-year college. The callback rate for these applicants is very close to the overall callback rate in the KLN data. The marginal effect of unemployment duration on the probability of callback is significantly negative for the KLN four-year-college graduates and larger in magnitude than for the overall sample (compare rows 6 and 3 of table 10). We conclude that variation in the education level of applicants does not account for the qualitative difference between our results and those of KLN.¹⁷

Time Period

The KLN analysis is based on job applications submitted between June 2011 and July 2012 whereas the FSvW analysis is based on applications submitted between March 2012 and August 2014. Clearly, the KLN analysis is much earlier in the period of recovery from the Great Recession. This may be part of the explanation for the fact that KLN find a much lower call-back rate to their applications for comparable jobs (3.61 percent for administrative and clerical jobs) than we find (10.4 percent).¹⁸ However, the information-based theory highlighted by both KLN and FSvW suggests that, to the extent employers infer worker quality partly from unemployment duration, the negative effect of unemployment duration on the callback rate should grow as the recovery proceeds and the labor market strengthens. In fact, even for our data from round 1 in 2012 we find a zero [End Page 190] effect, in contrast with the basic updating model.¹⁹

A potential source of reconciliation between the disparate findings of KLN and FSvW is suggested by our within-posting analysis. The results in table 9 suggest that observable characteristics are more important when callback rates are lower (say, one callback from four applications as opposed to three callbacks from four applications). The generally lower overall callback rates found by KLN are consistent with employers exercising more discretion in call-backs so that unemployment duration could play a more important role in the time period covered by their sample.

Geographic Variation

As explained earlier, in “Research Design,” our analysis was designed to cover eight metropolitan areas, four with relatively low unemployment rates (Dallas, Omaha, Pittsburgh, and Portland, Maine) and four with relatively high unemployment rates (Chicago, Sacramento, Tampa, and Charlotte, North Carolina). In contrast, the KLN analysis covers one hundred large American metropolitan areas.²⁰ Their analysis includes observations on seven of the eight cities used by FSvW, the exception being Portland, Maine. We investigate the extent to which differences in geographic coverage can account for the difference in findings across the two studies by using the seven-city subset of the KLN data to estimate our simple model of the effect of unemployment duration on the probability of callback.

Rows 6 and 7 of table 10 contain the results of this analysis. Row 6 of the table contains estimates of the marginal effect of a month of unemployment on the probability of callback for the KLN subsample for the seven FSvW cities. There are only 1,130 applications in these cities, so it is not surprising that the marginal effect of unemployment is estimated less precisely. However, the estimate is negative and significantly different from zero (p-value = 0.042). The estimated marginal effect for the 8,106 applications from the remaining ninety-two cities in the KLN sample, presented in row 7 of the table, is comparable in magnitude and significantly negative at conventional levels. These results imply that differences in the geographic composition of the KLN and FSvW samples are not likely to account for the differences in results.

Variation in Age

The differences in the implied age range of the résumé is the most striking contrast between our and other audit studies of the effect of unemployment duration on callback. The distributions of age of applicants in the KLN and FSvW samples are largely nonoverlapping. Applicants in the KLN sample range in age from nineteen to thirty-nine, with 99 percent between twenty-one and thirty-three, whereas applicants in the FSvW sample range in age from thirty-five to fifty-eight. As explained earlier, in “A Model of Learning About Applicant Quality,” this contrast has the potential to account for the different findings with regard to the relationship between unemployment duration and the probability of callback. KLN note themselves in their conclusion that it is important to assess whether their findings hold for older workers.

Rows 8 to 11 of table 10 contain analyses of the callback rate separately for four age groups in the KLN sample. Callback rates are similar across all four age groups, ranging from 10.7 percent to 12.8 percent.²¹ The marginal effect of unemployment duration on the callback rate is estimated to be negative for all age groups. There are significant differences in the marginal effect across age groups (p-value of test that all marginal effects equal = 0.047), but the absolute magnitude of the effect does not decline monotonically with age. The effect is largest by far in absolute magnitude for the youngest applicants (nineteen to twenty-two years [End Page 191] old) then declines for applicants aged twenty-three to twenty-six before rising somewhat for applicants twenty-seven to thirty and for applicants thirty-one to thirty-nine (97.5 percent of whom are thirty-one to thirty-four).

Given the substantial difference in the age ranges covered by KLN and our analyses, it is difficult to conclude anything from the age variation in the effect of unemployment duration within KLN’s sample. However, age may be an important factor in accounting for the difference in findings. The older applicants used by FSvW have significant longer work histories that may outweigh any recent unemployment experience when résumés are evaluated by potential employers. The younger applicants used by KLN do not have nearly as extensive a history and so recent unemployment experience may get higher weight in the evaluation of applicants. We also note that the applicants in the Eriksson and Rooth (2014) Swedish study and the Ghayad (2014) U.S. study are all in their twenties with no more than about five or six years of experience, which may account for their findings of significant effects of unemployment duration on callback.

To summarize the comparison with KLN regarding the effect of unemployment duration on the callback rate, the differences in the outcome measure and the choice of cities do not appear to be important factors in understanding the difference in findings. The differences in job type and time period have the potential to explain some but not all of difference in findings. The differences between the studies in applicants’ age is a strong candidate to explain the difference. However, the lack of overlap in the ages of applicants in the FSvW and KLN studies make it difficult to draw a definitive conclusion in this regard. Without a single study that includes a full range of ages, our conjecture that the importance of unemployment duration in determining callbacks declines with age remains suggestive rather than conclusive.

final comments

Based on our audit study of the determinants of the likelihood of callbacks to job applications, we find clear evidence that employers are less likely to call back older applicants (those in their fifties) than younger workers (those in their thirties and forties). This is consistent with work based on the Displaced Workers Survey and administrative data showing that older displaced workers are less likely to be employed subsequent to job loss (Farber 2015) and to suffer long-term nonemployment (Song and von Wachter 2014), and it has potentially important implications for the employment prospects of older job losers. We also find clear evidence that holding a relatively low-level interim job at the time of job application significantly reduces the likelihood of a callback. This suggests that employers may, either mechanically or by rule-of-thumb, overweight the most recent employment spell in screening applications and suggests that those individuals who do take a lower-level interim job should not report such jobs on their applications.

Recent work reports contrasting findings between unemployment duration and the likelihood of callback for younger workers. While prominent papers find a negative relationship between short unemployment durations and callback for the United States (Kroft, Lange, and Notowidigdo 2013; Ghayad 2014), another study finds no such relationship (Nunley et al. forthcoming). Again focusing on younger workers, a related paper for Sweden finds no effect of short unemployment spells but negative effects of long unemployment spells on callbacks for less-educated workers (Ericksson and Rooth 2014). In our work we unambiguously find no relationship between unemployment and call-back for mature and older workers. We attempt to reconcile our finding in this dimension with the work of KLN using their data and definitions comparable to ours, but cannot completely resolve the issue. Part of the difference may be the time period, since all of the earlier studies were fielded much earlier in the recovery period from the Great Recession when the labor market was weaker. Another difference, and one we think worthy of further exploration, is that all of the earlier studies focus on younger job applications (mostly in their twenties) whereas our study focuses on job applicants from their mid-thirties to mid-fifties. While there are good theoretical reasons to suspect that unemployment duration could be less important for older job applicants, a single study [End Page 192] that covers the full age spectrum is needed to draw a definitive conclusion on this issue.

Finally, our analysis of variation in callbacks within job postings suggests that the effect of observable résumés characteristics (age, interim job) is reduced when employers are calling back a higher fraction of their applicants. Our interpretation of this finding is that when employers are hungry for workers, they are less selective in who they call back. This suggests the power of stimulating aggregate demand as a strategy to improve the employment prospects of applicants who otherwise would not “make the cut” of receiving any positive response to a job application.

Henry S. Farber

Henry S. Farber is Hughes- Rogers Professor of Economics at Princeton University.

Dan Silverman

Dan Silverman is Rondthaler Professor of Economics at Arizona State University.

Till M. Von Wachter

Till M. von Wachter is associate professor in the Department of Economics at the University of California, Los Angeles.

APPENDIX–SAMPLE RéSUMéS

This appendix contains a set of four sample résumés:

1. Linda Carter, Sacramento, zero weeks’ unemployment, older worker, no interim job.
2. Jennifer Smith, Pittsburgh, twenty-four weeks’ unemployment, medium-age worker, no interim job.
3. Heather Adams, Dallas, fifty-two weeks’ unemployment, younger worker, interim job.
4. Linda Carter, Dallas, twelve weeks’ unemployment, older worker, interim job.

inline graphic
[End Page 193]

inline graphic
[End Page 194]

inline graphic
[End Page 195]

inline graphic
[End Page 196]

inline graphic
[End Page 197]

inline graphic
[End Page 198]

inline graphic
[End Page 199]

inline graphic
[End Page 200]

REFERENCES

Bertrand, Marianne, and Sendhil Mullainathan. 2004. “Are Emily and Greg More Employable Than Lakisha and Jamal? A Field Experiment on Labor Market Discrimination.” American Economic Review 94(4): 991–1013.

Google Scholar

Chamberlain, Gary. 1980. “Analysis of Variance with Qualitative Data.” Review of Economic Studies XLVII 47(1): 22–38.

Google Scholar

Elsby, Michael, Bart Hobijn, and Ayşegül Şahin. 2010. “The Labor Market in the Great Recession.” Brookings Papers on Economic Activity 41(1): 1–69.

Google Scholar

Eriksson, Stefan, and Dan-Olof Rooth. 2014. “Do Employers Use Unemployment as a Sorting Criterion When Hiring? Evidence from a Field Experiment.” American Economic Review 104(3): 1014–39.

Google Scholar

Farber, Henry S. 1999. “Alternative and Part-Time Employment Arrangements as a Response to Job Loss.” Journal of Labor Economics 17(S4): S142–69.

Google Scholar

———. 2015. “Job Loss in the Great Recession and Its Aftermath: U.S. Evidence from the Displaced Workers Survey.” IRL Working Paper No. 589. Princeton, N.J.: Princeton University, Industrial Relations Section, May.

Google Scholar

Farber, Henry S., and Robert Gibbons. 1996. “Learning and Wage Dynamics.” Quarterly Journal of Economics 111(4): 1007–47.

Google Scholar

Farber, Henry S., Dan Silverman, and Till von Wachter. 2016. “Determinants of Callbacks to Job Applications: An Audit Study.” American Economic Review 106(5): 314–18.

Google Scholar

Ghayad, Rand. 2014. “The Jobless Trap.” Available at: www.lexissecuritiesmosaic.com/gateway/FEDRES/SPEECHES/ugd_576e9a_f6cf3b6661e44621ad26547112f66691.pdf; accessed August 10, 2016.

Kroft, Kory, Fabian Lange, and Matthew J. Notowidigdo. 2013. “Duration Dependence and Labor Market Conditions: Evidence from a Field Experiment.” Quarterly Journal of Economics 129(2): 1123–67.

Google Scholar

Lahey, Joanna N. 2008. “Age, Women, and Hiring: An Experimental Study.” Journal of Human Resources 43(2008): 30–56.

Google Scholar

Nunley, John M., Adam Pugh, Nicholas Romero, and Richard Alan Seals, Jr. Forthcoming. “Unemployment, Underemployment, and Employment Opportunities: Results from a Correspondence Audit Study of the Labor Market for College Graduates.” Industrial and Labor Relations Review.

Google Scholar

Song, Jae, and Till von Wachter. 2014. “Long-Term Nonemployment and Job Displacement.” Evaluating Labor Market Dynamics, Proceedings from the Jackson Hole Economic Policy Symposium Sponsored by the Federal Reserve Board of Kansas City, Jackson Hole, Wyo. (August 2014). Available at: www.kansascityfed.org/publicat/sympos/2014/2014vonWachter.pdf; accessed September 30, 2016. [End Page 201]

Footnotes

1. Mean unemployment duration for college-educated females in administrative support occupations over the 2008-to-2014 period is 10.2 weeks longer for women aged forty-five to sixty-four than for women aged twenty-five to forty-four (the difference is statistically significant).

2. All of the fictitious applicants in our study had completed a four-year degree.

3. Similarity for the address was defined by the (minimum) Mahalanobis distance between the source address and the target by census tract age, race, education, and income level. Similarity for employers was, for large businesses, achieved by replacing the source employer with its chief competitor in the target city. For small businesses, similarity was achieved by simple search for a target business in the same industry with approximately the same age and number of employees. For government work, the source employer was simply switched to that of the target jurisdiction. Similarity of the postsecondary schools was identified by simple search using national ranking, public or private status, size, and distance to the target city.

4. The delay between rounds 2 and 3 was unintentional—it resulted from two of the authors (Silverman and von Wachter) moving their primary appointments to different universities. Additionally, data were inadvertently collected in Portland, Oregon, rather than Portland, Maine, in round 3. Since the relevant résumés were tailored to Portland, Maine, we do not include the Portland, Oregon, applications in the analysis. Thus, there are only seven cities in round 3.

5. Note that the quality of applicants will likely depend on the offered wage.

6. Although we do not include sequential search in our model, such a model would clearly have the property that the employer will set a reservation worker quality level as part of the search process and call back those applicants whose expected quality exceeds this threshold. Thus, applicants with higher expected quality will be more likely to receive a callback.

7. This is a result found by Kroft, Lange, and Notowidigdo (2013).

8. These are the weeks of unemployment implicit in the applications at fixed dates. Since the applications were submitted over a period of time following that date, the actual durations seen by potential employers are somewhat longer. Actual unemployment duration exceeds each base value by about 4 weeks on average (standard deviation of about 1.1 weeks for each base value).

9. The hypothesis of equality of callback rates across unemployment duration treatments cannot be rejected within any of the four rounds, with p-values ranging from 0.23 in round 1 to 0.71 in round 3.

10. In fact, the actual ages of the two applications for a posting could differ by one year, given that age is determined by birth date and the applications were sometimes submitted on different dates.

11. We have reproduced these findings with linear probability and probit models, and the results are not affected by the choice of functional form.

12. Let p(1) ≡ Pr{Callback = 1|X, D = 1} and p(0) ≡ Pr{Callback = 1|X, D = 0}, where D represents one of our right hand side dummy variables and X represents the remaining variables in the model. Then the odds ratio R is defined as

where β_D is the coefficient on D. Where the probabilities involved are small, the odds ratio is approximately the ratio of probabilities.

13. Note that column 2 in table 9 uses the same sample as column 6 in table 8, and the results are very similar.

14. For further discussion of the implications of the variation in findings for the external validity of audit studies, see our summary paper (Farber, Silverman, and von Wachter 2016).

15. Ghayad (2014) collected data in the United States in 2012 for three broad occupations (administrative, sales, and professional) in four broad industries. Eriksson and Rooth (2014) collected data in Sweden in 2007 for seven occupations (business sales assistant, cleaner, construction worker, machine operator, motor-vehicle driver, restaurant worker, and sales assistant). In contrast, our data were collected from 2012 to 2014 for a single broad occupation (white-collar office jobs such as administrative or executive assistants, receptionists, secretaries, and office associates).

16. The p-value for test of independence of callback and education in a two-way table is 0.497.

17. We also examined the smaller subset of the KLN sample that consisted of four-year college graduates applying for administrative/clerical jobs. The marginal effect of unemployment duration on the probability of callback is negative but not significantly different from zero in this smaller sample (p-value = 0.14). Given the small size of the sample (936 applicants), we do not draw any conclusion from this result.

18. Eriksson and Rooth (2014) find a callback rate of 25 percent in their 2007 Swedish study. Ghayad (2014) finds a callback rate of 8.3 percent in his 2012 (post–Great Recession) U.S. study. Note that these aggregate statistics refer to broader distributions of worker type.

19. Indeed, KLN investigate cross-sectional variation in the marginal effect on callback rates of unemployment duration by local unemployment rates (a second-order effect) and find that the marginal effect of unemployment duration on callback becomes more negative as the unemployment rate falls.

20. Ghayad (2014) covers the twenty-five largest metropolitan areas in the United States.

21. A χ2 test of independence of age and callback fails to reject independence (p-value = 0.28).

This work is licensed under a Creative Commons Attribution 4.0 International License.

Labor Unions and the Great Recession

The Changing Consequences of Unemployment for Household Finances

This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License.

RSF: The Russell Sage Foundation Journal of the Social Sciences

research design

a model of learning about applicant quality

descriptive analysis

Duration of Unemployment

Age

Interim Jobs

multivariate analysis

reconciliation with earlier work

The Outcome Measure

Variation in Job Type

Education Level

Time Period

Geographic Variation

Variation in Age

final comments

APPENDIX–SAMPLE RéSUMéS

REFERENCES

Footnotes

Previous Article

Next Article

Share

Additional Information

Project MUSE Mission