Exact Matching and Nonparametrics in a Large Population of Data
2025-07-30
Important
Thank you to Royce, Ryan, Carolina, Alex, and everyone involved in honoring Harold with this series; it is a testament to him, his contributions to Essex, the Summer School, his discipline and social science, and to what great people y’all are. 🙏🙏🙏
Harold was my friend, my teacher, one of my mentors, my co-teacher, my collaborator, and the reason, quite literally, that I became a political scientist. He introduced me to Essex last century, 1997. He made an indelible mark on my life. I am honored to give this. I hope he would approve…
My co-author, Tim Johnson, is responsible for all the brilliance; I own the errors including not having published this years ago.
Managers fear that the preferential hiring of military veterans leads to the selection of employees who score worse on merit criteria. Using all non-classified records in the U.S. federal government’s Central Personnel Data File from 1973-1997, we compare the educational backgrounds of employees who received veterans’ preference benefits with the educational backgrounds of employees who did not receive veterans’ preference.
We eliminate uncertainty due to sampling variation with the population of unclassified data. This allows us to focus on the uncertainty surrounding which employees to compare in our analysis.
Replicate prior evidence and media accounts that preference beneficiaries possess weaker educational credentials than non-recipients.
Make this relationship disappear—and, in some instances, invert—when we account for employees’ occupations and work circumstances, their jobs, using exact stratification.
Takeaway: The assessment of perceptions of hiring quality requires careful attention to the appropriate basis for comparing employees. Ending veterans preference because of differences in career trajectories or educational attainment is empirically unjustifiable.
van Riper’s History of the United States Civil Service notes it is one of the oldest personnel policies in the federal government.
Veterans’ preference predates the competitive civil service exam according to the Merit Systems Protection Board.
Lewis (2012): Under the policy, eligible military veterans—as well as the spouses and mothers of veterans who were disabled or killed in combat—receive either added points on their federal service applications or hiring priority over candidates categorized in the same quality tier.
Quite frankly, I have both positive and normative motivations that I must lay bare. The topic is of both public and academic interest.
In various forms for almost a century, Miller (1935), Ordway (1945), Lewis (2012) maintain that these procedures force managers to distribute jobs to preference recipients, even if other candidates appear to be more deserving of a position.
Lewis (2012) analysed 1% samples of the U.S. Office of Personnel Management (OPM) Central Personnel Data File (CPDF) and found that military veterans enter federal service with less education than nonveterans. When considering new hires that perform ``white collar’’ jobs, Lewis found that veterans were less likely than nonveterans to have completed college or to have earned a graduate degree. Furthermore, veterans averaged one year less education than their nonveteran peers.
As an aside, 1% samples are pretty dangerous for extreme imbalance.
Lewis 2012: 247: ``Federal personnel data for the past decade show that veteran new hires are older and less educated than nonveteran new hires, and that they do not advance as far in the first 15 years of their careers as nonveterans hired into the same grades at the same time, suggesting that veterans’ preference may be lowering the performance of the federal civil service.’’
Johnson and Walker (2018) use pay grade and pay increase data to show equivalent advancement tendencies for preference recipients and non-recipients when entry positions are rendered comparable.
Unlike the 1% samples, in the summer of 2009, the National Archives and Records Administration (NARA) released a copy of the U.S. Office of Personnel Management’s (OPM) Central Personnel Data File (CPDF) under a FOIA filed by Tim Johnson. This copy of the CPDF contained all non-sensitive employee records from 1973 to 1997. The data are a person-year panel with more than 3.8 million individual persons.
Past scholarship has claimed that veterans’ preference lowers the educational attainment of the federal service by preventing the acquisition of better-educated personnel.
This yields 3.8 million employees in their first year of service.1
Figure 1
The data are ordered; metric statistics are less than ideal. The Wilcoxon rank-sum simply ranks the data and assesses the sums of the ranks for preference recipients and non-recipients. There are three ways to obtain a probability measure for the null hypothesis of equality of medians: - exact permutations, - simulation, and - a normal approximation.
Table 2
Table 1 Excerpt
Our counterfactual of interest is defined by the job as opposed to the person.
With caveats, when we retain all data, the conventional wisdom mostly holds though 10 point preference recipients are much higher in 1979 and 1980.
But… year doesn’t seem like the best descriptor of a job.
And year is the only variable that retains ALL data
Wilcoxon statistics for Occupations
The top two results have switched signs: 5-point and 10-point Veterans’ Preference recipients have higher levels of education; the evidence for spouse/mother is a bit mixed.
Education by Preference Status for Clerk
Note the abundance of white on the right side of the figures. Higher education levels for preference recipients.
Most stringent job and education
Notes: The y-axis is censored at 200. The result from this is the top row of the next table.
Prior evidence has led to what Pearl (2012, 176) calls a distorted causal interpretation. The broad universal association is quite different than the conditional associations when the job becomes the counterfactual. Though it is claimed that any deviations from merit screening permit less-qualified individuals to enter the federal service, merit screening is only one of many filters.
On balance, we find very little evidence that veterans’ preference worsens the educational attainment of job incumbents.
How could we account for the finding that veterans receiving preference appear to possess equal or greater education in many cases? The GI Bill comes to mind.
Ala Pearl’s discussion of Simpson’s Paradox, given that veterans’ preference accrues to veterans that have unique access to educational benefits, it is quite easy to see how a relationship in the aggregate showing lower levels of education among preference recipients should fail to replicate for at least some jobs if veterans distribute themselves across occupations differently than nonveterans and veteran status correlates with potential educational benefits.
A DEI policy that has no discernible impacts on key elements of expected job performance. Combined with evidence about career trajectories, the belief that DEI policies lead to less qualified candidates is, in this case, quite mistaken.
Distributions
Prompt
Results
Veterans’ Preference and Educational Attainment