How can experiments play a greater role in public policy? Twelve proposals from an economic model of scaling

OMAR AL-UBAYDLI; MIN SOK LEE; JOHN A. LIST; CLAIRE L. MACKEVICIUS; DANA SUSKIND

doi:10.1017/bpp.2020.17

How can experiments play a greater role in public policy? Twelve proposals from an economic model of scaling

Published online by Cambridge University Press: 24 July 2020

OMAR AL-UBAYDLI ,

MIN SOK LEE ,

JOHN A. LIST ,

CLAIRE L. MACKEVICIUS and

DANA SUSKIND

Show author details

OMAR AL-UBAYDLI: Affiliation:
Bahrain Center for Strategic, International and Energy Studies, Manama, Bahrain Department of Economics and the Mercatus Center, George Mason University, Fairfax, VA, USA College of Industrial Management, King Fahad University of Petroleum and Minerals, Dhahran, Saudi Arabia
MIN SOK LEE*: Affiliation:
Kenneth C. Griffin Department of Economics, University of Chicago, Chicago, IL, USA
JOHN A. LIST: Affiliation:
Kenneth C. Griffin Department of Economics, University of Chicago, Chicago, IL, USA The Australian National University, Canberra, Australia NBER, Cambridge, MA, USA
CLAIRE L. MACKEVICIUS: Affiliation:
School of Education and Social Policy, Northwestern University, Evanston, IL, USA
DANA SUSKIND: Affiliation:
Professor of Surgery and Pediatric, University of Chicago, Chicago, IL, USA Co-Director, TMW Center for Early Learning + Public Health, University of Chicago, Chicago, IL, USA
*: *Correspondence to: Kenneth C. Griffin Department of Economics, University of Chicago, 1126 E. 59th Street, Chicago, IL 60637, USA. E-mail: mslee@uchicago.edu

Article contents

Abstract
References

Get access

Rights & Permissions

Abstract

Policymakers are increasingly turning to insights gained from the experimental method as a means to inform large-scale public policies. Critics view this increased usage as premature, pointing to the fact that many experimentally tested programs fail to deliver their promise at scale. Under this view, the experimental approach drives too much public policy. Yet, if policymakers could be more confident that the original research findings would be delivered at scale, even the staunchest critics would carve out a larger role for experiments to inform policy. Leveraging the economic framework of Al-Ubaydli et al. (2019), we put forward 12 simple proposals, spanning researchers, policymakers, funders and stakeholders, which together tackle the most vexing scalability threats. The framework highlights that only after we deepen our understanding of the scale-up problem will we be on solid ground to argue that scientific experiments should hold a more prominent place in the policymaker's quiver.

Information

Type: Articles
Information: Behavioural Public Policy , Volume 5 , Special Issue 1: Field Experiments and Public Policy , January 2021 , pp. 2 - 49

DOI: https://doi.org/10.1017/bpp.2020.17 [Opens in a new window]
Copyright: Copyright © The Author(s) 2020. Published by Cambridge University Press

Access options

Get access to the full version of this content by using one of the access options below. (Log in options will check for institutional or personal access. Content may require purchase if you do not have access.)

Article purchase

Temporarily unavailable

References

Achilles, C. M. (1993), The Lasting Benefits Study (LBS) in Grades 4 and 5 (1990-1991): A Legacy from Tennessee's Four-Year (K-3) Class-Size Study (1985-1989), Project STAR. Paper# 7.Google Scholar

Akram, A. A., Chowdhury, S. and Mobarak, A. M. (2017), Effects of Emigration on Rural Labor Markets (Working Paper No. 23929). https://doi.org/10.3386/w23929 CrossRef Google Scholar

Aldashev, G., Kirchsteiger, G. and Sebald, A. (2017), ‘Assignment procedure biases in randomised policy experiments’, The Economic Journal, 127(602): 873–895.CrossRef Google Scholar

Allcott, H. (2015), ‘Site selection bias in program evaluation’, The Quarterly Journal of Economics, 130(3): 1117–1165.CrossRef Google Scholar

Al-Ubaydli, O. and List, J. A. (2013), ‘On the Generalizability of Experimental Results in Economics’ in Frechette, G. and Schotter, A., Methods of Modern Experimental Economics, Oxford University Press.Google Scholar

Al-Ubaydli, O., List, J. A., LoRe, D. and Suskind, D. (2017), ‘Scaling for economists: lessons from the non-adherence problem in the medical literature’, Journal of Economic Perspectives, 31(4): 125–144. https://doi.org/10.1257/jep.31.4.125 CrossRef Google Scholar PubMed

Al-Ubaydli, O., List, J. A. and Suskind, D. (2019), The Science of Using Science: Towards an Understanding of the Threats to Scaling Experiments (Working Paper No. 25848).CrossRef Google Scholar

Al-Ubaydli, O., List, J. A. and Suskind, D. L. (2017), ‘What can we learn from experiments? Understanding the threats to the scalability of experimental results’, American Economic Review, 107(5): 282–286.10.1257/aer.p20171115CrossRef Google Scholar

Andrews, I. and Kasy, M. (2017), Identification of and Correction for Publication Bias (Working Paper No. 23298).CrossRef Google Scholar

Angrist, J. D., Dynarski, S. M., Kane, T. J., Pathak, P. A. and Walters, C. R. (2012), ‘Who benefits from KIPP?’ Journal of Policy Analysis and Management, 31(4): 837–860.CrossRef Google Scholar

Ashraf, N., Bandiera, O. and Lee, S. S. (2018), Losing Prosociality in the Quest for Talent? Sorting, Selection, and Productivity in the Delivery of Public Services (Working Paper).Google Scholar

August, G. J., Bloomquist, M. L., Lee, S. S., Realmuto, G. M. and Hektner, J. M. (2006), ‘Can evidence-based prevention programs be sustained in community practice settings? The early risers’ advanced-stage effectiveness trial’, Prevention Science, 7(2): 151–165.CrossRef Google Scholar PubMed

Banerjee, A., Banerji, R., Berry, J., Duflo, E., Kannan, H., Mukerji, S., Walton, M. (2017), ‘From proof of concept to scalable policies: challenges and solutions, with an application’, Journal of Economic Perspectives, 31(4): 73–102.CrossRef Google Scholar

Banerjee, A., Barnhardt, S. and Duflo, E. (2015a), Movies, Margins and Marketing: Encouraging the Adoption of Iron-Fortified Salt (Working Paper No. 21616).CrossRef Google Scholar

Banerjee, A., Karlan, D. and Zinman, J. (2015b), ‘Six randomized evaluations of microcredit: introduction and further steps’, American Economic Journal: Applied Economics, 7(1): 1–21.Google Scholar

Bauer, M. S., Damschroder, L., Hagedorn, H., Smith, J. and Kilbourne, A. M. (2015), ‘An introduction to implementation science for the non-specialist’, BMC Psychology, 3, 32.CrossRef Google Scholar PubMed

Bell, S. H. and Stuart, E. A. (2016), ‘On the “where” of social experiments: the nature and extent of the generalizability problem’, New Directions for Evaluation, 2016(152): 47–59.CrossRef Google Scholar

Bettis, R. A. (2012), ‘The search for asterisks: compromised statistical tests and flawed theories’, Strategic Management Journal, 33(1): 108–113.CrossRef Google Scholar

Bold, T., Kimenyi, M., Mwabu, G., Ng'ang’a, A. and Sandefur, J. (2013), ‘Scaling up what works: experimental evidence on external validity in kenyan education’, Center for Global Development Working Paper, (321).Google Scholar

Buera, F. J., Kaboski, J. P. and Shin, Y. (2012), The Macroeconomics of Microfinance (Working Paper No. 17905).CrossRef Google Scholar

Butera, L. and List, J. A. (2017), An economic approach to alleviate the crises of confidence in science: With an application to the public goods game (No. w23335). National Bureau of Economic Research.CrossRef Google Scholar

Button, K. S., Ioannidis, J. P. A., Mokrysz, C., Nosek, B. A., Flint, J., Robinson, E. S. J. and Munafò, M. R. (2013), ‘Power failure: why small sample size undermines the reliability of neuroscience’, Nature Reviews Neuroscience, 14(5): 365–376.CrossRef Google Scholar PubMed

Camerer, C. F., Dreber, A., Forsell, E., Ho, T.-H., Huber, J., Johannesson, M., Kirchler, M., Almenberg, J., Altmejd, A., Chan, T., Heikensten, E., Holzmeister, F., Imai, T., Isaksson, S., Nave, G., Pfeiffer, T., Razen, M. and Wu, H. (2016), ‘Evaluating replicability of laboratory experiments in economics’, Science, 351(6280): 1433–1436.CrossRef Google Scholar PubMed

Chambers, D. A., Glasgow, R. E. and Stange, K. (2013), ‘The dynamic sustainability framework: addressing the paradox of sustainment amid ongoing change’, Implementation Science, 8(117).CrossRef Google Scholar PubMed

Chen, Y. and Yang, D. Y. (2019), ‘The impact of media censorship: 1984 or brave new world? American Economic Review, 109(6): 2294–2332.CrossRef Google Scholar

Cheng, S., McDonald, E. J., Cheung, M. C., Arciero, V. S., Qureshi, M., Jiang, D., … Chan, K. K. W. (2017), ‘Do the american society of clinical oncology value framework and the european society of medical oncology magnitude of clinical benefit scale measure the same construct of clinical benefit? Journal of Clinical Oncology, 35(24): 2764–2771.CrossRef Google Scholar

Christensen, G. and Miguel, E. (2018), ‘Transparency, reproducibility, and the credibility of economics research’, Journal of Economic Literature, 56(3): 920–980.CrossRef Google Scholar

Cook, T. and Campbell, D. (1979), Quasi-experimentation: design and analysis issues for field settings, Boston, MA: Houghton Mifflin.Google Scholar

Cooper, C. L., Hind, D., Duncan, R., Walters, S., Lartey, A., Lee, E. and Bradburn, M. (2015), ‘A rapid review indicated higher recruitment rates in treatment trials than in prevention trials’, Journal of Clinical Epidemiology, 68(3): 347–354.CrossRef Google Scholar PubMed

Crépon, B., Duflo, E., Gurgand, M., Rathelot, R. and Zamora, P. (2013), ‘Do labor market policies have displacement effects? evidence from a clustered randomized experiment’, The Quarterly Journal of Economics, 128(2): 531–580.CrossRef Google Scholar

Crosse, S., Williams, B., Hagen, C. A., Harmon, M., Ristow, L., DiGaetano, R., … Derzon, J. H. (2011), Prevalence and Implementation Fidelity of Research-Based Prevention Programs in Public Schools. Final Report. Office of Planning, Evaluation and Policy Development, US Department of Education.Google Scholar

Czibor, E., Jimenez-Gomez, D. and List, J. A. (2019), The Dozen Things Experimental Economists Should Do (More Of) (SSRN Scholarly Paper No. ID 3313734).CrossRef Google Scholar

Davies, P. (2012), ‘The state of evidence-based policy evaluation and its role in policy formation’, National Institute Economic Review, 219(1): R41–R52.CrossRef Google Scholar

Davis, J. M. V., Guryan, J., Hallberg, K. and Ludwig, J. (2017), The Economics of Scale-Up (Working Paper No. 23925).CrossRef Google Scholar

Deaton, A. and Cartwright, N. (2018), ‘Understanding and misunderstanding randomized controlled trials’, Social Science & Medicine, 210, 2–21.CrossRef Google Scholar PubMed

Deke, J. and Finucane, M. (2019), Moving Beyond Statistical Significance: the BASIE (BAyeSian Interpretation of Estimates) Framework for Interpreting Findings from Impact Evaluations (OPRE Report 2019 35). Washington, DC: Office of Planning, Research, and Evaluation, Administration for Children and Families, U.S. Department of Health and Human Services.Google Scholar

Duflo, E., Dupas, P. and Kremer, M. (2017), The impact of free secondary education: Experimental evidence from Ghana. Massachusetts Institute of Technology Working Paper Cambridge, MA.Google Scholar

Ferracci, M., Jolivet, G., and van den Berg, G. J. (2013), ‘Evidence of treatment spillovers within markets’, The Review of Economics and Statistics, 96(5): 812–823.CrossRef Google Scholar

Fiorina, M. P. and Plott, C. R. (1978), ‘Committee decisions under majority rule: an experimental study’, American Political Science Review, 72, 575–598.CrossRef Google Scholar

Freedman, S., Friedlander, D., Lin, W. and Schweder, A. (1996), The GAIN Evaluation: Five-Year Impacts on Employment, Earnings and AFDC Receipt, New York: MDRC.Google Scholar

Freedman, S., Knab, J. T., Gennetian, L. A. and Navarro, D. (2000), The Los Angeles Jobs-First GAIN Evaluation: Final Report on a Work First Program in a Major Urban Center.Google Scholar

Friedlander, D., Hoetz, G., Long, D. and Quint, J. (1985), Maryland: Final Report on the Employment Initiatives Evaluation, New York, NY: MDRC.Google Scholar

Fryer, R. G., Levitt, S. D. and List, J. A. (2015), Parental Incentives and Early Childhood Achievement: A Field Experiment in Chicago Heights (No. w21477). National Bureau of Economic Research.CrossRef Google Scholar

Gelman, A. and Carlin, J. (2014), ‘Beyond power calculations: assessing type S (sign) and type M (magnitude) errors’, Perspectives on Psychological Science, 9(6): 641–651.CrossRef Google Scholar PubMed

Gilraine, M., Macartney, H. and McMillan, R. (2018), Education Reform in General Equilibrium: Evidence from California's Class Size Reduction (Working Paper No. 24191).Google Scholar

Glennerster, R. (2017), ‘Chapter 5 – The Practicalities of Running Randomized Evaluations: Partnerships, Measurement, Ethics, and Transparency’, in Banerjee, A. V. and Duflo, E. (eds), Handbook of Economic Field Experiments, Volume 1, 175–243.CrossRef Google Scholar

Gottfredson, D. C., Cook, T. D., Gardner, F. E. M., Gorman-Smith, D., Howe, G. W., Sandler, I. N. and Zafft, K. M. (2015), ‘Standards of evidence for efficacy, effectiveness, and scale-up research in prevention science: next generation’, Prevention Science, 16(7): 893–926.CrossRef Google Scholar PubMed

Greenland, S., Senn, S. J., Rothman, K. J., Carlin, J. B., Poole, C., Goodman, S. N. and Altman, D. G. (2016), ‘Statistical tests, P values, confidence intervals, and power: a guide to misinterpretations’, European Journal of Epidemiology, 31(4): 337–350.CrossRef Google Scholar PubMed

Hamarsland, G. D. (2012), Cost-benefit Analysis in Norway. Retrieved from https://www.ntnu.edu/documents/1261865083/1263461278/6_4_Hamarsland.pdf Google Scholar

Harrison, G. W. and List, J. A. (2004), ‘Field experiments’, Journal of Economic Literature, 42(4): 1009–1055.CrossRef Google Scholar

Heckman, J. J. (2010), ‘Building bridges between structural and program evaluation approaches to evaluating policy’, Journal of Economic Literature, 48(2): 356–398.CrossRef Google Scholar PubMed

Heckman, J. J., Ichimura, H., Smith, J. and Todd, P. (1998a), ‘Characterizing Selection Bias Using Experimental Data’, Econometrica, 66(5): 1017–1098.CrossRef Google Scholar

Heckman, J. J., Lalonde, R. J. and Smith, J. A. (1999), ‘Chapter 31 - The Economics and Econometrics of Active Labor Market Programs’, in Ashenfelter, O. C. and Card, D. (Eds.), Handbook of Labor Economics, Volume 3, 1865–2097.Google Scholar

Heckman, J. J., Lochner, L. and Taber, C. (1998b), ‘Explaining rising wage inequality: explorations with a dynamic general equilibrium model of labor earnings with heterogeneous agents’, Review of Economic Dynamics, 1(1): 1–58.CrossRef Google Scholar

Hippel, P. and Wagner, C. (2018), Does a Successful Randomized Experiment Lead to Successful Policy? Project Challenge and What Happened in Tennessee After Project STAR (SSRN Scholarly Paper No. ID 3153503).Google Scholar

Hitchcock, J., Dimino, J., Kurki, A., Wilkins, C. and Gersten, R. (2011), The Impact of Collaborative Strategic Reading on the Reading Comprehension of Grade 5 Students in Linguistically Diverse Schools. Final Report. NCEE 2011-4001. National Center for Education Evaluation and Regional Assistance.Google Scholar

Horner, R. H., Kinkaid, D., Sugai, G., Lewis, T., Eber, L., Barrett, S., Dickey, C. R., Richter, M., Sullivan, E., Boezio, C., Algozzine, B., Reynolds, H. and Johnson, N. (2014), ‘Scaling up school-wide positive behavioral interventions and supports: experiences of seven states with documented success’, Journal of Positive Behavior Interventions, 16(4): 197–208.CrossRef Google Scholar

Horsfall, S. and Santa, C. (1985), Project CRISS: Validation Report for the Joint Review and Dissemination Panel. Unpublished manuscript.Google Scholar

Horton, J. J., Rand, D. G. and Zeckhauser, R. J. (2011), ‘The online laboratory: conducting experiments in a real labor market’, Experimental Economics, 14(3): 399–425.CrossRef Google Scholar

Ioannidis, J. P. A. (2005), ‘Contradicted and initially stronger effects in highly cited clinical research’, JAMA, 294(2): 218–228.CrossRef Google Scholar PubMed

Jennions, M. D. and Moller, A. P. (2001), ‘Relationships fade with Time: a meta-analysis of temporal trends in publication in ecology and evolution’, Proceedings of the Royal Society of London, 269(1486): 43–48.CrossRef Google Scholar

Jepsen, C. and Rivkin, S. (2009), ‘Class size reduction and student achievement the potential tradeoff between teacher quality and class size’, Journal of Human Resources, 44(1): 223–250.CrossRef Google Scholar

Karlan, D. and List, J. A. (2007), ‘Does price matter in charitable giving? evidence from a large-scale natural field experiment’. American Economic Review, 97(5): 1774–1793.CrossRef Google Scholar

Kerwin, J. and Thornton, R. L. (2018), Making the Grade: The Sensitivity of Education Program Effectiveness to Input Choices and Outcome Measures (SSRN Scholarly Paper No. ID 3002723). Retrieved from Social Science Research Network website.CrossRef Google Scholar

Kilbourne, A. M., Neumann, M. S., Pincus, H. A., Bauer, M. S. and Stall, R. (2007), ‘Implementing evidence-based interventions in health care: application of the replicating effective programs framework’, Implementation Science: IS, 2, 42.CrossRef Google Scholar PubMed

Kline, P. and Walters, C. R. (2016), “Evaluating public programs with close substitutes: The case of Head Start,” Quarterly Journal of Economics 131(4): 1795–1848.CrossRef Google Scholar

Knechtel, V., Coen, T., Caronongan, P., Fung, N. and Goble, L. (2017), Pre-kindergarten impacts over time: An analysis of KIPP charter schools, Washington, DC: Mathematica Policy Research.Google Scholar

Komro, K. A., Flay, B. R., Biglan, A. and Wagenaar, A. C. (2016), ‘Research design issues for evaluating complex multicomponent interventions in neighborhoods and communities’, Translational Behavioral Medicine, 6(1): 153–159.CrossRef Google Scholar PubMed

Kushman, J., Hanita, M. and Raphael, J. (2011), An Experimental Study of the Project CRISS Reading Program on Grade 9 Reading Achievement in Rural High Schools. Final Report NCEE 2011-4007. National Center for Education Evaluation and Regional Assistance.Google Scholar

Levitt, S. D. and List, J. A. (2007), ‘What do laboratory experiments measuring social preferences reveal about the real world? Journal of Economic Perspectives, 21(2): 153–174.CrossRef Google Scholar

Lin, W. and Green, D. P. (2016), ‘Standard operating procedures: a safety net for pre-analysis plans’, PS: Political Science & Politics, 49(3): 495–500.Google Scholar

Lipsey, M. W. (1999), ‘Can rehabilitative programs reduce the recidivism of juvenile offenders? an inquiry into the effectiveness of practical programs’, Virginia Journal of Social Policy & the Law, 6(3): 611–642.Google Scholar

List, J. A. (2004), “Neoclassical theory versus prospect theory: evidence from the marketplace,” Econometrica, (2004), 72(2): pp. 615–625.CrossRef Google Scholar

List, J. A. (2006), “The behavioralist meets the market: measuring social preferences and reputation effects in actual transactions,” Journal of Political Economy, 114(1): pp. 1–37.CrossRef Google Scholar

List, J. A. (2007a), ‘Field experiments: a bridge between lab and naturally occurring data’, The B.E. Journal of Economic Analysis & Policy, 6(2), 85(2): 1–47.CrossRef Google Scholar

List, J. A. (2007b), “On the interpretation of giving in dictator games,” Journal of Political Economy, 115(3): 482–494.CrossRef Google Scholar

List, J. A. (2011a), “The market for charitable giving,” Journal of Economic Perspectives, 25(2): 157–180.CrossRef Google Scholar

List, J. A. (2011b), ‘Why economists should conduct field experiments and 14 tips for pulling one off’, Journal of Economic Perspectives, 25(3): 3–16.CrossRef Google Scholar

List, J. A., Momeni, F. and Zenou, Y. (2019), Are Measures of Early Education Programs Too Pessimistic? Evidence from a Large-Scale Field Experiment. Working Paper.CrossRef Google Scholar

List, J. A., Shaikh, A. M. and Xu, Y. (2016), ‘Multiple hypothesis testing in experimental economics’. Experimental Economics.CrossRef Google Scholar

Maniadis, Z., Tufano, F. and List, J. A. (2014), ‘One swallow doesn't make a summer: new evidence on anchoring effects’, American Economic Review, 104(1): 277–290.CrossRef Google Scholar

Miguel, E. and Kremer, M. (2004), ‘Worms: identifying impacts on education and health in the presence of treatment externalities’, Econometrica, 72(1): 159–217.CrossRef Google Scholar

Muralidharan, K. and Niehaus, P. (2017), ‘Experimentation at Scale’, Journal of Economic Perspectives, 31(4): 103–124.CrossRef Google Scholar

Muralidharan, K. and Sundararaman, V. (2015), ‘The aggregate effect of school choice: evidence from a two-stage experiment in india’, The Quarterly Journal of Economics, 130(3): 1011–1066.CrossRef Google Scholar

Nosek, B. A., Spies, J. R. and Motyl, M. (2012), ‘Scientific utopia II. Restructuring incentives and practices to promote truth over publishability’, Perspectives on Psychological Science, 7(6), 615–631.CrossRef Google Scholar PubMed

Obama, B. (2013), The Budget Message of the President. Retrieved from https://www.govinfo.gov/content/pkg/BUDGET-2014-BUD/pdf/BUDGET-2014-BUD.pdf Google Scholar

Ogutu, S. O., Fongar, A., Gödecke, T., Jäckering, L., Mwololo, H., Njuguna, M., … Qaim, M. (2018), ‘How to make farming and agricultural extension more nutrition-sensitive: evidence from a randomised controlled trial in Kenya’, European Review of Agricultural Economics, 1–24.CrossRef Google Scholar

Raudenbush, S. W. and Bloom, H. S. (2015), ‘Learning about and from a distribution of program impacts using multisite trials’, American Journal of Evaluation, 36(4): 475–499.CrossRef Google Scholar

Riccio, J. (1994), GAIN: Benefits, Costs, and Three-Year Impacts of a Welfare-to-Work Program. California's Greater Avenues for Independence Program.Google Scholar

Rudd, K. (2008), Address to Heads of Agencies and Members of Senior Executive Service, Great Hall, Parliament House, Canberra. Retrieved from https://pmtranscripts.pmc.gov.au/release/transcript-15893 Google Scholar

Schumacher, J., Milby, J., Raczynski, J., Engle, M., Caldwell, E. and Carr, J. (1994), ‘Demoralization and Threats to Validity in Birmingham's Homeless Project’, in Conrad, K. (ed), Critically Evaluating the Role of Experiments, Volume 1, San Francisco, CA: Jossey-Bass, 41–44.Google Scholar

Smith, V. (1962), ‘An Experimental Study of Competitive Market Behavior’, Economics Faculty Articles and Research.CrossRef Google Scholar

Stanley, T. D., Doucouliagos, H., Giles, M., Heckemeyer, J. H., Johnston, R. J., Laroche, P., … & Rosenberger, R. S. (2013), ‘Meta-analysis of economics research reporting guidelines’, Journal of Economic Surveys, 27(2): 390–394.CrossRef Google Scholar

Stuart, E. A., Ackerman, B. and Westreich, D. (2018), ‘Generalizability of randomized trial results to target populations: design and analysis possibilities’, Research on Social Work Practice, 28(5): 532–537.CrossRef Google Scholar PubMed

Stuart, E. A., Bell, S. H., Ebnesajjad, C., Olsen, R. B. and Orr, L. L. (2017), ‘Characteristics of school districts that participate in rigorous national educational evaluations’, Journal of Research on Educational Effectiveness, 10(1): 168–206.CrossRef Google Scholar PubMed

Supplee, L. H. and Meyer, A. L. (2015), ‘The intersection between prevention science and evidence-based policy: how the spr evidence standards support human services prevention programs’, Prevention Science, 16(7): 938–942.CrossRef Google Scholar PubMed

Supplee, L. H., Kelly, B. C., MacKinnon, D. M. and Barofsky, M. Y. (2013), ‘Introduction to the special issue: subgroup analysis in prevention and intervention research’, Prevention Science, 14(2): 107–110.CrossRef Google Scholar PubMed

Supplee, L. and Metz, A. (2015), Opportunities and challenges in evidence-based social policy (No. V27, 4).CrossRef Google Scholar

Tuttle, C. C., Gill, B., Gleason, P., Knechtel, V., Nichols-Barrer, I. and Resch, A. (2013), ‘KIPP Middle Schools: Impacts on Achievement and Other Outcomes’. Final Report. Mathematica Policy Research, Inc.Google Scholar

Tuttle, C. C., Gleason, P., Knechtel, V., Nichols-Barrer, I., Booker, K., Chojnacki, G., … Goble, L. (2015), ‘Understanding the Effect of KIPP as It Scales: Volume I, Impacts on Achievement and Other Outcomes’. Final Report of KIPP's “Investing in Innovation Grant Evaluation”. Mathematica Policy Research, Inc.Google Scholar

Vivalt, E. (2016), How much can we generalize from impact evaluations? Working paper.Google Scholar

Voelkl, B., Vogt, L., Sena, E. S. and Würbel, H. (2018), ‘Reproducibility of preclinical animal research improves with heterogeneity of study samples’, PLoS Biology, 16(2): e2003693.CrossRef Google Scholar PubMed

Wacholder, S., Chanock, S., Garcia-Closas, M., El Ghormli, L. and Rothman, N. (2004), ‘Assessing the probability that a positive report is false: an approach for molecular epidemiology studies’, JNCI: Journal of the National Cancer Institute, 96(6): 434–42.CrossRef Google Scholar PubMed

Walsh, E. and Sheridan, A. (2016), ‘Factors affecting patient participation in clinical trials in Ireland: A narrative review’, Contemporary Clinical Trials Communications, 3, 23–31.CrossRef Google Scholar PubMed

Weiss, M. J., Bloom, H. S., Verbitsky-Savitz, N., Gupta, H., Vigil, A. E., and Cullinan, D. N.. (2017), ‘How much do the effects of education and training programs vary across sites? evidence from past multisite randomized trials.’ Journal of Research on Educational Effectiveness 10(4): 843–876. https://doi.org/10.1080/19345747.2017.1300719.CrossRef Google Scholar

Young, N. S., Ioannidis, J. P. A. and Al-Ubaydli, O. (2008), ‘Why current publication practices may distort science’, PLoS Medicine, 5(10): e201.CrossRef Google Scholar PubMed

Zullig, L. L., Peterson, E. D. and Bosworth, H. B. (2013), ‘Ingredients of successful interventions to improve medication adherence’, JAMA, 310(24): 2611–2612.CrossRef Google Scholar PubMed

Article contents

How can experiments play a greater role in public policy? Twelve proposals from an economic model of scaling

Abstract

Information

Access options

Article purchase

Temporarily unavailable

References

Save article to Kindle

Save article to Dropbox

Save article to Google Drive

Reply to: Submit a response

Your details

You have entered the maximum number of contributors

Conflicting interests