Scientific production in the era of Large Language Models
Keigo Kusumegi, Xinyu Yang, Paul Ginsparg, Mathijs De Vaan, Toby Stuart and Yian Yin
Understanding the punctuated dynamics of scientific and technological frontiers
Yian Yin and Dashun Wang
Optimal limits of learning
Yian Yin and Dashun Wang
Cumulative advantage and delayed recognition in individual careers
Yangfanyu Yang, Jie Yang and Yian Yin
Synthesis of innovation and obsolescence
Edward D. Lee, Christopher P. Kempes, Manfred Laubichler, Marcus J. Hamilton, Jeffrey W. Lockhart, Frank Neffke, Hyejin Youn, Jose Ignacio Arroyo, Vito D.P. Servedio, Dashun Wang, Jessika Trancik, James Evans, Vicky Yang, Veronica R. Cappelli, Ernesto Ortega, Yian Yin and Geoffrey B. West arXiv:2505.05182
Survivors, complainers, and borderliners: Upward bias in online discussions of academic conference reviews
Hangxiao Zhu, Yian Yin and Yu Zhang arXiv:2509.16831
Featured publications
The piovt penalty in research
Ryan Hill*, Yian Yin*, Carolyn Stein, Xizhao Wang, Dashun Wang and Benjamin F. Jones (*: equal contributions) Nature, 642(8069): 999-1006, June 2025.
10.1038/s41586-025-09048-1
Scientists and inventors set the direction of their work amid evolving questions, opportunities and challenges, yet the understanding of pivots between research areas and their outcomes remains limited. Theories of creative search highlight the potential benefits of exploration but also emphasize difficulties in moving beyond one’s expertise. Here we introduce a measurement framework to quantify how far researchers move from their existing work, and apply it to millions of papers and patents.
We find a pervasive pivot penalty, in which the impact of new research steeply declines the further a researcher moves from their previous work. The pivot penalty applies nearly universally across science and patenting, and has been growing in magnitude over the past five decades. Larger pivots further exhibit weak engagement with established mixtures of prior knowledge, lower publication success rates and less market impact. Unexpected shocks to the research landscape, which may push researchers away from existing areas or pull them into new ones, further demonstrate substantial pivot penalties, including in the context of the COVID-19 pandemic. The pivot penalty generalizes across fields, career stage, productivity, collaboration and funding contexts, highlighting both the breadth and depth of the adaptive challenge. Overall, the findings point to large and increasing challenges in effectively adapting to new opportunities and threats, with implications for individual researchers, research organizations, science policy and the capacity of science and society as a whole to confront emergent demands.
Can Large Language Models provide useful feedback on research papers? A large-scale empirical analysis
Weixin Liang, Yuhui Zhang, Hancheng Cao, Binglu Wang, Daisy Yi Ding, Xinyu Yang, Kailas Vodrahalli, Siyu He, Daniel Scott Smith, Yian Yin, Daniel McFarland and James Zou NEJM AI, 1(8): AIoa2400196, July 2024.
10.1056/AIoa2400196
Expert feedback lays the foundation of rigorous research. However, the rapid growth of scholarly production challenges the conventional scientific feedback mechanisms. High-quality peer reviews are increasingly difficult to obtain. We created an automated pipeline using Generative Pretrained Transformer 4 (GPT-4) to provide comments on scientific papers, and evaluated the quality of GPT-4's feedback through two large-scale studies. We first quantitatively compared GPT-4’s generated feedback with human peer reviewers’ feedback in general scientific papers from 15 Nature family journals (3096 papers in total) and the International Conference on Learning Representations (ICLR) machine learning conference (1709 papers). Additionally, we conducted a prospective user study with 308 researchers from 110 institutions to understand how researchers perceive feedback generated by our system on their own papers.
Through both retrospective and prospective evaluation, we find substantial overlap between LLM and human feedback as well as positive user perceptions regarding the usefulness of LLM feedback. Tthe overlap in the points raised by GPT-4 and by human reviewers (average overlap of 30.85% for Nature journals and 39.23% for ICLR) is comparable with the overlap between two human reviewers (average overlap of 28.58% for Nature journals and 35.25% for ICLR). In our prospective user study, more than half (57.4%) of the users found GPT-4–generated feedback helpful/very helpful, and 82.4% found it more beneficial than feedback from at least some human reviewers. We also identify several limitations of large language model (LLM)–generated feedback. Although human expert review should continue to be the foundation of the scientific process, LLM feedback could benefit researchers, especially when timely expert feedback is not available and in earlier stages of manuscript preparation.
Public use and public funding of science Yian Yin, Yuxiao Dong, Kuansan Wang, Dashun Wang and Benjamin F. Jones Nature Human Behaviour 6(10): 1344–1350, July 2022.
10.1038/s41562-022-01397-5
NBER Working Paper: w28748
Knowledge of how science is consumed in public domains is essential for a deeper understanding of the role of science in human society. While science is heavily supported by public funding, common depictions suggest that scientific research remains an isolated or 'ivory tower' activity, with weak connectivity to public use, little relationship between the quality of research and its public use, and little correspondence between the funding of science and its public use. This paper introduces a measurement framework to examine public good features of science, allowing us to study public uses of science, the public funding of science, and how use and funding relate. Specifically, we integrate five large-scale datasets that link scientific publications from all scientific fields to their upstream funding support and downstream public uses across three public domains – government documents, the news media, and marketplace invention.
We find that the public uses of science are extremely diverse, with different public domains drawing distinctively across scientific fields. Yet amidst these differences, we find key forms of alignment in the interface between science and society. First, despite concerns that the public does not engage high-quality science, we find universal alignment, in each scientific field and public domain, between what the public consumes and what is highly impactful within science. Second, despite myriad factors underpinning the public funding of science, the resulting allocation across fields presents a striking alignment with the field's collective public use. Overall, public uses of science present a rich landscape of specialized consumption, yet collectively science and society interface with remarkable, quantifiable alignment between scientific use, public use, and funding.
Potentially long-lasting effects of the pandemic on scientists
Jian Gao, Yian Yin, Kyle R. Myers, Karim R. Lakhani and Dashun Wang Nature Communications, 12(1): 1-6, Oct 2021.
10.1038/s41467-021-26428-z
Extensive research has documented the immediate impacts of the COVID-19 pandemic on scientists, yet it remains unclear if and how such impacts have shifted over time. Here we compare results from two surveys of principal investigators, conducted between April 2020 and January 2021, along with analyses of large-scale publication data. We find that there has been a clear sign of recovery in some regards, as scientists' time spent on their work has almost returned to pre-pandemic levels. However, the latest data also reveals a new dimension in which the pandemic is affecting the scientific workforce: the rate of initiating new research projects. Except for the small fraction of scientists who directly engaged in COVID-related research, most scientists started significantly fewer new research projects in 2020. This decline is most pronounced amongst the same demographic groups of scientists who reported the largest initial disruptions: female scientists and those with young children. Yet in sharp contrast to the earlier phase of the pandemic, when there were large disparities across scientific fields, this loss of new projects appears remarkably homogeneous across fields. Analyses of large-scale publication data reveal a global decline in the rate of new collaborations, especially in non-COVID-related preprints, which is consistent with the reported decline in new projects. Overall, these findings highlight that, while the end of the pandemic may appear in sight in some countries, its large and unequal impact on the scientific workforce may be enduring, which may have broad implications for inequality and the long-term vitality of science.
Coevolution of policy and science during the pandemic Yian Yin*, Jian Gao*, Benjamin F. Jones and Dashun Wang (*: equal contributions) Science, 371(6525): 128-130, Jan 2021.
10.1126/science.abe3084
Disconnects between science and policy are a long-standing concern. Yet, our systematic understanding of the use of science in policy remains limited, partly because of the difficulty in reliably tracing the coevolution of policy and science at a large, global scale. Today, the world faces a common emergency in the COVID-19 pandemic, which presents a dynamic, uncertain, yet extraordinarily consequential policy environment across the globe. We combined two large-scale databases that capture policy and science and their interactions, allowing us to examine the coevolution of policy and science.
Our analysis suggests that many policy documents in the COVID-19 pandemic substantially access recent, peer-reviewed, and high-impact science. And policy documents that cite science are especially highly cited within the policy domain. At the same time, there is a heterogeneity in the use of science across policy-making institutions. The tendency for policy documents to cite science appears mostly concentrated within intergovernmental organizations (IGOs), such as the World Health Organization (WHO), and much less so in national governments, which consume science largely indirectly through the IGOs. This close coevolution between policy and science offers a useful indication that a key link is operating, but it has not been a sufficient condition for effectiveness in containing the pandemic.
Quantifying the dynamics of failure across science, startups, and security Yian Yin, Yang Wang, James A. Evans and Dashun Wang Nature, 575(7781): 190-194, Oct 2019.
10.1038/s41586-019-1725-y
Human achievements are often preceded by repeated attempts that fail, but little is known about the mechanisms that govern the dynamics of failure. Here, building on previous research relating to innovation, human dynamics and learning, we develop a simple one-parameter model that mimics how successful future attempts build on past efforts. Solving this model analytically suggests that a phase transition separates the dynamics of failure into regions of progression or stagnation and predicts that, near the critical threshold, agents who share similar characteristics and learning strategies may experience fundamentally different outcomes following failures. Above the critical point, agents exploit incremental refinements to systematically advance towards success, whereas below it, they explore disjoint opportunities without a pattern of improvement. The model makes several empirically testable predictions, demonstrating that those who eventually succeed and those who do not may initially appear similar, but can be characterized by fundamentally distinct failure dynamics in terms of the efficiency and quality associated with each subsequent attempt.
We collected large-scale data from three disparate domains and traced repeated attempts by investigators to obtain National Institutes of Health (NIH) grants to fund their research, innovators to successfully exit their startup ventures, and terrorist organizations to claim casualties in violent attacks. We find broadly consistent empirical support across all three domains, which systematically verifies each prediction of our model. Together, our findings unveil identifiable yet previously unknown early signals that enable us to identify failure dynamics that will lead ultimately to success or failure. Given the ubiquitous nature of failure and the paucity of quantitative approaches to understand it, these results represent an initial step towards the deeper understanding of the complex dynamics underlying failure.
Unequal effects of the COVID-19 pandemic on scientists
Kyle R. Myers, Wei Yang Tham, Yian Yin, Nina Cohodes, Jerry G. Thursby, Marie C. Thursby, Peter Schiffer, Joseph T. Walsh, Karim R. Lakhani and Dashun Wang Nature Human Behaviour, 4(9): 880-883, Jul 2020.
10.1038/s41562-020-0921-y Top 100 (#27 out of 3.4M) Altmetric papers in 2020
The COVID-19 pandemic has undoubtedly disrupted the scientific enterprise. Policymakers and institutional leaders have already begun to respond to mitigate the impacts of the pandemic on researchers. However, we lack evidence on the nature and magnitude of the disruptions scientists are experiencing. To gain some insight into the extent of disruptions scientists are experiencing, we conducted a preliminary survey, which was distributed on 13 April 2020, approximately 1 month after the World Health Organization declared COVID-19 a pandemic. We reached out to US- and Europe-based scientists across a wide range of institutions, career stages and demographic backgrounds. Within a week, we received full responses from 4,535 faculty or Principal Investigators.
Results of the survey highlight the sizable and heterogeneous ways the COVID-19 pandemic is affecting the scientific workforce. Scientists report a sharp decline in time spent on research on average, but there is substantial heterogeneity with a significant share reporting no changes or even increases. Some of this heterogeneity is due to field-specific differences, and some is due to gender. However, the largest disruptions are connected to a usually unobserved dimension: childcare, which can account for a significant fraction of gender differences. Amidst scarce evidence about the role of parenting in scientists' work, these results could have important short- and longer-term effects on their careers, which institution leaders and funders need to address carefully.
Nobel laureates are almost the same as us
Jichao Li, Yian Yin, Santo Fortunato and Dashun Wang Nature Reviews Physics, Apr 2019.
10.1038/s42254-019-0057-z
Quantitative studies of Nobel laureates' careers have predominantly focused on the prize-winning work alone. To test if there are indeed systematic differences between the careers of Nobel laureates and ordinary scientists, we studied a unique dataset of entire career histories for nearly all Nobel laureates in physics, chemistry and physiology or medicine from 1900 to 2016 (545 out of 590 laureates, 92.4%).
By testing the burst of most-cited papers and dynamics of team sizes, we find after removing the prize-winning papers, the career of Nobel laureates and ordinary scientists follow the same patterns. Together, our analysis show that apart from their prize-winning work, the careers of Nobel laureates follow the same patterns as those of the majority of scientists.
Other publications
SciSciNet: A large-scale open data lake for the science of science research
Zihang Lin, Yian Yin, Lu Liu and Dashun Wang Scientific Data, 10(1), June 2023.
10.1038/s41597-023-02198-9
Scientific elite revisited: patterns of productivity, collaboration, authorship and impact
Jichao Li, Yian Yin, Santo Fortunato and Dashun Wang Journal of the Royal Society Interface, 17(165): 20200135, Apr 2020.
10.1098/rsif.2020.0135
A dataset of publication records for Nobel laureates
Jichao Li, Yian Yin, Santo Fortunato and Dashun Wang Scientific Data, 6(33), Apr 2019.
10.1038/s41597-019-0033-6
The time dimension of science: Connecting the past to the future Yian Yin and Dashun Wang Journal of Informetrics, 11(2): 608-621, May 2017.
10.1016/j.joi.2017.04.002