
Department of Family Medicine, Kangdong Sacred Heart Hospital, Hallym University College of Medicine, Seoul, Korea
Copyright © 2026 Korean Council of Science Editors
This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
Conflict of interest
No potential conflict of interest relevant to this article was reported.
Funding
No financial support was received for this work.
Data availability
Data sharing is not applicable as no new data were created or analyzed in this article.
Supplementary materials
No supplementary materials were provided for this article.
| Presentation | Key findings |
|---|---|
| September 3 | |
| Morning 1. Use of AI in manuscript preparation and peer review | |
| Isamme AlFayyad et al. | Only 5.7% of BMJ Group submissions voluntarily disclosed AI use, suggesting substantial under-reporting. |
| Roy Perlis et al. | Among manuscripts and peer-review reports submitted to JAMA Network journals, only 2.7% declared AI use. |
| Vincent Yuan et al. | Of 423 medical AI studies published in 2023, 35% overstated their performance and 56% selectively emphasized significant findings, indicating widespread “spin” and selective reporting. |
| Morning 2. Authorship and research integrity | |
| Ana-Catarina Pinho-Gomes et al. | A comparative analysis of retraction reasons for women vs. men authors in biomedicine suggested gender-linked differences in the causes. |
| Tim Kersjes | “Paper mills” manipulate peer review by deploying fabricated identities. |
| Coromoto Power Febres et al. | Authorship changes during submission may signal threats to research integrity. |
| Nicholas DeVito et al. | Notifications via “RetractoBot” to authors citing retracted work markedly reduced subsequent citations of those retracted articles. |
| Afternoon 1. Diversity and research environments | |
| Michael Mensah et al. | Analysis of equity, diversity, and inclusion issues raised by JAMA Network reviewers showed persistent concerns about bias and unequal opportunities. |
| Noémie Aubert Bonn et al. | Indicators derived from the UK Research Excellence Framework 2021 can differentiate the quality of institutional research environments. |
| Afternoon 2. Research misconduct and integrity | |
| João Phillipe Cardenuto et al. | Characteristics of problematic images in retracted articles revealed frequent patterns suggestive of manipulation. |
| Ahmad Sofi-Mahmudi et al. | Across 167 countries, retraction counts correlated with democracy indices, indicating an association between political freedom and integrity in scholarly publishing. |
| Renee Hoch et al. | Lessons from PLOS One highlighted sustainable practices for maintaining high ethical standards amid large-scale threats. |
| September 4 | |
| Morning 1. Bias and reporting guidelines | |
| Jae Il Shin et al. | “Immortal time bias” is common in systematic reviews and meta-analyses and can materially distort effect estimates. |
| Yiwen Jiang et al. | Designating the same endpoint as primary versus secondary can yield different effect estimates, underscoring the need for careful design and interpretation. |
| Yulin Yu et al. | Repurposing datasets beyond their original intent in AI research can compromise interpretability and reproducibility. |
| Nicola Di Girolamo et al. | Phrases such as “to our knowledge” are pervasive in biomedical literature and may overstate originality or uncertainty. |
| Morning 2. Peer-review models | |
| Charvi Rastogi et al. | Maintaining reviewers' anonymity may improve objectivity and candor in deliberations. |
| John Carpenter et al. | Applying a double-anonymous, distributed review model to ALMA telescope proposals improved fairness and transparency. |
| Elena Álvarez-García et al. | Open review altered review length and deepened content. |
| Afternoon 1. Editorial and publishing models | |
| Christos Kotanidis et al. | Submitted abstracts often diverge substantially from the final published abstracts. |
| Afternoon 2. Peer-review timelines and incentives | |
| Emilie Gunn et al. | Extending review deadlines beyond 2 weeks did not increase reviewer acceptance but did lengthen turnaround time; testing “reviewer deadline” prompts improved timeliness and compliance, with uncertain effects on quality. |
| Christopher Cotton et al. | A randomized experiment in medical journals indicated that monetary incentives modestly increased reviewer participation. |
| September 5 | |
| Morning 1. AI for quality and reporting assessment | |
| Lan Jiang et al. | Large language models can automate checks for adherence to reporting guidelines in RCTs. |
| Fangwen Zhou et al. | Using SHAP to interpret LLM classifiers clarified which features drive automated RCT quality assessments. |
| Xiangji Ying et al. | A GPT-based system can automatically detect outcome switching in trials registered on ClinicalTrials.gov. |
| Morning 2. Open Science I | |
| Ayu Putu Madri Dew et al. | A randomized trial showed that Open Science checklists significantly improved reproducibility. |
| Benjamin Speich et al. | High rates of non-registration, discontinuation, and non-publication persist in RCTs from Switzerland, the UK, Germany, and Canada, underscoring the need for greater transparency. |
| David Blanco et al. | Among manuscripts submitted to The BMJ, multiple factors influenced inadequate trial registration, registration defects, and publication outcomes. |
| Afternoon 1. Open Science II | |
| Aidan Tan et al. | Meta-research on compliance with journal data sharing policies showed low real-world adherence and ongoing deficits in data transparency. |
| Robert Thibault et al. | Funder mandates substantially increased sharing of data, code, protocols, and research materials. |
| Kyobin Hwang et al. | Meta-research confirmed very low compliance with journal data sharing policies. |
| Afternoon 2. AI for peer-review quality and problem detection | |
| M. Janina Sarol et al. | LLMs can effectively detect citation errors in biomedical literature. |
| Bojan Batalo et al. | An automated system can identify “hype” to mitigate exaggerated claims. |
| Vishisht Rao et al. | Methods to detect LLM-generated peer reviews were evaluated (reported false-positive rate, 0). |
| Fares Alahdab et al. | LLM-assisted reviews may be competitive with, or comparable to, human reviews in quality and comprehensiveness. |
AI, artificial intelligence; RCT, randomized controlled trial; SHAP, Shapley Additive Explanations; LLM, large language model; GPT, generative pretrained transformer.