Affiliate Positions
- Adjunct Assistant Professor, UW Human Centered Design & Engineering
- Adjunct Assistant Professor, UW Biomedical Informatics & Medical Education
- Adjunct Assistant Professor, UW Computer Science & Engineering
Specializations
- Natural Language Processing
- Health Informatics
- Machine Learning for Health
Research Areas
Biography
Lucy Lu Wang is an Assistant Professor at the University of Washington Information School. Her research focuses on how to build better AI and NLP systems for extracting and understanding information from scientific texts; for example, can we create systems that leverage up-to-date literature to help us make better and more data-driven healthcare decisions, or design document understanding models that can improve the readability of scientific texts for people who are blind and low vision. Lucyās work on supplement interaction detection, gender trends in academic publishing, COVID-19 datasets, and document understanding has been featured in Geekwire, Boing Boing, Axios, VentureBeat, and the New York Times. Prior to joining the UW, she was a Young Investigator at the Allen Institute for AI, and she received her PhD in Biomedical Informatics and Medical Education from the University of Washington.
Education
- Ph D, Biomedical Informatics and Medical Education, University of Washington, 2019
- MS, Applied Biomedical Engineering, The Johns Hopkins University, 2013
- BS, Physics, Massachusetts Institute of Technology, 2009
Awards
- Institute for Medical Data Science Pilot Award - Institute for Medical Data Science, 2023
Publications and Contributions
-
Conference Paper(2024)Proceedings of the Fourth Workshop on Scholarly Document Processing (SDP 2024), pp. 307-313
-
Conference PaperCHIME: LLM-Assisted Hierarchical Organization of Scientific Studies for Literature Review Support (2024)ACL Findings 2024
-
Conference PaperCharacterizing LLM Abstention Behavior in Science QA with Context Perturbations (2024)EMNLP Findings 2024, pp. 3437-3450
-
Conference Paper(2024)ACM IUI 2024, pp. 886ā906
-
Conference Paper(2024)ACM CHI 2024, pp. 1-15
-
Invited Essay(2024)Against the Grain, 36(3)
-
Book Editor, Scholarly(2024)Journal of Biomedical Informatics, 150
-
Conference Paper(2024)Proceedings of the 23rd Workshop on Biomedical Natural Language Processing (BioNLP 2024), pp. 390-397
-
Conference Workshop PaperMitigating Overconfidence in Large Language Models: A Behavioral Lens on Confidence Estimation and Calibration (2024)Workshop on Behavioral Machine Learning at the Conference on Neural Information Processing Systems (NeurIPS 2024)
-
Conference Paper(2024)ACM FAccT 2024, pp. 1446ā1463
-
Conference Paper(2024)NAACL 2024 (Volume 1: Long Papers), pp. 4535ā4550
-
Guest Editorial(2024)Journal of Biomedical Informatics
-
Conference Paper(2024)NAACL 2024 (Volume 3: System Demonstrations), pp. 1ā11
-
Journal Article, Academic JournalThe Semantic Reader Project: Augmenting Scholarly Documents through AI-Powered Interactive Reading Interfaces (2024)Communications of the ACM
-
Conference PaperUncovering the New Accessibility Crisis in Scholarly PDFs: Publishing Model and Platform Changes Contribute to Declining Scholarly Document Accessibility in the Last Decade (2024)ACM ASSETS 2024
-
Conference Paper(2024)Proceedings of the Fourth Workshop on Scholarly Document Processing (SDP 2024), pp. 269-276
-
Conference Paper(2023)ACL 2023, pp. 9871ā9889
-
Magazine/Trade Publication(2023)AAMC Center For Health Justice
-
Conference Extended AbstractMeasuring the Prevalence and Downstream Impact of Data and Method Sharing in arXiv Preprints (2023)2nd Annual International Conference on the Science of Science and Innovation (ICSSI 2023)
-
Conference Paper(2023)EMNLP Findings 2023, pp. 8177ā8199
-
Journal Article, Academic Journal(2023)ACM Transactions on Computer-Human Interaction (TOCHI), 30(5), pp. 1-38
-
Preprint(2023)
-
Book, Chapter in Non-Scholarly Book-New(2023)Artificial Intelligence in Science: Challenges, Opportunities and the Future of Research, pp. 121-128
-
Overview of MSLR2022: A shared task on multi- document summarization for literature reviews (2022)SDP at COLING 2022
-
Conference PaperA Dataset of Alt Texts from HCI Publications: Analyses and Uses Towards Producing More Descriptive Alt Texts of Data Visualizations in Scientiļ¬c Papers (2022)ASSETS 2022
-
Journal Article, Academic Journal(2022)Scientiļ¬c Data, 9(1), pp. 1-11
-
Conference Paper(2022)ACL 2022 (Volume 1: Long Papers), pp. 2448ā2460
-
Journal Article, Academic Journal(2022)AI Magazine, 43(1), pp. 59-68
-
Conference Paper(2022)NAACL Findings 2022, pp. 438ā453
-
Conference Extended AbstractLiterature-Augmented Clinical Outcome Prediction (2022)Machine Learning for Health (ML4H) at NeurIPS 2022
-
Conference Paper(2022)NAACL Findings 2022, pp. 61ā76
-
Journal Article, Academic Journal(2022)ACM SIGACCESS Accessibility and Computing, Issue 134
-
Conference PaperSciFact-Open: Towards open-domain scientific claim verification (2022)EMNLP Findings 2022
-
Journal Article, Academic JournalVILA: Improving structured content extraction from scientiļ¬c PDF using visual layout groups (2022)Transactions of the ACL
-
Conference Extended AbstractA bibliometric analysis of citation diversity in accessibility and HCI research (2021)CHI Extended Abstracts 2021
-
Journal Article, Academic JournalGender trends in computer science authorship (2021)Communications of the ACM
-
Journal Article, Academic JournalHarnessing the Power of Smart and Connected Health to Tackle COVID-19: IoT, AI, Robotics, and Blockchain for a Better World (2021)IEEE Internet of Things
-
PreprintImproving the accessibility of scientiļ¬c documents: current state, user needs, and a system solution to enhance scientiļ¬c PDF accessibility for blind and low vision users (2021)
-
Conference Paper(2021)EMNLP 2021, pp. 7494ā7513
-
Conference PaperSciA11y: Converting scientiļ¬c papers to accessible HTML (2021)ASSETS 2021
-
Journal Article, Academic JournalSearching for scientiļ¬c evidence in a pandemic: an overview of TREC-COVID (2021)Journal of Biomedical Informatics
-
Conference PaperWhat do we mean by 'Accessibility Research'? A literature survey of accessibility papers in CHI and ASSETS from 1994 to 2019 (2021)CHI 2021
-
Conference Workshop PaperCORD-19: the COVID-19 open research dataset (2020)NLP-COVID at ACL 2020
-
Conference PaperFact or ļ¬ction: verifying scientiļ¬c claims (2020)EMNLP 2020
-
Conference PaperMedICaT: a dataset of medical images, captions, and textual references (2020)EMNLP Findings 2020
-
Journal Article, Academic JournalMitigating biases in CORD-19 for analyzing COVID-19 literature (2020)Frontiers in Research Metrics and Analytics
-
Journal Article, Academic JournalModelling kidney disease using ontology: insights from the Kidney Precision Medicine Project (2020)Nature Reviews Nephrology
-
Conference PaperOverview of the 2020 Epidemic Question Answering Track (2020)TAC 2020
-
Conference PaperS2ORC: the Semantic Scholar open research corpus (2020)ACL 2020
-
Conference PaperSUPP.AI: ļ¬nding evidence for supplement-drug interactions (2020)ACL Demo 2020
-
Conference PaperTREC-COVID: Constructing a Pandemic Information Retrieval Test Collection (2020)SIGIR Forum
-
Journal Article, Academic JournalTREC-COVID: rationale and structure of an information retrieval shared task for COVID-19 (2020)Journal of the American Medical Informatics Association
-
Journal Article, Academic JournalText mining approaches for dealing with the rapidly expanding literature on COVID-19 (2020)Brieļ¬ngs in Bioinformatics
-
ThesisOntology-driven pathway data integration (2019)Department of Biomedical Informatics and Medical Education, University of Washington
-
Conference Extended AbstractExtracting evidence of supplement-drug interactions from literature (2019)ML4H at NeurIPS 2019
-
Journal Article, Academic JournalPredicting instances of Pathway Ontology classes for pathway integration (2019)Journal of Biomedical Semantics
-
Conference PaperConstruction of the literature graph in Semantic Scholar (2018)NAACL Industry 2018
-
Conference Workshop PaperOntology alignment in the biomedical domain using entity deļ¬nitions and context (2018)BioNLP at ACL 2018
-
PreprintPhenotypeXpression: sub-classiļ¬cation of disease states using public gene expression data and literature (2018)
-
Journal Article, Academic JournalFluctuation analysis of peak expiratory ļ¬ow and its associations with treatment failure in asthma (2017)American Journal of Respiratory and Critical Care Medicine
-
Conference PaperSimilarity metrics for determining overlap among biological pathways (2017)ICBO 2017
-
Conference PaperAn analysis of diļ¬erences in biological pathway resources (2016)ICBO and BioCreative 2016
-
Conference PaperDevelopment of a novel Markov chain model for the prediction of head and neck squamous cell carcinoma dissemination (2016)AMIA 2016
-
Conference PaperBiological model development as an opportunity to provide content auditing for the Foundational Model of Anatomy ontology (2015)AMIA 2015
-
Journal Article, Academic JournalElectrical impedance myography in Duchenne muscular dystrophy and health controls: a multi-center study of reliability and validity (2015)Muscle & Nerve
-
Masters ThesisMatching Pursuit for Detecting Epileptic Response in EEG Following Photic Stimulation (2013)Department of Biomedical Engineering, The Johns Hopkins University
-
Journal Article, Academic JournalAssessment of alterations in the electrical impedance of muscle after experimental nerve injury via ļ¬nite-element analysis (2011)IEEE Transactions on Biomedical Engineering
-
Journal Article, Academic JournalElectrical impedance myography for monitoring motor neuron loss in the SOD1 G93A amyotrophic lateral sclerosis rat (2011)Clinical Neurophysiology
Presentations
-
Patient Information Rights and Access: An Exploration of Health Informatics and Population Health
(2025)
911±¬ĮĻĶų Alumni Event - Online
-
Advancing Science and Industry with Generative AI
(2024)
Seattle AWIS/WiDS Event - Seattle, WA
-
AI for Biomedicine
(2024)
Fred Hutch Translational Data Science Integrated Research Center (TDS IRC) Annual Retreat - Kirkland, WA
-
AI-assisted health information extraction and summarization
(2024)
American Association of Pediatric Hematology-Oncology (ASPHO) Informatics, Innovation & Entrepreneurship-Special Interest Group (IIE-SIG) - Seattle, WA, USA
-
AI-powered systems for scholarly search and content production
(2024)
AI Week for Researchers, Singapore Management University - Online
-
Challenges and Opportunities in Translational Science
(2024)
Semantic Scholar Research, AI2 - Online
-
FigurA11y: AI Assistance for Writing Scientific Alt Text
(2024)
ACM Conference on Intelligent User Interfaces (IUI 2024) - Greenville, South Carolina, USA
-
Navigating AI in Publishing: Best Practices and Use cases for IP Management, Equity, and Accessibility
(2024)
Zendy/AUPresses Webinar - Online
-
Paper Plain: Making medical research papers approachable to healthcare consumers with natural language processing
(2024)
ACM Transactions on Computer-Human Interaction - Honolulu, HI
-
Personalized Jargon Identification for Enhanced Interdisciplinary Communication
(2024)
NAACL - Online
-
Roundtable Discussion: The impact of the increasing use of AI on the research workflow - in particular, its effect on research quality, research evaluation and our skills
(2024)
AI Week for Researchers, Singapore Management University - Online
-
TOPICAL: TOPIC Pages Automagically
(2024)
NAACL System Demonstrations - Mexico City, Mexico
-
6 Years of FEVER Workshops - How Far Have We Come?
(2023)
The Sixth Workshop on Fact Extraction and Verification (FEVER) at EACL - Dubrovnik, Croatia
-
AI in Scholarly Communications: Where We Are and Where Weāre Going
(2023)
FORCE11 Scholarly Communication Institute (FSCI) Conference - Online
-
Automated Metrics for Medical Multi-Document Summarization Disagree with Human Evaluations
(2023)
Association for Computational Linguistics (ACLā23) - Toronto, Canada
-
Biomedical Evidence Extraction and Synthesis
(2023)
The Center for Informatics Research in Science and Scholarship (CIRSS) Seminar Series, School of Information Sciences, UIUC - Urbana-Champaign, IL
-
Can Scientific Claim Verification Help Us Do Better Science?
(2023)
The Sixth Workshop on Fact Extraction and Verification (FEVER) at EACL - Dubrovnik, Croatia
-
Generative AI for Translational Scholarly Communication
(2023)
SAUL-RSTF Webinar, National University of Singapore - Online
-
Generative AI for Translational Scholarly Communication.
(2023)
Hong Kong University of Science and Technology Library - Online
-
Improving the Accessibility of Scholarly Communication
(2023)
Universidade Estadual de Campinas (Unicamp) Computer Science Seminar - Online
-
Incorporating External Knowledge for Clinical Outcome Prediction
(2023)
Institute for Medical Data Science Seminar - Seattle, WA
-
Measuring the prevalence and downstream impact of data and method sharing in arXiv preprints
(2023)
International Conference on Computational Systems and Communication (ICSSI 2023) - Evanston, Illinois
-
Open domain multi-document summarization: A comprehensive study of model brittleness under retrieval
(2023)
2023 Conference on Empirical Methods in Natural Language Processing (EMNLP 2023) - Sentosa, Singapore
-
Taking and Giving Back: Open Access, Generative AI, and the Transformation of Scholarly Communication
(2023)
OA Week, Indiana University Bloomington - Bloomington, Illinois
-
AI and Scholarly Publishing
(2022)
Society for Scholarly Publishing āAsk the Expertsā Webinar - Online
-
Generating Scientific Claims for Zero-Shot Scientific Fact Checking
(2022)
ACL 2022 - Dublin, Ireland
-
How AI can make PDF useful again
(2022)
PageBreak - San Francisco, CA
-
Identifying and Mitigating Algorithmic Biases
(2022)
School of Law, Seattle University - Seattle, WA
-
Knowledge Representation and Semantics for Biomedical Knowledge Synthesis
(2022)
SeBiLAn Workshop at TheWebConf (WWW) - Online
-
Literature-Augmented Clinical Outcome Prediction
(2022)
NAACL - Seattle, WA, USA
-
MultiVerS: Improving scientiļ¬c claim veriļ¬cation with weak supervision and full-document context
(2022)
NAACL - Seattle, WA, USA
-
Ontology and NLP: Bridging the āStructural Chasm
(2022)
Department of Biomedical Informatics and Medical Education, University of Washington - Seattle, WA, USA
-
The Machine Element: Signals and Noise: How AI and ML Techniques are Being Deployed to Track a Global Pandemic
(2022)
Friends of the NLM Virtual Workshop - Online
-
Unlocking Biomedical Knowledge: NLP Systems for Automating Systematic Literature Review
(2022)
Department of Informatics, Luddy School of Informatics, Indiana University-Bloomington - Online
-
Unlocking Biomedical Knowledge: NLP Systems for Automating Systematic Literature Review
(2022)
Information School, University of Washington - Online
-
Unlocking Biomedical Knowledge: NLP Systems for Automating Systematic Literature Review
(2022)
School of Data Science, University of Virginia - Charlottesville, VA, USA
-
Unlocking Biomedical Knowledge: NLP Systems for Synthesizing Biomedical Evidence
(2022)
Computer Science Research Seminar, Emory University - Online
-
VILA: Improving Structured Content Extraction from Scientiļ¬c PDFs Using Visual Layout Groups
(2022)
ACL - Dublin, Ireland
-
A bibliometric analysis of citation diversity in accessibility and HCI research
(2021)
CHI - Online
-
Fast-track Learning: Growing Insights from Text-mining COVID-19 Data
(2021)
1st GTM2021 Virtual Forum - Online
-
Mathematics in the Scholarly Literature
(2021)
Conference on Artiļ¬cial Intelligence and Theorem Proving (AITP) - Aussois, France and Online
-
MS^2: Multi-document summarization of medical studies
(2021)
EMNLP - Punta Cana, Dominican Republic
-
NLP and Text Mining Resources for COVID-19 and Beyond
(2021)
Machine Learning for Preventing and Combating Pandemics Workshop at ICLR 2021 - Online
-
Practical NLP for Biomedicine: Synthesizing Knowledge from Scientiļ¬c Literature
(2021)
CS Colloquium, Northwestern University - Online
-
Practical NLP for scientiļ¬c text mining: extracting and synthesizing knowledge from the literature
(2021)
Science of Science Summer School (S4) - Online
-
SciA11y: Converting scientiļ¬c papers to accessible HTML
(2021)
ASSETS - Online
-
Text Mining Insights from the COVID-19 Pandemic
(2021)
Bibliometric-enhanced Information Retrieval (BIR) Workshop at ECIR 2021 - Online
-
The Power of AI: A Discussion on COVID-19 & the Future of Industries
(2021)
Legalweek - Online
-
The Power of AI: A Discussion on COVID-19 & the Future of Industries
(2021)
Relativity Media Pandemic short ļ¬lm discussion panel - Online
-
Using Machine Learning to Verify Scientiļ¬c Claims
(2021)
OECD Workshop on AI and the Productivity of Science - Online
-
What do we mean by 'Accessibility Research'? A literature survey of accessibility papers in CHI and ASSETS from 1994 to 2019
(2021)
CHI - Online
-
Building Community and Data Ecosystem for Data Discovery and Reuse
(2020)
Artiļ¬cial Intelligence for Data Discovery and Reuse (AIDR) Symposium - Online
-
CORD-19 Search: Using Machine Learning to Explore COVID-19 Scientiļ¬c Literature
(2020)
AWS Education: Research Seminar Series - Online
-
CORD-19: the COVID-19 open research datase
(2020)
NLP-COVID Workshop at ACL - Online
-
CORD-19: The COVID-19 Open Research Dataset
(2020)
Global Tech Mining Conference - Online
-
CORD-19: The COVID-19 Open Research Dataset
(2020)
NLP Meetup (NY-NLP, A2D-NLP, DC-NLP, Hungarian NLP, London Text Analytics) - Online
-
Fact or ļ¬ction: verifying scientiļ¬c claims
(2020)
EMNLP - Online
-
Improving Access to Scientiļ¬c Literature for NLP
(2020)
Microsoft Research Hanover Group - Online
-
MedICaT: a dataset of medical images, captions, and textual references
(2020)
SDP Workshop at EMNLP - Online
-
Mining the COVID-19 Scientiļ¬c Literature with the CORD-19 Open Research Dataset.
(2020)
Artiļ¬cial Intelligence for Data Discovery and Reuse (AIDR) Symposium - Online
-
Open Publishing and Open Data
(2020)
Neuro-Gairdner Open Science in Action Symposium - Online
-
Rapid Fire Session: Showcasing What is Here!
(2020)
Gastroenterology and Artiļ¬cial Intelligence: 2nd Annual Artiļ¬cial Intelligence Summit - Online
-
S2ORC: the Semantic Scholar open research corpus
(2020)
ACL - Online
-
SUPP.AI: ļ¬nding evidence for supplement-drug interactions
(2020)
ACL Demo - Online
-
The COVID-19 Open Research Dataset
(2020)
Connected Health and COVID-19: Now and Beyond the Great Lockdown - Online
-
The COVID-19 Open Research Dataset
(2020)
Centre for Science and Technology Studies, Leiden University - Online
-
The COVID-19 Open Research Dataset
(2020)
Semantic Indexing and Information Retrieval for Health (SIIRH) Workshop at ECIR - Online
-
The Role of Scientiļ¬c NLP During an Epidemic
(2020)
1st SciNLP Workshop on Natural Language Processing and Data Mining for Scientiļ¬c Text - Online
-
TREC-COVID: information retrieval for supporting COVID-19 research
(2020)
AMIA Natural Language Processing Working Group Pre-Symposium - Online
-
Automated Identiļ¬cation of Noise Signal in Spinal DCE-MRI using Independent Component Analysis and Unsupervised Machine Learning
(2019)
ISMRM - MontrƩal, QC, Canada
-
Extracting evidence of supplement-drug interactions from literature
(2019)
ML4H Workshop at NeurIPS - Vancouver, BC, Canada
-
Ontology-based Integration of Biological Pathway Data
(2019)
Scientiļ¬c Literature Knowledge Bases Workshop at Automated Knowledge Base Construction (AKBC) - Amherst, MA, USA
-
A Brief Introduction to Ontology
(2018)
Kidney Precision Medicine Project Ontology Webinar - Seattle, WA, USA
-
A SPARQL Tutorial
(2018)
Department of Biomedical Informatics and Medical Education, University of Washington - Seattle, WA, USA
-
Learning from Biomedical Knowledge
(2018)
The Allen Institute for Artiļ¬cial Intelligence (AI2) - Seattle, WA, USA
-
Ontologies and Algorithms for Integrating Biological Pathway Data
(2018)
BIME 590 Seminar, Department of Biomedical Informatics and Medical Education, University of Washington - Seattle, WA, USA
-
Ontology alignment in the biomedical domain using entity deļ¬nitions and context
(2018)
Bio-NLP Workshop at ACL - Melbourne, Australia
-
Ontology- based integration of pathway databases using Pathway Ontology annotations
(2018)
Bio-Ontologies at ISMB - Chicago, IL, USA
-
Quantifying the eļ¬ects of gene entity disambiguation for GSEA
(2018)
AMIA Symposium - San Francisco, CA, USA
-
Semi-automated integration of pathway data for pathway analysis
(2018)
Knowledge Representation and Semantics Working Group Pre-Symposium Doctoral Consortium, AMIA Symposium - San Francisco, CA, USA
-
Detection and Functional Classiļ¬cation of Fusion Genes Using Pathway Expression Proļ¬les
(2017)
AMIA Joint Summits on Translational Science - San Francisco, CA, USA
-
Similarity metrics for determining overlap among biological pathways
(2017)
ICBO - Newcastle upon Tyne, United Kingdom
-
An analysis of diļ¬erences in biological pathway resources
(2016)
ICBO & BioCreative - Corvallis, OR, USA
-
Auditing tree-like organ systems in the FMA using network motifs
(2016)
AMIA Symposium - Chicago, IL, USA
-
Development of a novel Markov chain model for the prediction of head and neck squamous cell carcinoma dissemination
(2016)
AMIA 2016 - Chicago, IL, USA
-
Discovering representational diļ¬erences between pathway knowledge bases for pathway resource merging
(2016)
AMIA Symposium - Chicago, IL, USA
-
Identifying and resolving inconsistencies in biological pathway resources
(2016)
NLM Informatics Training Conference - Columbus, OH, USA
-
Biological model development as an opportunity to provide content auditing for the foundational model of anatomy ontology
(2015)
AMIA Symposium - San Francisco, CA, USA
-
Development of a discharge ontology to support postanesthesia discharge decision making
(2015)
ICBO - Lisbon, Portugal
-
Ontological content auditing during model creation using the foundational model of anatomy
(2015)
NLM Informatics Training Conference - Bethesda, MD, USA
-
Detrended ļ¬uctuation analysis of peak expiratory ļ¬ow and its association with destabilization of asthma control
(2014)
International Conference of the American Thoracic Society (ATS) - San Diego, CA, USA
-
Electrical impedance myography in DMD: a multi-center study of reliability and relationships to strength and function
(2013)
The 18th International Congress of the World Muscle Society - Asilomar, CA, USA