Bibcode
Iyer, Kartheik G.; Yunus, Mikaeel; O'Neill, Charles; Ye, Christine; Hyk, Alina; McCormick, Kiera; Ciucă, Ioana; Wu, John F.; Accomazzi, Alberto; Astarita, Simone; Chakrabarty, Rishabh; Cranney, Jesse; Field, Anjalie; Ghosal, Tirthankar; Ginolfi, Michele; Huertas-Company, Marc; Jabłońska, Maja; Kruk, Sandor; Liu, Huiling; Marchidan, Gabriel; Mistry, Rohit; Naiman, J. P.; Peek, J. E. G.; Polimera, Mugdha; Rodríguez Méndez, Sergio J.; Schawinski, Kevin; Sharma, Sanjib; Smith, Michael J.; Ting, Yuan-Sen; Walmsley, Mike
Bibliographical reference
The Astrophysical Journal Supplement Series
Advertised on:
12
2024
Citations
0
Refereed citations
0
Description
The exponential growth of astronomical literature poses significant challenges for researchers navigating and synthesizing general insights or even domain-specific knowledge. We present pathfinder, a machine learning framework designed to enable literature review and knowledge discovery in astronomy, focusing on semantic searching with natural language instead of syntactic searches with keywords. Utilizing state-of-the-art large language models (LLMs) and a corpus of 385,166 peer-reviewed papers from the Astrophysics Data System, pathfinder offers an innovative approach to scientific inquiry and literature exploration. Our framework couples advanced retrieval techniques with LLM-based synthesis to search astronomical literature by semantic context as a complement to currently existing methods that use keywords or citation graphs. It addresses complexities of jargon, named entities, and temporal aspects through time-based and citation-based weighting schemes. We demonstrate the tool's versatility through case studies, showcasing its application in various research scenarios. The system's performance is evaluated using custom benchmarks, including single-paper and multipaper tasks. Beyond literature review, pathfinder offers unique capabilities for reformatting answers in ways that are accessible to various audiences (e.g., in a different language or as simplified text), visualizing research landscapes, and tracking the impact of observatories and methodologies. This tool represents a significant advancement in applying artificial intelligence to astronomical research, aiding researchers at all career stages in navigating modern astronomy literature.