Created
December 9, 2024 17:23
-
-
Save neubig/467572017ee032d878bd24dab3454fa4 to your computer and use it in GitHub Desktop.
openscholar_summary.txt
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
OpenScholar is a retrieval-augmented language model that assists researchers in synthesizing scientific literature. The system uses a database of 45 million open-access papers to provide citation-backed responses to queries, accurately identifying relevant passages and generating reliable answers across multiple scientific domains. This approach addresses the growing challenge of keeping up with rapidly expanding scientific literature. | |
The researchers developed ScholarQABench, a multi-domain benchmark for evaluating literature search capabilities, with 2,967 expert-written queries and 208 detailed answers across computer science, physics, neuroscience, and biomedicine. In testing, OpenScholar-8B outperformed GPT-4o by 5% and PaperQA2 by 7% in correctness metrics, despite being a smaller, open model. | |
Citation accuracy stands as a key strength of OpenScholar. While GPT-4o shows concerning citation hallucination rates of 78-90%, OpenScholar matches human expert-level accuracy in citation verification. The system's architecture, combining a specialized datastore, retriever, and self-feedback inference loop, enhances existing language models' performance. When integrated with GPT-4o (as OpenScholar-GPT4o), it improved correctness by 12%. | |
Human evaluations further confirmed OpenScholar's effectiveness. Expert reviewers preferred OpenScholar-8B's responses over human-written ones 51% of the time, while OpenScholar-GPT4o achieved a 70% preference rate, compared to GPT-4o's 32%. This shows OpenScholar's ability to generate quality scientific syntheses. | |
The project follows open science principles, with all code, models, datastore, and data being open-sourced, including a public demo. This accessibility ensures that the research community can build upon these advances. OpenScholar represents a significant step forward in AI-assisted research, providing a reliable tool for synthesizing scientific literature while maintaining high standards of accuracy and citation integrity |
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment