Towards Diverse Scientific Hypothesis Search with Large Language Models
Abstract
Evolutionary framework for hypothesis generation that improves diversity and quality through multi-temperature sampling and information exchange across search levels.
Large language models (LLMs) are on the rise for accelerating scientific discovery, most recently in advanced tasks such as generating valid scientific hypotheses. Yet in many discovery settings, the goal is not to identify a single best hypothesis since validation can be noisy and expensive, and scientists benefit from a set of high-quality alternative hypotheses that hedge against downstream uncertainty for the best solutions. Nevertheless, commonly used evolutionary search recipes tend to prioritize optimization over exploration in hypothesis generation, and the resulting selection pressure during the search process leads to diversity collapse. Motivated by these limitations, we formulate hypothesis search as a sampling problem, where the objective is to efficiently produce diverse, high-quality hypotheses under a fixed validation budget. Building on this perspective, we propose \ours, an evolutionary framework inspired by the classical parallel tempering algorithm that searches hypotheses at multiple temperature levels and enables principled information exchange across temperatures to improve exploration without disrupting convergence. Across domains including molecular discovery, equation discovery, and algorithm discovery, our approach consistently improves both hypothesis quality and diversity under the same validation budget, and produces candidates that remain robust under more expensive downstream computational validations.
Community
In recursive self-evolving or discovery systems, diversity isn't just a nice-to-have, it's a key ingredient for unlocking sustained progress and continuous improvements. In our recent ICML2026 paper, "Towards Diverse Scientific Hypothesis Search with Large Language Models", we take a closer look at why diversity matters in discovery systems and introduce a simple but effective solution: EvoDiverse.
This is an automated message from the Librarian Bot. I found the following papers similar to this paper.
The following papers were recommended by the Semantic Scholar API
- EvoGens: A Population-Based Heuristic Search Framework for Scientific Idea Generation (2026)
- What Makes an LLM a Good Optimizer? A Trajectory Analysis of LLM-Guided Evolutionary Search (2026)
- SeaEvo: Advancing Algorithm Discovery with Strategy Space Evolution (2026)
- EvoGM: Learning to Merge LLMs via Evolutionary Generative Optimization (2026)
- SMCEvolve: Principled Scientific Discovery via Sequential Monte Carlo Evolution (2026)
- LEVI: Stronger Search Architectures Can Substitute for Larger LLMs in Evolutionary Search (2026)
- LABO: LLM-Accelerated Bayesian Optimization through Broad Exploration and Selective Experimentation (2026)
Please give a thumbs up to this comment if you found it helpful!
If you want recommendations for any Paper on Hugging Face checkout this Space
You can directly ask Librarian Bot for paper recommendations by tagging it in a comment: @librarian-bot recommend
Get this paper in your agent:
hf papers read 2606.10587 Don't have the latest CLI?
curl -LsSf https://hf.co/cli/install.sh | bash Models citing this paper 0
No model linking this paper
Datasets citing this paper 0
No dataset linking this paper
Spaces citing this paper 0
No Space linking this paper
Collections including this paper 0
No Collection including this paper