Spark: A System for Scientifically Creative Idea Generation

Proceedings of the 16th International Conference on Computational Creativity

paper
An idea generation system that couples retrieval-augmented idea generation with a reviewer model trained on 600K scientific reviews from OpenReview
Authors

Aishik Sanyal

Samuel Schapiro

Sumuk Shashidhar

Royce Moon

Lav R. Varshney

Dilek Hakkani-Tur

Published

April 18, 2025

Abstract

Recently, large language models (LLMs) have shown promising abilities to generate novel research ideas in science, a direction which coincides with many foundational principles in computational creativity (CC). In light of these developments, we present an idea generation system named Spark that couples retrieval-augmented idea generation using LLMs with a reviewer model named Judge trained on 600K scientific reviews from OpenReview. Our work is both a system demonstration and intended to inspire other CC researchers to explore grounding the generation and evaluation of scientific ideas within foundational CC principles. To this end, we release the annotated dataset used to train Judge, inviting other researchers to explore the use of LLMs for idea generation and creative evaluations.

Citation

BibTeX citation:
@article{sanyal2025,
  author = {Sanyal, Aishik and Schapiro, Samuel and Shashidhar, Sumuk
    and Moon, Royce and R. Varshney, Lav and Hakkani-Tur, Dilek},
  title = {Spark: {A} {System} for {Scientifically} {Creative} {Idea}
    {Generation}},
  journal = {Proceedings of the 16th International Conference on
    Computational Creativity},
  date = {2025-04-18},
  url = {https://arxiv.org/abs/2504.20090},
  langid = {en}
}
For attribution, please cite this work as:
Sanyal, Aishik, Samuel Schapiro, Sumuk Shashidhar, Royce Moon, Lav R. Varshney, and Dilek Hakkani-Tur. 2025. “Spark: A System for Scientifically Creative Idea Generation.” Proceedings of the 16th International Conference on Computational Creativity, April. https://arxiv.org/abs/2504.20090.