In-Context Symbolic Regression : Leveraging Large Language Models for Function Discovery

Research output: Chapter in Book/Report/Conference proceedingConference article in proceedingsScientificpeer-review

Abstract

State of the art Symbolic Regression (SR) methods currently build specialized models, while the application of Large Language Models (LLMs) remains largely unexplored. In this work, we introduce the first comprehensive framework that utilizes LLMs for the task of SR. We propose In-Context Symbolic Regression (ICSR), an SR method which iteratively refines a functional form with an LLM and determines its coefficients with an external optimizer. ICSR leverages LLMs’ strong mathematical prior both to propose an initial set of possible functions given the observations and to refine them based on their errors. Our findings reveal that LLMs are able to successfully find symbolic equations that fit the given data, matching or outperforming the overall performance of the best SR baselines on four popular benchmarks, while yielding simpler equations with better out of distribution generalization.

Original languageEnglish
Title of host publicationProceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 4: Student Research Workshop)
EditorsXiyan Fu, Eve Fleisig
PublisherAssociation for Computational Linguistics
Pages589-606
Number of pages18
ISBN (Electronic)979-8-89176-097-4
Publication statusPublished - 2024
MoE publication typeA4 Conference publication
EventAnnual Meeting of the Association for Computational Linguistics - Bangkok, Thailand
Duration: 11 Aug 202416 Aug 2024
Conference number: 62

Publication series

NameProceedings of the Annual Meeting of the Association for Computational Linguistics
PublisherAssociation for Computational Linguistics
Volume4
ISSN (Print)0736-587X

Conference

ConferenceAnnual Meeting of the Association for Computational Linguistics
Abbreviated titleACL
Country/TerritoryThailand
CityBangkok
Period11/08/202416/08/2024

Fingerprint

Dive into the research topics of 'In-Context Symbolic Regression : Leveraging Large Language Models for Function Discovery'. Together they form a unique fingerprint.
  • Science-IT

    Hakala, M. (Manager)

    School of Science

    Facility/equipment: Facility

Cite this