Towards an AI Mathematician

My Session Status

What:

Talk

Part of:

Day 5 | Large Language Models: Applications, Ethics & Risks

When:

11:00 AM, Friday 7 Jun 2024 EDT (1 hour 30 minutes)

Theme:

Large Language Models: Applications, Ethics & Risks

Mathematics is a hallmark of human intelligence and a long-standing goal of AI. It involves analyzing complex information, identifying patterns, forming conjectures, and performing logical deduction. Many of these capabilities are beyond the reach of current AI, and unlocking them can revolutionize AI applications in scientific discovery, formal verification, and beyond. In this talk, I will present initial steps towards the grand vision of AI mathematicians, taking an approach that combines the generative power of large language models (LLMs) with the logical rigor of formal methods.

I will cover our work on using LLMs to (1) prove formal theorems in proof assistants such as Coq and Lean and (2) automatically translate human-written mathematics into formal theorems and proofs—a task called autoformalization. For theorem proving, we introduce the entire system for extracting data, training LLMs to generate proof steps, interacting with proof assistants to search for proofs, and deploying the model to assist human users. For autoformalization, using Euclidean geometry as an example domain, we introduce a neuro-symbolic framework that combines LLMs with SMT solvers and domain knowledge. Finally, we discuss future directions for AI mathematicians beyond theorem proving and autoformalization, including important problems such as automatic conjecturing and applications in natural language and program verification.

References

Yang, K., Swope, A., Gu, A., Chalamala, R., Song, P., Yu, S., … & Anandkumar, A. (2024). Leandojo: Theorem proving with retrieval-augmented language models. Advances in Neural Information Processing Systems, 36.

Shulman, M. (2024). Strange new universes: Proof assistants and synthetic foundations. Bulletin of the American Mathematical Society.

Kaiyu Yang

Speaker

My Session Status

Allow attendees to rate sessions with a "thumbs up" or "thumbs down".

Allow attendees to send feedback about sessions

Allows attendees to send short textual feedback to the organizer for a session. This is only sent to the organizer and not the speakers.

Enable list of attendees for sessions

When enabled, you can choose to display attendee lists for individual sessions. Only attendees who have chosen to share their profile will be listed.

Display the list of attendees for this session

Enable to display the attendee list on this session's detail page. This change applies only to this session.

Allow attendees to participate in a discussion thread for sessions

Changes here will affect all session detail pages unless otherwise noted

Towards an AI Mathematician

My Session Status

References

My Session Status

Session detail

We use cookies