Boolean Is Not Too Restrictive: Learning Optimal Decision Trees via MaxSAT

My Session Status

What:

Talk

Part of:

Day 9

When:

10:45 AM, Wednesday 10 Jun 2026 EDT (1 hour)

Theme:

Computers

Tag:

SAT approaches to reasoning and decision-making

A crucial trade-off in combinatorial optimization paradigms is between expressiveness and performance. SAT and its optimization variant, MaxSAT, are often regarded as having particularly restrictive syntaxes, since they are limited to boolean values and conjunctive clauses. However, solvers for these paradigms are comparatively time- and memory-efficient. This efficiency makes SAT an attractive candidate for solving a variety of combinatorial problems. SAT is also NP-complete, meaning that it can theoretically encode every other NP problem. However, the theoretical existence of a reduction does not guarantee a computationally viable approach in practice. The reduced instance may blow up in size or become particularly difficult to solve if the source problem lacks a natural and direct boolean translation. Thus, a dilemma arises: does encoding a seemingly non-boolean problem in SAT actually enable one to reap its performance benefits?

The goal of this talk is to argue that the answer to this dilemma is positive in more cases than one might expect. We use decision trees as a case study. They are highly accurate yet interpretable machine learning models and are commonly used in sensitive applications. The majority of decision tree learning problems are NP-hard and as such can theoretically be solved using SAT. However, decision tree learning typically requires support for non-binary data and objectives, which are challenging to encode in the boolean world of SAT.

We present a set of techniques and encodings that overcome these challenges, enabling us to solve certain variants of this problem for the first time and improve the state of the art for others. We further discuss the benefits of an exact approach to learning decision trees, including modularity between components, theoretical guarantees, and support for user-specified constraints. We conclude with an overview of experimental results demonstrating the practical effectiveness of the approach.

References

Pouya Shati, Eldan Cohen, and Sheila McIlraith. “Optimal decision trees for interpretable clustering with constraints”. In: Proceedings of the Thirty-Second International Joint Conference on Artificial Intelligence. 2023, pp. 2022–2030.

Pouya Shati, Eldan Cohen, and Sheila McIlraith. “SAT-Based Approach for Learning Optimal Decision Trees with Non-Binary Features”. In: 27th International Conference on Principles and Practice of Constraint Programming (CP 2021). Schloss Dagstuhl-Leibniz-Zentrum f ̈ur Informatik. 2021.

Florent Avellaneda. “Efficient inference of optimal decision trees”. In: AAAI Conference on Artificial Intelligence (AAAI). 2020, pp. 3195–3202.

Hao Hu. “Interpretable Machine Learning Models via Maximum Boolean Satisfiability”. PhD thesis. INSA de Toulouse, 2022.

Pouya Shati

Presenter

My Session Status

Allow attendees to rate sessions with a "thumbs up" or "thumbs down".

Allow attendees to send feedback about sessions

Allows attendees to send short textual feedback to the organizer for a session. This is only sent to the organizer and not the speakers.

Enable list of attendees for sessions

When enabled, you can choose to display attendee lists for individual sessions. Only attendees who have chosen to share their profile will be listed.

Display the list of attendees for this session

Enable to display the attendee list on this session's detail page. This change applies only to this session.

Allow attendees to participate in a discussion thread for sessions

Changes here will affect all session detail pages unless otherwise noted

Boolean Is Not Too Restrictive: Learning Optimal Decision Trees via MaxSAT

My Session Status

My Session Status

Session detail