arXiv AI Papers

Draft-and-Prune: Improving the Reliability of Auto-formalization for Logical Reasoning

Back to overview

A new method called Draft-and-Prune improves automatic formalization (AF) for logical reasoning by translating natural language problems into executable programs. The approach generates multiple natural language plans, filters semantically incorrect formalizations, and uses majority voting to aggregate predictions. Tested on four benchmarks, D&P achieves 78.