LLM Self-Correction with DeCRIM: Decompose, Critique, and Refine for Enhanced Following of Instructions with Multiple Constraints
EMNLP, 2024 & Sys2Reasoning @ NeurIPS, 2024
TL;DR: Introducing RealInstruct to evaluate LLMs on real multi-constrained instructions, and DeCRIM self-correction that improves instruction following by decomposing requests and refining responses, enabling open LLMs to outperform GPT-4 with strong feedback.