Navigating the Alignment-Calibration Trade-off: A Pareto-SuperiorFrontier via Model Merging

11 Nov

When AI models are tuned to follow human instructions, they pay an alignment tax - losing both accuracy, diversity and causing it to halucinate confidence. Merging tuned and base models can recover both, creating smarter, more calibrated AI.