Semantic Calibration in Base LLMs: Tuning Breaks Confidence