Poison as Cure

Poison as Cure: Visual Noise for Mitigating Object Hallucinations in LVMs

¹Xiamen University ²Westlake University ³DAMO Academy, Alibaba Group ⁴Hupan Laboratory

^*Corresponding author: wanghuan@westlake.edu.cn

Abstract

Large vision-language models (LVMs) extend large language models (LLMs) with visual perception capabilities, enabling them to process and interpret visual information. A major challenge compromising their reliability is object hallucination that LVMs may generate plausible but factually inaccurate information. We propose a novel visual adversarial perturbation (VAP) method to mitigate this hallucination issue. VAP alleviates LVM hallucination by applying strategically optimized visual noise without altering the base model. Our method formulates hallucination suppression as an optimization problem, leveraging adversarial strategies to generate beneficial visual perturbations that enhance the model's factual grounding and reduce parametric knowledge bias. Extensive experimental results demonstrate that our method consistently reduces object hallucinations across 8 state-of-the-art LVMs, validating its efficacy across diverse evaluations.

Detailed Overview of Our Proposed Method

The VAP method generates beneficial visual noise by leveraging adversarial knowledge through the optimization of three strategies: (1) maximizing the semantic alignment between the LVM's response and the visual content to preserve the semantic consistency of the image, (2) minimizing the response similarity between the original and distorted visual content through noise-induced uncertainty, and (3) mitigating parametric knowledge bias by minimizing the similarity of representations between original and distorted inputs. Strategies (2) and (3) jointly mitigate parametric knowledge bias. The optimized visual noise effectively mitigates object hallucinations.

Method Overview — **Illustration of our proposed VAP method.**

Experiment Results

Text-axis evaluation comparison under three evaluation settings of POPE on the validation set of MSCOCO: Random Sampling (selecting absent objects), Popular Sampling (choosing the most frequent missing objects based on dataset-wide occurrence), and Adversarial Sampling (ranking objects by co-occurrence with ground-truth and selecting the most frequent ones). The values in green indicate the percentage improvements achieved by our proposed method.

Vision-/text-Axis evaluation comparison under the BEAF Benchmark. Compared to the text-axis hallucination evaluation, BEAF includes the change-aware hallucination metrics: TU, IG, SB_p, SB_n, ID, and F1_TUID. Although some metrics show slight degradation, the overall performance demonstrates consistent improvement. The values in green indicate the percentage improvements achieved by our proposed method, while the values in red reflect the performance degradation.

Comparison of object hallucination evaluation under the CHAIR setting. I₁ denotes “Generate a short caption of the image”, and I₂ denotes “Provide a brief description of the given image”. The values in green indicate the percentage improvements achieved by our proposed method.

Illustration of the effectiveness on VQA Tasks.

Intern-VL2-MPO

Instruct-BLIP

Intern-VL2

LLaVA-v1.5

LLaVA-OneVision(OV)

DeepSeek-VL2

Ovis1.6-Gemma2

Qwen-VL2

@article{zhang2025poison, title={Poison as Cure: Visual Noise for Mitigating Object Hallucinations in LVMs}, author={Zhang, Kejia and Tao, Keda and Tang, Jiasheng and Wang, Huan}, journal={arXiv preprint arXiv:2501.19164}, year={2025} }

Poison as Cure: Visual Noise for Mitigating Object Hallucinations in LVMs

Abstract

Detailed Overview of Our Proposed Method

Experiment Results

Comparison of object hallucination evaluation under the CHAIR setting. I₁ denotes “Generate a short caption of the image”, and I₂ denotes “Provide a brief description of the given image”. The values in green indicate the percentage improvements achieved by our proposed method.

Illustration of the effectiveness on VQA Tasks.

Intern-VL2-MPO

Instruct-BLIP

Intern-VL2

LLaVA-v1.5

LLaVA-OneVision(OV)

DeepSeek-VL2

Ovis1.6-Gemma2

Qwen-VL2

BibTeX

Visitor Location Map

Poison as Cure: Visual Noise for Mitigating Object Hallucinations in LVMs

Abstract

Detailed Overview of Our Proposed Method

Experiment Results

Comparison of object hallucination evaluation under the CHAIR setting. I1 denotes “Generate a short caption of the image”, and I2 denotes “Provide a brief description of the given image”. The values in green indicate the percentage improvements achieved by our proposed method.

Illustration of the effectiveness on VQA Tasks.

Intern-VL2-MPO

Instruct-BLIP

Intern-VL2

LLaVA-v1.5

LLaVA-OneVision(OV)

DeepSeek-VL2

Ovis1.6-Gemma2

Qwen-VL2

BibTeX

Visitor Location Map

Comparison of object hallucination evaluation under the CHAIR setting. I₁ denotes “Generate a short caption of the image”, and I₂ denotes “Provide a brief description of the given image”. The values in green indicate the percentage improvements achieved by our proposed method.