Publications

(2025). SPQR: A Standardized Benchmark for Modern Safety Alignment Methods in Text-to-Image Diffusion Models'.
(2025). Mitigating Watermark Forgery in Generative Models via Randomized Key Selection.
(2025). Improving LLM First-Token Predictions in\\Multiple-Choice Question Answering via Output Prefilling.
(2024). Unlearning Vision Transformers without Retaining Data via Low-Rank Decompositions. In ICPR.
(2024). Towards Understanding the Fragility of Multilingual LLMs against Fine-Tuning Attacks. In NAACL.
(2023). Safe-CLIP: Removing NSFW Concepts from Vision-and-Language Models. In ECCV.
(2023). Unveiling the Impact of Image Transformations on Deepfake Detection: An Experimental Analysis. In ICIAP.
(2021). Revisiting The Evaluation of Class Activation Mapping for Explainability: A Novel Metric and Experimental Analysis. In CVPRW.