Hongfei Lin — Scifaro

Distinguishing Right from Wrong in Debates: Attribution Analysis of Chinese Harmful Memes

Research on harmful meme detection has garnered significant attention, resulting in the development of numerous datasets and methods. However, progress in detecting Chinese harmful memes lags considerably, primarily due to two challenges:…

Computation and Language · Computer Science 2026-05-26 Weiming Wang , Junyu Lu , Han Wang , Xiaokun Zhang , Zewen Bai , Bo Xu , Liang Yang , Hongfei Lin

Harder to Defend: Towards Chinese Toxicity Attacks via Implicit Enhancement and Obfuscation Rewriting

Large language models (LLMs) require robust toxicity evaluation beyond explicit wording. This setting remains underexplored in Chinese, where toxicity may combine semantic indirectness with surface obfuscation. We introduce Chinese Implicit…

Computation and Language · Computer Science 2026-05-22 Jingyi Kang , Junyu Lu , Bo Xu , Hongbo Wang , Linlin zong , Roy Ka-Wei Lee , Hongfei Lin

Aligning LLM Uncertainty with Human Disagreement in Subjectivity Analysis

Large language models for subjectivity analysis are typically trained with aggregated labels, which compress variations in human judgment into a single supervision signal. This paradigm overlooks the intrinsic uncertainty of low-agreement…

Computation and Language · Computer Science 2026-05-14 Junyu Lu , Deyi Ji , Xuanyi Liu , Lanyun Zhu , Bo Xu , Liang Yang , Xian-Sheng Hua , Hongfei Lin

E2E-GMNER: End-to-End Generative Grounded Multimodal Named Entity Recognition

Grounded Multimodal Named Entity Recognition (GMNER) aims to jointly identify named entity mentions in text, predict their semantic types, and ground each entity to a corresponding visual region in an associated image. Existing approaches…

Computer Vision and Pattern Recognition · Computer Science 2026-04-21 Meng Zhang , Jinzhong Ning , Xiaolong Wu , Hongfei Lin , Yijia Zhang

RexDrug: Reliable Multi-Drug Combination Extraction through Reasoning-Enhanced LLMs

Automated Drug Combination Extraction (DCE) from large-scale biomedical literature is crucial for advancing precision medicine and pharmacological research. However, existing relation extraction methods primarily focus on binary…

Computation and Language · Computer Science 2026-03-10 Zhijun Wang , Ling Luo , Dinghao Pan , Huan Zhuang , Lejing Yu , Yuanyuan Sun , Hongfei Lin

Multimodal Recommendation (MMR) systems are crucial for modern platforms but are often hampered by inherent noise and uncertainty in modal features, such as blurry images, diverse visual appearances, or ambiguous text. Existing methods…

Information Retrieval · Computer Science 2026-01-28 Xinzhuo Wu , Hongbo Wang , Yuan Lin , Kan Xu , Liang Yang , Hongfei Lin

The Straight and Narrow: Do LLMs Possess an Internal Moral Path?

Enhancing the moral alignment of Large Language Models (LLMs) is a critical challenge in AI safety. Current alignment techniques often act as superficial guardrails, leaving the intrinsic moral representations of LLMs largely untouched. In…

Computation and Language · Computer Science 2026-01-16 Luoming Hu , Jingjie Zeng , Liang Yang , Hongfei Lin

VisualQuest: A Benchmark for Abstract Visual Reasoning in MLLMs

We introduce VisualQuest, a novel dataset designed to rigorously evaluate multimodal large language models (MLLMs) on abstract visual reasoning tasks that require the integration of symbolic, cultural, and linguistic knowledge. Unlike…

Computer Vision and Pattern Recognition · Computer Science 2026-01-05 Kelaiti Xiao , Liang Yang , Dongyu Zhang , Paerhati Tulajiang , Hongfei Lin

Chinese Discharge Drug Recommendation in Metabolic Diseases with Large Language Models

Intelligent drug recommendation based on Electronic Health Records (EHRs) is critical for improving the quality and efficiency of clinical decision-making. By leveraging large-scale patient data, drug recommendation systems can assist…

Computation and Language · Computer Science 2025-12-08 Juntao Li , Haobin Yuan , Ling Luo , Yan Jiang , Fan Wang , Ping Zhang , Huiyi Lv , Jian Wang , Yuanyuan Sun , Hongfei Lin

Visual Puns from Idioms: An Iterative LLM-T2IM-MLLM Framework

We study idiom-based visual puns--images that align an idiom's literal and figurative meanings--and present an iterative framework that coordinates a large language model (LLM), a text-to-image model (T2IM), and a multimodal LLM (MLLM) for…

Computation and Language · Computer Science 2025-12-01 Kelaiti Xiao , Liang Yang , Dongyu Zhang , Paerhati Tulajiang , Hongfei Lin

CommonVoice-SpeechRE and RPG-MoGe: Advancing Speech Relation Extraction with a New Dataset and Multi-Order Generative Framework

Speech Relation Extraction (SpeechRE) aims to extract relation triplets directly from speech. However, existing benchmark datasets rely heavily on synthetic data, lacking sufficient quantity and diversity of real human speech. Moreover,…

Computation and Language · Computer Science 2025-11-25 Jinzhong Ning , Paerhati Tulajiang , Yingying Le , Yijia Zhang , Yuanyuan Sun , Hongfei Lin , Haifeng Liu

Overview of CHIP 2025 Shared Task 2: Discharge Medication Recommendation for Metabolic Diseases Based on Chinese Electronic Health Records

Discharge medication recommendation plays a critical role in ensuring treatment continuity, preventing readmission, and improving long-term management for patients with chronic metabolic diseases. This paper present an overview of the CHIP…

Computation and Language · Computer Science 2025-11-11 Juntao Li , Haobin Yuan , Ling Luo , Tengxiao Lv , Yan Jiang , Fan Wang , Ping Zhang , Huiyi Lv , Jian Wang , Yuanyuan Sun , Hongfei Lin

MedTrust-RAG: Evidence Verification and Trust Alignment for Biomedical Question Answering

Biomedical question answering (QA) requires accurate interpretation of complex medical knowledge. Large language models (LLMs) have shown promising capabilities in this domain, with retrieval-augmented generation (RAG) systems enhancing…

Computation and Language · Computer Science 2025-10-21 Yingpeng Ning , Yuanyuan Sun , Ling Luo , Yanhua Wang , Yuchen Pan , Hongfei Lin

A Unified Biomedical Named Entity Recognition Framework with Large Language Models

Accurate recognition of biomedical named entities is critical for medical information extraction and knowledge discovery. However, existing methods often struggle with nested entities, entity boundary ambiguity, and cross-lingual…

Computation and Language · Computer Science 2025-10-13 Tengxiao Lv , Ling Luo , Juntao Li , Yanhua Wang , Yuchen Pan , Chao Liu , Yanan Wang , Yan Jiang , Huiyi Lv , Yuanyuan Sun , Jian Wang , Hongfei Lin

FocusMed: A Large Language Model-based Framework for Enhancing Medical Question Summarization with Focus Identification

With the rapid development of online medical platforms, consumer health questions (CHQs) are inefficient in diagnosis due to redundant information and frequent non-professional terms. The medical question summary (MQS) task aims to…

Computation and Language · Computer Science 2025-10-07 Chao Liu , Ling Luo , Tengxiao Lv , Huan Zhuang , Lejing Yu , Jian Wang , Hongfei Lin

Enhancing Textual Personality Detection toward Social Media: Integrating Long-term and Short-term Perspectives

Textual personality detection aims to identify personality characteristics by analyzing user-generated content on social media platforms. Extensive psychological literature highlights that personality encompasses both long-term stable…

Computation and Language · Computer Science 2025-09-30 Haohao Zhu , Xiaokun Zhang , Junyu Lu , Youlin Wu , Zewen Bai , Changrong Min , Liang Yang , Bo Xu , Dongyu Zhang , Hongfei Lin

IP2: Entity-Guided Interest Probing for Personalized News Recommendation

News recommender systems aim to provide personalized news reading experiences for users based on their reading history. Behavioral science studies suggest that screen-based news reading contains three successive steps: scanning, title…

Information Retrieval · Computer Science 2025-07-21 Youlin Wu , Yuanyuan Sun , Xiaokun Zhang , Haoxi Zhan , Bo Xu , Liang Yang , Hongfei Lin

Fine-Grained Chinese Hate Speech Understanding: Span-Level Resources, Coded Term Lexicon, and Enhanced Detection Frameworks

The proliferation of hate speech has inflicted significant societal harm, with its intensity and directionality closely tied to specific targets and arguments. In recent years, numerous machine learning-based methods have been developed to…

Computation and Language · Computer Science 2025-07-16 Zewen Bai , Liang Yang , Shengdi Yin , Yuanyuan Sun , Hongfei Lin

Cultural Bias Matters: A Cross-Cultural Benchmark Dataset and Sentiment-Enriched Model for Understanding Multimodal Metaphors

Metaphors are pervasive in communication, making them crucial for natural language processing (NLP). Previous research on automatic metaphor processing predominantly relies on training data consisting of English samples, which often reflect…

Computation and Language · Computer Science 2025-06-10 Senqi Yang , Dongyu Zhang , Jing Ren , Ziqi Xu , Xiuzhen Zhang , Yiliao Song , Hongfei Lin , Feng Xia

Rethinking Contrastive Learning in Session-based Recommendation

Session-based recommendation aims to predict intents of anonymous users based on limited behaviors. With the ability in alleviating data sparsity, contrastive learning is prevailing in the task. However, we spot that existing contrastive…

Information Retrieval · Computer Science 2025-06-06 Xiaokun Zhang , Bo Xu , Fenglong Ma , Zhizheng Wang , Liang Yang , Hongfei Lin