Instant research discovery

Search and browse ingested papers with intelligence signals and fast filtering.

PaperIntel
"Dark Triad" Model Organisms of Misalignment: Narrow Fine-Tuning Mirrors Human Antisocial Behavior

Fiona Collins, Thilo Hagendorff, Sanaya Parekh, Jonas Kaplan

Year: 2026Area: cs.CLCitations: -

Tags: cscl, alignment-training, ai-safety, preprint

E5 / R4 (95%)
A Causal Graph Approach to Oppositional Narrative Analysis

Miguel Fernandez-de-Retana, Diego Revilla, Lingfeng Chen, Martin Fernandez-de-Retana

Year: 2026Area: cs.CLCitations: -

Tags: cscl, ai-safety, preprint

E5 / R3 (95%)
A Closer Look into LLMs for Table Understanding

Jia Wang, Chuanyu Qin, Zheng Lin, Peize Li

Year: 2026Area: cs.CLCitations: -

Tags: cscl, ai-safety, preprint

-
A Family of LLMs Liberated from Static Vocabularies

Max Meuer, Aleph Alpha, Jan Hendrik Metzen, Dylan Rodriquez

Year: 2026Area: cs.CLCitations: -

Tags: cscl, ai-safety, preprint

-
A Joint Neural Baseline for Concept, Assertion, and Relation Extraction from Clinical Text

Ribeka Tanaka, Fei Cheng, Sadao Kurohashi

Year: 2026Area: cs.CLCitations: -

Tags: cscl, ai-safety, preprint

E4 / R2 (94%)
A Systematic Investigation of Document Chunking Strategies and Embedding Sensitivity

Muntasir Adnan, Carlos C. N. Kuhn, Muhammad Arslan Shaukat

Year: 2026Area: cs.CLCitations: -

Tags: cscl, ai-safety, preprint

E5 / R3 (95%)
AI Steerability 360: A Toolkit for Steering Large Language Models

Pierre Dognin, Moninder Singh, Praveen Venkateswaran, Avinash Balakrishnan

Year: 2026Area: cs.CLCitations: -

Tags: cscl, ai-safety, preprint

E8 / R4 (96%)
ASDA: Automated Skill Distillation and Adaptation for Financial Reasoning

Sum Yee Chan, Wenting Tan, Tak-Wah Lam, Tik Yu Yim

Year: 2026Area: cs.CLCitations: -

Tags: cscl, ai-safety, preprint

-
Abductive Reasoning with Syllogistic Forms in Large Language Models

Mitsuhiro Okada, Koji Mineshima, Takanobu Morishita Kentaro Ozeki, Risako Ando

Year: 2026Area: cs.CLCitations: -

Tags: cscl, ai-safety, preprint

E7 / R3 (95%)
AdaCultureSafe: Adaptive Cultural Safety Grounded by Cultural Knowledge in Large Language Models

Pengfei Bai, Zhirong Liao, Jiawei Jiang, Di Lin

Year: 2026Area: cs.CLCitations: -

Tags: cscl, ai-safety, preprint

E5 / R3 (94%)
Adaptive Activation Cancellation for Hallucination Mitigation in Large Language Models

Yong Wang, Gurcan Comert, Eric Yocam, Paris Kalathas

Year: 2026Area: cs.CLCitations: -

Tags: cscl, ai-safety, preprint

E5 / R3 (97%)
Adaptive Decoding via Test-Time Policy Learning for Self-Improving Generation

Eelaaf Zahid, Asmita Bhardwaj, Yuya Jeremy Ong, Basel Shbita

Year: 2026Area: cs.CLCitations: -

Tags: cscl, ai-safety, preprint

-
Adaptive Guidance for Retrieval-Augmented Masked Diffusion Models

Jaemin Kim, Jong Chul Ye

Year: 2026Area: cs.CLCitations: -

Tags: cscl, ai-safety, preprint

-
Aligning Large Language Models with Searcher Preferences

Yan Gao, Hui Xiong, Chengqiang Lu, Peilun Zhou

Year: 2026Area: cs.CLCitations: -

Tags: cscl, ai-safety, preprint

-
Aligning Paralinguistic Understanding and Generation in Speech LLMs via Multi-Task Reinforcement Learning

Zhaojiang Lin, Surya Teja Appini, Florian Metze, Rashi Rungta

Year: 2026Area: cs.CLCitations: -

Tags: cscl, ai-safety, preprint

-
Aligning to Illusions: Choice Blindness in Human and AI Feedback

Wenbin Wu

Year: 2026Area: cs.CLCitations: -

Tags: cscl, ai-safety, preprint

E5 / R4 (95%)
Alignment Makes Language Models Normative, Not Descriptive

Eilam Shapira, Moshe Tennenholtz, Roi Reichart

Year: 2026Area: cs.CLCitations: -

Tags: cscl, alignment-training, ai-safety, preprint

-
An Extreme Multi-label Text Classification (XMTC) Library Dataset: What if we took "Use of Practical AI in Digital Libraries" seriously?

Jennifer D'Souza, Maximilian Kähler, Luca Zaccagna, Osma Suominen

Year: 2026Area: cs.CLCitations: -

Tags: cscl, ai-safety, preprint

-
Anonymous-by-Construction: An LLM-Driven Framework for Privacy-Preserving Text

Nicolás D'Ippolito, Pablo Ronco, Federico Albanese

Year: 2026Area: cs.CLCitations: -

Tags: cscl, ai-safety, preprint

-
Artificial Intelligence for Sentiment Analysis of Persian Poetry

Arash Zargar, Mitra Shafaei, Abolfazl Moshiri, Shabnam Rahimi-Golkhandan

Year: 2026Area: cs.CLCitations: -

Tags: cscl, ai-safety, preprint

-
Attention-guided Evidence Grounding for Spoken Question Answering

Bolin Chen, Yueping He, Bowen Li, Chengjun Mao

Year: 2026Area: cs.CLCitations: -

Tags: cscl, ai-safety, preprint

-
Automatic Cardiac Risk Management Classification using large-context Electronic Patients Health Records

Bram van Es, Jacopo Vitale, Mario Merone, Leandro Pecchia

Year: 2026Area: cs.CLCitations: -

Tags: cscl, ai-safety, preprint

E6 / R3 (97%)
BATQuant: Outlier-resilient MXFP4 Quantization via Learnable Block-wise Optimization

Xianzhi Yu, Haoli Bai, Zhenhua Dong, Ji-Fu Li

Year: 2026Area: cs.CLCitations: -

Tags: cscl, ai-safety, preprint

-
BTZSC: A Benchmark for Zero-Shot Text Classification Across Cross-Encoders, Embedding Models, Rerankers and LLMs

Ilias Aarab

Year: 2026Area: cs.CLCitations: -

Tags: cscl, ai-safety, preprint

-
Bielik-Minitron-7B: Compressing Large Language Models via Structured Pruning and Knowledge Distillation for the Polish Language

Adrian Gwoździej, Krzysztof Wróbel, Sergio P. Perez, Łukasz Flis

Year: 2026Area: cs.CLCitations: -

Tags: cscl, ai-safety, preprint

-
CCR-Bench: A Comprehensive Benchmark for Evaluating LLMs on Complex Constraints, Control Flows, and Real-World Cases

Yunfei Ma, Minglu Liu, Rui Liu, Xiaona Xue

Year: 2026Area: cs.CLCitations: -

Tags: cscl, ai-safety, preprint

E4 / R3 (96%)
CCTU: A Benchmark for Tool Use under Complex Constraints

Guoqiang Zhang, Qi Zhang, Junjie Ye, Xuanjing Huang

Year: 2026Area: cs.CLCitations: -

Tags: cscl, ai-safety, preprint

-
CLAG: Adaptive Memory Organization via Agent-Driven Clustering for Small Language Model Agents

Jaewoo Kang, Wonjune Jang, Junha Jung, Taeyun Roh

Year: 2026Area: cs.CLCitations: -

Tags: cscl, ai-safety, preprint

-
COGNAC at SemEval-2026 Task 5: LLM Ensembles for Human-Level Word Sense Plausibility Rating in Challenging Narratives

Azwad Anjum Islam, Tisa Islam Erana

Year: 2026Area: cs.CLCitations: -

Tags: cscl, ai-safety, preprint

-
CRIMSON: A Clinically-Grounded LLM-Based Metric for Generative Radiology Report Evaluation

Sung Eun Kim, Thibault Heintz, Pranav Rajpurkar, Mona Alhammad

Year: 2026Area: cs.CLCitations: -

Tags: cscl, ai-safety, safety-evaluation, preprint

E5 / R3 (94%)

Showing 30 of 147 papers on page 1.