Instant research discovery

Search and browse ingested papers with intelligence signals and fast filtering.

Showing 1-30 of 5578 papers (page 1 of 186)

PreviousNext
PaperIntel
"Better Ask for Forgiveness than Permission": Practices and Policies of AI Disclosure in Freelance Work

Senya Wong, Hyo Jin Do, Jessica He, Angel Hsing-Chi Hwang

Published: 2026-03-08Area: cs.HCCitations: -

Tags: ai-safety, cshc, preprint

E5 / R3 (94%)
"Dark Triad" Model Organisms of Misalignment: Narrow Fine-Tuning Mirrors Human Antisocial Behavior

Fiona Collins, Thilo Hagendorff, Sanaya Parekh, Jonas Kaplan

Published: 2026-03-06Area: cs.CLCitations: -

Tags: cscl, alignment-training, ai-safety, preprint

E5 / R4 (95%)
"I followed what felt right, not what I was told": Autonomy, Coaching, and Recognizing Bias Through AI-Mediated Dialogue

Patrick Carrington, Atieh Taheri, Hamza El Alaoui, Jeffrey P. Bigham

Published: 2026-03-11Area: cs.HCCitations: -

Tags: ai-safety, cshc, preprint

E5 / R3 (96%)
"I'm Not Reading All of That": Understanding Software Engineers' Level of Cognitive Engagement with Agentic Coding Assistants

Emily Kuang, Lheane Marie Dizon, Patricia Nicole Monderin, Carlos Rafael Catalan

Published: 2026-03-15Area: cs.HCCitations: -

Tags: ai-safety, cshc, preprint

E4 / R3 (93%)
$D^3$-RSMDE: 40$\times$ Faster and High-Fidelity Remote Sensing Monocular Depth Estimation

Haofei Zhang, Jie Song, Zunlei Feng, Mingli Song

Published: 2026-03-17Area: cs.CVCitations: -

Tags: ai-safety, cscv, preprint

E6 / R4 (94%)
$PA^3$: $\textbf{P}$olicy-$\textbf{A}$ware $\textbf{A}$gent $\textbf{A}$lignment through Chain-of-Thought

Benjamin Z. Yao, Chenlei Guo, Lichao Wang, Ruhi Sarikaya

Published: 2026-03-15Area: cs.CLCitations: -

Tags: cscl, ai-safety, preprint

E5 / R3 (97%)
$R$-equivalence on Cubic Surfaces I: Existing Cases with Non-Trivial Universal Equivalence

Dimitri Kanevsky, Julian Salazar, Matt Harvey

Published: 2026-03-19Area: math.AGCitations: -

Tags: ai-safety, mathag, preprint

E6 / R3 (94%)
$V_{0.5}$: Generalist Value Model as a Prior for Sparse RL Rollouts

Hongyan Hao, Xunliang Cai, Yueqing Sun, Han-Jia Ye

Published: 2026-03-11Area: cs.LGCitations: -

Tags: ai-safety, cslg, preprint

E5 / R3 (95%)
$\textbf{Re}^{2}$: Unlocking LLM Reasoning via Reinforcement Learning with Re-solving

Jianye Hao, Shuli Xu, Pinzheng Wang, Dong Li

Published: 2026-03-07Area: cs.AICitations: -

Tags: ai-safety, csai, preprint

E5 / R3 (94%)
$p^2$RAG: Privacy-Preserving RAG Service Supporting Arbitrary Top-$k$ Retrieval

Mingyue Wang, Cong Wang, Xiaohua Jia, Yulong Ming

Published: 2026-03-16Area: cs.CRCitations: -

Tags: ai-safety, cscr, preprint

E5 / R3 (96%)
100x Cost & Latency Reduction: Performance Analysis of AI Query Approximation using Lightweight Proxy Models

Pushkar Kadilkar, Thibaud Hottelier, Yves-Laurent Kom Samo, Jian He

Published: 2026-03-16Area: cs.DBCitations: -

Tags: ai-safety, csdb, preprint

E6 / R3 (97%)
360° Image Perception with MLLMs: A Comprehensive Benchmark and a Training-Free Method

Huyen T. T. Tran, Takayuki Okatani, Kang-Jun Liu, Farros Alferro

Published: 2026-03-17Area: cs.CVCitations: -

Tags: ai-safety, cscv, preprint

E5 / R3 (95%)
4D Synchronized Fields: Motion-Language Gaussian Splatting for Temporal Scene Understanding

Rasul Khanbayov, Erchin Serpedin, Hasan Kurban, Samir Abdaljalil

Published: 2026-03-15Area: cs.CVCitations: -

Tags: ai-safety, cscv, preprint

E5 / R3 (94%)
A Behavioral Fingerprint for Large Language Models: Provenance Tracking via Refusal Vectors

Victor S. Sheng, Zhenyu Xu

Published: 2026-02-10Area: Representation AnalysisCitations: -

Tags: empirical, representation-analysis, ai-safety

E5 / R3 (95%)
A Benchmark for Multi-Party Negotiation Games from Real Negotiation Data

Finale Doshi-Velez, Leo Benac, Jonas Raedler, Zilin Ma

Published: 2026-03-14Area: cs.MACitations: -

Tags: ai-safety, csma, preprint

E4 / R2 (91%)
A Blockchain-based Traceability System for AI-Driven Engine Blade Inspection

Yusra Abdulrahman, Khaled Salah, Mohammed A. Mohammed Eltoum, Eman Ouda

Published: 2026-03-09Area: cs.CRCitations: -

Tags: ai-safety, cscr, preprint

E8 / R4 (96%)
A Causal Graph Approach to Oppositional Narrative Analysis

Miguel Fernandez-de-Retana, Diego Revilla, Lingfeng Chen, Martin Fernandez-de-Retana

Published: 2026-03-06Area: cs.CLCitations: -

Tags: cscl, ai-safety, preprint

E5 / R3 (95%)
A Causal Perspective for Enhancing Jailbreak Attack and Defense

Kui Ren, Haozhe Feng, Licheng Pan, Hui Xue

Published: 2026-01-31Area: Adversarial RobustnessCitations: -

Tags: empirical, ai-safety, adversarial-robustness

E6 / R3 (94%)
A Closer Look into LLMs for Table Understanding

Jia Wang, Chuanyu Qin, Zheng Lin, Peize Li

Published: 2026-03-16Area: cs.CLCitations: -

Tags: cscl, ai-safety, preprint

E6 / R3 (94%)
A Computationally Efficient Learning of Artificial Intelligence System Reliability Considering Error Propagation

Larry Head, Fenglian Pan, Jian Liu, Yili Hong

Published: 2026-03-18Area: cs.AICitations: -

Tags: ai-safety, csai, preprint

E5 / R3 (96%)
A Concept is More Than a Word: Diversified Unlearning in Text-to-Image Diffusion Models

Duc Hao Pham, Dien Hy Ngo, Duy Khanh Dinh, Van Duy Truong

Published: 2026-03-19Area: cs.AICitations: -

Tags: ai-safety, csai, preprint

E5 / R3 (96%)
A Consensus-Driven Multi-LLM Pipeline for Missing-Person Investigations

Ravi Mukkamala, Joshua Castillo

Published: 2026-03-09Area: cs.AICitations: -

Tags: ai-safety, csai, preprint

E6 / R4 (94%)
A Context Alignment Pre-processor for Enhancing the Coherence of Human-LLM Dialog

Ding Wei

Published: 2026-03-17Area: cs.AICitations: -

Tags: alignment-training, ai-safety, csai, preprint

E5 / R4 (96%)
A Contextual Help Browser Extension to Assist Digital Illiterate Internet Users

Christos Koutsiaris

Published: 2026-03-18Area: cs.IRCitations: -

Tags: ai-safety, csir, preprint

E5 / R4 (99%)
A Cortically Inspired Architecture for Modular Perceptual AI

Prerna Luthra

Published: 2026-03-07Area: cs.AICitations: -

Tags: ai-safety, csai, preprint

E5 / R4 (94%)
A Diffusion Analysis of Policy Gradient for Stochastic Bandits

Tor Lattimore

Published: 2026-03-10Area: stat.MLCitations: -

Tags: ai-safety, statml, preprint

E5 / R2 (96%)
A Dual Certificate Approach to Sparsity in Infinite-Width Shallow Neural Networks

Christoph Brune, Marcello Carioni, Leonardo Del Grande

Published: 2026-03-18Area: math.OCCitations: -

Tags: ai-safety, mathoc, preprint

E5 / R3 (96%)
A Family of LLMs Liberated from Static Vocabularies

Max Meuer, Aleph Alpha, Jan Hendrik Metzen, Dylan Rodriquez

Published: 2026-03-16Area: cs.CLCitations: -

Tags: cscl, ai-safety, preprint

E5 / R3 (96%)
A Fragile Guardrail: Diffusion LLM's Safety Blessing and Its Failure Mode

Yupeng Chen, Philip Torr, Eric Sommerlade, Jialin Yu

Published: 2026-01-30Area: Adversarial RobustnessCitations: -

Tags: empirical, ai-safety, adversarial-robustness

E6 / R3 (95%)
A Framework and Prototype for a Navigable Map of Datasets in Engineering Design and Systems Engineering

Daniel R. Herber, H. Sinan Bank

Published: 2026-03-16Area: cs.SECitations: -

Tags: ai-safety, csse, preprint

E5 / R4 (97%)