← Back to papers

Paper deep dive

200 Concrete Open Problems in Mechanistic Interpretability

Neel Nanda

Year: 2022Venue: Alignment ForumArea: Surveys & ReviewsType: SurveyEmbeddings: 0

Intelligence

Status: not_run | Model: - | Prompt: - | Confidence: 0%

Entities (0)

No extracted entities yet.

Relation Signals (0)

No relation signals yet.

Cypher Suggestions (0)

No Cypher suggestions yet.

Abstract

Comprehensive list of 200 concrete, tractable research problems in mechanistic interpretability organized by topic area with difficulty ratings.

Tags

ai-safety (imported, 100%)interpretability (suggested, 80%)survey (suggested, 88%)surveys-reviews (suggested, 92%)

Links

Full Text

No full-text extraction is stored for this paper yet.