arxiv:2604.18519
Joseph Tang
lilvjosephtang
AI & ML interests
None yet
Recent Activity
authored a paper 1 day ago
LLM Safety From Within: Detecting Harmful Content with Internal Representations authored a paper 1 day ago
Maia-2: A Unified Model for Human-AI Alignment in Chess