Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing
    • Website
      • Tasks
      • HuggingChat
      • Collections
      • Languages
      • Organizations
    • Community
      • Blog
      • Posts
      • Daily Papers
      • Learn
      • Discord
      • Forum
      • GitHub
    • Solutions
      • Team & Enterprise
      • Hugging Face PRO
      • Enterprise Support
      • Inference Providers
      • Inference Endpoints
      • Storage Buckets

  • Log In
  • Sign Up
Mikhail Terekhov's picture
1 6

Mikhail Terekhov

terekhov
Gargaz's profile picture 21world's profile picture jasoncorkill's profile picture
·
  • MikhailTerekhov

AI & ML interests

Reinforcement Learning, Multi-objective Reinforcement Learning, RLHF

Recent Activity

liked a dataset about 1 month ago
RoganInglis/control-tax
upvoted a paper 7 months ago
Adaptive Attacks on Trusted Monitors Subvert AI Control Protocols
authored a paper 7 months ago
Unpacking SDXL Turbo: Interpreting Text-to-Image Models with Sparse Autoencoders
View all activity

Organizations

CLAIRE Lab @EPFL's profile picture

authored 3 papers 7 months ago

Unpacking SDXL Turbo: Interpreting Text-to-Image Models with Sparse Autoencoders

Paper • 2410.22366 • Published Oct 28, 2024 • 84

Control Tax: The Price of Keeping AI in Check

Paper • 2506.05296 • Published Jun 5, 2025

Adaptive Attacks on Trusted Monitors Subvert AI Control Protocols

Paper • 2510.09462 • Published Oct 10, 2025 • 6
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs