Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Sean McLeish's picture
12 41 3

Sean McLeish PRO

smcleish
Fishtiks's profile picture astein0's profile picture KevinDavidHayes's profile picture
ยท
https://mcleish7.github.io/
  • SeanMcleish
  • mcleish7

AI & ML interests

None yet

Recent Activity

upvoted a paper 11 days ago
How Much Is One Recurrence Worth? Iso-Depth Scaling Laws for Looped Language Models
updated a model 17 days ago
smcleish/tuo-prod-0.6b-embed-4b-instruct-cs-4-summary-mean-1024-mlp-ov0-causal-1e-5-post-train-2e-5
updated a collection 17 days ago
compression
View all activity

Organizations

Tom Goldstein's Lab at University of Maryland, College Park's profile picture Leon Sean Dev's profile picture University of Maryland's profile picture Gemstones ๐Ÿ’Ž: A Model Suite for Multi-Faceted Scaling Laws's profile picture Gemstones ๐Ÿ’Ž: A Model Suite for Multi-Faceted Scaling Laws (Cooldowns)'s profile picture Gemstones ๐Ÿ’Ž: A Model Suite for Multi-Faceted Scaling Laws (LR Ablation)'s profile picture Latent Context Language Model's profile picture

authored a paper 6 months ago

Teaching Pretrained Language Models to Think Deeper with Retrofitted Recurrence

Paper โ€ข 2511.07384 โ€ข Published Nov 10, 2025 โ€ข 19
authored a paper about 1 year ago

Scaling up Test-Time Compute with Latent Reasoning: A Recurrent Depth Approach

Paper โ€ข 2502.05171 โ€ข Published Feb 7, 2025 โ€ข 155
authored 3 papers almost 2 years ago

Benchmarking ChatGPT on Algorithmic Reasoning

Paper โ€ข 2404.03441 โ€ข Published Apr 4, 2024

The CLRS-Text Algorithmic Reasoning Language Benchmark

Paper โ€ข 2406.04229 โ€ข Published Jun 6, 2024 โ€ข 4

Transformers Can Do Arithmetic with the Right Embeddings

Paper โ€ข 2405.17399 โ€ข Published May 27, 2024 โ€ข 54
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs