Al-Hussein's picture

Al-Hussein

AlHussein

·

AI & ML interests

Knowledge Distillation, Self-Supervised Learning, Semi-Supervised Learning

Recent Activity

upvoted a paper 3 days ago

Rewarding the Scientific Process: Process-Level Reward Modeling for Agentic Data Analysis

upvoted a paper 3 days ago

SketchVLM: Vision language models can annotate images to explain thoughts and guide users

upvoted a paper 3 days ago

Tuna-2: Pixel Embeddings Beat Vision Encoders for Multimodal Understanding and Generation

View all activity

Organizations

None yet

upvoted 3 papers 3 days ago

Rewarding the Scientific Process: Process-Level Reward Modeling for Agentic Data Analysis

Paper • 2604.24198 • Published 19 days ago • 22

SketchVLM: Vision language models can annotate images to explain thoughts and guide users

Paper • 2604.22875 • Published 23 days ago • 35

Tuna-2: Pixel Embeddings Beat Vision Encoders for Multimodal Understanding and Generation

Paper • 2604.24763 • Published 19 days ago • 70

upvoted 2 papers 4 days ago

X2SAM: Any Segmentation in Images and Videos

Paper • 2605.00891 • Published 19 days ago • 25

Video Generation with Predictive Latents

Paper • 2605.02134 • Published 12 days ago • 24

upvoted a paper 2 months ago

Imagination Helps Visual Reasoning, But Not Yet in Latent Space

Paper • 2602.22766 • Published Feb 26 • 44

upvoted a paper 3 months ago

Kimi K2.5: Visual Agentic Intelligence

Paper • 2602.02276 • Published Feb 2 • 270

upvoted a paper 5 months ago

Both Semantics and Reconstruction Matter: Making Representation Encoders Ready for Text-to-Image Generation and Editing

Paper • 2512.17909 • Published Dec 19, 2025 • 37

upvoted 4 papers 6 months ago

The Dragon Hatchling: The Missing Link between the Transformer and Models of the Brain

Paper • 2509.26507 • Published Sep 30, 2025 • 550

DeepSearch: Overcome the Bottleneck of Reinforcement Learning with Verifiable Rewards via Monte Carlo Tree Search

Paper • 2509.25454 • Published Sep 29, 2025 • 148

Less is More: Recursive Reasoning with Tiny Networks

Paper • 2510.04871 • Published Oct 6, 2025 • 514

The Principles of Diffusion Models

Paper • 2510.21890 • Published Oct 24, 2025 • 64

upvoted 4 papers 7 months ago

Robot Learning: A Tutorial

Paper • 2510.12403 • Published Oct 14, 2025 • 130

D2E: Scaling Vision-Action Pretraining on Desktop Data for Transfer to Embodied AI

Paper • 2510.05684 • Published Oct 7, 2025 • 146

DreamOmni2: Multimodal Instruction-based Editing and Generation

Paper • 2510.06679 • Published Oct 8, 2025 • 74

Baseer: A Vision-Language Model for Arabic Document-to-Markdown OCR

Paper • 2509.18174 • Published Sep 17, 2025 • 134

upvoted 4 papers 8 months ago

LLMs4All: A Review on Large Language Models for Research and Applications in Academic Disciplines

Paper • 2509.19580 • Published Sep 23, 2025 • 14

Video models are zero-shot learners and reasoners

Paper • 2509.20328 • Published Sep 24, 2025 • 100

MANZANO: A Simple and Scalable Unified Multimodal Model with a Hybrid Vision Tokenizer

Paper • 2509.16197 • Published Sep 19, 2025 • 58

Scaling Agents via Continual Pre-training

Paper • 2509.13310 • Published Sep 16, 2025 • 117