Not Lain's picture

Building on HF

Not Lain

not-lain

chonkie-ai

·

https://not-lain.github.io

AI & ML interests

custom AI models with HF integration, HuggingFace fellow 🤗

Recent Activity

upvoted a paper 3 days ago

ShadowPEFT: Shadow Network for Parameter-Efficient Fine-Tuning

reacted to SeanLee97's post with 🚀 3 days ago

Our lab recently released a paper where we introduce ShadowPEFT, a new Parameter-Efficient Fine-Tuning (PEFT) paradigm tailored for edge computing scenarios. Unlike traditional approaches such as LoRA and its variants, which inject trainable parameters directly into the weights of Transformer, requiring tight coupling with the backbone. ShadowPEFT instead enhances the frozen large base model by adding a lightweight, centralized, pretrainable, and detachable Shadow network. This shadow network operates in parallel with the base model, delivering learned corrections to each decoder layer. Because the shadow module is architecturally decoupled from the backbone, it can be independently trained, stored, and deployed, benefiting edge computing scenarios and edge-cloud collaboration computing. - HF Paper: https://huggingface.co/papers/2604.19254 - GitHub: https://github.com/ShadowLLM/shadow-peft - HF Collection: https://huggingface.co/collections/shadow-llm/shadow-peft-models

reacted to SeanLee97's post with 🔥 3 days ago

Our lab recently released a paper where we introduce ShadowPEFT, a new Parameter-Efficient Fine-Tuning (PEFT) paradigm tailored for edge computing scenarios. Unlike traditional approaches such as LoRA and its variants, which inject trainable parameters directly into the weights of Transformer, requiring tight coupling with the backbone. ShadowPEFT instead enhances the frozen large base model by adding a lightweight, centralized, pretrainable, and detachable Shadow network. This shadow network operates in parallel with the base model, delivering learned corrections to each decoder layer. Because the shadow module is architecturally decoupled from the backbone, it can be independently trained, stored, and deployed, benefiting edge computing scenarios and edge-cloud collaboration computing. - HF Paper: https://huggingface.co/papers/2604.19254 - GitHub: https://github.com/ShadowLLM/shadow-peft - HF Collection: https://huggingface.co/collections/shadow-llm/shadow-peft-models

View all activity

Organizations

published an article 7 months ago

Article

Visualizing How VLMs Work

Oct 7, 2025

•

55

published an article 7 months ago

Article

KV Caching Explained: Optimizing Transformer Inference Efficiency

Jan 30, 2025

•

305

published an article over 1 year ago

Article

Mastering Tensor Dimensions in Transformers

Jan 12, 2025

•

165

published an article over 1 year ago

Article

PyTorchModelHubMixin: Bridging the Gap for Custom AI Models on Hugging Face

Nov 11, 2024

•

20

published an article almost 2 years ago

Article

RAG using huggingface tools

Jul 7, 2024

•

90

published an article almost 2 years ago

Article

Image-based search engine

Jul 4, 2024

•

32

published an article almost 2 years ago

Article

Train custom AI models with the trainer API and adapt them to 🤗

Jun 29, 2024

•

32

published an article about 2 years ago

Article

Custom architectures with HuggingFace 🤗

Apr 22, 2024

•

30