Advertisement ยท 728 ร— 90

Posts by Somnath Basu Roy Chowdhury

(9/n) Finally, I would like to thank all my amazing co-authors: Avinava, @abeirami.bsky.social , Rahul, Nicholas, Amr, Snigdha.

cc @unccs.bsky.social

1 year ago 1 0 0 0
[Somnath Basu Roy Chowdhury]Blogs

(8/n) Here is a blog post with a simplified overview of our work: www.cs.unc.edu/~somnath/blo...

Code: github.com/brcsomnath/pef
Paper link: arxiv.org/abs/2503.20098

1 year ago 2 0 1 0
Post image

(7/n) We would like to highlight previous great works, like LEACE, that perfectly erase concepts to protect against linear adversaries. In our work, we improve upon this method and present a technique that can protect against any adversary.

x.com/norabelrose/...

1 year ago 2 0 1 0
Post image

(6/n) We also visualize the learned representations from different erasure methods. We observe that PEF perfectly erasure group (or concept) information without losing other information (collapsing the representation space).

1 year ago 1 0 1 0
Post image

(5/n) Empirically, we observe that PEF reaches the theoretical limits of erasure even in challenging settings where other methods struggle, including both linear (INLP, LEACE) and non-linear techniques (FaRM, KRaM).

1 year ago 1 0 1 0
Post image

(4/n) When the distributions are unequal, we still achieve perfect erasure but with a slightly reduced utility. The erasure function in this setting is shown below.

1 year ago 1 0 1 0
Post image

(3/n) From the above limits, we show that optimally perfect concept erasure is only feasible when the underlying distributions are equal up to permutations. In such scenarios, the erasure function is shown in the diagram.

1 year ago 1 0 1 0
Post image

(2/n) We study the fundamental limits of concept erasure. Borrowing from the work of @FlavioCalmon et al in information theory literature, we characterize the erasure capacity and maximum utility that can be retained during concept erasure.

1 year ago 2 0 1 0
Post image

๐‡๐จ๐ฐ ๐œ๐š๐ง ๐ฐ๐ž ๐ฉ๐ž๐ซ๐Ÿ๐ž๐œ๐ญ๐ฅ๐ฒ ๐ž๐ซ๐š๐ฌ๐ž ๐œ๐จ๐ง๐œ๐ž๐ฉ๐ญ๐ฌ ๐Ÿ๐ซ๐จ๐ฆ ๐‹๐‹๐Œ๐ฌ?

Our method, Perfect Erasure Functions (PEF), erases concepts perfectly from LLM representations. We analytically derive PEF w/o parameter estimation. PEFs achieve pareto optimal erasure-utility tradeoff backed w/ theoretical guarantees. #AISTATS2025 ๐Ÿงต

1 year ago 37 8 2 3
Advertisement

Please stop by our posters if youโ€™re interested. Feel free to reach out if you're interested in AI safety, efficiency, and just want to chat!

CC: @unccs.bsky.social

1 year ago 0 0 0 0
Post image

(3/3) ๐“๐จ๐ฐ๐š๐ซ๐๐ฌ ๐’๐œ๐š๐ฅ๐š๐›๐ฅ๐ž ๐„๐ฑ๐š๐œ๐ญ ๐Œ๐š๐œ๐ก๐ข๐ง๐ž ๐”๐ง๐ฅ๐ž๐š๐ซ๐ง๐ข๐ง๐  ๐”๐ฌ๐ข๐ง๐  ๐๐„๐…๐“

Iโ€™m also presenting my ongoing unlearning work at SafeGenAI Workshop. This uses a novel PEFT training approach to improve exact unlearning efficiency

arxiv.org/abs/2406.16257

1 year ago 1 0 1 0
Post image

(2/3) ๐…๐š๐ฌ๐ญ ๐“๐ซ๐ž๐ž-๐…๐ข๐ž๐ฅ๐ ๐ˆ๐ง๐ญ๐ž๐ ๐ซ๐š๐ญ๐จ๐ซ

An efficient method for graph field integration (a special case of matrix-vector mult.) using integrator trees. FTFI enables polylog-lin. time multiplication w/ performance boost in vision transformers

arxiv.org/abs/2406.15881

1 year ago 0 0 1 0
Post image

๐ŸšจIโ€™m traveling to #NeurIPS2024 next week to present these papers.

(1/3) ๐’๐ญ๐ซ๐ฎ๐œ๐ญ๐ฎ๐ซ๐ž๐ ๐”๐ง๐ซ๐ž๐ฌ๐ญ๐ซ๐ข๐œ๐ญ๐ž๐-๐‘๐š๐ง๐ค ๐Œ๐š๐ญ๐ซ๐ข๐œ๐ž๐ฌ ๐Ÿ๐จ๐ซ ๐๐„๐…๐“

A new PEFT method replacing low-rank matrices (LoRA) with more expressive structured matrices

arxiv.org/abs/2406.17740

1 year ago 6 1 1 0

Please stop by our posters if youโ€™re interested. Feel free to reach out if you're interested in AI safety, efficiency, and just want to chat!

CC: @unccs.bsky.social

1 year ago 0 0 0 0
Post image

(3/3) ๐“๐จ๐ฐ๐š๐ซ๐๐ฌ ๐’๐œ๐š๐ฅ๐š๐›๐ฅ๐ž ๐„๐ฑ๐š๐œ๐ญ ๐Œ๐š๐œ๐ก๐ข๐ง๐ž ๐”๐ง๐ฅ๐ž๐š๐ซ๐ง๐ข๐ง๐  ๐”๐ฌ๐ข๐ง๐  ๐๐„๐…๐“

Iโ€™m also presenting my ongoing unlearning work at SafeGenAI Workshop. This uses a novel PEFT training approach to improve exact unlearning efficiency

arxiv.org/abs/2406.16257

1 year ago 0 0 1 0
Post image

(2/3) ๐…๐š๐ฌ๐ญ ๐“๐ซ๐ž๐ž-๐…๐ข๐ž๐ฅ๐ ๐ˆ๐ง๐ญ๐ž๐ ๐ซ๐š๐ญ๐จ๐ซ

An efficient method for graph field integration (a special case of matrix-vector mult.) using integrator trees. FTFI enables polylog-lin. time multiplication w/ performance boost in vision transformers

arxiv.org/abs/2406.15881

1 year ago 0 0 1 0
Advertisement