Link Search Menu Expand Document

Below, we show a list of selected publications and talks on the current research focus.

For the complete lists of papers, see Preprints, Conference Papers, and Journal Papers.

Research Focus

Theory and algorithms for deep learning with foundation models.

LinkTopic/TypeTLDRSummaryGithub
Arxiv’24LLM/TheoryDual Operating Modes of In-Context LearningSummaryGithub
Arxiv’24LLM/AlgorithmCan MLLMs Perform Text-to-Image In-Context Learning?SummaryGithub
ICLR’24PEFT/TheoryThe Expressive Power of Low-Rank Adaptation (LoRA)SummaryGithub
ICLR’24LLM/AlgorithmImage Clustering Conditioned on Text CriteriaSummaryGithub
ICLR’24LLM/AlgorithmTeaching arithmetic to a small TransformerSummaryGithub
ICLR’24LLM/AlgorithmA Looped-Transformer Architecture for Efficient Meta-learningSummaryGithub
NeurIPSW’23LLM/AlgorithmCoded Prompts for Large Language Models  
NeurIPSW’23CLIP/AlgorithmZero-shot Improvement of Object Counting with CLIP  
NeurIPSW’23Diffusion/AlgorithmSuper-Resolution Emulation of Large Cosmological Fields with a 3D Conditional Diffusion Model  
NeurIPS’23Diffusion/AlgorithmReinforcement learning for improved text-to-image alignmentSummaryGithub
ICML’23LLM/TheoryLooped Transformers as Programmable ComputersSummaryGithub
ICML’23Diffusion/AlgorithmReinforcement learning for faster DDPM samplingSummaryGithub
ACL’23 (Findings)LLM/AlgorithmAn LLM agent with memory for long-term conversationSummaryGithub
EMNLP’22 (Findings)LLM/AlgorithmUnsupervised word translation (via connecting two CLIP models)SummaryGithub
NeurIPS’22LLM/AlgorithmLIFT: Language-Interfaced Fine-Tuning for Non-Language Machine Learning TasksSummaryGithub
NeurIPS’22Diffusion/TheoryScore-based Generative Modeling Secretly Minimizes the Wasserstein DistanceSummaryGithub
ICMLW’23CLIP/TheoryMini-Batch Optimization of Contrastive Loss Github
TMLRLLM/AlgorithmA compute-latency trade-off for language model decoding  
ACLW’22LLM/AlgorithmDebiasing language models via parameter-efficient fine-tuning  

Selected Invited Talks on Deep Learning with Foundation Models

  • (Dec. 2023) CSP Seminar @ University of Michigan
    Title: Towards a Theoretical Understanding of Parameter-Efficient Fine-Tuning (and Beyond)
  • (Nov. 2023) Efficient ML workshop @ Google Research New York
    Title: The Expressive Power of Low-Rank Adaptation (LoRA)
  • (Oct. 2023) Trust Perspectives in Machine Learning, Law, and Public Policy at the Institute for Data, Econometrics, Algorithms, and Learning (IDEAL) @ Northwestern University (Oct. 2023)
  • (Oct. 2023) AI in Imaging and Medicine: Breaking Silos, Building Bridges @ University of Wisconsin-Madison
  • (Sep. 2023) The Machine Learning for Medical Imaging (ML4MI) @ University of Wisconsin-Madison
  • (May 2023) KSEA Distinguished Guest Series
  • (Feb. 2023) Information Theory and Applications Workshop
  • (Feb. 2023) The Coordinated Science Laboratory Student Conference @ UIUC
  • (Jan. 2023) Information Theory and Data Science Workshop @ National University of Singapore
  • (Jan. 2023) Systems, Information, Learning and Optimization (SILO) Seminar @ University of Wisconsin-Madison
  • (Aug. 2022) Samsung Advanced Institute of Technology