Alex Li

Alexander C. Li

I'm a member of technical staff on the pretraining team at Anthropic. Previously, I was a PhD student in the Machine Learning Department at Carnegie Mellon University, advised by Deepak Pathak. My work was supported by the NSF Graduate Research Fellowship.

Before that, I was an electrical engineering and computer science major at UC Berkeley, where I received my BS and MS. I was a researcher in the Berkeley Artificial Intelligence Research Lab and was advised by Pieter Abbeel and Lerrel Pinto. I've also spent time as an intern at AI at Meta.

Email / CV / Google Scholar / GitHub / Twitter

Research

I'm interested in generative models, generalization, optimization, and the role of data in deep learning. Some papers are highlighted.

	Generative Classifiers Avoid Shortcut Solutions Alexander C. Li, Ananya Kumar, Deepak Pathak ICLR 2025 ICML 2024 Workshop on Structured Probabilistic Inference & Generative Modeling (Oral Presentation) pdf \| abstract
	On the Surprising Effectiveness of Attention Transfer for Vision Transformers Alexander C. Li, Yuandong Tian, Beidi Chen, Deepak Pathak, Xinlei Chen NeurIPS 2024 arxiv \| pdf \| code \| abstract
	An Introduction to Vision-Language Modeling Florian Bardes, Richard Yuanzhe Pang, Anurag Ajay, Alexander C. Li, ... (41 total authors) arxiv \| pdf \| abstract
	Diffusion-TTA: Test-time Adaptation of Discriminative Models via Generative Feedback Mihir Prabhudesai, Tsung-Wei Ke, Alexander C. Li, Deepak Pathak, Katerina Fragkiadaki NeurIPS 2023 arxiv \| pdf \| project page \| code \| abstract
	Your Diffusion Model is Secretly a Zero-Shot Classifier Alexander C. Li, Mihir Prabhudesai, Shivam Duggal, Ellis Brown, Deepak Pathak ICCV 2023 arxiv \| pdf \| project page \| code \| abstract
	Internet Explorer: Targeted Representation Learning on the Open Web Alexander C. Li, Ellis Brown, Alexei A. Efros, Deepak Pathak ICML 2023 arxiv \| pdf \| project page \| code \| abstract
	Understanding Collapse in Non-Contrastive Siamese Representation Learning Alexander C. Li, Alexei A. Efros, Deepak Pathak ECCV 2022 arxiv \| pdf \| project page \| code \| bibtex \| abstract
	Functional Regularization for Reinforcement Learning via Learned Fourier Features Alexander C. Li, Deepak Pathak NeurIPS 2021 arxiv \| pdf \| project page \| code \| bibtex \| abstract
	Generalized Hindsight for Reinforcement Learning Alexander C. Li, Lerrel Pinto, Pieter Abbeel NeurIPS 2020 arxiv \| pdf \| project page \| code \| bibtex \| abstract
	Sub-policy Adaptation for Hierarchical Reinforcement Learning Alexander C. Li, Carlos Florensa, Ignasi Clavera, Pieter Abbeel International Conference on Learning Representations (ICLR), 2020 arxiv \| pdf \| project page \| code \| bibtex \| abstract

Service

	Co-head TA, CS 294-158: Deep Unsupervised Learning, Spring 2020 Head Content TA, EECS 126: Stochastic Processes, Fall 2019 TA, CS 188: Artificial Intelligence, Spring 2019 TA, CS 188: Artificial Intelligence, Fall 2018 Academic Intern, CS 189: Machine Learning, Spring 2018 Reader, CS 70: Discrete Mathematics & Probability, Fall 2017
	Mentor, Google Code Corps 2017

Awards and Honors

Two Sigma PhD Fellowship Runner-Up (2nd place out of 160 nominees) (2023)
National Science Foundation Graduate Research Fellowship (2020)
1st place at Citadel San Francisco Invitational Data Open (2019)
Mark D. Weiser Excellence in Computing Scholarship (2019)
Accel Scholar (2018)
Quantedge Award for Academic Excellence: awarded to ~120 seniors at UC Berkeley for a 4.0 GPA (2018)
Cal Alumni Association Leadership Award (2017)
Dean's Honor List: Fall 2016, Spring 2017, Fall 2017, Spring 2018, Spring 2019
Robert J. Kraft Award for Freshmen (2016)
Regents’ and Chancellor’s Scholar: awarded to top <2% of entering students at Berkeley (2016)

Website template from Jon Barron.