Alex L. Zhang

Incoming PhD Student at MIT CSAIL, Princeton CS '24

Alex_Zhang_2024.png

Hi 👋! I broadly work on problems related to evaluating language model capabilities, systems programming for machine learning, and AI for science. I co-lead the GPU MODE leaderboard, which most recently hosted a $100k competition with AMD.

I will be a PhD student at MIT CSAIL starting Fall 2025. For the past year I’ve been working as a researcher at VantAI working on AI-based drug discovery.

Before that, I graduated as the top student of the Princeton CS department, where I was blessed with amazing mentors: Professor Karthik Narasimhan, Dr. Khanh Nguyen, Dr. Ofir Press, and Professor Kai Li. Before that, I used to make and sell PC games, one of which was mildly successful (~100k+ players). [example]



research highlights:


Feel free to reach out to talk about anything through my email at [x]@mit.edu where [x]=altzhang. It’s not obvious from my research, but I spent most of my undergrad studying math + CS theory and love to chat about it! On that topic, check out my college roommate Evan’s math reading list! Finally, I’m also very active in the GPU MODE community and co-lead the GPU programming leaderboard.


latest posts


selected publications

  1. videogamebench.png
    VideoGameBench: Can Vision-Language Models complete popular video games?
    Alex Zhang, Thomas L. Griffiths, Karthik R. Narasimhan, and Ofir Press
    Under review
  2. NLP
    kernelbench.png
    KernelBench: Can LLMs Write Efficient GPU Kernels?
    Anne Ouyang*, Simon Guo*, Simran Arora, Alex Zhang , and 3 more authors
    ICML 2025, DL4C (Best Paper) & SSI-FM Workshop @ ICLR 2025
  3. NLP
    teaser_mm.png
    SWE-bench Multimodal: Do AI Systems Generalize to Visual Software Domains?
    John Yang*, Carlos E. Jimenez*Alex Zhang, Kilian Lieret , and 9 more authors
    ICLR 2025
  4. NLP
    lgwm.png
    Language-guided World Models: A Model-Based Approach to AI Control.
    Alex Zhang*, Khanh Nguyen*, Jens Tuyls, Albert Lin , and 1 more author
    SpLU-RoboNLP Workshop @ ACL 2024 (Oral)

news

Jun 12, 2025 Went on stage with Lisa Su, CEO of AMD, who recognized our AMD GPU programming competition and announced the winners! [picture].
Nov 01, 2024 Started as a researcher at VantAI under the wonderful Luca Naef!
Jun 03, 2024 Starting a research internship at Snapchat focused on large-scale recommendation systems!
Mar 17, 2023 Organized and went on AI TigerTrek with my close (and brilliant!) friends Evan Dogariu, Michael Tang, and Jiatong Yu! We visited and had Q&A’s at OpenAI, Google, Anthropic, Redwood Research, Stanford HAI, Nuro, and more!
Aug 29, 2022 Finished my internship in Seattle at Apple INFI working on LMs in Siri!